2023-11-26 09:35:18   INFO  **********************Start logging**********************
2023-11-26 09:35:18   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-11-26 09:35:18   INFO  total_batch_size: 24
2023-11-26 09:35:18   INFO  cfg_file         cfgs/picture_models/picture_waymo_detection.yaml
2023-11-26 09:35:18   INFO  batch_size       3
2023-11-26 09:35:18   INFO  epochs           24
2023-11-26 09:35:18   INFO  workers          4
2023-11-26 09:35:18   INFO  extra_tag        detection
2023-11-26 09:35:18   INFO  ckpt             None
2023-11-26 09:35:18   INFO  pretrained_model waymo_pretrain_model.pth
2023-11-26 09:35:18   INFO  launcher         pytorch
2023-11-26 09:35:18   INFO  tcp_port         18888
2023-11-26 09:35:18   INFO  sync_bn          True
2023-11-26 09:35:18   INFO  fix_random_seed  False
2023-11-26 09:35:18   INFO  ckpt_save_interval 1
2023-11-26 09:35:18   INFO  local_rank       0
2023-11-26 09:35:18   INFO  max_ckpt_save_num 30
2023-11-26 09:35:18   INFO  merge_all_iters_to_one_epoch False
2023-11-26 09:35:18   INFO  set_cfgs         None
2023-11-26 09:35:18   INFO  max_waiting_mins 0
2023-11-26 09:35:18   INFO  start_epoch      0
2023-11-26 09:35:18   INFO  num_epochs_to_eval 0
2023-11-26 09:35:18   INFO  save_to_file     False
2023-11-26 09:35:18   INFO  use_tqdm_to_record False
2023-11-26 09:35:18   INFO  logger_iter_interval 50
2023-11-26 09:35:18   INFO  ckpt_save_time_interval 300
2023-11-26 09:35:18   INFO  wo_gpu_stat      False
2023-11-26 09:35:18   INFO  fp16             False
2023-11-26 09:35:18   INFO  cfg.LOCAL_RANK: 0
2023-11-26 09:35:18   INFO  cfg.CLASS_NAMES: ['Vehicle', 'Pedestrian', 'Cyclist']
2023-11-26 09:35:18   INFO  
cfg.DATA_CONFIG = edict()
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATASET: WaymoDataset
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/waymo
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.PROCESSED_DATA_TAG: waymo_processed_data_v0_5_0
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-74.88, -74.88, -2, 74.88, 74.88, 4.0]
2023-11-26 09:35:18   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-11-26 09:35:18   INFO  
cfg.DATA_CONFIG.SAMPLED_INTERVAL = edict()
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.train: 1
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.test: 1
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.FILTER_EMPTY_BOXES_FOR_TRAIN: True
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DISABLE_NLZ_FLAG_ON_POINTS: True
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.USE_SHARED_MEMORY: False
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.SHARED_MEMORY_FILE_LIMIT: 35000
2023-11-26 09:35:18   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.AUG_CONFIG_LIST: [{'NAME': 'gt_sampling', 'USE_ROAD_PLANE': False, 'DB_INFO_PATH': ['waymo_processed_data_v0_5_0_waymo_dbinfos_train_sampled_1.pkl'], 'USE_SHARED_MEMORY': True, 'DB_DATA_PATH': ['waymo_processed_data_v0_5_0_gt_database_train_sampled_1_global.npy'], 'BACKUP_DB_INFO': {'DB_INFO_PATH': 'waymo_processed_data_v0_5_0_waymo_dbinfos_train_sampled_1_multiframe_-4_to_0.pkl', 'DB_DATA_PATH': 'waymo_processed_data_v0_5_0_gt_database_train_sampled_1_multiframe_-4_to_0_global.npy', 'NUM_POINT_FEATURES': 6}, 'PREPARE': {'filter_by_min_points': ['Vehicle:5', 'Pedestrian:10', 'Cyclist:10'], 'filter_by_difficulty': [-1]}, 'SAMPLE_GROUPS': ['Vehicle:15', 'Pedestrian:10', 'Cyclist:10'], 'NUM_POINT_FEATURES': 5, 'REMOVE_EXTRA_WIDTH': [0.0, 0.0, 0.0], 'LIMIT_WHOLE_SCENE': True}, {'NAME': 'random_world_flip', 'ALONG_AXIS_LIST': ['x', 'y']}, {'NAME': 'random_world_rotation', 'WORLD_ROT_ANGLE': [-0.78539816, 0.78539816]}, {'NAME': 'random_world_scaling', 'WORLD_SCALE_RANGE': [0.95, 1.05]}, {'NAME': 'random_world_translation', 'NOISE_TRANSLATE_STD': [0.5, 0.5, 0.5]}]
2023-11-26 09:35:18   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_range', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.32, 0.32, 0.1875]}]
2023-11-26 09:35:18   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/waymo_dataset.yaml
2023-11-26 09:35:18   INFO  
cfg.MODEL = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.NAME: CenterPoint
2023-11-26 09:35:18   INFO  
cfg.MODEL.VFE = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.VFE.NAME: DynPillarVFE3D
2023-11-26 09:35:18   INFO  cfg.MODEL.VFE.WITH_DISTANCE: False
2023-11-26 09:35:18   INFO  cfg.MODEL.VFE.USE_ABSLOTE_XYZ: True
2023-11-26 09:35:18   INFO  cfg.MODEL.VFE.USE_NORM: True
2023-11-26 09:35:18   INFO  cfg.MODEL.VFE.NUM_FILTERS: [192, 192]
2023-11-26 09:35:18   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVT
2023-11-26 09:35:18   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 468, 468, 32 ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 192, 192, 192, 192 ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [2, 2, 1]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-11-26 09:35:18   INFO  
cfg.MODEL.BACKBONE_3D.MASK_CONFIG = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 192, 192, 192, 192 ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [468, 468]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 192
2023-11-26 09:35:18   INFO  
cfg.MODEL.MAP_TO_BEV = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.MAP_TO_BEV.NAME: PointPillarScatter3d
2023-11-26 09:35:18   INFO  cfg.MODEL.MAP_TO_BEV.INPUT_SHAPE: [468, 468, 1]
2023-11-26 09:35:18   INFO  cfg.MODEL.MAP_TO_BEV.NUM_BEV_FEATURES: 192
2023-11-26 09:35:18   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.NAME: BaseBEVResBackbone
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.LAYER_NUMS: [1, 2, 2]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.LAYER_STRIDES: [1, 2, 2]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.NUM_FILTERS: [128, 128, 256]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.UPSAMPLE_STRIDES: [1, 2, 4]
2023-11-26 09:35:18   INFO  cfg.MODEL.BACKBONE_2D.NUM_UPSAMPLE_FILTERS: [128, 128, 128]
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.NAME: CenterHead
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.CLASS_NAMES_EACH_HEAD: [['Vehicle', 'Pedestrian', 'Cyclist']]
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SHARED_CONV_CHANNEL: 64
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.USE_BIAS_BEFORE_NORM: False
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.NUM_HM_CONV: 2
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.BN_EPS: 0.001
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.BN_MOM: 0.01
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_ORDER: ['center', 'center_z', 'dim', 'rot']
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT = edict()
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center.out_channels: 2
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center.num_conv: 2
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z.out_channels: 1
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z.num_conv: 2
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim.out_channels: 3
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim.num_conv: 2
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot.out_channels: 2
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot.num_conv: 2
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou.out_channels: 1
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou.num_conv: 2
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.FEATURE_MAP_STRIDE: 1
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.NUM_MAX_OBJS: 500
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.GAUSSIAN_OVERLAP: 0.1
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.MIN_RADIUS: 2
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.IOU_REG_LOSS: True
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.LOSS_CONFIG = edict()
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.cls_weight: 1.0
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.loc_weight: 2.0
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.code_weights: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0]
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.POST_PROCESSING = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.SCORE_THRESH: 0.1
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.POST_CENTER_LIMIT_RANGE: [-80, -80, -10.0, 80, 80, 10.0]
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.MAX_OBJ_PER_SAMPLE: 500
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.USE_IOU_TO_RECTIFY_SCORE: True
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.IOU_RECTIFIER: [0.68, 0.71, 0.65]
2023-11-26 09:35:18   INFO  
cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_TYPE: multi_class_nms
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_THRESH: [0.7, 0.6, 0.55]
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_PRE_MAXSIZE: [4096, 4096, 4096]
2023-11-26 09:35:18   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_POST_MAXSIZE: [500, 500, 500]
2023-11-26 09:35:18   INFO  
cfg.MODEL.POST_PROCESSING = edict()
2023-11-26 09:35:18   INFO  cfg.MODEL.POST_PROCESSING.RECALL_THRESH_LIST: [0.3, 0.5, 0.7]
2023-11-26 09:35:18   INFO  cfg.MODEL.POST_PROCESSING.EVAL_METRIC: waymo
2023-11-26 09:35:18   INFO  
cfg.OPTIMIZATION = edict()
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 3
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 24
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.LR: 0.001
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.PCT_START: 0.1
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 100
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 10
2023-11-26 09:35:18   INFO  cfg.OPTIMIZATION.LOSS_SCALE_FP16: 32.0
2023-11-26 09:35:18   INFO  
cfg.HOOK = edict()
2023-11-26 09:35:18   INFO  
cfg.HOOK.DisableAugmentationHook = edict()
2023-11-26 09:35:18   INFO  cfg.HOOK.DisableAugmentationHook.DISABLE_AUG_LIST: ['gt_sampling', 'random_world_flip', 'random_world_rotation', 'random_world_scaling', 'random_world_translation']
2023-11-26 09:35:18   INFO  cfg.HOOK.DisableAugmentationHook.NUM_LAST_EPOCHS: 1
2023-11-26 09:35:18   INFO  cfg.TAG: picture_waymo_detection
2023-11-26 09:35:18   INFO  cfg.EXP_GROUP_PATH: cfgs/picture_model
2023-11-26 09:35:22   INFO  Database filter by min points Vehicle: 1194364 => 1019919
2023-11-26 09:35:22   INFO  Database filter by min points Pedestrian: 1114091 => 861165
2023-11-26 09:35:22   INFO  Database filter by min points Cyclist: 53344 => 45157
2023-11-26 09:35:22   INFO  Database filter by difficulty Vehicle: 1019919 => 1019919
2023-11-26 09:35:22   INFO  Database filter by difficulty Pedestrian: 861165 => 861165
2023-11-26 09:35:22   INFO  Database filter by difficulty Cyclist: 45157 => 45157
2023-11-26 09:35:25   INFO  Loading GT database to shared memory
2023-11-26 09:35:25   INFO  GT database has been saved to shared memory
2023-11-26 09:35:25   INFO  Loading Waymo dataset
2023-11-26 09:35:31   INFO  Total skipped info 0
2023-11-26 09:35:31   INFO  Total samples for Waymo dataset: 158081
2023-11-26 09:35:31   INFO  DistributedDataParallel(
  (module): CenterPoint(
    (vfe): DynamicPillarVFE_3d(
      (pfn_layers): ModuleList(
        (0): PFNLayerV2(
          (linear): Linear(in_features=11, out_features=96, bias=False)
          (norm): SyncBatchNorm(96, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
        (1): PFNLayerV2(
          (linear): Linear(in_features=192, out_features=192, bias=False)
          (norm): SyncBatchNorm(192, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
      )
    )
    (backbone_3d): DSVTBackboneMAE(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (2): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (3): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (2): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (3): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (3): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
    )
  )
    (map_to_bev_module): PointPillarScatter3d()
    (pfe): None
    (backbone_2d): BaseBEVResBackbone(
      (blocks): ModuleList(
        (0): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(192, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(192, 128, kernel_size=(1, 1), stride=(1, 1), bias=False)
              (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (1): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(128, 128, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (2): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(128, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(128, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
      )
      (deblocks): ModuleList(
        (0): Sequential(
          (0): ConvTranspose2d(128, 128, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (1): Sequential(
          (0): ConvTranspose2d(128, 128, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (2): Sequential(
          (0): ConvTranspose2d(256, 128, kernel_size=(4, 4), stride=(4, 4), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
      )
    )
    (dense_head): CenterHead(
      (shared_conv): Sequential(
        (0): Conv2d(384, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
        (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
      )
      (heads_list): ModuleList(
        (0): SeparateHead(
          (center): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 2, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (center_z): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 1, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (dim): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (rot): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 2, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (iou): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 1, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (hm): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
        )
      )
      (hm_loss_func): FocalLossCenterNet()
      (reg_loss_func): RegLossCenterNet()
    )
    (point_head): None
    (roi_head): None
  )
)
2023-11-26 09:36:09   INFO  Total number of parameters: 9236651
2023-11-26 09:36:09   INFO  **********************Start training cfgs/picture_model/picture_waymo_detection(detection)**********************
2023-11-26 09:37:15   INFO  epoch: 0/24, acc_iter=50, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:10:43, time_cost(all): 0:00:57/2 days, 2:15:22, loss=3.131384386141653, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=4.854509765938582, lr=0.001142325793229088
2023-11-26 09:38:13   INFO  epoch: 0/24, acc_iter=100, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:00:59, time_cost(all): 0:01:55/2 days, 4:11:43, loss=2.998153628976303, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=1.775680346014438, lr=0.001284651586458175
2023-11-26 09:39:11   INFO  epoch: 0/24, acc_iter=150, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:23, time_cost(all): 0:02:53/2 days, 1:37:23, loss=2.864922871810953, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=3.8736335300305464, lr=0.001426977379687263
2023-11-26 09:40:09   INFO  epoch: 0/24, acc_iter=200, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:07:48, time_cost(all): 0:03:51/2 days, 3:20:36, loss=2.731692114645603, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=2.8982095813684197, lr=0.00156930317291635
2023-11-26 09:41:06   INFO  epoch: 0/24, acc_iter=250, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:05:04, time_cost(all): 0:04:48/2 days, 3:53:36, loss=2.598461357480252, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=1.209525247786658, lr=0.001711628966145438
2023-11-26 09:42:04   INFO  epoch: 0/24, acc_iter=300, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:57:29, time_cost(all): 0:05:46/2 days, 4:26:06, loss=2.465230600314902, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=2.136965400218819, lr=0.001853954759374526
2023-11-26 09:43:02   INFO  epoch: 0/24, acc_iter=350, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:02:56, time_cost(all): 0:06:44/2 days, 4:36:38, loss=2.331999843149552, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.4866993422300463, lr=0.001996280552603613
2023-11-26 09:44:00   INFO  epoch: 0/24, acc_iter=400, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:55:51, time_cost(all): 0:07:42/2 days, 4:01:39, loss=2.198769085984202, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=1.5292735778077444, lr=0.002138606345832701
2023-11-26 09:44:57   INFO  epoch: 0/24, acc_iter=450, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:01:54, time_cost(all): 0:08:39/2 days, 3:09:17, loss=2.065538328818852, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.3951900464696636, lr=0.002280932139061788
2023-11-26 09:45:55   INFO  epoch: 0/24, acc_iter=500, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:57:19, time_cost(all): 0:09:37/2 days, 4:54:30, loss=1.932307571653502, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=0.6663517911540457, lr=0.002423257932290876
2023-11-26 09:46:53   INFO  epoch: 0/24, acc_iter=550, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:53:22, time_cost(all): 0:10:35/2 days, 4:40:56, loss=1.799076814488151, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=4.339371041079172, lr=0.002565583725519963
2023-11-26 09:47:51   INFO  epoch: 0/24, acc_iter=600, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:53:52, time_cost(all): 0:11:33/2 days, 0:28:05, loss=1.665846057322801, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.0318888876621863, lr=0.002707909518749051
2023-11-26 09:48:48   INFO  epoch: 0/24, acc_iter=650, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:53:59, time_cost(all): 0:12:30/2 days, 2:41:56, loss=1.532615300157451, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.801099427092617, lr=0.002850235311978139
2023-11-26 09:49:46   INFO  epoch: 0/24, acc_iter=700, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:54:16, time_cost(all): 0:13:28/2 days, 2:26:26, loss=1.399384542992101, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=4.72978907492077, lr=0.002992561105207227
2023-11-26 09:50:44   INFO  epoch: 0/24, acc_iter=750, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:53:56, time_cost(all): 0:14:26/2 days, 2:12:22, loss=1.266153785826751, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=4.8265291423192025, lr=0.003134886898436314
2023-11-26 09:51:42   INFO  epoch: 0/24, acc_iter=800, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:54:35, time_cost(all): 0:15:24/2 days, 4:51:43, loss=1.132923028661401, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=0.8173126137052796, lr=0.003277212691665402
2023-11-26 09:52:39   INFO  epoch: 0/24, acc_iter=850, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:50:08, time_cost(all): 0:16:21/1 day, 23:57:20, loss=0.999692271496051, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.594026182909083, lr=0.003419538484894489
2023-11-26 09:53:37   INFO  epoch: 0/24, acc_iter=900, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:53:08, time_cost(all): 0:17:19/2 days, 1:52:56, loss=0.8664615143307, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=4.663121654303959, lr=0.003561864278123577
2023-11-26 09:54:35   INFO  epoch: 0/24, acc_iter=950, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:44:57, time_cost(all): 0:18:17/2 days, 4:17:10, loss=0.73323075716535, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.7505200664946092, lr=0.003704190071352665
2023-11-26 09:55:33   INFO  epoch: 0/24, acc_iter=1000, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:51:39, time_cost(all): 0:19:15/1 day, 23:56:47, loss=0.627709515150872, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.2033430904537907, lr=0.003846515864581752
2023-11-26 09:56:30   INFO  epoch: 0/24, acc_iter=1050, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:46:46, time_cost(all): 0:20:12/2 days, 1:16:39, loss=0.599892457639897, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=3.5573818971130056, lr=0.00398884165781084
2023-11-26 09:57:28   INFO  epoch: 0/24, acc_iter=1100, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:42:30, time_cost(all): 0:21:10/2 days, 2:19:18, loss=0.599784915279795, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=0.7216571525337829, lr=0.004131167451039927
2023-11-26 09:58:26   INFO  epoch: 0/24, acc_iter=1150, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:49:42, time_cost(all): 0:22:08/2 days, 4:51:51, loss=0.599677372919692, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=4.2452030097156115, lr=0.004273493244269014
2023-11-26 09:59:24   INFO  epoch: 0/24, acc_iter=1200, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:48:30, time_cost(all): 0:23:06/2 days, 2:02:11, loss=0.599569830559589, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=1.2547834232846493, lr=0.004415819037498102
2023-11-26 10:00:21   INFO  epoch: 0/24, acc_iter=1250, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:41:00, time_cost(all): 0:24:03/2 days, 3:19:05, loss=0.599462288199487, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=1.6650507030509565, lr=0.00455814483072719
2023-11-26 10:01:19   INFO  epoch: 0/24, acc_iter=1300, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:40:15, time_cost(all): 0:25:01/2 days, 1:39:56, loss=0.599354745839384, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.7898859299405092, lr=0.004700470623956277
2023-11-26 10:02:17   INFO  epoch: 0/24, acc_iter=1350, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:43:37, time_cost(all): 0:25:59/1 day, 23:47:03, loss=0.599247203479281, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.4213919284141046, lr=0.004842796417185365
2023-11-26 10:03:15   INFO  epoch: 0/24, acc_iter=1400, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:44:14, time_cost(all): 0:26:57/2 days, 3:06:35, loss=0.599139661119179, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.954906358605234, lr=0.004985122210414453
2023-11-26 10:04:12   INFO  epoch: 0/24, acc_iter=1450, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:38:29, time_cost(all): 0:27:54/1 day, 23:56:21, loss=0.599032118759076, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=3.5694966784538704, lr=0.005127448003643541
2023-11-26 10:05:10   INFO  epoch: 0/24, acc_iter=1500, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:36:10, time_cost(all): 0:28:52/2 days, 0:01:52, loss=0.598924576398973, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=3.3756860934968143, lr=0.005269773796872628
2023-11-26 10:06:08   INFO  epoch: 0/24, acc_iter=1550, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:37:41, time_cost(all): 0:29:50/2 days, 3:17:28, loss=0.598817034038871, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=1.6413268203201166, lr=0.005412099590101716
2023-11-26 10:07:06   INFO  epoch: 0/24, acc_iter=1600, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:39:31, time_cost(all): 0:30:48/2 days, 2:37:19, loss=0.598709491678768, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=2.0563143689857757, lr=0.005554425383330804
2023-11-26 10:08:03   INFO  epoch: 0/24, acc_iter=1650, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:32:48, time_cost(all): 0:31:45/2 days, 3:28:31, loss=0.598601949318665, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=1.5731816132737912, lr=0.005696751176559891
2023-11-26 10:09:01   INFO  epoch: 0/24, acc_iter=1700, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:32:20, time_cost(all): 0:32:43/2 days, 2:23:55, loss=0.598494406958562, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.6093577777356214, lr=0.005839076969788979
2023-11-26 10:09:59   INFO  epoch: 0/24, acc_iter=1750, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:33:06, time_cost(all): 0:33:41/2 days, 2:28:55, loss=0.59838686459846, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=1.7399248388977875, lr=0.005981402763018067
2023-11-26 10:10:57   INFO  epoch: 0/24, acc_iter=1800, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:31:20, time_cost(all): 0:34:39/1 day, 23:58:28, loss=0.598279322238357, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=1.5731142540356755, lr=0.006123728556247154
2023-11-26 10:11:54   INFO  epoch: 0/24, acc_iter=1850, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:34:03, time_cost(all): 0:35:36/2 days, 1:54:47, loss=0.598171779878254, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=1.073794130818618, lr=0.006266054349476241
2023-11-26 10:12:52   INFO  epoch: 0/24, acc_iter=1900, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:27:00, time_cost(all): 0:36:34/2 days, 2:33:31, loss=0.598064237518152, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.3567988106584463, lr=0.006408380142705329
2023-11-26 10:13:50   INFO  epoch: 0/24, acc_iter=1950, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:30:35, time_cost(all): 0:37:32/2 days, 0:25:49, loss=0.597956695158049, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.790348549690391, lr=0.006550705935934416
2023-11-26 10:14:48   INFO  epoch: 0/24, acc_iter=2000, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:25:51, time_cost(all): 0:38:30/2 days, 0:10:44, loss=0.597849152797946, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=4.544963469211183, lr=0.006693031729163504
2023-11-26 10:15:45   INFO  epoch: 0/24, acc_iter=2050, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:29:35, time_cost(all): 0:39:27/2 days, 0:29:09, loss=0.597741610437844, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=4.845746973394809, lr=0.006835357522392592
2023-11-26 10:16:43   INFO  epoch: 0/24, acc_iter=2100, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:29:50, time_cost(all): 0:40:25/2 days, 4:22:12, loss=0.597634068077741, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=1.0607698700900663, lr=0.006977683315621679
2023-11-26 10:17:41   INFO  epoch: 0/24, acc_iter=2150, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:28:17, time_cost(all): 0:41:23/2 days, 2:46:11, loss=0.597526525717638, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=3.164056042569995, lr=0.007120009108850767
2023-11-26 10:18:39   INFO  epoch: 0/24, acc_iter=2200, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:26:11, time_cost(all): 0:42:21/2 days, 3:59:55, loss=0.597418983357536, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=2.497842150142882, lr=0.007262334902079854
2023-11-26 10:19:36   INFO  epoch: 0/24, acc_iter=2250, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:23:26, time_cost(all): 0:43:18/2 days, 3:53:44, loss=0.597311440997433, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=3.8533836507571917, lr=0.007404660695308942
2023-11-26 10:20:34   INFO  epoch: 0/24, acc_iter=2300, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:30, time_cost(all): 0:44:16/2 days, 2:16:55, loss=0.59720389863733, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=0.5670760453969418, lr=0.00754698648853803
2023-11-26 10:21:32   INFO  epoch: 0/24, acc_iter=2350, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:17:39, time_cost(all): 0:45:14/2 days, 0:48:13, loss=0.597096356277228, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.9977907037482479, lr=0.007689312281767117
2023-11-26 10:22:30   INFO  epoch: 0/24, acc_iter=2400, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:21:13, time_cost(all): 0:46:12/2 days, 2:55:53, loss=0.596988813917125, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.3268383570898128, lr=0.007831638074996206
2023-11-26 10:23:27   INFO  epoch: 0/24, acc_iter=2450, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:22:02, time_cost(all): 0:47:09/2 days, 2:43:09, loss=0.596881271557022, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=3.447184598161755, lr=0.007973963868225293
2023-11-26 10:24:25   INFO  epoch: 0/24, acc_iter=2500, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:16:18, time_cost(all): 0:48:07/2 days, 3:15:18, loss=0.59677372919692, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.5952851171252052, lr=0.00811628966145438
2023-11-26 10:25:23   INFO  epoch: 0/24, acc_iter=2550, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:15:13, time_cost(all): 0:49:05/2 days, 2:03:14, loss=0.596666186836817, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=3.4427129342194784, lr=0.008258615454683468
2023-11-26 10:26:21   INFO  epoch: 0/24, acc_iter=2600, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:20:32, time_cost(all): 0:50:03/2 days, 3:52:49, loss=0.596558644476714, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=0.6923412560752853, lr=0.008400941247912555
2023-11-26 10:27:18   INFO  epoch: 0/24, acc_iter=2650, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:19:16, time_cost(all): 0:51:00/1 day, 23:48:49, loss=0.596451102116612, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=3.5643915291303374, lr=0.008543267041141642
2023-11-26 10:28:16   INFO  epoch: 0/24, acc_iter=2700, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:13:23, time_cost(all): 0:51:58/2 days, 3:54:38, loss=0.596343559756509, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.5529698159072276, lr=0.008685592834370731
2023-11-26 10:29:14   INFO  epoch: 0/24, acc_iter=2750, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:12:30, time_cost(all): 0:52:56/1 day, 23:51:39, loss=0.596236017396406, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=3.1792820567791518, lr=0.008827918627599816
2023-11-26 10:30:12   INFO  epoch: 0/24, acc_iter=2800, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:41, time_cost(all): 0:53:54/2 days, 0:35:02, loss=0.596128475036304, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.4722859540102091, lr=0.008970244420828905
2023-11-26 10:31:09   INFO  epoch: 0/24, acc_iter=2850, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:08:53, time_cost(all): 0:54:51/2 days, 1:14:46, loss=0.596020932676201, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=2.028117287914766, lr=0.009112570214057994
2023-11-26 10:32:07   INFO  epoch: 0/24, acc_iter=2900, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:09:33, time_cost(all): 0:55:49/2 days, 3:16:57, loss=0.595913390316098, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.8440215422099215, lr=0.009254896007287083
2023-11-26 10:33:05   INFO  epoch: 0/24, acc_iter=2950, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:34, time_cost(all): 0:56:47/1 day, 23:48:55, loss=0.595805847955995, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=4.585892517111612, lr=0.009397221800516168
2023-11-26 10:34:03   INFO  epoch: 0/24, acc_iter=3000, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:06, time_cost(all): 0:57:45/2 days, 4:04:31, loss=0.595698305595893, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.9904332268864837, lr=0.009539547593745257
2023-11-26 10:35:00   INFO  epoch: 0/24, acc_iter=3050, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:09:13, time_cost(all): 0:58:42/2 days, 0:25:05, loss=0.59559076323579, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=0.7091456758821045, lr=0.009681873386974345
2023-11-26 10:35:58   INFO  epoch: 0/24, acc_iter=3100, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:08:43, time_cost(all): 0:59:40/1 day, 23:50:54, loss=0.595483220875688, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=1.5103384068814076, lr=0.00982419918020343
2023-11-26 10:36:56   INFO  epoch: 0/24, acc_iter=3150, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:02:56, time_cost(all): 1:00:38/1 day, 23:51:05, loss=0.595375678515585, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=1.0198270859343186, lr=0.00996652497343252
2023-11-26 10:37:54   INFO  epoch: 0/24, acc_iter=3200, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:13, time_cost(all): 1:01:36/1 day, 23:40:46, loss=0.595268136155482, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=1.5199222091172526, lr=0.010272126916654014
2023-11-26 10:38:51   INFO  epoch: 0/24, acc_iter=3250, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:01:09, time_cost(all): 1:02:33/2 days, 2:36:32, loss=0.595160593795379, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.254089707063967, lr=0.010627941399726733
2023-11-26 10:39:49   INFO  epoch: 0/24, acc_iter=3300, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:15, time_cost(all): 1:03:31/2 days, 0:29:26, loss=0.595053051435277, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=0.7144598253016263, lr=0.010983755882799453
2023-11-26 10:40:47   INFO  epoch: 0/24, acc_iter=3350, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:15, time_cost(all): 1:04:29/2 days, 0:01:46, loss=0.594945509075174, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.9740928001870535, lr=0.011339570365872171
2023-11-26 10:41:45   INFO  epoch: 0/24, acc_iter=3400, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:59:41, time_cost(all): 1:05:27/2 days, 2:19:35, loss=0.594837966715071, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=3.0389800404725715, lr=0.01169538484894489
2023-11-26 10:42:42   INFO  epoch: 0/24, acc_iter=3450, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:59:30, time_cost(all): 1:06:24/2 days, 0:26:01, loss=0.594730424354969, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.2916564178850842, lr=0.01205119933201761
2023-11-26 10:43:40   INFO  epoch: 0/24, acc_iter=3500, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:09, time_cost(all): 1:07:22/2 days, 3:52:23, loss=0.594622881994866, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.4028342505800881, lr=0.012407013815090328
2023-11-26 10:44:38   INFO  epoch: 0/24, acc_iter=3550, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:59:18, time_cost(all): 1:08:20/2 days, 3:17:45, loss=0.594515339634763, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=3.8851790326527356, lr=0.012762828298163047
2023-11-26 10:45:36   INFO  epoch: 0/24, acc_iter=3600, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:05, time_cost(all): 1:09:18/2 days, 2:04:27, loss=0.594407797274661, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=3.7052217589336474, lr=0.013118642781235767
2023-11-26 10:46:33   INFO  epoch: 0/24, acc_iter=3650, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:59:05, time_cost(all): 1:10:15/2 days, 0:59:11, loss=0.594300254914558, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=4.33241910045423, lr=0.013474457264308485
2023-11-26 10:47:31   INFO  epoch: 0/24, acc_iter=3700, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:27, time_cost(all): 1:11:13/2 days, 3:30:19, loss=0.594192712554455, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.258316247124588, lr=0.013830271747381204
2023-11-26 10:48:29   INFO  epoch: 0/24, acc_iter=3750, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:53:36, time_cost(all): 1:12:11/2 days, 0:17:14, loss=0.594085170194353, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=3.7221122840979617, lr=0.014186086230453924
2023-11-26 10:49:27   INFO  epoch: 0/24, acc_iter=3800, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:12, time_cost(all): 1:13:09/2 days, 2:00:12, loss=0.59397762783425, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=3.812389936604319, lr=0.014541900713526642
2023-11-26 10:50:24   INFO  epoch: 0/24, acc_iter=3850, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:54:04, time_cost(all): 1:14:06/1 day, 23:01:48, loss=0.593870085474147, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=3.9581762354631, lr=0.01489771519659936
2023-11-26 10:51:22   INFO  epoch: 0/24, acc_iter=3900, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:14, time_cost(all): 1:15:04/2 days, 0:49:20, loss=0.593762543114045, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.190268513279324, lr=0.01525352967967208
2023-11-26 10:52:20   INFO  epoch: 0/24, acc_iter=3950, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:47, time_cost(all): 1:16:02/2 days, 0:27:41, loss=0.593655000753942, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=3.8137835261942, lr=0.0156093441627448
2023-11-26 10:53:18   INFO  epoch: 0/24, acc_iter=4000, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:48:41, time_cost(all): 1:17:00/2 days, 1:28:27, loss=0.593547458393839, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=4.45178206031699, lr=0.015965158645817518
2023-11-26 10:54:15   INFO  epoch: 0/24, acc_iter=4050, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:48:56, time_cost(all): 1:17:57/1 day, 23:29:01, loss=0.593439916033737, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.216684045530602, lr=0.016320973128890238
2023-11-26 10:55:13   INFO  epoch: 0/24, acc_iter=4100, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:50:11, time_cost(all): 1:18:55/1 day, 23:31:43, loss=0.593332373673634, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=3.330994184512808, lr=0.016676787611962958
2023-11-26 10:56:11   INFO  epoch: 0/24, acc_iter=4150, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:38, time_cost(all): 1:19:53/2 days, 0:38:37, loss=0.593224831313531, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=2.7231441659944036, lr=0.017032602095035675
2023-11-26 10:57:09   INFO  epoch: 0/24, acc_iter=4200, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:50, time_cost(all): 1:20:51/1 day, 23:00:32, loss=0.593117288953429, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.98(1.03), norm=1.2064459440328053, lr=0.017388416578108395
2023-11-26 10:58:06   INFO  epoch: 0/24, acc_iter=4250, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:42:58, time_cost(all): 1:21:48/2 days, 2:01:53, loss=0.593009746593326, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.6131977501206634, lr=0.017744231061181115
2023-11-26 10:59:04   INFO  epoch: 0/24, acc_iter=4300, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:46:00, time_cost(all): 1:22:46/2 days, 1:17:29, loss=0.592902204233223, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=4.671625097160781, lr=0.01810004554425383
2023-11-26 11:00:02   INFO  epoch: 0/24, acc_iter=4350, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:43:04, time_cost(all): 1:23:44/1 day, 23:46:35, loss=0.59279466187312, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=1.7764751951311792, lr=0.018455860027326552
2023-11-26 11:01:00   INFO  epoch: 0/24, acc_iter=4400, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:41:41, time_cost(all): 1:24:42/2 days, 0:09:57, loss=0.592687119513018, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=3.379204969588434, lr=0.018811674510399272
2023-11-26 11:01:57   INFO  epoch: 0/24, acc_iter=4450, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:26, time_cost(all): 1:25:39/2 days, 3:10:26, loss=0.592579577152915, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.682004235968945, lr=0.01916748899347199
2023-11-26 11:02:55   INFO  epoch: 0/24, acc_iter=4500, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:12, time_cost(all): 1:26:37/1 day, 22:54:57, loss=0.592472034792812, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=3.6428818713930755, lr=0.01952330347654471
2023-11-26 11:03:53   INFO  epoch: 0/24, acc_iter=4550, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:59, time_cost(all): 1:27:35/2 days, 3:37:12, loss=0.59236449243271, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=3.467041716497416, lr=0.01987911795961743
2023-11-26 11:04:51   INFO  epoch: 0/24, acc_iter=4600, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:39:21, time_cost(all): 1:28:33/2 days, 1:09:18, loss=0.592256950072607, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.416115383963173, lr=0.020234932442690146
2023-11-26 11:05:48   INFO  epoch: 0/24, acc_iter=4650, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:14, time_cost(all): 1:29:30/2 days, 2:34:19, loss=0.592149407712504, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=3.988174910398851, lr=0.020590746925762866
2023-11-26 11:06:46   INFO  epoch: 0/24, acc_iter=4700, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:36:56, time_cost(all): 1:30:28/2 days, 1:14:51, loss=0.592041865352402, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=4.945456253268264, lr=0.020946561408835586
2023-11-26 11:07:44   INFO  epoch: 0/24, acc_iter=4750, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:37, time_cost(all): 1:31:26/2 days, 0:21:19, loss=0.591934322992299, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=4.875789916962655, lr=0.021302375891908303
2023-11-26 11:08:42   INFO  epoch: 0/24, acc_iter=4800, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:40, time_cost(all): 1:32:24/1 day, 22:44:04, loss=0.591826780632196, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=1.6917005114886727, lr=0.021658190374981026
2023-11-26 11:09:39   INFO  epoch: 0/24, acc_iter=4850, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:33:14, time_cost(all): 1:33:21/1 day, 23:53:32, loss=0.591719238272094, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.170917489604822, lr=0.022014004858053743
2023-11-26 11:10:37   INFO  epoch: 0/24, acc_iter=4900, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:04, time_cost(all): 1:34:19/2 days, 3:08:49, loss=0.591611695911991, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.365961358726515, lr=0.02236981934112646
2023-11-26 11:11:35   INFO  epoch: 0/24, acc_iter=4950, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:39, time_cost(all): 1:35:17/2 days, 1:33:45, loss=0.591504153551888, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=3.119986895792186, lr=0.02272563382419918
2023-11-26 11:12:33   INFO  epoch: 0/24, acc_iter=5000, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:16, time_cost(all): 1:36:15/2 days, 3:15:34, loss=0.591396611191786, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.151900739615993, lr=0.0230814483072719
2023-11-26 11:13:30   INFO  epoch: 0/24, acc_iter=5050, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:31:02, time_cost(all): 1:37:12/2 days, 1:44:00, loss=0.591289068831683, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=3.3321609344370757, lr=0.023437262790344617
2023-11-26 11:14:28   INFO  epoch: 0/24, acc_iter=5100, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:23, time_cost(all): 1:38:10/2 days, 2:34:46, loss=0.59118152647158, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=2.895171814708679, lr=0.023793077273417337
2023-11-26 11:15:26   INFO  epoch: 0/24, acc_iter=5150, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:44, time_cost(all): 1:39:08/2 days, 0:40:36, loss=0.591073984111478, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.3739090900865705, lr=0.024148891756490057
2023-11-26 11:16:24   INFO  epoch: 0/24, acc_iter=5200, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:44, time_cost(all): 1:40:06/1 day, 23:36:40, loss=0.590966441751375, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=4.423068021572192, lr=0.024504706239562773
2023-11-26 11:17:21   INFO  epoch: 0/24, acc_iter=5250, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:57, time_cost(all): 1:41:03/2 days, 1:47:39, loss=0.590858899391272, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.6595852885890228, lr=0.024860520722635494
2023-11-26 11:18:19   INFO  epoch: 0/24, acc_iter=5300, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:23:54, time_cost(all): 1:42:01/1 day, 22:39:09, loss=0.59075135703117, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=2.182561804083268, lr=0.025216335205708214
2023-11-26 11:19:17   INFO  epoch: 0/24, acc_iter=5350, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:30, time_cost(all): 1:42:59/2 days, 1:59:54, loss=0.590643814671067, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=4.751115792575721, lr=0.02557214968878093
2023-11-26 11:20:15   INFO  epoch: 0/24, acc_iter=5400, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:46, time_cost(all): 1:43:57/2 days, 0:46:05, loss=0.590536272310964, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=3.9513307778155613, lr=0.025927964171853654
2023-11-26 11:21:13   INFO  epoch: 0/24, acc_iter=5450, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:55, time_cost(all): 1:44:55/2 days, 0:48:14, loss=0.590428729950862, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.9211026009421546, lr=0.02628377865492637
2023-11-26 11:22:10   INFO  epoch: 0/24, acc_iter=5500, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:49, time_cost(all): 1:45:52/2 days, 1:56:03, loss=0.590321187590759, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=3.0676873383210976, lr=0.026639593137999087
2023-11-26 11:23:08   INFO  epoch: 0/24, acc_iter=5550, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:01, time_cost(all): 1:46:50/2 days, 1:51:36, loss=0.590213645230656, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=1.6898505877793049, lr=0.02699540762107181
2023-11-26 11:24:06   INFO  epoch: 0/24, acc_iter=5600, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:37, time_cost(all): 1:47:48/2 days, 0:28:09, loss=0.590106102870554, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.2348343322988815, lr=0.027351222104144528
2023-11-26 11:25:04   INFO  epoch: 0/24, acc_iter=5650, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:20, time_cost(all): 1:48:46/2 days, 3:07:27, loss=0.589998560510451, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=2.3800836470116256, lr=0.027707036587217244
2023-11-26 11:26:01   INFO  epoch: 0/24, acc_iter=5700, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:23, time_cost(all): 1:49:43/2 days, 2:02:44, loss=0.589891018150348, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=1.548735615201606, lr=0.02806285107028996
2023-11-26 11:26:59   INFO  epoch: 0/24, acc_iter=5750, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:51, time_cost(all): 1:50:41/1 day, 23:51:38, loss=0.589783475790245, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=0.9699579784900668, lr=0.028418665553362685
2023-11-26 11:27:57   INFO  epoch: 0/24, acc_iter=5800, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:42, time_cost(all): 1:51:39/1 day, 22:51:02, loss=0.589675933430143, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=4.001880130391918, lr=0.0287744800364354
2023-11-26 11:28:55   INFO  epoch: 0/24, acc_iter=5850, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:51, time_cost(all): 1:52:37/1 day, 23:34:57, loss=0.58956839107004, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=2.2400486249903127, lr=0.029130294519508118
2023-11-26 11:29:52   INFO  epoch: 0/24, acc_iter=5900, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:04, time_cost(all): 1:53:34/2 days, 1:23:44, loss=0.589460848709937, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=2.1464838522628154, lr=0.02948610900258084
2023-11-26 11:30:50   INFO  epoch: 0/24, acc_iter=5950, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:38, time_cost(all): 1:54:32/2 days, 2:44:46, loss=0.589353306349835, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=2.064842465533152, lr=0.02984192348565356
2023-11-26 11:31:48   INFO  epoch: 0/24, acc_iter=6000, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:20, time_cost(all): 1:55:30/1 day, 22:57:52, loss=0.589245763989732, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=4.8427579939146765, lr=0.030197737968726275
2023-11-26 11:32:46   INFO  epoch: 0/24, acc_iter=6050, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:00, time_cost(all): 1:56:28/1 day, 22:56:38, loss=0.589138221629629, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.686242192611248, lr=0.030553552451799
2023-11-26 11:33:43   INFO  epoch: 0/24, acc_iter=6100, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:08:57, time_cost(all): 1:57:25/2 days, 0:35:42, loss=0.589030679269527, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=1.5540648642471535, lr=0.030909366934871715
2023-11-26 11:34:41   INFO  epoch: 0/24, acc_iter=6150, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:28, time_cost(all): 1:58:23/2 days, 0:29:58, loss=0.588923136909424, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.8989420121559228, lr=0.03126518141794443
2023-11-26 11:35:39   INFO  epoch: 0/24, acc_iter=6200, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:35, time_cost(all): 1:59:21/2 days, 0:44:06, loss=0.588815594549321, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=1.8123123052427887, lr=0.031620995901017156
2023-11-26 11:36:37   INFO  epoch: 0/24, acc_iter=6250, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:43, time_cost(all): 2:00:19/2 days, 1:48:15, loss=0.588708052189219, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=1.8125742694289393, lr=0.03197681038408987
2023-11-26 11:37:34   INFO  epoch: 0/24, acc_iter=6300, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:15, time_cost(all): 2:01:16/2 days, 0:32:56, loss=0.588600509829116, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=4.714623392071654, lr=0.03233262486716259
2023-11-26 11:38:32   INFO  epoch: 0/24, acc_iter=6350, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:32, time_cost(all): 2:02:14/2 days, 2:32:34, loss=0.588492967469013, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.667440025620417, lr=0.03268843935023531
2023-11-26 11:39:30   INFO  epoch: 0/24, acc_iter=6400, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:46, time_cost(all): 2:03:12/2 days, 1:58:07, loss=0.588385425108911, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=4.855712520336087, lr=0.03304425383330803
2023-11-26 11:40:28   INFO  epoch: 0/24, acc_iter=6450, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:33, time_cost(all): 2:04:10/2 days, 2:10:38, loss=0.588277882748808, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=4.909119032763782, lr=0.03340006831638075
2023-11-26 11:41:25   INFO  epoch: 0/24, acc_iter=6500, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:42, time_cost(all): 2:05:07/2 days, 2:05:37, loss=0.588170340388705, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=3.652994446554262, lr=0.03375588279945347
2023-11-26 11:42:23   INFO  epoch: 0/24, acc_iter=6550, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:41, time_cost(all): 2:06:05/1 day, 22:25:28, loss=0.588062798028603, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=0.9722914555105102, lr=0.034111697282526186
2023-11-26 11:43:21   INFO  epoch: 1/24, acc_iter=6637, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:07:26, time_cost(all): 2:07:03/1 day, 23:11:39, loss=0.587875674322024, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=0.7424889972010944, lr=0.03473081448307272
2023-11-26 11:44:19   INFO  epoch: 1/24, acc_iter=6687, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:01:15, time_cost(all): 2:08:01/2 days, 3:00:06, loss=0.587768131961921, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=1.4640717105633878, lr=0.035086628966145436
2023-11-26 11:45:16   INFO  epoch: 1/24, acc_iter=6737, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:30, time_cost(all): 2:08:58/2 days, 2:41:43, loss=0.587660589601819, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.034357886025436, lr=0.03544244344921816
2023-11-26 11:46:14   INFO  epoch: 1/24, acc_iter=6787, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:08:19, time_cost(all): 2:09:56/2 days, 1:06:18, loss=0.587553047241716, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.057890299046345, lr=0.035798257932290876
2023-11-26 11:47:12   INFO  epoch: 1/24, acc_iter=6837, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:57:22, time_cost(all): 2:10:54/2 days, 2:44:14, loss=0.587445504881613, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.416039544174893, lr=0.03615407241536359
2023-11-26 11:48:10   INFO  epoch: 1/24, acc_iter=6887, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:27, time_cost(all): 2:11:52/2 days, 0:14:24, loss=0.587337962521511, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=1.286941765180062, lr=0.03650988689843632
2023-11-26 11:49:07   INFO  epoch: 1/24, acc_iter=6937, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:16, time_cost(all): 2:12:49/1 day, 23:44:20, loss=0.587230420161408, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=3.719313977827058, lr=0.03686570138150903
2023-11-26 11:50:05   INFO  epoch: 1/24, acc_iter=6987, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:56:17, time_cost(all): 2:13:47/1 day, 23:40:06, loss=0.587122877801305, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=1.9812445264421188, lr=0.03722151586458175
2023-11-26 11:51:03   INFO  epoch: 1/24, acc_iter=7037, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:52:22, time_cost(all): 2:14:45/2 days, 2:23:33, loss=0.587015335441203, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.187741893434186, lr=0.037577330347654474
2023-11-26 11:52:01   INFO  epoch: 1/24, acc_iter=7087, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:00:35, time_cost(all): 2:15:43/2 days, 0:53:25, loss=0.5869077930811, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=1.6691654362718498, lr=0.03793314483072719
2023-11-26 11:52:58   INFO  epoch: 1/24, acc_iter=7137, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/2:01:57, time_cost(all): 2:16:40/2 days, 0:13:19, loss=0.586800250720997, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=3.4683816735571784, lr=0.03828895931379991
2023-11-26 11:53:56   INFO  epoch: 1/24, acc_iter=7187, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:56:08, time_cost(all): 2:17:38/1 day, 22:41:24, loss=0.586692708360894, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=4.394773117968533, lr=0.03864477379687263
2023-11-26 11:54:54   INFO  epoch: 1/24, acc_iter=7237, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:53:21, time_cost(all): 2:18:36/2 days, 1:35:33, loss=0.586585166000792, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.138560515219803, lr=0.03900058827994535
2023-11-26 11:55:52   INFO  epoch: 1/24, acc_iter=7287, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:55:46, time_cost(all): 2:19:34/2 days, 2:06:52, loss=0.586477623640689, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=4.576752571963032, lr=0.039356402763018064
2023-11-26 11:56:49   INFO  epoch: 1/24, acc_iter=7337, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:51:18, time_cost(all): 2:20:31/2 days, 1:34:11, loss=0.586370081280586, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=3.5828856840796193, lr=0.03971221724609079
2023-11-26 11:57:47   INFO  epoch: 1/24, acc_iter=7387, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:48:13, time_cost(all): 2:21:29/2 days, 1:08:19, loss=0.586262538920484, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=1.0300355152245686, lr=0.040068031729163504
2023-11-26 11:58:45   INFO  epoch: 1/24, acc_iter=7437, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:45:44, time_cost(all): 2:22:27/1 day, 23:17:29, loss=0.586154996560381, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=2.540040783990328, lr=0.04042384621223622
2023-11-26 11:59:43   INFO  epoch: 1/24, acc_iter=7487, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:52:29, time_cost(all): 2:23:25/2 days, 1:49:09, loss=0.586047454200278, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=3.111497053164814, lr=0.040779660695308945
2023-11-26 12:00:40   INFO  epoch: 1/24, acc_iter=7537, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:44:26, time_cost(all): 2:24:22/1 day, 21:58:18, loss=0.585939911840176, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=2.516832807422688, lr=0.04113547517838166
2023-11-26 12:01:38   INFO  epoch: 1/24, acc_iter=7587, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:43:31, time_cost(all): 2:25:20/2 days, 0:46:13, loss=0.585832369480073, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=3.3685099281845736, lr=0.041491289661454385
2023-11-26 12:02:36   INFO  epoch: 1/24, acc_iter=7637, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:46:49, time_cost(all): 2:26:18/2 days, 2:04:12, loss=0.58572482711997, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=2.811399413687085, lr=0.0418471041445271
2023-11-26 12:03:34   INFO  epoch: 1/24, acc_iter=7687, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:45:52, time_cost(all): 2:27:16/2 days, 1:27:51, loss=0.585617284759868, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=4.518084302512983, lr=0.04220291862759982
2023-11-26 12:04:31   INFO  epoch: 1/24, acc_iter=7737, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:40:30, time_cost(all): 2:28:13/1 day, 23:13:24, loss=0.585509742399765, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.1886224492612467, lr=0.04255873311067254
2023-11-26 12:05:29   INFO  epoch: 1/24, acc_iter=7787, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:43:32, time_cost(all): 2:29:11/2 days, 0:16:57, loss=0.585402200039662, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.5765477410689825, lr=0.04291454759374526
2023-11-26 12:06:27   INFO  epoch: 1/24, acc_iter=7837, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:44:02, time_cost(all): 2:30:09/1 day, 23:37:01, loss=0.58529465767956, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.1972793695484025, lr=0.043270362076817975
2023-11-26 12:07:25   INFO  epoch: 1/24, acc_iter=7887, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:46:26, time_cost(all): 2:31:07/1 day, 22:33:34, loss=0.585187115319457, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.2878026522780575, lr=0.0436261765598907
2023-11-26 12:08:22   INFO  epoch: 1/24, acc_iter=7937, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:38:01, time_cost(all): 2:32:04/2 days, 1:44:37, loss=0.585079572959354, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=3.1993727285165963, lr=0.043981991042963416
2023-11-26 12:09:20   INFO  epoch: 1/24, acc_iter=7987, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:38:58, time_cost(all): 2:33:02/1 day, 22:52:57, loss=0.584972030599252, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=2.2618096597161426, lr=0.04433780552603613
2023-11-26 12:10:18   INFO  epoch: 1/24, acc_iter=8037, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:37:41, time_cost(all): 2:34:00/1 day, 21:57:44, loss=0.584864488239149, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.522909179858584, lr=0.044693620009108856
2023-11-26 12:11:16   INFO  epoch: 1/24, acc_iter=8087, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:37:18, time_cost(all): 2:34:58/2 days, 2:06:09, loss=0.584756945879046, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=0.7231764433135084, lr=0.04504943449218157
2023-11-26 12:12:13   INFO  epoch: 1/24, acc_iter=8137, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:39:27, time_cost(all): 2:35:55/2 days, 0:35:29, loss=0.584649403518944, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=2.1505045944381145, lr=0.04540524897525429
2023-11-26 12:13:11   INFO  epoch: 1/24, acc_iter=8187, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:35:43, time_cost(all): 2:36:53/2 days, 1:18:50, loss=0.584541861158841, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=3.3014365435116386, lr=0.045761063458327006
2023-11-26 12:14:09   INFO  epoch: 1/24, acc_iter=8237, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:31:22, time_cost(all): 2:37:51/1 day, 22:48:12, loss=0.584434318798738, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=2.1893052372103714, lr=0.04611687794139973
2023-11-26 12:15:07   INFO  epoch: 1/24, acc_iter=8287, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:23, time_cost(all): 2:38:49/1 day, 22:25:21, loss=0.584326776438636, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.8187892442308478, lr=0.046472692424472446
2023-11-26 12:16:04   INFO  epoch: 1/24, acc_iter=8337, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:31:44, time_cost(all): 2:39:46/2 days, 0:13:07, loss=0.584219234078533, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=4.4130514814332935, lr=0.04682850690754516
2023-11-26 12:17:02   INFO  epoch: 1/24, acc_iter=8387, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:33:19, time_cost(all): 2:40:44/1 day, 22:56:19, loss=0.58411169171843, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=1.8501402720205178, lr=0.04718432139061789
2023-11-26 12:18:00   INFO  epoch: 1/24, acc_iter=8437, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:33:31, time_cost(all): 2:41:42/2 days, 1:28:18, loss=0.584004149358328, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=4.936581598287575, lr=0.0475401358736906
2023-11-26 12:18:58   INFO  epoch: 1/24, acc_iter=8487, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:30:24, time_cost(all): 2:42:40/1 day, 22:04:23, loss=0.583896606998225, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=2.736127968635474, lr=0.04789595035676332
2023-11-26 12:19:55   INFO  epoch: 1/24, acc_iter=8537, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:25:09, time_cost(all): 2:43:37/1 day, 22:07:27, loss=0.583789064638122, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=4.814904165752365, lr=0.048251764839836044
2023-11-26 12:20:53   INFO  epoch: 1/24, acc_iter=8587, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:31:39, time_cost(all): 2:44:35/1 day, 23:02:13, loss=0.58368152227802, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=4.454532917613947, lr=0.04860757932290876
2023-11-26 12:21:51   INFO  epoch: 1/24, acc_iter=8637, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:28:44, time_cost(all): 2:45:33/1 day, 23:37:23, loss=0.583573979917917, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.232533704832477, lr=0.04896339380598148
2023-11-26 12:22:49   INFO  epoch: 1/24, acc_iter=8687, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:24:04, time_cost(all): 2:46:31/1 day, 23:01:42, loss=0.583466437557814, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=3.363706393807324, lr=0.0493192082890542
2023-11-26 12:23:46   INFO  epoch: 1/24, acc_iter=8737, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:28:38, time_cost(all): 2:47:28/2 days, 1:01:29, loss=0.583358895197711, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=1.3712226034030262, lr=0.04967502277212692
2023-11-26 12:24:44   INFO  epoch: 1/24, acc_iter=8787, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:26:23, time_cost(all): 2:48:26/1 day, 23:36:23, loss=0.583251352837609, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=4.46795124803748, lr=0.050030837255199634
2023-11-26 12:25:42   INFO  epoch: 1/24, acc_iter=8837, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:22:34, time_cost(all): 2:49:24/2 days, 1:06:11, loss=0.583143810477506, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.859439346303901, lr=0.05038665173827236
2023-11-26 12:26:40   INFO  epoch: 1/24, acc_iter=8887, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:24:39, time_cost(all): 2:50:22/1 day, 21:57:22, loss=0.583036268117403, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=4.763483748365356, lr=0.050742466221345074
2023-11-26 12:27:37   INFO  epoch: 1/24, acc_iter=8937, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:24:09, time_cost(all): 2:51:19/2 days, 0:21:07, loss=0.582928725757301, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.35068453898049, lr=0.05109828070441779
2023-11-26 12:28:35   INFO  epoch: 1/24, acc_iter=8987, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:16:36, time_cost(all): 2:52:17/1 day, 23:16:28, loss=0.582821183397198, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=0.6230744167901066, lr=0.051454095187490514
2023-11-26 12:29:33   INFO  epoch: 1/24, acc_iter=9037, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:17:22, time_cost(all): 2:53:15/1 day, 23:12:04, loss=0.582713641037095, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.049464810216799, lr=0.05180990967056323
2023-11-26 12:30:31   INFO  epoch: 1/24, acc_iter=9087, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:15:32, time_cost(all): 2:54:13/1 day, 23:13:04, loss=0.582606098676993, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=4.746741516527349, lr=0.05216572415363595
2023-11-26 12:31:28   INFO  epoch: 1/24, acc_iter=9137, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:19:53, time_cost(all): 2:55:10/2 days, 0:31:41, loss=0.58249855631689, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.0227617929158903, lr=0.05252153863670867
2023-11-26 12:32:26   INFO  epoch: 1/24, acc_iter=9187, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:19:31, time_cost(all): 2:56:08/1 day, 22:10:24, loss=0.582391013956787, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=2.39770063528089, lr=0.05287735311978139
2023-11-26 12:33:24   INFO  epoch: 1/24, acc_iter=9237, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:15:02, time_cost(all): 2:57:06/1 day, 21:26:10, loss=0.582283471596685, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.5449566016006946, lr=0.053233167602854105
2023-11-26 12:34:22   INFO  epoch: 1/24, acc_iter=9287, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:18:18, time_cost(all): 2:58:04/1 day, 23:36:24, loss=0.582175929236582, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=1.5106788013903958, lr=0.05358898208592683
2023-11-26 12:35:19   INFO  epoch: 1/24, acc_iter=9337, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:14:01, time_cost(all): 2:59:01/2 days, 0:48:38, loss=0.582068386876479, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=1.55319564672045, lr=0.053944796568999545
2023-11-26 12:36:17   INFO  epoch: 1/24, acc_iter=9387, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:14:06, time_cost(all): 2:59:59/2 days, 1:18:52, loss=0.581960844516377, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=0.5660803410886895, lr=0.05430061105207226
2023-11-26 12:37:15   INFO  epoch: 1/24, acc_iter=9437, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:11:04, time_cost(all): 3:00:57/1 day, 23:19:32, loss=0.581853302156274, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=1.5561203346820975, lr=0.054656425535144985
2023-11-26 12:38:13   INFO  epoch: 1/24, acc_iter=9487, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:36, time_cost(all): 3:01:55/2 days, 0:02:07, loss=0.581745759796171, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=4.884862564764771, lr=0.05501224001821771
2023-11-26 12:39:10   INFO  epoch: 1/24, acc_iter=9537, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:10:42, time_cost(all): 3:02:52/1 day, 22:09:31, loss=0.581638217436069, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=2.7782334663777783, lr=0.05536805450129042
2023-11-26 12:40:08   INFO  epoch: 1/24, acc_iter=9587, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:08:40, time_cost(all): 3:03:50/2 days, 1:54:04, loss=0.581530675075966, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=0.6133738718136434, lr=0.05572386898436314
2023-11-26 12:41:06   INFO  epoch: 1/24, acc_iter=9637, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:05:30, time_cost(all): 3:04:48/2 days, 0:29:45, loss=0.581423132715863, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=4.075312586084706, lr=0.05607968346743586
2023-11-26 12:42:04   INFO  epoch: 1/24, acc_iter=9687, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:10:07, time_cost(all): 3:05:46/1 day, 23:35:40, loss=0.581315590355761, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=3.2890907867977695, lr=0.05643549795050858
2023-11-26 12:43:01   INFO  epoch: 1/24, acc_iter=9737, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:04:59, time_cost(all): 3:06:43/1 day, 23:33:41, loss=0.581208047995658, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.557844412375428, lr=0.0567913124335813
2023-11-26 12:43:59   INFO  epoch: 1/24, acc_iter=9787, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:23, time_cost(all): 3:07:41/2 days, 0:51:40, loss=0.581100505635555, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=1.6202221875257643, lr=0.05714712691665402
2023-11-26 12:44:57   INFO  epoch: 1/24, acc_iter=9837, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:05:54, time_cost(all): 3:08:39/1 day, 23:46:28, loss=0.580992963275452, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.082900786115071, lr=0.05750294139972673
2023-11-26 12:45:55   INFO  epoch: 1/24, acc_iter=9887, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:00, time_cost(all): 3:09:37/1 day, 22:28:34, loss=0.58088542091535, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=2.852011071044098, lr=0.057858755882799456
2023-11-26 12:46:52   INFO  epoch: 1/24, acc_iter=9937, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/0:59:41, time_cost(all): 3:10:34/1 day, 23:52:58, loss=0.580777878555247, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.5536176683406073, lr=0.05821457036587217
2023-11-26 12:47:50   INFO  epoch: 1/24, acc_iter=9987, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:39, time_cost(all): 3:11:32/2 days, 1:01:56, loss=0.580670336195145, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=4.0029686277075385, lr=0.0585703848489449
2023-11-26 12:48:48   INFO  epoch: 1/24, acc_iter=10037, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:57:43, time_cost(all): 3:12:30/1 day, 21:51:25, loss=0.580562793835042, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.2839397934808607, lr=0.05892619933201761
2023-11-26 12:49:46   INFO  epoch: 1/24, acc_iter=10087, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:23, time_cost(all): 3:13:28/1 day, 21:22:52, loss=0.580455251474939, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.310266349377915, lr=0.05928201381509034
2023-11-26 12:50:43   INFO  epoch: 1/24, acc_iter=10137, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:01:02, time_cost(all): 3:14:25/1 day, 22:53:37, loss=0.580347709114836, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.244893631055585, lr=0.05963782829816305
2023-11-26 12:51:41   INFO  epoch: 1/24, acc_iter=10187, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:07, time_cost(all): 3:15:23/2 days, 1:12:37, loss=0.580240166754734, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=0.7544671058815451, lr=0.05999364278123577
2023-11-26 12:52:39   INFO  epoch: 1/24, acc_iter=10237, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:55:03, time_cost(all): 3:16:21/2 days, 1:20:23, loss=0.580132624394631, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=1.2458432932958265, lr=0.06034945726430849
2023-11-26 12:53:37   INFO  epoch: 1/24, acc_iter=10287, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:26, time_cost(all): 3:17:19/1 day, 21:37:00, loss=0.580025082034528, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=0.7386760729492723, lr=0.06070527174738121
2023-11-26 12:54:34   INFO  epoch: 1/24, acc_iter=10337, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:53:39, time_cost(all): 3:18:16/1 day, 23:41:11, loss=0.579917539674426, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=1.1076263783884692, lr=0.06106108623045393
2023-11-26 12:55:32   INFO  epoch: 1/24, acc_iter=10387, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:15, time_cost(all): 3:19:14/1 day, 23:28:40, loss=0.579809997314323, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=3.4875943598315375, lr=0.06141690071352665
2023-11-26 12:56:30   INFO  epoch: 1/24, acc_iter=10437, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:49, time_cost(all): 3:20:12/1 day, 22:01:27, loss=0.57970245495422, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=4.52400114390682, lr=0.06177271519659936
2023-11-26 12:57:28   INFO  epoch: 1/24, acc_iter=10487, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:24, time_cost(all): 3:21:10/1 day, 21:56:00, loss=0.579594912594118, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=1.0815748142097643, lr=0.062128529679672084
2023-11-26 12:58:25   INFO  epoch: 1/24, acc_iter=10537, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:07, time_cost(all): 3:22:07/2 days, 0:19:35, loss=0.579487370234015, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=4.410140232879547, lr=0.0624843441627448
2023-11-26 12:59:23   INFO  epoch: 1/24, acc_iter=10587, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:47:38, time_cost(all): 3:23:05/1 day, 21:11:39, loss=0.579379827873912, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=3.4277997880336875, lr=0.06284015864581752
2023-11-26 13:00:21   INFO  epoch: 1/24, acc_iter=10637, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:51:00, time_cost(all): 3:24:03/1 day, 21:40:01, loss=0.57927228551381, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=3.6399893656948885, lr=0.06319597312889023
2023-11-26 13:01:19   INFO  epoch: 1/24, acc_iter=10687, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:50:13, time_cost(all): 3:25:01/1 day, 21:47:57, loss=0.579164743153707, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=4.877110020910167, lr=0.06355178761196296
2023-11-26 13:02:16   INFO  epoch: 1/24, acc_iter=10737, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:46:53, time_cost(all): 3:25:58/1 day, 21:58:19, loss=0.579057200793604, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=2.7387934110190537, lr=0.06390760209503567
2023-11-26 13:03:14   INFO  epoch: 1/24, acc_iter=10787, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:44:17, time_cost(all): 3:26:56/1 day, 23:38:04, loss=0.578949658433502, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=3.370676495756149, lr=0.0642634165781084
2023-11-26 13:04:12   INFO  epoch: 1/24, acc_iter=10837, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:43:36, time_cost(all): 3:27:54/1 day, 21:48:24, loss=0.578842116073399, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=4.0289816881791545, lr=0.06461923106118111
2023-11-26 13:05:10   INFO  epoch: 1/24, acc_iter=10887, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:58, time_cost(all): 3:28:52/1 day, 21:50:25, loss=0.578734573713296, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=4.545395021218842, lr=0.06497504554425383
2023-11-26 13:06:08   INFO  epoch: 1/24, acc_iter=10937, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:21, time_cost(all): 3:29:50/2 days, 0:17:40, loss=0.578627031353194, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.433703668914985, lr=0.06533086002732655
2023-11-26 13:07:05   INFO  epoch: 1/24, acc_iter=10987, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:06, time_cost(all): 3:30:47/1 day, 23:41:57, loss=0.578519488993091, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=4.278458410995473, lr=0.06568667451039926
2023-11-26 13:08:03   INFO  epoch: 1/24, acc_iter=11037, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:39:12, time_cost(all): 3:31:45/1 day, 22:08:38, loss=0.578411946632988, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=1.3532914517064272, lr=0.06604248899347198
2023-11-26 13:09:01   INFO  epoch: 1/24, acc_iter=11087, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:59, time_cost(all): 3:32:43/2 days, 1:09:35, loss=0.578304404272886, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=4.714640512904138, lr=0.06639830347654471
2023-11-26 13:09:59   INFO  epoch: 1/24, acc_iter=11137, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:39:25, time_cost(all): 3:33:41/1 day, 21:58:38, loss=0.578196861912783, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=1.6147743532040137, lr=0.06675411795961743
2023-11-26 13:10:56   INFO  epoch: 1/24, acc_iter=11187, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:53, time_cost(all): 3:34:38/1 day, 23:24:48, loss=0.57808931955268, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=1.729273490936496, lr=0.06710993244269015
2023-11-26 13:11:54   INFO  epoch: 1/24, acc_iter=11237, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:35:33, time_cost(all): 3:35:36/1 day, 22:36:35, loss=0.577981777192577, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=4.24103990923958, lr=0.06746574692576286
2023-11-26 13:12:52   INFO  epoch: 1/24, acc_iter=11287, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:09, time_cost(all): 3:36:34/2 days, 0:06:29, loss=0.577874234832475, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=4.87204956505843, lr=0.06782156140883558
2023-11-26 13:13:50   INFO  epoch: 1/24, acc_iter=11337, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:03, time_cost(all): 3:37:32/2 days, 0:52:44, loss=0.577766692472372, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.4588472040668314, lr=0.0681773758919083
2023-11-26 13:14:47   INFO  epoch: 1/24, acc_iter=11387, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:10, time_cost(all): 3:38:29/2 days, 0:55:48, loss=0.577659150112269, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=2.645304967223322, lr=0.06853319037498103
2023-11-26 13:15:45   INFO  epoch: 1/24, acc_iter=11437, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:33:14, time_cost(all): 3:39:27/1 day, 21:33:25, loss=0.577551607752167, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=4.873361265042929, lr=0.06888900485805374
2023-11-26 13:16:43   INFO  epoch: 1/24, acc_iter=11487, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:06, time_cost(all): 3:40:25/2 days, 1:13:30, loss=0.577444065392064, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=1.7190082657625179, lr=0.06924481934112646
2023-11-26 13:17:41   INFO  epoch: 1/24, acc_iter=11537, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:34, time_cost(all): 3:41:23/1 day, 22:52:53, loss=0.577336523031961, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=3.083399540531067, lr=0.06960063382419918
2023-11-26 13:18:38   INFO  epoch: 1/24, acc_iter=11587, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:55, time_cost(all): 3:42:20/1 day, 21:52:41, loss=0.577228980671859, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=3.6399749271553388, lr=0.06995644830727189
2023-11-26 13:19:36   INFO  epoch: 1/24, acc_iter=11637, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:28:17, time_cost(all): 3:43:18/2 days, 0:59:40, loss=0.577121438311756, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.2098973631811143, lr=0.07031226279034462
2023-11-26 13:20:34   INFO  epoch: 1/24, acc_iter=11687, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:20, time_cost(all): 3:44:16/1 day, 21:09:13, loss=0.577013895951653, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.9867742848932926, lr=0.07066807727341734
2023-11-26 13:21:32   INFO  epoch: 1/24, acc_iter=11737, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:49, time_cost(all): 3:45:14/1 day, 23:19:44, loss=0.576906353591551, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.9487515947858673, lr=0.07102389175649006
2023-11-26 13:22:29   INFO  epoch: 1/24, acc_iter=11787, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:25:28, time_cost(all): 3:46:11/2 days, 1:00:42, loss=0.576798811231448, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=2.6888368223140575, lr=0.07137970623956277
2023-11-26 13:23:27   INFO  epoch: 1/24, acc_iter=11837, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:24, time_cost(all): 3:47:09/2 days, 0:35:10, loss=0.576691268871345, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.5318167634915463, lr=0.0717355207226355
2023-11-26 13:24:25   INFO  epoch: 1/24, acc_iter=11887, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:23:45, time_cost(all): 3:48:07/1 day, 22:02:03, loss=0.576583726511243, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.3104147112947657, lr=0.0720913352057082
2023-11-26 13:25:23   INFO  epoch: 1/24, acc_iter=11937, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:10, time_cost(all): 3:49:05/1 day, 22:08:39, loss=0.57647618415114, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=1.9573363514163953, lr=0.07244714968878094
2023-11-26 13:26:20   INFO  epoch: 1/24, acc_iter=11987, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:15, time_cost(all): 3:50:02/1 day, 23:43:12, loss=0.576368641791037, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.320625571371371, lr=0.07280296417185364
2023-11-26 13:27:18   INFO  epoch: 1/24, acc_iter=12037, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:57, time_cost(all): 3:51:00/2 days, 0:45:33, loss=0.576261099430935, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=0.8516904841526928, lr=0.07315877865492637
2023-11-26 13:28:16   INFO  epoch: 1/24, acc_iter=12087, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:57, time_cost(all): 3:51:58/1 day, 23:30:06, loss=0.576153557070832, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=4.618294328630075, lr=0.07351459313799909
2023-11-26 13:29:14   INFO  epoch: 1/24, acc_iter=12137, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:18, time_cost(all): 3:52:56/1 day, 23:12:38, loss=0.576046014710729, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=2.2539348573937765, lr=0.0738704076210718
2023-11-26 13:30:11   INFO  epoch: 1/24, acc_iter=12187, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:12, time_cost(all): 3:53:53/1 day, 20:50:47, loss=0.575938472350627, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=4.277068495437599, lr=0.07422622210414452
2023-11-26 13:31:09   INFO  epoch: 1/24, acc_iter=12237, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:37, time_cost(all): 3:54:51/1 day, 23:35:31, loss=0.575830929990524, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=2.1908538863659266, lr=0.07458203658721725
2023-11-26 13:32:07   INFO  epoch: 1/24, acc_iter=12287, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:14, time_cost(all): 3:55:49/1 day, 23:22:06, loss=0.575723387630421, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=2.6139083263031666, lr=0.07493785107028995
2023-11-26 13:33:05   INFO  epoch: 1/24, acc_iter=12337, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:10, time_cost(all): 3:56:47/1 day, 23:01:21, loss=0.575615845270319, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.2128616400569765, lr=0.07529366555336268
2023-11-26 13:34:02   INFO  epoch: 1/24, acc_iter=12387, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:46, time_cost(all): 3:57:44/1 day, 21:54:41, loss=0.575508302910216, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=3.4779180415847284, lr=0.0756494800364354
2023-11-26 13:35:00   INFO  epoch: 1/24, acc_iter=12437, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:46, time_cost(all): 3:58:42/1 day, 20:31:52, loss=0.575400760550113, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.7579425884223014, lr=0.07600529451950812
2023-11-26 13:35:58   INFO  epoch: 1/24, acc_iter=12487, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:09, time_cost(all): 3:59:40/1 day, 22:06:29, loss=0.575293218190011, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=2.167321329483068, lr=0.07636110900258083
2023-11-26 13:36:56   INFO  epoch: 1/24, acc_iter=12537, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:12, time_cost(all): 4:00:38/2 days, 0:45:45, loss=0.575185675829908, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.633482052684135, lr=0.07671692348565357
2023-11-26 13:37:53   INFO  epoch: 1/24, acc_iter=12587, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:25, time_cost(all): 4:01:35/1 day, 21:43:19, loss=0.575078133469805, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=0.6993656979216566, lr=0.07707273796872627
2023-11-26 13:38:51   INFO  epoch: 1/24, acc_iter=12637, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:42, time_cost(all): 4:02:33/1 day, 22:29:25, loss=0.574970591109703, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=4.642530922495254, lr=0.077428552451799
2023-11-26 13:39:49   INFO  epoch: 1/24, acc_iter=12687, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:08:58, time_cost(all): 4:03:31/1 day, 23:24:44, loss=0.5748630487496, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=1.4712950560627642, lr=0.07778436693487172
2023-11-26 13:40:47   INFO  epoch: 1/24, acc_iter=12737, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:44, time_cost(all): 4:04:29/1 day, 22:07:26, loss=0.574755506389497, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.3447173473568865, lr=0.07814018141794443
2023-11-26 13:41:44   INFO  epoch: 1/24, acc_iter=12787, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:24, time_cost(all): 4:05:26/2 days, 0:24:15, loss=0.574647964029394, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=0.8338709773251194, lr=0.07849599590101715
2023-11-26 13:42:42   INFO  epoch: 1/24, acc_iter=12837, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:43, time_cost(all): 4:06:24/1 day, 20:59:20, loss=0.574540421669292, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=4.779172451817755, lr=0.07885181038408988
2023-11-26 13:43:40   INFO  epoch: 1/24, acc_iter=12887, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:33, time_cost(all): 4:07:22/1 day, 22:37:21, loss=0.574432879309189, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=2.5103246576811533, lr=0.07920762486716258
2023-11-26 13:44:38   INFO  epoch: 1/24, acc_iter=12937, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:35, time_cost(all): 4:08:20/1 day, 21:16:03, loss=0.574325336949086, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=0.750738165341323, lr=0.07956343935023531
2023-11-26 13:45:35   INFO  epoch: 1/24, acc_iter=12987, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:33, time_cost(all): 4:09:17/1 day, 20:24:00, loss=0.574217794588984, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=4.311134490188978, lr=0.07991925383330803
2023-11-26 13:46:33   INFO  epoch: 1/24, acc_iter=13037, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:45, time_cost(all): 4:10:15/1 day, 23:21:19, loss=0.574110252228881, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=0.6487589967724168, lr=0.08027506831638075
2023-11-26 13:47:31   INFO  epoch: 1/24, acc_iter=13087, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:39, time_cost(all): 4:11:13/1 day, 20:38:25, loss=0.574002709868778, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=1.6425044365549577, lr=0.08063088279945346
2023-11-26 13:48:29   INFO  epoch: 1/24, acc_iter=13137, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 4:12:11/1 day, 21:06:35, loss=0.573895167508676, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=2.2874226044939787, lr=0.0809866972825262
2023-11-26 13:49:26   INFO  epoch: 2/24, acc_iter=13224, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:06:04, time_cost(all): 4:13:08/1 day, 22:53:58, loss=0.573708043802097, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.3566730278613552, lr=0.08160581448307272
2023-11-26 13:50:24   INFO  epoch: 2/24, acc_iter=13274, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:09:32, time_cost(all): 4:14:06/1 day, 21:09:24, loss=0.573600501441994, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=0.5357108520989615, lr=0.08196162896614544
2023-11-26 13:51:22   INFO  epoch: 2/24, acc_iter=13324, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:05:20, time_cost(all): 4:15:04/1 day, 21:47:58, loss=0.573492959081892, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=0.8717156445593106, lr=0.08231744344921815
2023-11-26 13:52:20   INFO  epoch: 2/24, acc_iter=13374, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:03:31, time_cost(all): 4:16:02/1 day, 23:56:15, loss=0.573385416721789, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=2.1820122788612837, lr=0.08267325793229087
2023-11-26 13:53:17   INFO  epoch: 2/24, acc_iter=13424, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:59:17, time_cost(all): 4:16:59/1 day, 23:48:36, loss=0.573277874361686, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=1.4525963323472317, lr=0.08302907241536359
2023-11-26 13:54:15   INFO  epoch: 2/24, acc_iter=13474, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:00:39, time_cost(all): 4:17:57/1 day, 21:16:18, loss=0.573170332001584, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.092675218576117, lr=0.08338488689843632
2023-11-26 13:55:13   INFO  epoch: 2/24, acc_iter=13524, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:03:13, time_cost(all): 4:18:55/2 days, 0:06:47, loss=0.573062789641481, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=1.410643089721154, lr=0.08374070138150903
2023-11-26 13:56:11   INFO  epoch: 2/24, acc_iter=13574, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:58:13, time_cost(all): 4:19:53/2 days, 0:15:19, loss=0.572955247281378, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=3.760772363878275, lr=0.08409651586458175
2023-11-26 13:57:08   INFO  epoch: 2/24, acc_iter=13624, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:55:40, time_cost(all): 4:20:50/1 day, 21:02:31, loss=0.572847704921276, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.1512363205881075, lr=0.08445233034765447
2023-11-26 13:58:06   INFO  epoch: 2/24, acc_iter=13674, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:55:58, time_cost(all): 4:21:48/1 day, 22:25:01, loss=0.572740162561173, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=4.005044101496699, lr=0.08480814483072718
2023-11-26 13:59:04   INFO  epoch: 2/24, acc_iter=13724, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:58:18, time_cost(all): 4:22:46/1 day, 23:15:02, loss=0.57263262020107, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.1301183170656612, lr=0.0851639593137999
2023-11-26 14:00:02   INFO  epoch: 2/24, acc_iter=13774, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:59:36, time_cost(all): 4:23:44/2 days, 0:16:48, loss=0.572525077840968, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.3341724782189388, lr=0.08551977379687263
2023-11-26 14:00:59   INFO  epoch: 2/24, acc_iter=13824, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:55:26, time_cost(all): 4:24:41/1 day, 21:51:30, loss=0.572417535480865, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=1.700408834990542, lr=0.08587558827994535
2023-11-26 14:01:57   INFO  epoch: 2/24, acc_iter=13874, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:56:38, time_cost(all): 4:25:39/2 days, 0:08:11, loss=0.572309993120762, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=0.9008125942791299, lr=0.08623140276301806
2023-11-26 14:02:55   INFO  epoch: 2/24, acc_iter=13924, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:57:20, time_cost(all): 4:26:37/1 day, 20:20:31, loss=0.57220245076066, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.2360225498602342, lr=0.08658721724609078
2023-11-26 14:03:53   INFO  epoch: 2/24, acc_iter=13974, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:50:12, time_cost(all): 4:27:35/1 day, 22:37:29, loss=0.572094908400557, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=0.8116072434801227, lr=0.0869430317291635
2023-11-26 14:04:50   INFO  epoch: 2/24, acc_iter=14024, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:50:33, time_cost(all): 4:28:32/1 day, 22:02:58, loss=0.571987366040454, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=2.401108087426019, lr=0.08729884621223621
2023-11-26 14:05:48   INFO  epoch: 2/24, acc_iter=14074, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:50:54, time_cost(all): 4:29:30/1 day, 21:20:40, loss=0.571879823680352, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=3.773730107868764, lr=0.08765466069530894
2023-11-26 14:06:46   INFO  epoch: 2/24, acc_iter=14124, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:52:01, time_cost(all): 4:30:28/1 day, 21:35:15, loss=0.571772281320249, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=3.007690497748557, lr=0.08801047517838166
2023-11-26 14:07:44   INFO  epoch: 2/24, acc_iter=14174, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:52:00, time_cost(all): 4:31:26/1 day, 22:43:38, loss=0.571664738960146, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=2.600703952268406, lr=0.08836628966145438
2023-11-26 14:08:41   INFO  epoch: 2/24, acc_iter=14224, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:50:42, time_cost(all): 4:32:23/1 day, 22:18:58, loss=0.571557196600043, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.4268723651764166, lr=0.0887221041445271
2023-11-26 14:09:39   INFO  epoch: 2/24, acc_iter=14274, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:48:36, time_cost(all): 4:33:21/1 day, 21:08:15, loss=0.571449654239941, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=4.926523793923684, lr=0.08907791862759981
2023-11-26 14:10:37   INFO  epoch: 2/24, acc_iter=14324, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:44:06, time_cost(all): 4:34:19/1 day, 22:16:22, loss=0.571342111879838, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.4996620969144914, lr=0.08943373311067253
2023-11-26 14:11:35   INFO  epoch: 2/24, acc_iter=14374, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:42:45, time_cost(all): 4:35:17/1 day, 21:37:58, loss=0.571234569519735, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=3.439047462037456, lr=0.08978954759374526
2023-11-26 14:12:32   INFO  epoch: 2/24, acc_iter=14424, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:41:03, time_cost(all): 4:36:14/1 day, 23:48:31, loss=0.571127027159633, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=3.990213206529373, lr=0.09014536207681798
2023-11-26 14:13:30   INFO  epoch: 2/24, acc_iter=14474, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:38:27, time_cost(all): 4:37:12/1 day, 21:41:11, loss=0.57101948479953, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=2.793687512129264, lr=0.09050117655989069
2023-11-26 14:14:28   INFO  epoch: 2/24, acc_iter=14524, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:36:05, time_cost(all): 4:38:10/1 day, 22:41:54, loss=0.570911942439427, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.9044321829065416, lr=0.09085699104296341
2023-11-26 14:15:26   INFO  epoch: 2/24, acc_iter=14574, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:42:10, time_cost(all): 4:39:08/2 days, 0:11:04, loss=0.570804400079325, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=2.4870856681751285, lr=0.09121280552603613
2023-11-26 14:16:23   INFO  epoch: 2/24, acc_iter=14624, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:36:39, time_cost(all): 4:40:05/2 days, 0:17:21, loss=0.570696857719222, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.070592513039202, lr=0.09156862000910884
2023-11-26 14:17:21   INFO  epoch: 2/24, acc_iter=14674, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:33:22, time_cost(all): 4:41:03/1 day, 23:04:35, loss=0.570589315359119, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.673203230166861, lr=0.09192443449218157
2023-11-26 14:18:19   INFO  epoch: 2/24, acc_iter=14724, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:33:01, time_cost(all): 4:42:01/1 day, 21:12:44, loss=0.570481772999017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=4.959423260512393, lr=0.09228024897525428
2023-11-26 14:19:17   INFO  epoch: 2/24, acc_iter=14774, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:37:12, time_cost(all): 4:42:59/1 day, 22:32:38, loss=0.570374230638914, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=1.2219592273983753, lr=0.092636063458327
2023-11-26 14:20:14   INFO  epoch: 2/24, acc_iter=14824, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:31:40, time_cost(all): 4:43:56/1 day, 20:35:14, loss=0.570266688278811, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=3.170044656000913, lr=0.09299187794139972
2023-11-26 14:21:12   INFO  epoch: 2/24, acc_iter=14874, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:30:06, time_cost(all): 4:44:54/1 day, 20:59:33, loss=0.570159145918709, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.1735266878914636, lr=0.09334769242447244
2023-11-26 14:22:10   INFO  epoch: 2/24, acc_iter=14924, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:29:37, time_cost(all): 4:45:52/1 day, 20:30:03, loss=0.570051603558606, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=1.71356766367648, lr=0.09370350690754516
2023-11-26 14:23:08   INFO  epoch: 2/24, acc_iter=14974, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:32:15, time_cost(all): 4:46:50/1 day, 20:40:21, loss=0.569944061198503, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.205022004200633, lr=0.09405932139061789
2023-11-26 14:24:05   INFO  epoch: 2/24, acc_iter=15024, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:32:11, time_cost(all): 4:47:47/1 day, 22:21:44, loss=0.569836518838401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=4.833857744883907, lr=0.09441513587369059
2023-11-26 14:25:03   INFO  epoch: 2/24, acc_iter=15074, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:26:17, time_cost(all): 4:48:45/1 day, 23:26:48, loss=0.569728976478298, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=1.6500887739401704, lr=0.09477095035676332
2023-11-26 14:26:01   INFO  epoch: 2/24, acc_iter=15124, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:30:48, time_cost(all): 4:49:43/1 day, 21:08:40, loss=0.569621434118195, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=2.1587047020555605, lr=0.09512676483983604
2023-11-26 14:26:59   INFO  epoch: 2/24, acc_iter=15174, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:26:54, time_cost(all): 4:50:41/1 day, 19:47:18, loss=0.569513891758093, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=1.066345847984188, lr=0.09548257932290875
2023-11-26 14:27:56   INFO  epoch: 2/24, acc_iter=15224, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:27:24, time_cost(all): 4:51:38/1 day, 23:50:16, loss=0.56940634939799, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=4.187037066601507, lr=0.09583839380598147
2023-11-26 14:28:54   INFO  epoch: 2/24, acc_iter=15274, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:46, time_cost(all): 4:52:36/1 day, 23:39:35, loss=0.569298807037887, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=4.899958724602548, lr=0.0961942082890542
2023-11-26 14:29:52   INFO  epoch: 2/24, acc_iter=15324, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:22:53, time_cost(all): 4:53:34/1 day, 19:33:56, loss=0.569191264677785, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=3.414896375246595, lr=0.0965500227721269
2023-11-26 14:30:50   INFO  epoch: 2/24, acc_iter=15374, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:23:30, time_cost(all): 4:54:32/1 day, 21:37:52, loss=0.569083722317682, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.6826054337258367, lr=0.09690583725519963
2023-11-26 14:31:47   INFO  epoch: 2/24, acc_iter=15424, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:19:33, time_cost(all): 4:55:29/1 day, 23:15:07, loss=0.568976179957579, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.390476257268145, lr=0.09726165173827235
2023-11-26 14:32:45   INFO  epoch: 2/24, acc_iter=15474, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:25:31, time_cost(all): 4:56:27/1 day, 19:53:24, loss=0.568868637597477, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=4.031296106016455, lr=0.09761746622134507
2023-11-26 14:33:43   INFO  epoch: 2/24, acc_iter=15524, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:19, time_cost(all): 4:57:25/1 day, 20:53:26, loss=0.568761095237374, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.0425085778992327, lr=0.09797328070441778
2023-11-26 14:34:41   INFO  epoch: 2/24, acc_iter=15574, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:18:57, time_cost(all): 4:58:23/1 day, 21:55:42, loss=0.568653552877271, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=1.7287916742161666, lr=0.09832909518749051
2023-11-26 14:35:38   INFO  epoch: 2/24, acc_iter=15624, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:17:57, time_cost(all): 4:59:20/1 day, 20:45:42, loss=0.568546010517168, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=0.8700867872002852, lr=0.09868490967056323
2023-11-26 14:36:36   INFO  epoch: 2/24, acc_iter=15674, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:18:06, time_cost(all): 5:00:18/1 day, 20:23:24, loss=0.568438468157066, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.53523851358104, lr=0.09904072415363595
2023-11-26 14:37:34   INFO  epoch: 2/24, acc_iter=15724, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:18:15, time_cost(all): 5:01:16/1 day, 20:31:38, loss=0.568330925796963, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=4.06215902668644, lr=0.09939653863670866
2023-11-26 14:38:32   INFO  epoch: 2/24, acc_iter=15774, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:30, time_cost(all): 5:02:14/1 day, 23:50:55, loss=0.56822338343686, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=3.7463496733507875, lr=0.09975235311978138
2023-11-26 14:39:29   INFO  epoch: 2/24, acc_iter=15824, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:12:10, time_cost(all): 5:03:11/1 day, 22:35:24, loss=0.568115841076758, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=0.6321356222982977, lr=0.09998781210108687
2023-11-26 14:40:27   INFO  epoch: 2/24, acc_iter=15874, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:32, time_cost(all): 5:04:09/1 day, 19:39:09, loss=0.568008298716655, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=0.5128095604615608, lr=0.09994772032834628
2023-11-26 14:41:25   INFO  epoch: 2/24, acc_iter=15924, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:14:59, time_cost(all): 5:05:07/1 day, 20:27:31, loss=0.567900756356552, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.4540174145056903, lr=0.09990762855560568
2023-11-26 14:42:23   INFO  epoch: 2/24, acc_iter=15974, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:11:09, time_cost(all): 5:06:05/1 day, 22:48:37, loss=0.56779321399645, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=1.8086472773818676, lr=0.0998675367828651
2023-11-26 14:43:20   INFO  epoch: 2/24, acc_iter=16024, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:27, time_cost(all): 5:07:02/1 day, 21:14:26, loss=0.567685671636347, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=4.942308674961261, lr=0.09982744501012451
2023-11-26 14:44:18   INFO  epoch: 2/24, acc_iter=16074, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:52, time_cost(all): 5:08:00/1 day, 21:42:58, loss=0.567578129276244, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=2.3135246982536364, lr=0.09978735323738393
2023-11-26 14:45:16   INFO  epoch: 2/24, acc_iter=16124, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:52, time_cost(all): 5:08:58/1 day, 19:41:40, loss=0.567470586916142, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=2.9758884347164525, lr=0.09974726146464334
2023-11-26 14:46:14   INFO  epoch: 2/24, acc_iter=16174, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:12:24, time_cost(all): 5:09:56/1 day, 20:41:34, loss=0.567363044556039, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=1.4155734698569895, lr=0.09970716969190276
2023-11-26 14:47:12   INFO  epoch: 2/24, acc_iter=16224, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:09:26, time_cost(all): 5:10:54/1 day, 22:04:13, loss=0.567255502195936, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=3.354027423193648, lr=0.09966707791916217
2023-11-26 14:48:09   INFO  epoch: 2/24, acc_iter=16274, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:06:44, time_cost(all): 5:11:51/1 day, 22:50:39, loss=0.567147959835834, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=4.456749428846454, lr=0.09962698614642157
2023-11-26 14:49:07   INFO  epoch: 2/24, acc_iter=16324, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:32, time_cost(all): 5:12:49/1 day, 19:25:38, loss=0.567040417475731, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=3.6653043080519114, lr=0.09958689437368098
2023-11-26 14:50:05   INFO  epoch: 2/24, acc_iter=16374, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:05:19, time_cost(all): 5:13:47/1 day, 23:18:26, loss=0.566932875115628, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=3.964245685349417, lr=0.0995468026009404
2023-11-26 14:51:03   INFO  epoch: 2/24, acc_iter=16424, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:01:58, time_cost(all): 5:14:45/1 day, 21:01:09, loss=0.566825332755526, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=4.413351094252915, lr=0.09950671082819981
2023-11-26 14:52:00   INFO  epoch: 2/24, acc_iter=16474, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:05:41, time_cost(all): 5:15:42/1 day, 22:38:39, loss=0.566717790395423, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.921120950482552, lr=0.09946661905545923
2023-11-26 14:52:58   INFO  epoch: 2/24, acc_iter=16524, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:05, time_cost(all): 5:16:40/1 day, 21:45:51, loss=0.56661024803532, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=3.4061849633296886, lr=0.09942652728271864
2023-11-26 14:53:56   INFO  epoch: 2/24, acc_iter=16574, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:45, time_cost(all): 5:17:38/1 day, 21:09:23, loss=0.566502705675218, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=3.83742287670828, lr=0.09938643550997804
2023-11-26 14:54:54   INFO  epoch: 2/24, acc_iter=16624, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:59:49, time_cost(all): 5:18:36/1 day, 20:23:30, loss=0.566395163315115, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.7250623646089336, lr=0.09934634373723746
2023-11-26 14:55:51   INFO  epoch: 2/24, acc_iter=16674, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:00:20, time_cost(all): 5:19:33/1 day, 19:15:41, loss=0.566287620955012, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.532357484513549, lr=0.09930625196449687
2023-11-26 14:56:49   INFO  epoch: 2/24, acc_iter=16724, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:56:59, time_cost(all): 5:20:31/1 day, 20:58:53, loss=0.566180078594909, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.212232078498183, lr=0.09926616019175628
2023-11-26 14:57:47   INFO  epoch: 2/24, acc_iter=16774, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:58:00, time_cost(all): 5:21:29/1 day, 22:32:57, loss=0.566072536234807, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=2.167764162394877, lr=0.0992260684190157
2023-11-26 14:58:45   INFO  epoch: 2/24, acc_iter=16824, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:56:39, time_cost(all): 5:22:27/1 day, 22:03:25, loss=0.565964993874704, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=1.8159667334681542, lr=0.09918597664627511
2023-11-26 14:59:42   INFO  epoch: 2/24, acc_iter=16874, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:51, time_cost(all): 5:23:24/1 day, 21:44:37, loss=0.565857451514602, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=1.9963310461231338, lr=0.09914588487353451
2023-11-26 15:00:40   INFO  epoch: 2/24, acc_iter=16924, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:53:44, time_cost(all): 5:24:22/1 day, 19:59:49, loss=0.565749909154499, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=3.6837411220784553, lr=0.09910579310079393
2023-11-26 15:01:38   INFO  epoch: 2/24, acc_iter=16974, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:55:11, time_cost(all): 5:25:20/1 day, 22:22:37, loss=0.565642366794396, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=2.2924353946440936, lr=0.09906570132805334
2023-11-26 15:02:36   INFO  epoch: 2/24, acc_iter=17024, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:00, time_cost(all): 5:26:18/1 day, 19:03:34, loss=0.565534824434293, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=2.3880895258131933, lr=0.09902560955531275
2023-11-26 15:03:33   INFO  epoch: 2/24, acc_iter=17074, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:53:13, time_cost(all): 5:27:15/1 day, 23:13:34, loss=0.565427282074191, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=0.8145573607060126, lr=0.09898551778257217
2023-11-26 15:04:31   INFO  epoch: 2/24, acc_iter=17124, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:49, time_cost(all): 5:28:13/1 day, 20:54:43, loss=0.565319739714088, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.882156872562362, lr=0.09894542600983158
2023-11-26 15:05:29   INFO  epoch: 2/24, acc_iter=17174, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:49:43, time_cost(all): 5:29:11/1 day, 22:16:32, loss=0.565212197353985, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=0.5211864094528883, lr=0.098905334237091
2023-11-26 15:06:27   INFO  epoch: 2/24, acc_iter=17224, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:49:47, time_cost(all): 5:30:09/1 day, 20:25:39, loss=0.565104654993883, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.444284891472288, lr=0.0988652424643504
2023-11-26 15:07:24   INFO  epoch: 2/24, acc_iter=17274, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:00, time_cost(all): 5:31:06/1 day, 19:20:31, loss=0.56499711263378, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=2.3292121389442273, lr=0.09882515069160981
2023-11-26 15:08:22   INFO  epoch: 2/24, acc_iter=17324, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:26, time_cost(all): 5:32:04/1 day, 22:52:47, loss=0.564889570273677, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.1786134930528531, lr=0.09878505891886923
2023-11-26 15:09:20   INFO  epoch: 2/24, acc_iter=17374, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:59, time_cost(all): 5:33:02/1 day, 21:51:31, loss=0.564782027913575, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.8078897351898435, lr=0.09874496714612864
2023-11-26 15:10:18   INFO  epoch: 2/24, acc_iter=17424, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:50, time_cost(all): 5:34:00/1 day, 22:46:54, loss=0.564674485553472, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=1.9573648215673236, lr=0.09870487537338805
2023-11-26 15:11:15   INFO  epoch: 2/24, acc_iter=17474, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:46:10, time_cost(all): 5:34:57/1 day, 22:57:57, loss=0.564566943193369, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=4.827387469797765, lr=0.09866478360064747
2023-11-26 15:12:13   INFO  epoch: 2/24, acc_iter=17524, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:45:02, time_cost(all): 5:35:55/1 day, 20:23:15, loss=0.564459400833267, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.888890245326654, lr=0.09862469182790687
2023-11-26 15:13:11   INFO  epoch: 2/24, acc_iter=17574, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:00, time_cost(all): 5:36:53/1 day, 23:02:55, loss=0.564351858473164, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=0.6305155503645161, lr=0.09858460005516628
2023-11-26 15:14:09   INFO  epoch: 2/24, acc_iter=17624, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:51, time_cost(all): 5:37:51/1 day, 22:13:33, loss=0.564244316113061, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.210935073266716, lr=0.0985445082824257
2023-11-26 15:15:06   INFO  epoch: 2/24, acc_iter=17674, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:39:04, time_cost(all): 5:38:48/1 day, 21:47:58, loss=0.564136773752959, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.522279205043248, lr=0.09850441650968511
2023-11-26 15:16:04   INFO  epoch: 2/24, acc_iter=17724, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:02, time_cost(all): 5:39:46/1 day, 20:32:54, loss=0.564029231392856, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=0.6140392668487955, lr=0.09846432473694453
2023-11-26 15:17:02   INFO  epoch: 2/24, acc_iter=17774, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:21, time_cost(all): 5:40:44/1 day, 20:02:25, loss=0.563921689032753, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=2.671294692736822, lr=0.09842423296420394
2023-11-26 15:18:00   INFO  epoch: 2/24, acc_iter=17824, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:11, time_cost(all): 5:41:42/1 day, 23:03:40, loss=0.563814146672651, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=1.8119344258303738, lr=0.09838414119146334
2023-11-26 15:18:57   INFO  epoch: 2/24, acc_iter=17874, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:33, time_cost(all): 5:42:39/1 day, 20:08:46, loss=0.563706604312548, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=4.741208716538695, lr=0.09834404941872275
2023-11-26 15:19:55   INFO  epoch: 2/24, acc_iter=17924, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:54, time_cost(all): 5:43:37/1 day, 21:48:26, loss=0.563599061952445, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=1.4543579021204651, lr=0.09830395764598217
2023-11-26 15:20:53   INFO  epoch: 2/24, acc_iter=17974, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:48, time_cost(all): 5:44:35/1 day, 20:49:45, loss=0.563491519592343, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=2.5035013313902508, lr=0.09826386587324158
2023-11-26 15:21:51   INFO  epoch: 2/24, acc_iter=18024, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:53, time_cost(all): 5:45:33/1 day, 19:07:52, loss=0.56338397723224, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=4.3134692294191925, lr=0.098223774100501
2023-11-26 15:22:48   INFO  epoch: 2/24, acc_iter=18074, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:06, time_cost(all): 5:46:30/1 day, 22:24:35, loss=0.563276434872137, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.1081686964478568, lr=0.09818368232776041
2023-11-26 15:23:46   INFO  epoch: 2/24, acc_iter=18124, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:49, time_cost(all): 5:47:28/1 day, 21:27:27, loss=0.563168892512035, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=1.968169416068203, lr=0.09814359055501981
2023-11-26 15:24:44   INFO  epoch: 2/24, acc_iter=18174, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:29:28, time_cost(all): 5:48:26/1 day, 19:53:12, loss=0.563061350151932, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.2876279787837515, lr=0.09810349878227922
2023-11-26 15:25:42   INFO  epoch: 2/24, acc_iter=18224, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:18, time_cost(all): 5:49:24/1 day, 19:42:02, loss=0.562953807791829, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=4.168934759899717, lr=0.09806340700953864
2023-11-26 15:26:39   INFO  epoch: 2/24, acc_iter=18274, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:52, time_cost(all): 5:50:21/1 day, 19:58:35, loss=0.562846265431726, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=0.5225397557815219, lr=0.09802331523679805
2023-11-26 15:27:37   INFO  epoch: 2/24, acc_iter=18324, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:38, time_cost(all): 5:51:19/1 day, 22:29:23, loss=0.562738723071624, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=1.6243827986439316, lr=0.09798322346405747
2023-11-26 15:28:35   INFO  epoch: 2/24, acc_iter=18374, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:33, time_cost(all): 5:52:17/1 day, 19:36:52, loss=0.562631180711521, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=3.703945009025674, lr=0.09794313169131688
2023-11-26 15:29:33   INFO  epoch: 2/24, acc_iter=18424, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:37, time_cost(all): 5:53:15/1 day, 19:27:17, loss=0.562523638351418, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=3.570774914663131, lr=0.0979030399185763
2023-11-26 15:30:30   INFO  epoch: 2/24, acc_iter=18474, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:58, time_cost(all): 5:54:12/1 day, 22:06:58, loss=0.562416095991316, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=0.5361170238619384, lr=0.0978629481458357
2023-11-26 15:31:28   INFO  epoch: 2/24, acc_iter=18524, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:48, time_cost(all): 5:55:10/1 day, 21:02:15, loss=0.562308553631213, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=4.826525859760269, lr=0.09782285637309511
2023-11-26 15:32:26   INFO  epoch: 2/24, acc_iter=18574, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:50, time_cost(all): 5:56:08/1 day, 19:23:58, loss=0.56220101127111, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=4.901222923550124, lr=0.09778276460035452
2023-11-26 15:33:24   INFO  epoch: 2/24, acc_iter=18624, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:56, time_cost(all): 5:57:06/1 day, 22:59:29, loss=0.562093468911008, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=4.7720528894489505, lr=0.09774267282761394
2023-11-26 15:34:21   INFO  epoch: 2/24, acc_iter=18674, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:15, time_cost(all): 5:58:03/1 day, 22:01:11, loss=0.561985926550905, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=4.484673734995715, lr=0.09770258105487335
2023-11-26 15:35:19   INFO  epoch: 2/24, acc_iter=18724, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:53, time_cost(all): 5:59:01/1 day, 20:25:05, loss=0.561878384190802, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=1.286852072803314, lr=0.09766248928213277
2023-11-26 15:36:17   INFO  epoch: 2/24, acc_iter=18774, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:12, time_cost(all): 5:59:59/1 day, 19:01:39, loss=0.5617708418307, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.373342047050216, lr=0.09762239750939217
2023-11-26 15:37:15   INFO  epoch: 2/24, acc_iter=18824, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:11, time_cost(all): 6:00:57/1 day, 20:05:06, loss=0.561663299470597, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=0.9403576268584489, lr=0.09758230573665158
2023-11-26 15:38:12   INFO  epoch: 2/24, acc_iter=18874, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:11, time_cost(all): 6:01:54/1 day, 19:47:52, loss=0.561555757110494, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=3.813469930152624, lr=0.097542213963911
2023-11-26 15:39:10   INFO  epoch: 2/24, acc_iter=18924, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:38, time_cost(all): 6:02:52/1 day, 21:07:56, loss=0.561448214750392, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=3.404813395990013, lr=0.09750212219117041
2023-11-26 15:40:08   INFO  epoch: 2/24, acc_iter=18974, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:23, time_cost(all): 6:03:50/1 day, 22:33:49, loss=0.561340672390289, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.4283951380463096, lr=0.09746203041842982
2023-11-26 15:41:06   INFO  epoch: 2/24, acc_iter=19024, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:57, time_cost(all): 6:04:48/1 day, 18:34:33, loss=0.561233130030186, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=3.4938074801594583, lr=0.09742193864568924
2023-11-26 15:42:03   INFO  epoch: 2/24, acc_iter=19074, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:33, time_cost(all): 6:05:45/1 day, 18:55:40, loss=0.561125587670084, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.5063617789417165, lr=0.09738184687294864
2023-11-26 15:43:01   INFO  epoch: 2/24, acc_iter=19124, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:09, time_cost(all): 6:06:43/1 day, 21:06:20, loss=0.561018045309981, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=3.851977026954769, lr=0.09734175510020805
2023-11-26 15:43:59   INFO  epoch: 2/24, acc_iter=19174, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:45, time_cost(all): 6:07:41/1 day, 18:24:06, loss=0.560910502949878, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=2.1632481191508974, lr=0.09730166332746747
2023-11-26 15:44:57   INFO  epoch: 2/24, acc_iter=19224, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:51, time_cost(all): 6:08:39/1 day, 18:45:11, loss=0.560802960589776, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.137127986026709, lr=0.09726157155472688
2023-11-26 15:45:54   INFO  epoch: 2/24, acc_iter=19274, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:05, time_cost(all): 6:09:36/1 day, 21:59:57, loss=0.560695418229673, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=3.526661222270471, lr=0.0972214797819863
2023-11-26 15:46:52   INFO  epoch: 2/24, acc_iter=19324, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:25, time_cost(all): 6:10:34/1 day, 19:14:27, loss=0.56058787586957, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=3.61344692423508, lr=0.09718138800924571
2023-11-26 15:47:50   INFO  epoch: 2/24, acc_iter=19374, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:11, time_cost(all): 6:11:32/1 day, 20:25:48, loss=0.560480333509468, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=4.012319670587331, lr=0.09714129623650511
2023-11-26 15:48:48   INFO  epoch: 2/24, acc_iter=19424, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:34, time_cost(all): 6:12:30/1 day, 19:45:27, loss=0.560372791149365, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=1.1924948188127071, lr=0.09710120446376452
2023-11-26 15:49:45   INFO  epoch: 2/24, acc_iter=19474, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:43, time_cost(all): 6:13:27/1 day, 18:42:37, loss=0.560265248789262, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.2506463979461717, lr=0.09706111269102394
2023-11-26 15:50:43   INFO  epoch: 2/24, acc_iter=19524, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:32, time_cost(all): 6:14:25/1 day, 22:00:56, loss=0.56015770642916, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.129691026611046, lr=0.09702102091828335
2023-11-26 15:51:41   INFO  epoch: 2/24, acc_iter=19574, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:43, time_cost(all): 6:15:23/1 day, 18:35:25, loss=0.560050164069057, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=1.961666664447679, lr=0.09698092914554277
2023-11-26 15:52:39   INFO  epoch: 2/24, acc_iter=19624, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:34, time_cost(all): 6:16:21/1 day, 19:22:34, loss=0.559942621708954, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=3.0084158808967625, lr=0.09694083737280218
2023-11-26 15:53:36   INFO  epoch: 2/24, acc_iter=19674, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:37, time_cost(all): 6:17:18/1 day, 21:41:36, loss=0.559835079348851, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=4.421089383070898, lr=0.0969007456000616
2023-11-26 15:54:34   INFO  epoch: 2/24, acc_iter=19724, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 6:18:16/1 day, 21:17:01, loss=0.559727536988749, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=2.667898109605143, lr=0.096860653827321
2023-11-26 15:55:32   INFO  epoch: 3/24, acc_iter=19811, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:10:07, time_cost(all): 6:19:14/1 day, 19:40:45, loss=0.55954041328217, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.66414257997182, lr=0.09679089414275238
2023-11-26 15:56:30   INFO  epoch: 3/24, acc_iter=19861, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:06:12, time_cost(all): 6:20:12/1 day, 21:07:38, loss=0.559432870922068, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=4.194585976569519, lr=0.09675080237001178
2023-11-26 15:57:27   INFO  epoch: 3/24, acc_iter=19911, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:25, time_cost(all): 6:21:09/1 day, 21:50:32, loss=0.559325328561965, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=2.025898375787949, lr=0.0967107105972712
2023-11-26 15:58:25   INFO  epoch: 3/24, acc_iter=19961, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:00:30, time_cost(all): 6:22:07/1 day, 21:41:27, loss=0.559217786201862, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.2636157410424946, lr=0.09667061882453061
2023-11-26 15:59:23   INFO  epoch: 3/24, acc_iter=20011, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:03:43, time_cost(all): 6:23:05/1 day, 18:59:02, loss=0.559110243841759, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=2.068183953140121, lr=0.09663052705179002
2023-11-26 16:00:21   INFO  epoch: 3/24, acc_iter=20061, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:57:14, time_cost(all): 6:24:03/1 day, 20:50:02, loss=0.559002701481657, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=1.134354221394583, lr=0.09659043527904944
2023-11-26 16:01:18   INFO  epoch: 3/24, acc_iter=20111, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:08, time_cost(all): 6:25:00/1 day, 19:13:21, loss=0.558895159121554, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.64851044164738, lr=0.09655034350630885
2023-11-26 16:02:16   INFO  epoch: 3/24, acc_iter=20161, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:56:45, time_cost(all): 6:25:58/1 day, 22:23:36, loss=0.558787616761451, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.82796558666048, lr=0.09651025173356825
2023-11-26 16:03:14   INFO  epoch: 3/24, acc_iter=20211, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:58:17, time_cost(all): 6:26:56/1 day, 21:36:15, loss=0.558680074401349, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=0.9673038629199322, lr=0.09647015996082767
2023-11-26 16:04:12   INFO  epoch: 3/24, acc_iter=20261, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:51:38, time_cost(all): 6:27:54/1 day, 19:23:01, loss=0.558572532041246, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=0.6830675937159207, lr=0.09643006818808708
2023-11-26 16:05:09   INFO  epoch: 3/24, acc_iter=20311, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:54:04, time_cost(all): 6:28:51/1 day, 20:10:39, loss=0.558464989681143, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=4.344906404331498, lr=0.0963899764153465
2023-11-26 16:06:07   INFO  epoch: 3/24, acc_iter=20361, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:55:40, time_cost(all): 6:29:49/1 day, 19:59:18, loss=0.558357447321041, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=1.9145795300902155, lr=0.09634988464260591
2023-11-26 16:07:05   INFO  epoch: 3/24, acc_iter=20411, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:49:04, time_cost(all): 6:30:47/1 day, 19:32:00, loss=0.558249904960938, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=4.4792497833170195, lr=0.09630979286986532
2023-11-26 16:08:03   INFO  epoch: 3/24, acc_iter=20461, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:57:09, time_cost(all): 6:31:45/1 day, 20:37:42, loss=0.558142362600835, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.6272153676696526, lr=0.09626970109712474
2023-11-26 16:09:00   INFO  epoch: 3/24, acc_iter=20511, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:52:10, time_cost(all): 6:32:42/1 day, 18:26:18, loss=0.558034820240733, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.3049332508791744, lr=0.09622960932438414
2023-11-26 16:09:58   INFO  epoch: 3/24, acc_iter=20561, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:54:21, time_cost(all): 6:33:40/1 day, 19:25:06, loss=0.55792727788063, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.2(1.03), norm=4.2289831935806, lr=0.09618951755164355
2023-11-26 16:10:56   INFO  epoch: 3/24, acc_iter=20611, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:50:10, time_cost(all): 6:34:38/1 day, 22:15:14, loss=0.557819735520527, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=0.8886307803661981, lr=0.09614942577890297
2023-11-26 16:11:54   INFO  epoch: 3/24, acc_iter=20661, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:51:42, time_cost(all): 6:35:36/1 day, 20:03:52, loss=0.557712193160425, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=1.5407242205238467, lr=0.09610933400616238
2023-11-26 16:12:51   INFO  epoch: 3/24, acc_iter=20711, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:43:59, time_cost(all): 6:36:33/1 day, 18:28:36, loss=0.557604650800322, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=4.655813464556087, lr=0.0960692422334218
2023-11-26 16:13:49   INFO  epoch: 3/24, acc_iter=20761, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:49:32, time_cost(all): 6:37:31/1 day, 17:55:52, loss=0.557497108440219, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=4.214996263604881, lr=0.09602915046068121
2023-11-26 16:14:47   INFO  epoch: 3/24, acc_iter=20811, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:50:35, time_cost(all): 6:38:29/1 day, 18:35:20, loss=0.557389566080117, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=2.096459951815441, lr=0.09598905868794061
2023-11-26 16:15:45   INFO  epoch: 3/24, acc_iter=20861, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:45:26, time_cost(all): 6:39:27/1 day, 20:44:46, loss=0.557282023720014, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=0.7389313521221943, lr=0.09594896691520002
2023-11-26 16:16:42   INFO  epoch: 3/24, acc_iter=20911, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:43:49, time_cost(all): 6:40:24/1 day, 18:07:38, loss=0.557174481359911, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.451616916794439, lr=0.09590887514245944
2023-11-26 16:17:40   INFO  epoch: 3/24, acc_iter=20961, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:48:45, time_cost(all): 6:41:22/1 day, 18:05:21, loss=0.557066938999809, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=0.5906562146613699, lr=0.09586878336971885
2023-11-26 16:18:38   INFO  epoch: 3/24, acc_iter=21011, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:39:58, time_cost(all): 6:42:20/1 day, 21:53:14, loss=0.556959396639706, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=3.1117028913780436, lr=0.09582869159697827
2023-11-26 16:19:36   INFO  epoch: 3/24, acc_iter=21061, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:45:54, time_cost(all): 6:43:18/1 day, 21:58:55, loss=0.556851854279603, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=3.347853953759071, lr=0.09578859982423768
2023-11-26 16:20:33   INFO  epoch: 3/24, acc_iter=21111, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:40:49, time_cost(all): 6:44:15/1 day, 21:03:42, loss=0.5567443119195, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=0.7626965055641113, lr=0.09574850805149708
2023-11-26 16:21:31   INFO  epoch: 3/24, acc_iter=21161, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:40:50, time_cost(all): 6:45:13/1 day, 22:09:25, loss=0.556636769559398, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=0.755904097834466, lr=0.0957084162787565
2023-11-26 16:22:29   INFO  epoch: 3/24, acc_iter=21211, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:36:19, time_cost(all): 6:46:11/1 day, 20:38:29, loss=0.556529227199295, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=4.3087625338027085, lr=0.09566832450601591
2023-11-26 16:23:27   INFO  epoch: 3/24, acc_iter=21261, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:40:10, time_cost(all): 6:47:09/1 day, 22:04:01, loss=0.556421684839192, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=4.885625715287811, lr=0.09562823273327532
2023-11-26 16:24:24   INFO  epoch: 3/24, acc_iter=21311, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:41:29, time_cost(all): 6:48:06/1 day, 20:59:22, loss=0.55631414247909, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.537864313344426, lr=0.09558814096053474
2023-11-26 16:25:22   INFO  epoch: 3/24, acc_iter=21361, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:37:29, time_cost(all): 6:49:04/1 day, 17:47:22, loss=0.556206600118987, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.866408022623353, lr=0.09554804918779415
2023-11-26 16:26:20   INFO  epoch: 3/24, acc_iter=21411, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:34:01, time_cost(all): 6:50:02/1 day, 22:01:17, loss=0.556099057758884, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=0.7899667249561061, lr=0.09550795741505355
2023-11-26 16:27:18   INFO  epoch: 3/24, acc_iter=21461, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:32:34, time_cost(all): 6:51:00/1 day, 21:40:11, loss=0.555991515398782, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=0.9465170690936895, lr=0.09546786564231297
2023-11-26 16:28:15   INFO  epoch: 3/24, acc_iter=21511, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:36:57, time_cost(all): 6:51:57/1 day, 20:51:52, loss=0.555883973038679, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=2.545806945657964, lr=0.09542777386957238
2023-11-26 16:29:13   INFO  epoch: 3/24, acc_iter=21561, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:30:10, time_cost(all): 6:52:55/1 day, 20:31:24, loss=0.555776430678576, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=0.8963115447880519, lr=0.0953876820968318
2023-11-26 16:30:11   INFO  epoch: 3/24, acc_iter=21611, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:30:50, time_cost(all): 6:53:53/1 day, 17:38:42, loss=0.555668888318474, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=1.597590573790394, lr=0.09534759032409121
2023-11-26 16:31:09   INFO  epoch: 3/24, acc_iter=21661, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:27:48, time_cost(all): 6:54:51/1 day, 18:14:44, loss=0.555561345958371, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=2.621142365491825, lr=0.09530749855135062
2023-11-26 16:32:07   INFO  epoch: 3/24, acc_iter=21711, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:23, time_cost(all): 6:55:49/1 day, 20:19:09, loss=0.555453803598268, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=3.2833954171850412, lr=0.09526740677861004
2023-11-26 16:33:04   INFO  epoch: 3/24, acc_iter=21761, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:28:29, time_cost(all): 6:56:46/1 day, 19:15:56, loss=0.555346261238166, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.978628952435474, lr=0.09522731500586944
2023-11-26 16:34:02   INFO  epoch: 3/24, acc_iter=21811, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:25:29, time_cost(all): 6:57:44/1 day, 21:14:22, loss=0.555238718878063, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.3022149901551092, lr=0.09518722323312885
2023-11-26 16:35:00   INFO  epoch: 3/24, acc_iter=21861, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:29:08, time_cost(all): 6:58:42/1 day, 18:07:48, loss=0.55513117651796, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=0.6804613697015484, lr=0.09514713146038827
2023-11-26 16:35:58   INFO  epoch: 3/24, acc_iter=21911, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:24:03, time_cost(all): 6:59:40/1 day, 18:06:12, loss=0.555023634157858, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=1.6903016669249218, lr=0.09510703968764768
2023-11-26 16:36:55   INFO  epoch: 3/24, acc_iter=21961, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:26:33, time_cost(all): 7:00:37/1 day, 18:30:53, loss=0.554916091797755, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=3.5990656904489366, lr=0.0950669479149071
2023-11-26 16:37:53   INFO  epoch: 3/24, acc_iter=22011, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:19:48, time_cost(all): 7:01:35/1 day, 21:51:26, loss=0.554808549437652, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=4.5917744944187096, lr=0.09502685614216651
2023-11-26 16:38:51   INFO  epoch: 3/24, acc_iter=22061, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:23:16, time_cost(all): 7:02:33/1 day, 21:08:21, loss=0.55470100707755, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=0.7754596977125026, lr=0.09498676436942591
2023-11-26 16:39:49   INFO  epoch: 3/24, acc_iter=22111, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:24:49, time_cost(all): 7:03:31/1 day, 21:36:50, loss=0.554593464717447, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=3.030576445454845, lr=0.09494667259668532
2023-11-26 16:40:46   INFO  epoch: 3/24, acc_iter=22161, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:24:14, time_cost(all): 7:04:28/1 day, 18:31:23, loss=0.554485922357344, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=1.2234570227492378, lr=0.09490658082394474
2023-11-26 16:41:44   INFO  epoch: 3/24, acc_iter=22211, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:22:20, time_cost(all): 7:05:26/1 day, 20:37:24, loss=0.554378379997242, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=2.865752797159708, lr=0.09486648905120415
2023-11-26 16:42:42   INFO  epoch: 3/24, acc_iter=22261, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:15:15, time_cost(all): 7:06:24/1 day, 17:26:24, loss=0.554270837637139, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=2.2272244501877307, lr=0.09482639727846356
2023-11-26 16:43:40   INFO  epoch: 3/24, acc_iter=22311, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:15:01, time_cost(all): 7:07:22/1 day, 21:13:56, loss=0.554163295277036, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=1.0727521224835672, lr=0.09478630550572298
2023-11-26 16:44:37   INFO  epoch: 3/24, acc_iter=22361, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:17:55, time_cost(all): 7:08:19/1 day, 18:10:09, loss=0.554055752916934, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.7187089689505894, lr=0.09474621373298238
2023-11-26 16:45:35   INFO  epoch: 3/24, acc_iter=22411, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:15:34, time_cost(all): 7:09:17/1 day, 17:50:13, loss=0.553948210556831, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.3383012460853525, lr=0.0947061219602418
2023-11-26 16:46:33   INFO  epoch: 3/24, acc_iter=22461, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:49, time_cost(all): 7:10:15/1 day, 20:44:18, loss=0.553840668196728, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.38058954361924, lr=0.09466603018750121
2023-11-26 16:47:31   INFO  epoch: 3/24, acc_iter=22511, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:10:36, time_cost(all): 7:11:13/1 day, 19:18:54, loss=0.553733125836625, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=4.033906315852343, lr=0.09462593841476062
2023-11-26 16:48:28   INFO  epoch: 3/24, acc_iter=22561, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:53, time_cost(all): 7:12:10/1 day, 21:05:27, loss=0.553625583476523, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=1.7000592183171617, lr=0.09458584664202004
2023-11-26 16:49:26   INFO  epoch: 3/24, acc_iter=22611, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:10:09, time_cost(all): 7:13:08/1 day, 20:03:52, loss=0.55351804111642, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=0.9409057163099042, lr=0.09454575486927945
2023-11-26 16:50:24   INFO  epoch: 3/24, acc_iter=22661, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:14:29, time_cost(all): 7:14:06/1 day, 18:03:12, loss=0.553410498756317, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.9112769897662174, lr=0.09450566309653885
2023-11-26 16:51:22   INFO  epoch: 3/24, acc_iter=22711, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:10, time_cost(all): 7:15:04/1 day, 20:18:39, loss=0.553302956396215, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=4.693413745638251, lr=0.09446557132379826
2023-11-26 16:52:19   INFO  epoch: 3/24, acc_iter=22761, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:10:08, time_cost(all): 7:16:01/1 day, 19:32:27, loss=0.553195414036112, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=1.9033248537788336, lr=0.09442547955105768
2023-11-26 16:53:17   INFO  epoch: 3/24, acc_iter=22811, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:09:53, time_cost(all): 7:16:59/1 day, 21:07:14, loss=0.553087871676009, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.4144654952810929, lr=0.09438538777831709
2023-11-26 16:54:15   INFO  epoch: 3/24, acc_iter=22861, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:09:51, time_cost(all): 7:17:57/1 day, 20:12:39, loss=0.552980329315907, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=3.2108435579651298, lr=0.0943452960055765
2023-11-26 16:55:13   INFO  epoch: 3/24, acc_iter=22911, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:03:40, time_cost(all): 7:18:55/1 day, 20:04:39, loss=0.552872786955804, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.25787306708828, lr=0.09430520423283592
2023-11-26 16:56:10   INFO  epoch: 3/24, acc_iter=22961, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:02:16, time_cost(all): 7:19:52/1 day, 21:31:08, loss=0.552765244595701, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=3.159173213236995, lr=0.09426511246009533
2023-11-26 16:57:08   INFO  epoch: 3/24, acc_iter=23011, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:06:37, time_cost(all): 7:20:50/1 day, 19:54:47, loss=0.552657702235599, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.16672836004495, lr=0.09422502068735474
2023-11-26 16:58:06   INFO  epoch: 3/24, acc_iter=23061, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:02:32, time_cost(all): 7:21:48/1 day, 18:58:00, loss=0.552550159875496, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=0.6802123665429032, lr=0.09418492891461415
2023-11-26 16:59:04   INFO  epoch: 3/24, acc_iter=23111, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:02:23, time_cost(all): 7:22:46/1 day, 19:15:45, loss=0.552442617515393, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=0.842505754933313, lr=0.09414483714187356
2023-11-26 17:00:01   INFO  epoch: 3/24, acc_iter=23161, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:59:35, time_cost(all): 7:23:43/1 day, 19:58:05, loss=0.552335075155291, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=1.7427909454524717, lr=0.09410474536913298
2023-11-26 17:00:59   INFO  epoch: 3/24, acc_iter=23211, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:57:35, time_cost(all): 7:24:41/1 day, 17:25:13, loss=0.552227532795188, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=2.022804931550699, lr=0.09406465359639239
2023-11-26 17:01:57   INFO  epoch: 3/24, acc_iter=23261, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:01:08, time_cost(all): 7:25:39/1 day, 20:20:53, loss=0.552119990435085, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=3.623167266214779, lr=0.0940245618236518
2023-11-26 17:02:55   INFO  epoch: 3/24, acc_iter=23311, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:56:06, time_cost(all): 7:26:37/1 day, 17:38:36, loss=0.552012448074983, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.567710410890543, lr=0.0939844700509112
2023-11-26 17:03:52   INFO  epoch: 3/24, acc_iter=23361, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:25, time_cost(all): 7:27:34/1 day, 20:57:14, loss=0.55190490571488, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.621243955756779, lr=0.09394437827817062
2023-11-26 17:04:50   INFO  epoch: 3/24, acc_iter=23411, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:12, time_cost(all): 7:28:32/1 day, 17:18:07, loss=0.551797363354777, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.371055015833723, lr=0.09390428650543003
2023-11-26 17:05:48   INFO  epoch: 3/24, acc_iter=23461, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:41, time_cost(all): 7:29:30/1 day, 20:27:42, loss=0.551689820994675, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=2.23207745151849, lr=0.09386419473268945
2023-11-26 17:06:46   INFO  epoch: 3/24, acc_iter=23511, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:52:53, time_cost(all): 7:30:28/1 day, 20:40:36, loss=0.551582278634572, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=4.336016178124751, lr=0.09382410295994886
2023-11-26 17:07:43   INFO  epoch: 3/24, acc_iter=23561, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:53:03, time_cost(all): 7:31:25/1 day, 19:22:31, loss=0.551474736274469, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.7500713126112233, lr=0.09378401118720828
2023-11-26 17:08:41   INFO  epoch: 3/24, acc_iter=23611, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:50:57, time_cost(all): 7:32:23/1 day, 18:19:41, loss=0.551367193914367, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=2.8645771218524643, lr=0.09374391941446769
2023-11-26 17:09:39   INFO  epoch: 3/24, acc_iter=23661, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:50:44, time_cost(all): 7:33:21/1 day, 17:34:37, loss=0.551259651554264, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.6126314562437334, lr=0.09370382764172709
2023-11-26 17:10:37   INFO  epoch: 3/24, acc_iter=23711, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:15, time_cost(all): 7:34:19/1 day, 17:11:21, loss=0.551152109194161, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=2.453712789295179, lr=0.0936637358689865
2023-11-26 17:11:34   INFO  epoch: 3/24, acc_iter=23761, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:50:43, time_cost(all): 7:35:16/1 day, 19:11:20, loss=0.551044566834058, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.0937726436672115, lr=0.09362364409624592
2023-11-26 17:12:32   INFO  epoch: 3/24, acc_iter=23811, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:50:15, time_cost(all): 7:36:14/1 day, 19:37:39, loss=0.550937024473956, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=1.8027621796394793, lr=0.09358355232350533
2023-11-26 17:13:30   INFO  epoch: 3/24, acc_iter=23861, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:50:04, time_cost(all): 7:37:12/1 day, 19:43:37, loss=0.550829482113853, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.231916540187882, lr=0.09354346055076475
2023-11-26 17:14:28   INFO  epoch: 3/24, acc_iter=23911, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:48:48, time_cost(all): 7:38:10/1 day, 20:09:49, loss=0.550721939753751, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=3.2647876159430265, lr=0.09350336877802415
2023-11-26 17:15:25   INFO  epoch: 3/24, acc_iter=23961, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:09, time_cost(all): 7:39:07/1 day, 19:50:42, loss=0.550614397393648, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=1.333721755333198, lr=0.09346327700528356
2023-11-26 17:16:23   INFO  epoch: 3/24, acc_iter=24011, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:43:29, time_cost(all): 7:40:05/1 day, 17:29:00, loss=0.550506855033545, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.867063384753976, lr=0.09342318523254298
2023-11-26 17:17:21   INFO  epoch: 3/24, acc_iter=24061, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:44:34, time_cost(all): 7:41:03/1 day, 20:53:14, loss=0.550399312673442, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.437101071516202, lr=0.09338309345980239
2023-11-26 17:18:19   INFO  epoch: 3/24, acc_iter=24111, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:06, time_cost(all): 7:42:01/1 day, 20:50:37, loss=0.55029177031334, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=3.7469435720503763, lr=0.0933430016870618
2023-11-26 17:19:16   INFO  epoch: 3/24, acc_iter=24161, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:40, time_cost(all): 7:42:58/1 day, 19:03:42, loss=0.550184227953237, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.2451500526916295, lr=0.09330290991432122
2023-11-26 17:20:14   INFO  epoch: 3/24, acc_iter=24211, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:43:10, time_cost(all): 7:43:56/1 day, 18:06:45, loss=0.550076685593134, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=4.188488321259669, lr=0.09326281814158063
2023-11-26 17:21:12   INFO  epoch: 3/24, acc_iter=24261, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:38:11, time_cost(all): 7:44:54/1 day, 21:06:51, loss=0.549969143233032, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=0.5886080393319175, lr=0.09322272636884003
2023-11-26 17:22:10   INFO  epoch: 3/24, acc_iter=24311, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:38:27, time_cost(all): 7:45:52/1 day, 19:42:34, loss=0.549861600872929, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=3.652425160490852, lr=0.09318263459609945
2023-11-26 17:23:07   INFO  epoch: 3/24, acc_iter=24361, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:03, time_cost(all): 7:46:49/1 day, 20:59:56, loss=0.549754058512826, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=2.2944644509074417, lr=0.09314254282335886
2023-11-26 17:24:05   INFO  epoch: 3/24, acc_iter=24411, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:44, time_cost(all): 7:47:47/1 day, 20:09:37, loss=0.549646516152724, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.4214344069694147, lr=0.09310245105061828
2023-11-26 17:25:03   INFO  epoch: 3/24, acc_iter=24461, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:14, time_cost(all): 7:48:45/1 day, 20:19:41, loss=0.549538973792621, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=3.4024349105497596, lr=0.09306235927787769
2023-11-26 17:26:01   INFO  epoch: 3/24, acc_iter=24511, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:58, time_cost(all): 7:49:43/1 day, 20:51:28, loss=0.549431431432518, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=0.9612473983479168, lr=0.0930222675051371
2023-11-26 17:26:58   INFO  epoch: 3/24, acc_iter=24561, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:57, time_cost(all): 7:50:40/1 day, 19:27:49, loss=0.549323889072416, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=3.7028712634419882, lr=0.0929821757323965
2023-11-26 17:27:56   INFO  epoch: 3/24, acc_iter=24611, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:33:32, time_cost(all): 7:51:38/1 day, 17:41:52, loss=0.549216346712313, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=0.9778240522768964, lr=0.09294208395965592
2023-11-26 17:28:54   INFO  epoch: 3/24, acc_iter=24661, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:10, time_cost(all): 7:52:36/1 day, 20:30:45, loss=0.54910880435221, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.439884432627993, lr=0.09290199218691533
2023-11-26 17:29:52   INFO  epoch: 3/24, acc_iter=24711, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:48, time_cost(all): 7:53:34/1 day, 19:46:18, loss=0.549001261992108, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.07(1.03), norm=2.3571754979304975, lr=0.09286190041417475
2023-11-26 17:30:49   INFO  epoch: 3/24, acc_iter=24761, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:32, time_cost(all): 7:54:31/1 day, 17:21:48, loss=0.548893719632005, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=3.319873488805705, lr=0.09282180864143416
2023-11-26 17:31:47   INFO  epoch: 3/24, acc_iter=24811, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:19, time_cost(all): 7:55:29/1 day, 16:51:17, loss=0.548786177271902, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=2.7194733144148393, lr=0.09278171686869358
2023-11-26 17:32:45   INFO  epoch: 3/24, acc_iter=24861, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:04, time_cost(all): 7:56:27/1 day, 19:46:16, loss=0.5486786349118, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=2.263245672301008, lr=0.09274162509595298
2023-11-26 17:33:43   INFO  epoch: 3/24, acc_iter=24911, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:25, time_cost(all): 7:57:25/1 day, 19:14:29, loss=0.548571092551697, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=3.129021398535365, lr=0.09270153332321239
2023-11-26 17:34:40   INFO  epoch: 3/24, acc_iter=24961, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:15, time_cost(all): 7:58:22/1 day, 18:08:31, loss=0.548463550191594, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.7345027708266045, lr=0.0926614415504718
2023-11-26 17:35:38   INFO  epoch: 3/24, acc_iter=25011, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:49, time_cost(all): 7:59:20/1 day, 17:31:48, loss=0.548356007831492, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.8420303616735243, lr=0.09262134977773122
2023-11-26 17:36:36   INFO  epoch: 3/24, acc_iter=25061, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:54, time_cost(all): 8:00:18/1 day, 16:49:11, loss=0.548248465471389, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=0.8004366203605006, lr=0.09258125800499063
2023-11-26 17:37:34   INFO  epoch: 3/24, acc_iter=25111, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:53, time_cost(all): 8:01:16/1 day, 18:30:26, loss=0.548140923111286, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=3.6222472332291065, lr=0.09254116623225005
2023-11-26 17:38:31   INFO  epoch: 3/24, acc_iter=25161, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:08, time_cost(all): 8:02:13/1 day, 18:17:39, loss=0.548033380751183, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=4.194099866684876, lr=0.09250107445950945
2023-11-26 17:39:29   INFO  epoch: 3/24, acc_iter=25211, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:26, time_cost(all): 8:03:11/1 day, 20:10:33, loss=0.547925838391081, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=1.985550175033074, lr=0.09246098268676886
2023-11-26 17:40:27   INFO  epoch: 3/24, acc_iter=25261, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:03, time_cost(all): 8:04:09/1 day, 16:51:34, loss=0.547818296030978, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=3.1877983991404175, lr=0.09242089091402828
2023-11-26 17:41:25   INFO  epoch: 3/24, acc_iter=25311, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:32, time_cost(all): 8:05:07/1 day, 17:16:11, loss=0.547710753670875, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=2.0603159989244, lr=0.09238079914128769
2023-11-26 17:42:22   INFO  epoch: 3/24, acc_iter=25361, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:50, time_cost(all): 8:06:04/1 day, 20:39:03, loss=0.547603211310773, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.510640760368896, lr=0.0923407073685471
2023-11-26 17:43:20   INFO  epoch: 3/24, acc_iter=25411, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:07, time_cost(all): 8:07:02/1 day, 19:36:08, loss=0.54749566895067, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=2.6531315024054334, lr=0.09230061559580652
2023-11-26 17:44:18   INFO  epoch: 3/24, acc_iter=25461, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:55, time_cost(all): 8:08:00/1 day, 19:07:31, loss=0.547388126590567, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.138290382766815, lr=0.09226052382306593
2023-11-26 17:45:16   INFO  epoch: 3/24, acc_iter=25511, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:36, time_cost(all): 8:08:58/1 day, 17:25:46, loss=0.547280584230465, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=3.083164163690796, lr=0.09222043205032533
2023-11-26 17:46:13   INFO  epoch: 3/24, acc_iter=25561, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:32, time_cost(all): 8:09:55/1 day, 19:09:53, loss=0.547173041870362, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=2.5699864574437306, lr=0.09218034027758475
2023-11-26 17:47:11   INFO  epoch: 3/24, acc_iter=25611, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:18, time_cost(all): 8:10:53/1 day, 19:35:49, loss=0.547065499510259, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=3.0306016674871543, lr=0.09214024850484416
2023-11-26 17:48:09   INFO  epoch: 3/24, acc_iter=25661, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:21, time_cost(all): 8:11:51/1 day, 19:37:25, loss=0.546957957150157, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.7033546422633896, lr=0.09210015673210357
2023-11-26 17:49:07   INFO  epoch: 3/24, acc_iter=25711, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:01, time_cost(all): 8:12:49/1 day, 17:25:31, loss=0.546850414790054, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=1.9107069622115789, lr=0.09206006495936299
2023-11-26 17:50:04   INFO  epoch: 3/24, acc_iter=25761, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:24, time_cost(all): 8:13:46/1 day, 16:24:28, loss=0.546742872429951, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=1.8026957260867729, lr=0.0920199731866224
2023-11-26 17:51:02   INFO  epoch: 3/24, acc_iter=25811, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:35, time_cost(all): 8:14:44/1 day, 18:23:20, loss=0.546635330069849, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=3.362531263762086, lr=0.0919798814138818
2023-11-26 17:52:00   INFO  epoch: 3/24, acc_iter=25861, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:08:56, time_cost(all): 8:15:42/1 day, 18:25:15, loss=0.546527787709746, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=0.9367507021716579, lr=0.09193978964114122
2023-11-26 17:52:58   INFO  epoch: 3/24, acc_iter=25911, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:48, time_cost(all): 8:16:40/1 day, 16:52:10, loss=0.546420245349643, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=3.195475831376168, lr=0.09189969786840063
2023-11-26 17:53:55   INFO  epoch: 3/24, acc_iter=25961, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:17, time_cost(all): 8:17:37/1 day, 17:08:27, loss=0.546312702989541, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=3.4396798852777635, lr=0.09185960609566005
2023-11-26 17:54:53   INFO  epoch: 3/24, acc_iter=26011, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:20, time_cost(all): 8:18:35/1 day, 20:31:48, loss=0.546205160629438, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=3.8610652068596125, lr=0.09181951432291946
2023-11-26 17:55:51   INFO  epoch: 3/24, acc_iter=26061, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:19, time_cost(all): 8:19:33/1 day, 17:15:53, loss=0.546097618269335, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=3.256371867548296, lr=0.09177942255017887
2023-11-26 17:56:49   INFO  epoch: 3/24, acc_iter=26111, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:21, time_cost(all): 8:20:31/1 day, 17:24:20, loss=0.545990075909233, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.7868387536777508, lr=0.09173933077743829
2023-11-26 17:57:46   INFO  epoch: 3/24, acc_iter=26161, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:32, time_cost(all): 8:21:28/1 day, 19:12:35, loss=0.54588253354913, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=4.852533022396336, lr=0.09169923900469769
2023-11-26 17:58:44   INFO  epoch: 3/24, acc_iter=26211, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:44, time_cost(all): 8:22:26/1 day, 18:47:22, loss=0.545774991189027, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.569099699826123, lr=0.0916591472319571
2023-11-26 17:59:42   INFO  epoch: 3/24, acc_iter=26261, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:37, time_cost(all): 8:23:24/1 day, 17:23:01, loss=0.545667448828925, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=2.6404381744271683, lr=0.09161905545921652
2023-11-26 18:00:40   INFO  epoch: 3/24, acc_iter=26311, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 8:24:22/1 day, 19:33:12, loss=0.545559906468822, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=0.764769152967241, lr=0.09157896368647593
2023-11-26 18:01:37   INFO  epoch: 4/24, acc_iter=26398, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:06:22, time_cost(all): 8:25:19/1 day, 16:19:27, loss=0.545372782762243, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.741315640022663, lr=0.0915092040019073
2023-11-26 18:02:35   INFO  epoch: 4/24, acc_iter=26448, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:06:38, time_cost(all): 8:26:17/1 day, 16:28:18, loss=0.545265240402141, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.4227340500459116, lr=0.09146911222916672
2023-11-26 18:03:33   INFO  epoch: 4/24, acc_iter=26498, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:46, time_cost(all): 8:27:15/1 day, 18:34:34, loss=0.545157698042038, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=0.6465108220977169, lr=0.09142902045642613
2023-11-26 18:04:31   INFO  epoch: 4/24, acc_iter=26548, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:00:25, time_cost(all): 8:28:13/1 day, 18:51:07, loss=0.545050155681935, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.2717353924253376, lr=0.09138892868368555
2023-11-26 18:05:28   INFO  epoch: 4/24, acc_iter=26598, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:01:13, time_cost(all): 8:29:10/1 day, 17:14:16, loss=0.544942613321832, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=2.235372470536104, lr=0.09134883691094495
2023-11-26 18:06:26   INFO  epoch: 4/24, acc_iter=26648, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:05:55, time_cost(all): 8:30:08/1 day, 18:45:22, loss=0.54483507096173, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.377972312270904, lr=0.09130874513820436
2023-11-26 18:07:24   INFO  epoch: 4/24, acc_iter=26698, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:57:34, time_cost(all): 8:31:06/1 day, 19:09:57, loss=0.544727528601627, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=0.6411334710570461, lr=0.09126865336546378
2023-11-26 18:08:22   INFO  epoch: 4/24, acc_iter=26748, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:02:15, time_cost(all): 8:32:04/1 day, 19:01:06, loss=0.544619986241525, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=0.8120660109765749, lr=0.09122856159272319
2023-11-26 18:09:19   INFO  epoch: 4/24, acc_iter=26798, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:53:20, time_cost(all): 8:33:01/1 day, 16:51:09, loss=0.544512443881422, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=4.215831809043676, lr=0.0911884698199826
2023-11-26 18:10:17   INFO  epoch: 4/24, acc_iter=26848, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:01:56, time_cost(all): 8:33:59/1 day, 17:21:21, loss=0.544404901521319, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.3082481058310442, lr=0.09114837804724202
2023-11-26 18:11:15   INFO  epoch: 4/24, acc_iter=26898, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:57:45, time_cost(all): 8:34:57/1 day, 19:18:45, loss=0.544297359161216, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=1.1828173790654335, lr=0.09110828627450143
2023-11-26 18:12:13   INFO  epoch: 4/24, acc_iter=26948, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:57:54, time_cost(all): 8:35:55/1 day, 19:24:30, loss=0.544189816801114, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=2.145450919741325, lr=0.09106819450176083
2023-11-26 18:13:11   INFO  epoch: 4/24, acc_iter=26998, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:50:13, time_cost(all): 8:36:53/1 day, 20:09:33, loss=0.544082274441011, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.815462401595237, lr=0.09102810272902025
2023-11-26 18:14:08   INFO  epoch: 4/24, acc_iter=27048, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:52:29, time_cost(all): 8:37:50/1 day, 16:33:13, loss=0.543974732080908, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.5972126820885086, lr=0.09098801095627966
2023-11-26 18:15:06   INFO  epoch: 4/24, acc_iter=27098, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:50:59, time_cost(all): 8:38:48/1 day, 18:48:50, loss=0.543867189720806, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=4.80495973345997, lr=0.09094791918353907
2023-11-26 18:16:04   INFO  epoch: 4/24, acc_iter=27148, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:51:00, time_cost(all): 8:39:46/1 day, 15:59:46, loss=0.543759647360703, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=1.079603391738149, lr=0.09090782741079849
2023-11-26 18:17:02   INFO  epoch: 4/24, acc_iter=27198, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:46:21, time_cost(all): 8:40:44/1 day, 16:01:24, loss=0.5436521050006, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.957665171723951, lr=0.09086773563805789
2023-11-26 18:17:59   INFO  epoch: 4/24, acc_iter=27248, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:52:49, time_cost(all): 8:41:41/1 day, 17:08:44, loss=0.543544562640498, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=4.366137008217416, lr=0.0908276438653173
2023-11-26 18:18:57   INFO  epoch: 4/24, acc_iter=27298, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:47:06, time_cost(all): 8:42:39/1 day, 19:00:43, loss=0.543437020280395, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.7050394718059945, lr=0.09078755209257672
2023-11-26 18:19:55   INFO  epoch: 4/24, acc_iter=27348, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:45:07, time_cost(all): 8:43:37/1 day, 19:07:56, loss=0.543329477920292, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=2.5179792377031562, lr=0.09074746031983613
2023-11-26 18:20:53   INFO  epoch: 4/24, acc_iter=27398, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:46:21, time_cost(all): 8:44:35/1 day, 18:56:58, loss=0.54322193556019, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=1.3045136359304665, lr=0.09070736854709555
2023-11-26 18:21:50   INFO  epoch: 4/24, acc_iter=27448, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:45:32, time_cost(all): 8:45:32/1 day, 18:05:12, loss=0.543114393200087, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=3.7239606324398657, lr=0.09066727677435496
2023-11-26 18:22:48   INFO  epoch: 4/24, acc_iter=27498, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:43:24, time_cost(all): 8:46:30/1 day, 16:25:54, loss=0.543006850839984, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=2.0501318915697313, lr=0.09062718500161437
2023-11-26 18:23:46   INFO  epoch: 4/24, acc_iter=27548, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:44:51, time_cost(all): 8:47:28/1 day, 17:15:54, loss=0.542899308479882, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.2393260347340846, lr=0.09058709322887377
2023-11-26 18:24:44   INFO  epoch: 4/24, acc_iter=27598, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:38:59, time_cost(all): 8:48:26/1 day, 19:03:58, loss=0.542791766119779, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.661289908475957, lr=0.09054700145613319
2023-11-26 18:25:41   INFO  epoch: 4/24, acc_iter=27648, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:43:24, time_cost(all): 8:49:23/1 day, 16:46:52, loss=0.542684223759676, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.845059585149806, lr=0.0905069096833926
2023-11-26 18:26:39   INFO  epoch: 4/24, acc_iter=27698, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:36:24, time_cost(all): 8:50:21/1 day, 16:44:28, loss=0.542576681399574, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=3.3996772195392637, lr=0.09046681791065202
2023-11-26 18:27:37   INFO  epoch: 4/24, acc_iter=27748, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:39:23, time_cost(all): 8:51:19/1 day, 16:40:55, loss=0.542469139039471, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.3435359462702876, lr=0.09042672613791143
2023-11-26 18:28:35   INFO  epoch: 4/24, acc_iter=27798, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:36:45, time_cost(all): 8:52:17/1 day, 16:48:36, loss=0.542361596679368, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=1.8662964709843999, lr=0.09038663436517085
2023-11-26 18:29:32   INFO  epoch: 4/24, acc_iter=27848, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:42:13, time_cost(all): 8:53:14/1 day, 19:53:38, loss=0.542254054319266, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=0.6995590181752493, lr=0.09034654259243025
2023-11-26 18:30:30   INFO  epoch: 4/24, acc_iter=27898, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:32:50, time_cost(all): 8:54:12/1 day, 17:39:21, loss=0.542146511959163, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=1.2357024947155222, lr=0.09030645081968966
2023-11-26 18:31:28   INFO  epoch: 4/24, acc_iter=27948, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:38:49, time_cost(all): 8:55:10/1 day, 18:55:46, loss=0.54203896959906, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.858960493192779, lr=0.09026635904694907
2023-11-26 18:32:26   INFO  epoch: 4/24, acc_iter=27998, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:37:18, time_cost(all): 8:56:08/1 day, 19:30:41, loss=0.541931427238957, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=4.240257270937594, lr=0.09022626727420849
2023-11-26 18:33:23   INFO  epoch: 4/24, acc_iter=28048, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:48, time_cost(all): 8:57:05/1 day, 16:04:14, loss=0.541823884878855, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=2.3201926301117917, lr=0.0901861755014679
2023-11-26 18:34:21   INFO  epoch: 4/24, acc_iter=28098, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:34:09, time_cost(all): 8:58:03/1 day, 18:24:48, loss=0.541716342518752, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=1.146042789318936, lr=0.09014608372872732
2023-11-26 18:35:19   INFO  epoch: 4/24, acc_iter=28148, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:33:49, time_cost(all): 8:59:01/1 day, 19:48:36, loss=0.541608800158649, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=3.6928110411649806, lr=0.09010599195598673
2023-11-26 18:36:17   INFO  epoch: 4/24, acc_iter=28198, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:34:31, time_cost(all): 8:59:59/1 day, 18:43:54, loss=0.541501257798547, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.3141677222177925, lr=0.09006590018324613
2023-11-26 18:37:14   INFO  epoch: 4/24, acc_iter=28248, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:29:08, time_cost(all): 9:00:56/1 day, 19:14:24, loss=0.541393715438444, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=4.502516731799496, lr=0.09002580841050555
2023-11-26 18:38:12   INFO  epoch: 4/24, acc_iter=28298, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:35, time_cost(all): 9:01:54/1 day, 16:46:11, loss=0.541286173078341, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=0.7845512965061111, lr=0.08998571663776496
2023-11-26 18:39:10   INFO  epoch: 4/24, acc_iter=28348, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:25:57, time_cost(all): 9:02:52/1 day, 16:23:44, loss=0.541178630718239, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.406913641131068, lr=0.08994562486502437
2023-11-26 18:40:08   INFO  epoch: 4/24, acc_iter=28398, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:28:05, time_cost(all): 9:03:50/1 day, 17:13:21, loss=0.541071088358136, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=0.5851047254324283, lr=0.08990553309228379
2023-11-26 18:41:05   INFO  epoch: 4/24, acc_iter=28448, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:24:22, time_cost(all): 9:04:47/1 day, 18:20:38, loss=0.540963545998033, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.6599709104688314, lr=0.0898654413195432
2023-11-26 18:42:03   INFO  epoch: 4/24, acc_iter=28498, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:24:12, time_cost(all): 9:05:45/1 day, 15:41:40, loss=0.540856003637931, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.4452393869763016, lr=0.0898253495468026
2023-11-26 18:43:01   INFO  epoch: 4/24, acc_iter=28548, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:54, time_cost(all): 9:06:43/1 day, 16:31:31, loss=0.540748461277828, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.3401343852496725, lr=0.08978525777406202
2023-11-26 18:43:59   INFO  epoch: 4/24, acc_iter=28598, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:25:42, time_cost(all): 9:07:41/1 day, 15:49:06, loss=0.540640918917725, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=0.6627424109466737, lr=0.08974516600132143
2023-11-26 18:44:56   INFO  epoch: 4/24, acc_iter=28648, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:26:06, time_cost(all): 9:08:38/1 day, 16:22:44, loss=0.540533376557623, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=4.828423333497602, lr=0.08970507422858084
2023-11-26 18:45:54   INFO  epoch: 4/24, acc_iter=28698, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:29, time_cost(all): 9:09:36/1 day, 19:26:48, loss=0.54042583419752, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.3153727440509169, lr=0.08966498245584026
2023-11-26 18:46:52   INFO  epoch: 4/24, acc_iter=28748, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:20:02, time_cost(all): 9:10:34/1 day, 17:30:19, loss=0.540318291837417, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=3.643719754471637, lr=0.08962489068309967
2023-11-26 18:47:50   INFO  epoch: 4/24, acc_iter=28798, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:22:02, time_cost(all): 9:11:32/1 day, 19:10:56, loss=0.540210749477315, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=3.2255430146752224, lr=0.08958479891035907
2023-11-26 18:48:47   INFO  epoch: 4/24, acc_iter=28848, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:52, time_cost(all): 9:12:29/1 day, 18:28:24, loss=0.540103207117212, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.705896728991953, lr=0.08954470713761849
2023-11-26 18:49:45   INFO  epoch: 4/24, acc_iter=28898, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:40, time_cost(all): 9:13:27/1 day, 17:55:53, loss=0.539995664757109, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=2.3924112329876177, lr=0.0895046153648779
2023-11-26 18:50:43   INFO  epoch: 4/24, acc_iter=28948, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:40, time_cost(all): 9:14:25/1 day, 15:31:42, loss=0.539888122397007, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=4.2047682217726745, lr=0.08946452359213732
2023-11-26 18:51:41   INFO  epoch: 4/24, acc_iter=28998, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:12:32, time_cost(all): 9:15:23/1 day, 19:19:18, loss=0.539780580036904, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.244080032599995, lr=0.08942443181939673
2023-11-26 18:52:38   INFO  epoch: 4/24, acc_iter=29048, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:17:59, time_cost(all): 9:16:20/1 day, 15:51:19, loss=0.539673037676801, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=2.603372154184049, lr=0.08938434004665614
2023-11-26 18:53:36   INFO  epoch: 4/24, acc_iter=29098, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:15:16, time_cost(all): 9:17:18/1 day, 19:02:39, loss=0.539565495316699, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=4.94306354336746, lr=0.08934424827391554
2023-11-26 18:54:34   INFO  epoch: 4/24, acc_iter=29148, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:12:26, time_cost(all): 9:18:16/1 day, 17:10:55, loss=0.539457952956596, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=2.5328560369696556, lr=0.08930415650117496
2023-11-26 18:55:32   INFO  epoch: 4/24, acc_iter=29198, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:14:53, time_cost(all): 9:19:14/1 day, 16:24:39, loss=0.539350410596493, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=2.3973244633359125, lr=0.08926406472843437
2023-11-26 18:56:29   INFO  epoch: 4/24, acc_iter=29248, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:36, time_cost(all): 9:20:11/1 day, 17:10:23, loss=0.539242868236391, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=2.2250436379277714, lr=0.08922397295569379
2023-11-26 18:57:27   INFO  epoch: 4/24, acc_iter=29298, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:11:47, time_cost(all): 9:21:09/1 day, 18:06:50, loss=0.539135325876288, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.6810471131411475, lr=0.0891838811829532
2023-11-26 18:58:25   INFO  epoch: 4/24, acc_iter=29348, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:06:54, time_cost(all): 9:22:07/1 day, 16:19:56, loss=0.539027783516185, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=1.10412341945552, lr=0.08914378941021261
2023-11-26 18:59:23   INFO  epoch: 4/24, acc_iter=29398, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:05:17, time_cost(all): 9:23:05/1 day, 16:12:04, loss=0.538920241156083, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.0684644778262014, lr=0.08910369763747203
2023-11-26 19:00:20   INFO  epoch: 4/24, acc_iter=29448, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:03:54, time_cost(all): 9:24:02/1 day, 16:58:35, loss=0.53881269879598, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=0.8040236163954124, lr=0.08906360586473143
2023-11-26 19:01:18   INFO  epoch: 4/24, acc_iter=29498, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:02:59, time_cost(all): 9:25:00/1 day, 16:46:20, loss=0.538705156435877, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.9349222529134806, lr=0.08902351409199084
2023-11-26 19:02:16   INFO  epoch: 4/24, acc_iter=29548, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:02:13, time_cost(all): 9:25:58/1 day, 17:46:08, loss=0.538597614075774, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.875874767023506, lr=0.08898342231925026
2023-11-26 19:03:14   INFO  epoch: 4/24, acc_iter=29598, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:05:18, time_cost(all): 9:26:56/1 day, 15:15:08, loss=0.538490071715672, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=2.2209223954877912, lr=0.08894333054650967
2023-11-26 19:04:11   INFO  epoch: 4/24, acc_iter=29648, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:04:03, time_cost(all): 9:27:53/1 day, 15:55:49, loss=0.538382529355569, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=2.51574365457171, lr=0.08890323877376909
2023-11-26 19:05:09   INFO  epoch: 4/24, acc_iter=29698, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:10, time_cost(all): 9:28:51/1 day, 17:33:44, loss=0.538274986995466, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.694055705867466, lr=0.08886314700102849
2023-11-26 19:06:07   INFO  epoch: 4/24, acc_iter=29748, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:03:00, time_cost(all): 9:29:49/1 day, 17:53:40, loss=0.538167444635364, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=1.2758594512089863, lr=0.0888230552282879
2023-11-26 19:07:05   INFO  epoch: 4/24, acc_iter=29798, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:02:49, time_cost(all): 9:30:47/1 day, 18:09:34, loss=0.538059902275261, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=4.752397403244184, lr=0.08878296345554731
2023-11-26 19:08:02   INFO  epoch: 4/24, acc_iter=29848, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:58:16, time_cost(all): 9:31:44/1 day, 18:36:50, loss=0.537952359915158, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=1.7897035636831333, lr=0.08874287168280673
2023-11-26 19:09:00   INFO  epoch: 4/24, acc_iter=29898, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:01:07, time_cost(all): 9:32:42/1 day, 18:02:40, loss=0.537844817555056, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=1.3257376770278477, lr=0.08870277991006614
2023-11-26 19:09:58   INFO  epoch: 4/24, acc_iter=29948, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:17, time_cost(all): 9:33:40/1 day, 17:08:56, loss=0.537737275194953, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.211139464730054, lr=0.08866268813732556
2023-11-26 19:10:56   INFO  epoch: 4/24, acc_iter=29998, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:54:47, time_cost(all): 9:34:38/1 day, 16:08:39, loss=0.53762973283485, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.3490636753465495, lr=0.08862259636458497
2023-11-26 19:11:53   INFO  epoch: 4/24, acc_iter=30048, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:14, time_cost(all): 9:35:35/1 day, 17:47:37, loss=0.537522190474748, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.4989218092900756, lr=0.08858250459184439
2023-11-26 19:12:51   INFO  epoch: 4/24, acc_iter=30098, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:52:01, time_cost(all): 9:36:33/1 day, 17:53:44, loss=0.537414648114645, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=3.8978594429846964, lr=0.08854241281910379
2023-11-26 19:13:49   INFO  epoch: 4/24, acc_iter=30148, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:54:25, time_cost(all): 9:37:31/1 day, 16:27:52, loss=0.537307105754542, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=1.007762382091559, lr=0.0885023210463632
2023-11-26 19:14:47   INFO  epoch: 4/24, acc_iter=30198, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:53:25, time_cost(all): 9:38:29/1 day, 16:11:53, loss=0.53719956339444, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.112095297387809, lr=0.08846222927362261
2023-11-26 19:15:44   INFO  epoch: 4/24, acc_iter=30248, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:50:09, time_cost(all): 9:39:26/1 day, 16:54:29, loss=0.537092021034337, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=0.5648902474028992, lr=0.08842213750088203
2023-11-26 19:16:42   INFO  epoch: 4/24, acc_iter=30298, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:35, time_cost(all): 9:40:24/1 day, 18:23:36, loss=0.536984478674234, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.7554302289587131, lr=0.08838204572814144
2023-11-26 19:17:40   INFO  epoch: 4/24, acc_iter=30348, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:51:37, time_cost(all): 9:41:22/1 day, 16:11:16, loss=0.536876936314132, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=0.9753551560717297, lr=0.08834195395540084
2023-11-26 19:18:38   INFO  epoch: 4/24, acc_iter=30398, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:47:49, time_cost(all): 9:42:20/1 day, 17:21:32, loss=0.536769393954029, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.1543325118868741, lr=0.08830186218266026
2023-11-26 19:19:35   INFO  epoch: 4/24, acc_iter=30448, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:19, time_cost(all): 9:43:17/1 day, 15:37:41, loss=0.536661851593926, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=3.6965174762776987, lr=0.08826177040991967
2023-11-26 19:20:33   INFO  epoch: 4/24, acc_iter=30498, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:17, time_cost(all): 9:44:15/1 day, 15:25:52, loss=0.536554309233824, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.884972760836314, lr=0.08822167863717909
2023-11-26 19:21:31   INFO  epoch: 4/24, acc_iter=30548, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:47:52, time_cost(all): 9:45:13/1 day, 15:26:31, loss=0.536446766873721, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=4.162397217778427, lr=0.0881815868644385
2023-11-26 19:22:29   INFO  epoch: 4/24, acc_iter=30598, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:08, time_cost(all): 9:46:11/1 day, 17:57:05, loss=0.536339224513618, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=3.055700759899338, lr=0.08814149509169791
2023-11-26 19:23:26   INFO  epoch: 4/24, acc_iter=30648, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:26, time_cost(all): 9:47:08/1 day, 17:42:56, loss=0.536231682153516, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=3.2505128606305385, lr=0.08810140331895733
2023-11-26 19:24:24   INFO  epoch: 4/24, acc_iter=30698, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:37, time_cost(all): 9:48:06/1 day, 15:56:13, loss=0.536124139793413, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=4.21752821062597, lr=0.08806131154621673
2023-11-26 19:25:22   INFO  epoch: 4/24, acc_iter=30748, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:22, time_cost(all): 9:49:04/1 day, 18:52:14, loss=0.53601659743331, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=1.7234303276753442, lr=0.08802121977347614
2023-11-26 19:26:20   INFO  epoch: 4/24, acc_iter=30798, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:39:22, time_cost(all): 9:50:02/1 day, 16:22:18, loss=0.535909055073208, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=0.5415540972988302, lr=0.08798112800073556
2023-11-26 19:27:17   INFO  epoch: 4/24, acc_iter=30848, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:43, time_cost(all): 9:50:59/1 day, 16:55:48, loss=0.535801512713105, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=0.9727700092187964, lr=0.08794103622799497
2023-11-26 19:28:15   INFO  epoch: 4/24, acc_iter=30898, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:38:26, time_cost(all): 9:51:57/1 day, 17:28:15, loss=0.535693970353002, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.83(1.03), norm=2.8472500556706044, lr=0.08790094445525438
2023-11-26 19:29:13   INFO  epoch: 4/24, acc_iter=30948, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:14, time_cost(all): 9:52:55/1 day, 15:56:56, loss=0.535586427992899, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.5088497789127819, lr=0.0878608526825138
2023-11-26 19:30:11   INFO  epoch: 4/24, acc_iter=30998, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:10, time_cost(all): 9:53:53/1 day, 18:15:05, loss=0.535478885632797, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=1.2745948918029577, lr=0.0878207609097732
2023-11-26 19:31:08   INFO  epoch: 4/24, acc_iter=31048, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:36:56, time_cost(all): 9:54:50/1 day, 15:53:08, loss=0.535371343272694, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.263516606992743, lr=0.08778066913703261
2023-11-26 19:32:06   INFO  epoch: 4/24, acc_iter=31098, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:25, time_cost(all): 9:55:48/1 day, 17:50:45, loss=0.535263800912591, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=0.732189461150966, lr=0.08774057736429203
2023-11-26 19:33:04   INFO  epoch: 4/24, acc_iter=31148, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:39, time_cost(all): 9:56:46/1 day, 17:22:42, loss=0.535156258552489, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.25423661375789, lr=0.08770048559155144
2023-11-26 19:34:02   INFO  epoch: 4/24, acc_iter=31198, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:15, time_cost(all): 9:57:44/1 day, 16:35:19, loss=0.535048716192386, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=2.0766764780127835, lr=0.08766039381881086
2023-11-26 19:34:59   INFO  epoch: 4/24, acc_iter=31248, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:42, time_cost(all): 9:58:41/1 day, 17:51:57, loss=0.534941173832283, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=0.6778132285454073, lr=0.08762030204607027
2023-11-26 19:35:57   INFO  epoch: 4/24, acc_iter=31298, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:26, time_cost(all): 9:59:39/1 day, 16:46:27, loss=0.534833631472181, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.164189826975151, lr=0.08758021027332968
2023-11-26 19:36:55   INFO  epoch: 4/24, acc_iter=31348, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:50, time_cost(all): 10:00:37/1 day, 15:44:51, loss=0.534726089112078, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=4.6934727066578645, lr=0.08754011850058908
2023-11-26 19:37:53   INFO  epoch: 4/24, acc_iter=31398, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:11, time_cost(all): 10:01:35/1 day, 15:34:59, loss=0.534618546751975, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.4762438951416232, lr=0.0875000267278485
2023-11-26 19:38:50   INFO  epoch: 4/24, acc_iter=31448, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:05, time_cost(all): 10:02:32/1 day, 18:04:29, loss=0.534511004391873, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=3.547663552673081, lr=0.08745993495510791
2023-11-26 19:39:48   INFO  epoch: 4/24, acc_iter=31498, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:48, time_cost(all): 10:03:30/1 day, 14:55:26, loss=0.53440346203177, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=1.4289250185075795, lr=0.08741984318236733
2023-11-26 19:40:46   INFO  epoch: 4/24, acc_iter=31548, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:15, time_cost(all): 10:04:28/1 day, 14:56:49, loss=0.534295919671667, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=2.864074830407431, lr=0.08737975140962674
2023-11-26 19:41:44   INFO  epoch: 4/24, acc_iter=31598, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:07, time_cost(all): 10:05:26/1 day, 17:06:39, loss=0.534188377311565, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=1.259819811973808, lr=0.08733965963688614
2023-11-26 19:42:41   INFO  epoch: 4/24, acc_iter=31648, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:26, time_cost(all): 10:06:23/1 day, 18:31:18, loss=0.534080834951462, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=3.599463762680944, lr=0.08729956786414556
2023-11-26 19:43:39   INFO  epoch: 4/24, acc_iter=31698, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:42, time_cost(all): 10:07:21/1 day, 15:02:46, loss=0.533973292591359, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=2.627663308983511, lr=0.08725947609140497
2023-11-26 19:44:37   INFO  epoch: 4/24, acc_iter=31748, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:53, time_cost(all): 10:08:19/1 day, 17:43:39, loss=0.533865750231257, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=2.0646159674646265, lr=0.08721938431866438
2023-11-26 19:45:35   INFO  epoch: 4/24, acc_iter=31798, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:20:58, time_cost(all): 10:09:17/1 day, 17:22:22, loss=0.533758207871154, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.769541268566386, lr=0.0871792925459238
2023-11-26 19:46:32   INFO  epoch: 4/24, acc_iter=31848, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:37, time_cost(all): 10:10:14/1 day, 16:12:59, loss=0.533650665511051, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.4230474027132143, lr=0.08713920077318321
2023-11-26 19:47:30   INFO  epoch: 4/24, acc_iter=31898, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:08, time_cost(all): 10:11:12/1 day, 17:27:15, loss=0.533543123150949, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=0.8874311366468548, lr=0.08709910900044263
2023-11-26 19:48:28   INFO  epoch: 4/24, acc_iter=31948, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:33, time_cost(all): 10:12:10/1 day, 14:31:51, loss=0.533435580790846, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=0.8990419711228742, lr=0.08705901722770203
2023-11-26 19:49:26   INFO  epoch: 4/24, acc_iter=31998, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:57, time_cost(all): 10:13:08/1 day, 16:00:39, loss=0.533328038430743, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=2.874778371903496, lr=0.08701892545496144
2023-11-26 19:50:23   INFO  epoch: 4/24, acc_iter=32048, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:37, time_cost(all): 10:14:05/1 day, 16:35:25, loss=0.53322049607064, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=1.2851573259013382, lr=0.08697883368222085
2023-11-26 19:51:21   INFO  epoch: 4/24, acc_iter=32098, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:21, time_cost(all): 10:15:03/1 day, 16:27:00, loss=0.533112953710538, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.226378479077644, lr=0.08693874190948027
2023-11-26 19:52:19   INFO  epoch: 4/24, acc_iter=32148, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:58, time_cost(all): 10:16:01/1 day, 17:50:31, loss=0.533005411350435, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.6422867434443618, lr=0.08689865013673968
2023-11-26 19:53:17   INFO  epoch: 4/24, acc_iter=32198, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:59, time_cost(all): 10:16:59/1 day, 18:24:08, loss=0.532897868990333, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=3.9845865939768395, lr=0.0868585583639991
2023-11-26 19:54:14   INFO  epoch: 4/24, acc_iter=32248, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:08, time_cost(all): 10:17:56/1 day, 16:51:27, loss=0.53279032663023, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=3.989694402726146, lr=0.0868184665912585
2023-11-26 19:55:12   INFO  epoch: 4/24, acc_iter=32298, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:56, time_cost(all): 10:18:54/1 day, 17:20:46, loss=0.532682784270127, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=2.1181168719831596, lr=0.08677837481851791
2023-11-26 19:56:10   INFO  epoch: 4/24, acc_iter=32348, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:19, time_cost(all): 10:19:52/1 day, 15:25:38, loss=0.532575241910024, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=4.650569241998075, lr=0.08673828304577733
2023-11-26 19:57:08   INFO  epoch: 4/24, acc_iter=32398, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:39, time_cost(all): 10:20:50/1 day, 17:57:18, loss=0.532467699549922, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=1.6487031614936887, lr=0.08669819127303674
2023-11-26 19:58:06   INFO  epoch: 4/24, acc_iter=32448, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:49, time_cost(all): 10:21:48/1 day, 17:25:54, loss=0.532360157189819, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=0.8205781264686514, lr=0.08665809950029615
2023-11-26 19:59:03   INFO  epoch: 4/24, acc_iter=32498, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:18, time_cost(all): 10:22:45/1 day, 16:23:10, loss=0.532252614829716, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=1.930584613710805, lr=0.08661800772755557
2023-11-26 20:00:01   INFO  epoch: 4/24, acc_iter=32548, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:15, time_cost(all): 10:23:43/1 day, 17:40:38, loss=0.532145072469614, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=1.0238524185029236, lr=0.08657791595481498
2023-11-26 20:00:59   INFO  epoch: 4/24, acc_iter=32598, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:13, time_cost(all): 10:24:41/1 day, 15:23:49, loss=0.532037530109511, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=3.506562449237484, lr=0.08653782418207438
2023-11-26 20:01:57   INFO  epoch: 4/24, acc_iter=32648, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:38, time_cost(all): 10:25:39/1 day, 16:35:11, loss=0.531929987749408, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.646942393477331, lr=0.0864977324093338
2023-11-26 20:02:54   INFO  epoch: 4/24, acc_iter=32698, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:46, time_cost(all): 10:26:36/1 day, 15:00:18, loss=0.531822445389306, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.503376317777704, lr=0.08645764063659321
2023-11-26 20:03:52   INFO  epoch: 4/24, acc_iter=32748, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:38, time_cost(all): 10:27:34/1 day, 15:48:16, loss=0.531714903029203, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=4.174449646149599, lr=0.08641754886385263
2023-11-26 20:04:50   INFO  epoch: 4/24, acc_iter=32798, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:33, time_cost(all): 10:28:32/1 day, 14:44:39, loss=0.5316073606691, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=2.65389595321897, lr=0.08637745709111204
2023-11-26 20:05:48   INFO  epoch: 4/24, acc_iter=32848, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:44, time_cost(all): 10:29:30/1 day, 18:06:37, loss=0.531499818308998, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=1.0967771696377582, lr=0.08633736531837144
2023-11-26 20:06:45   INFO  epoch: 4/24, acc_iter=32898, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 10:30:27/1 day, 17:58:49, loss=0.531392275948895, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.776836404411652, lr=0.08629727354563085
2023-11-26 20:07:43   INFO  epoch: 5/24, acc_iter=32985, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:05:23, time_cost(all): 10:31:25/1 day, 16:54:17, loss=0.531205152242316, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=2.794947992955316, lr=0.08622751386106224
2023-11-26 20:08:41   INFO  epoch: 5/24, acc_iter=33035, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:10:51, time_cost(all): 10:32:23/1 day, 14:33:47, loss=0.531097609882214, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=4.001223236504834, lr=0.08618742208832166
2023-11-26 20:09:39   INFO  epoch: 5/24, acc_iter=33085, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:08:41, time_cost(all): 10:33:21/1 day, 14:44:55, loss=0.530990067522111, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=4.615946775162946, lr=0.08614733031558106
2023-11-26 20:10:36   INFO  epoch: 5/24, acc_iter=33135, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:03:09, time_cost(all): 10:34:18/1 day, 15:09:10, loss=0.530882525162008, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=2.919740488396475, lr=0.08610723854284047
2023-11-26 20:11:34   INFO  epoch: 5/24, acc_iter=33185, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:58:14, time_cost(all): 10:35:16/1 day, 15:51:53, loss=0.530774982801906, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=3.593281940158248, lr=0.08606714677009988
2023-11-26 20:12:32   INFO  epoch: 5/24, acc_iter=33235, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:57:14, time_cost(all): 10:36:14/1 day, 14:07:45, loss=0.530667440441803, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=0.512265654734555, lr=0.0860270549973593
2023-11-26 20:13:30   INFO  epoch: 5/24, acc_iter=33285, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:00:44, time_cost(all): 10:37:12/1 day, 16:05:56, loss=0.5305598980817, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=2.240717292560568, lr=0.08598696322461871
2023-11-26 20:14:27   INFO  epoch: 5/24, acc_iter=33335, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:59:02, time_cost(all): 10:38:09/1 day, 16:10:49, loss=0.530452355721598, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=2.439824285164969, lr=0.08594687145187813
2023-11-26 20:15:25   INFO  epoch: 5/24, acc_iter=33385, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:02:39, time_cost(all): 10:39:07/1 day, 14:43:17, loss=0.530344813361495, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.1070177476272565, lr=0.08590677967913753
2023-11-26 20:16:23   INFO  epoch: 5/24, acc_iter=33435, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:56:33, time_cost(all): 10:40:05/1 day, 16:07:03, loss=0.530237271001392, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.325139192095565, lr=0.08586668790639694
2023-11-26 20:17:21   INFO  epoch: 5/24, acc_iter=33485, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:53:06, time_cost(all): 10:41:03/1 day, 16:56:01, loss=0.530129728641289, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=0.6347271985045382, lr=0.08582659613365635
2023-11-26 20:18:18   INFO  epoch: 5/24, acc_iter=33535, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:55:28, time_cost(all): 10:42:00/1 day, 17:02:57, loss=0.530022186281187, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.4537086160254242, lr=0.08578650436091577
2023-11-26 20:19:16   INFO  epoch: 5/24, acc_iter=33585, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:52:18, time_cost(all): 10:42:58/1 day, 14:55:53, loss=0.529914643921084, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.5142315767424648, lr=0.08574641258817518
2023-11-26 20:20:14   INFO  epoch: 5/24, acc_iter=33635, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:54:32, time_cost(all): 10:43:56/1 day, 17:38:46, loss=0.529807101560982, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.12(1.03), norm=2.8172350235688786, lr=0.0857063208154346
2023-11-26 20:21:12   INFO  epoch: 5/24, acc_iter=33685, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:49:24, time_cost(all): 10:44:54/1 day, 15:37:15, loss=0.529699559200879, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=3.377574720290409, lr=0.085666229042694
2023-11-26 20:22:09   INFO  epoch: 5/24, acc_iter=33735, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:46:09, time_cost(all): 10:45:51/1 day, 14:51:26, loss=0.529592016840776, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=1.4505844052943357, lr=0.08562613726995341
2023-11-26 20:23:07   INFO  epoch: 5/24, acc_iter=33785, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:46:36, time_cost(all): 10:46:49/1 day, 16:22:26, loss=0.529484474480673, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.283930307788556, lr=0.08558604549721283
2023-11-26 20:24:05   INFO  epoch: 5/24, acc_iter=33835, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:50:31, time_cost(all): 10:47:47/1 day, 14:29:46, loss=0.529376932120571, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.18048015297586, lr=0.08554595372447224
2023-11-26 20:25:03   INFO  epoch: 5/24, acc_iter=33885, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:49:56, time_cost(all): 10:48:45/1 day, 14:19:45, loss=0.529269389760468, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.160182224540158, lr=0.08550586195173165
2023-11-26 20:26:00   INFO  epoch: 5/24, acc_iter=33935, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:42:47, time_cost(all): 10:49:42/1 day, 15:42:16, loss=0.529161847400365, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=4.238757566763752, lr=0.08546577017899107
2023-11-26 20:26:58   INFO  epoch: 5/24, acc_iter=33985, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:45, time_cost(all): 10:50:40/1 day, 14:14:17, loss=0.529054305040263, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=2.8525895995765107, lr=0.08542567840625047
2023-11-26 20:27:56   INFO  epoch: 5/24, acc_iter=34035, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:40:45, time_cost(all): 10:51:38/1 day, 16:00:38, loss=0.52894676268016, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=3.658187767213914, lr=0.08538558663350988
2023-11-26 20:28:54   INFO  epoch: 5/24, acc_iter=34085, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:40:42, time_cost(all): 10:52:36/1 day, 13:54:28, loss=0.528839220320057, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.715920239515194, lr=0.0853454948607693
2023-11-26 20:29:51   INFO  epoch: 5/24, acc_iter=34135, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:41:06, time_cost(all): 10:53:33/1 day, 16:46:02, loss=0.528731677959955, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.7436412171634132, lr=0.08530540308802871
2023-11-26 20:30:49   INFO  epoch: 5/24, acc_iter=34185, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:47:35, time_cost(all): 10:54:31/1 day, 17:42:42, loss=0.528624135599852, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.3788373289215592, lr=0.08526531131528813
2023-11-26 20:31:47   INFO  epoch: 5/24, acc_iter=34235, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:45:18, time_cost(all): 10:55:29/1 day, 16:59:47, loss=0.528516593239749, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.457578813648747, lr=0.08522521954254754
2023-11-26 20:32:45   INFO  epoch: 5/24, acc_iter=34285, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:42:22, time_cost(all): 10:56:27/1 day, 15:24:40, loss=0.528409050879647, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=1.1380284160345013, lr=0.08518512776980694
2023-11-26 20:33:42   INFO  epoch: 5/24, acc_iter=34335, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:35:21, time_cost(all): 10:57:24/1 day, 15:53:09, loss=0.528301508519544, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=1.8313445057613844, lr=0.08514503599706635
2023-11-26 20:34:40   INFO  epoch: 5/24, acc_iter=34385, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:43:36, time_cost(all): 10:58:22/1 day, 17:41:09, loss=0.528193966159441, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.7343866077677825, lr=0.08510494422432577
2023-11-26 20:35:38   INFO  epoch: 5/24, acc_iter=34435, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:41:37, time_cost(all): 10:59:20/1 day, 15:41:10, loss=0.528086423799339, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.7359665435828653, lr=0.08506485245158518
2023-11-26 20:36:36   INFO  epoch: 5/24, acc_iter=34485, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:40:27, time_cost(all): 11:00:18/1 day, 16:36:36, loss=0.527978881439236, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.6613623207950021, lr=0.0850247606788446
2023-11-26 20:37:33   INFO  epoch: 5/24, acc_iter=34535, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:37:14, time_cost(all): 11:01:15/1 day, 13:57:47, loss=0.527871339079133, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=1.8103796971110495, lr=0.08498466890610401
2023-11-26 20:38:31   INFO  epoch: 5/24, acc_iter=34585, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:31:09, time_cost(all): 11:02:13/1 day, 17:04:14, loss=0.527763796719031, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=2.426675791672518, lr=0.08494457713336342
2023-11-26 20:39:29   INFO  epoch: 5/24, acc_iter=34635, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:38:27, time_cost(all): 11:03:11/1 day, 14:05:31, loss=0.527656254358928, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=1.313267496598504, lr=0.08490448536062283
2023-11-26 20:40:27   INFO  epoch: 5/24, acc_iter=34685, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:32:29, time_cost(all): 11:04:09/1 day, 15:09:53, loss=0.527548711998825, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.7898729333927021, lr=0.08486439358788224
2023-11-26 20:41:24   INFO  epoch: 5/24, acc_iter=34735, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:33:08, time_cost(all): 11:05:06/1 day, 16:48:37, loss=0.527441169638723, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=3.8261193385662553, lr=0.08482430181514165
2023-11-26 20:42:22   INFO  epoch: 5/24, acc_iter=34785, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:32:04, time_cost(all): 11:06:04/1 day, 15:04:40, loss=0.52733362727862, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=1.5852098176949285, lr=0.08478421004240107
2023-11-26 20:43:20   INFO  epoch: 5/24, acc_iter=34835, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:25:55, time_cost(all): 11:07:02/1 day, 13:41:14, loss=0.527226084918517, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=1.7536910017305163, lr=0.08474411826966048
2023-11-26 20:44:18   INFO  epoch: 5/24, acc_iter=34885, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:30:25, time_cost(all): 11:08:00/1 day, 15:27:14, loss=0.527118542558415, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=1.9884046276772336, lr=0.08470402649691988
2023-11-26 20:45:15   INFO  epoch: 5/24, acc_iter=34935, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:27:26, time_cost(all): 11:08:57/1 day, 15:35:40, loss=0.527011000198312, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=4.014515184875855, lr=0.0846639347241793
2023-11-26 20:46:13   INFO  epoch: 5/24, acc_iter=34985, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:27:10, time_cost(all): 11:09:55/1 day, 16:54:39, loss=0.526903457838209, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.303964410700409, lr=0.08462384295143871
2023-11-26 20:47:11   INFO  epoch: 5/24, acc_iter=35035, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:24:00, time_cost(all): 11:10:53/1 day, 15:39:24, loss=0.526795915478106, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=1.3272662985764803, lr=0.08458375117869812
2023-11-26 20:48:09   INFO  epoch: 5/24, acc_iter=35085, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:25:14, time_cost(all): 11:11:51/1 day, 14:37:58, loss=0.526688373118004, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=0.8465944280533548, lr=0.08454365940595754
2023-11-26 20:49:06   INFO  epoch: 5/24, acc_iter=35135, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:00, time_cost(all): 11:12:48/1 day, 16:06:21, loss=0.526580830757901, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=4.116159094657425, lr=0.08450356763321695
2023-11-26 20:50:04   INFO  epoch: 5/24, acc_iter=35185, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:19:30, time_cost(all): 11:13:46/1 day, 13:53:41, loss=0.526473288397799, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=2.2392243052037903, lr=0.08446347586047637
2023-11-26 20:51:02   INFO  epoch: 5/24, acc_iter=35235, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:09, time_cost(all): 11:14:44/1 day, 15:42:12, loss=0.526365746037696, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.473166946837429, lr=0.08442338408773577
2023-11-26 20:52:00   INFO  epoch: 5/24, acc_iter=35285, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:18:47, time_cost(all): 11:15:42/1 day, 14:00:28, loss=0.526258203677593, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=2.797108118940526, lr=0.08438329231499518
2023-11-26 20:52:57   INFO  epoch: 5/24, acc_iter=35335, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:21:29, time_cost(all): 11:16:39/1 day, 14:45:38, loss=0.52615066131749, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=4.394548579263689, lr=0.0843432005422546
2023-11-26 20:53:55   INFO  epoch: 5/24, acc_iter=35385, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:23:18, time_cost(all): 11:17:37/1 day, 16:45:08, loss=0.526043118957388, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=3.430599836903425, lr=0.08430310876951401
2023-11-26 20:54:53   INFO  epoch: 5/24, acc_iter=35435, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:00, time_cost(all): 11:18:35/1 day, 16:09:20, loss=0.525935576597285, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.3198220857470706, lr=0.08426301699677342
2023-11-26 20:55:51   INFO  epoch: 5/24, acc_iter=35485, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:31, time_cost(all): 11:19:33/1 day, 13:45:13, loss=0.525828034237182, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.6549199229867817, lr=0.08422292522403284
2023-11-26 20:56:48   INFO  epoch: 5/24, acc_iter=35535, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:45, time_cost(all): 11:20:30/1 day, 16:34:49, loss=0.52572049187708, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=1.743950339392247, lr=0.08418283345129224
2023-11-26 20:57:46   INFO  epoch: 5/24, acc_iter=35585, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:30, time_cost(all): 11:21:28/1 day, 15:24:01, loss=0.525612949516977, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.9794025773159, lr=0.08414274167855165
2023-11-26 20:58:44   INFO  epoch: 5/24, acc_iter=35635, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:54, time_cost(all): 11:22:26/1 day, 14:00:40, loss=0.525505407156874, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.395630228000375, lr=0.08410264990581107
2023-11-26 20:59:42   INFO  epoch: 5/24, acc_iter=35685, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:17:20, time_cost(all): 11:23:24/1 day, 16:14:38, loss=0.525397864796772, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=1.5057633006055662, lr=0.08406255813307048
2023-11-26 21:00:39   INFO  epoch: 5/24, acc_iter=35735, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:10:31, time_cost(all): 11:24:21/1 day, 15:37:49, loss=0.525290322436669, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=4.453118562897997, lr=0.0840224663603299
2023-11-26 21:01:37   INFO  epoch: 5/24, acc_iter=35785, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:14:02, time_cost(all): 11:25:19/1 day, 15:40:36, loss=0.525182780076566, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=1.8844916111087533, lr=0.08398237458758931
2023-11-26 21:02:35   INFO  epoch: 5/24, acc_iter=35835, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:08:37, time_cost(all): 11:26:17/1 day, 15:05:39, loss=0.525075237716464, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.2282729007958877, lr=0.08394228281484872
2023-11-26 21:03:33   INFO  epoch: 5/24, acc_iter=35885, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:10:56, time_cost(all): 11:27:15/1 day, 15:34:23, loss=0.524967695356361, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=3.234948424098418, lr=0.08390219104210812
2023-11-26 21:04:30   INFO  epoch: 5/24, acc_iter=35935, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:06:34, time_cost(all): 11:28:12/1 day, 14:36:51, loss=0.524860152996258, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=0.695186381790196, lr=0.08386209926936754
2023-11-26 21:05:28   INFO  epoch: 5/24, acc_iter=35985, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:06:14, time_cost(all): 11:29:10/1 day, 16:55:29, loss=0.524752610636156, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=4.849512996525529, lr=0.08382200749662695
2023-11-26 21:06:26   INFO  epoch: 5/24, acc_iter=36035, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:06:27, time_cost(all): 11:30:08/1 day, 16:58:20, loss=0.524645068276053, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.506334894089636, lr=0.08378191572388637
2023-11-26 21:07:24   INFO  epoch: 5/24, acc_iter=36085, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:09, time_cost(all): 11:31:06/1 day, 16:06:52, loss=0.52453752591595, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=2.182487167203139, lr=0.08374182395114578
2023-11-26 21:08:21   INFO  epoch: 5/24, acc_iter=36135, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:03:07, time_cost(all): 11:32:03/1 day, 14:28:32, loss=0.524429983555848, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.0552479626269657, lr=0.08370173217840518
2023-11-26 21:09:19   INFO  epoch: 5/24, acc_iter=36185, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:01:31, time_cost(all): 11:33:01/1 day, 13:42:15, loss=0.524322441195745, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=4.492896307737615, lr=0.0836616404056646
2023-11-26 21:10:17   INFO  epoch: 5/24, acc_iter=36235, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:02:56, time_cost(all): 11:33:59/1 day, 13:56:39, loss=0.524214898835642, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.0803712191496007, lr=0.08362154863292401
2023-11-26 21:11:15   INFO  epoch: 5/24, acc_iter=36285, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:18, time_cost(all): 11:34:57/1 day, 16:31:32, loss=0.52410735647554, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=1.8781686907183113, lr=0.08358145686018342
2023-11-26 21:12:12   INFO  epoch: 5/24, acc_iter=36335, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:09, time_cost(all): 11:35:54/1 day, 13:41:13, loss=0.523999814115437, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=2.112125178369599, lr=0.08354136508744284
2023-11-26 21:13:10   INFO  epoch: 5/24, acc_iter=36385, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:02:20, time_cost(all): 11:36:52/1 day, 15:38:45, loss=0.523892271755334, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=1.6729916582607482, lr=0.08350127331470225
2023-11-26 21:14:08   INFO  epoch: 5/24, acc_iter=36435, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:56:33, time_cost(all): 11:37:50/1 day, 14:17:11, loss=0.523784729395231, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=1.1487990136511217, lr=0.08346118154196167
2023-11-26 21:15:06   INFO  epoch: 5/24, acc_iter=36485, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:59:04, time_cost(all): 11:38:48/1 day, 16:11:56, loss=0.523677187035129, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=4.160529112163326, lr=0.08342108976922108
2023-11-26 21:16:03   INFO  epoch: 5/24, acc_iter=36535, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:56:30, time_cost(all): 11:39:45/1 day, 16:51:41, loss=0.523569644675026, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.3614255923928922, lr=0.08338099799648048
2023-11-26 21:17:01   INFO  epoch: 5/24, acc_iter=36585, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:05, time_cost(all): 11:40:43/1 day, 14:27:44, loss=0.523462102314923, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=2.549111808485738, lr=0.0833409062237399
2023-11-26 21:17:59   INFO  epoch: 5/24, acc_iter=36635, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:52, time_cost(all): 11:41:41/1 day, 15:32:14, loss=0.523354559954821, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.956794009967462, lr=0.08330081445099931
2023-11-26 21:18:57   INFO  epoch: 5/24, acc_iter=36685, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:55:32, time_cost(all): 11:42:39/1 day, 13:50:53, loss=0.523247017594718, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=0.9289996397441446, lr=0.08326072267825872
2023-11-26 21:19:54   INFO  epoch: 5/24, acc_iter=36735, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:52:26, time_cost(all): 11:43:36/1 day, 13:06:06, loss=0.523139475234615, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=1.4443509688005705, lr=0.08322063090551814
2023-11-26 21:20:52   INFO  epoch: 5/24, acc_iter=36785, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:52:14, time_cost(all): 11:44:34/1 day, 14:05:39, loss=0.523031932874513, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=1.7995566704288932, lr=0.08318053913277754
2023-11-26 21:21:50   INFO  epoch: 5/24, acc_iter=36835, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:51:08, time_cost(all): 11:45:32/1 day, 16:53:35, loss=0.52292439051441, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=3.8169951205738384, lr=0.08314044736003695
2023-11-26 21:22:48   INFO  epoch: 5/24, acc_iter=36885, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:02, time_cost(all): 11:46:30/1 day, 13:08:47, loss=0.522816848154307, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=4.691203497891522, lr=0.08310035558729637
2023-11-26 21:23:45   INFO  epoch: 5/24, acc_iter=36935, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:48:36, time_cost(all): 11:47:27/1 day, 15:49:39, loss=0.522709305794205, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=1.0762678698301074, lr=0.08306026381455578
2023-11-26 21:24:43   INFO  epoch: 5/24, acc_iter=36985, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:53, time_cost(all): 11:48:25/1 day, 13:19:35, loss=0.522601763434102, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.598104356122675, lr=0.0830201720418152
2023-11-26 21:25:41   INFO  epoch: 5/24, acc_iter=37035, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:49:31, time_cost(all): 11:49:23/1 day, 16:38:50, loss=0.522494221073999, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=2.8176475094748463, lr=0.08298008026907461
2023-11-26 21:26:39   INFO  epoch: 5/24, acc_iter=37085, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:44:44, time_cost(all): 11:50:21/1 day, 15:59:00, loss=0.522386678713897, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.6388980946771692, lr=0.08293998849633402
2023-11-26 21:27:36   INFO  epoch: 5/24, acc_iter=37135, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:43:50, time_cost(all): 11:51:18/1 day, 13:30:31, loss=0.522279136353794, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=4.1196155977626425, lr=0.08289989672359342
2023-11-26 21:28:34   INFO  epoch: 5/24, acc_iter=37185, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:11, time_cost(all): 11:52:16/1 day, 14:16:38, loss=0.522171593993691, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.0804516385386687, lr=0.08285980495085284
2023-11-26 21:29:32   INFO  epoch: 5/24, acc_iter=37235, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:42:40, time_cost(all): 11:53:14/1 day, 13:38:52, loss=0.522064051633589, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=1.5868196099033731, lr=0.08281971317811225
2023-11-26 21:30:30   INFO  epoch: 5/24, acc_iter=37285, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:43:55, time_cost(all): 11:54:12/1 day, 16:16:53, loss=0.521956509273486, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=3.5728568586121643, lr=0.08277962140537166
2023-11-26 21:31:27   INFO  epoch: 5/24, acc_iter=37335, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:22, time_cost(all): 11:55:09/1 day, 13:58:28, loss=0.521848966913383, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=2.6929245020159964, lr=0.08273952963263108
2023-11-26 21:32:25   INFO  epoch: 5/24, acc_iter=37385, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:40, time_cost(all): 11:56:07/1 day, 15:19:35, loss=0.521741424553281, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=4.001245863122813, lr=0.08269943785989049
2023-11-26 21:33:23   INFO  epoch: 5/24, acc_iter=37435, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:39:24, time_cost(all): 11:57:05/1 day, 14:18:45, loss=0.521633882193178, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=2.837810943400382, lr=0.0826593460871499
2023-11-26 21:34:21   INFO  epoch: 5/24, acc_iter=37485, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:37:18, time_cost(all): 11:58:03/1 day, 15:37:47, loss=0.521526339833075, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=2.3822011452362037, lr=0.08261925431440931
2023-11-26 21:35:18   INFO  epoch: 5/24, acc_iter=37535, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:12, time_cost(all): 11:59:00/1 day, 15:01:03, loss=0.521418797472972, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=4.037962205244199, lr=0.08257916254166872
2023-11-26 21:36:16   INFO  epoch: 5/24, acc_iter=37585, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:12, time_cost(all): 11:59:58/1 day, 16:38:39, loss=0.52131125511287, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.5787386395715073, lr=0.08253907076892814
2023-11-26 21:37:14   INFO  epoch: 5/24, acc_iter=37635, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:39, time_cost(all): 12:00:56/1 day, 13:34:43, loss=0.521203712752767, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=3.5842476738207405, lr=0.08249897899618755
2023-11-26 21:38:12   INFO  epoch: 5/24, acc_iter=37685, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:58, time_cost(all): 12:01:54/1 day, 15:44:04, loss=0.521096170392665, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=1.9221066790642305, lr=0.08245888722344696
2023-11-26 21:39:09   INFO  epoch: 5/24, acc_iter=37735, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:02, time_cost(all): 12:02:51/1 day, 13:41:52, loss=0.520988628032562, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=3.525573899752952, lr=0.08241879545070638
2023-11-26 21:40:07   INFO  epoch: 5/24, acc_iter=37785, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:46, time_cost(all): 12:03:49/1 day, 13:56:42, loss=0.520881085672459, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=3.0143437221918186, lr=0.08237870367796578
2023-11-26 21:41:05   INFO  epoch: 5/24, acc_iter=37835, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:33:38, time_cost(all): 12:04:47/1 day, 13:39:39, loss=0.520773543312356, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.3355817813045734, lr=0.08233861190522519
2023-11-26 21:42:03   INFO  epoch: 5/24, acc_iter=37885, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:59, time_cost(all): 12:05:45/1 day, 14:54:19, loss=0.520666000952254, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=1.3416740746619522, lr=0.0822985201324846
2023-11-26 21:43:01   INFO  epoch: 5/24, acc_iter=37935, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:16, time_cost(all): 12:06:43/1 day, 14:27:57, loss=0.520558458592151, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=2.826851308856315, lr=0.08225842835974402
2023-11-26 21:43:58   INFO  epoch: 5/24, acc_iter=37985, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:37, time_cost(all): 12:07:40/1 day, 13:09:56, loss=0.520450916232048, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=1.424040497738998, lr=0.08221833658700343
2023-11-26 21:44:56   INFO  epoch: 5/24, acc_iter=38035, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:25, time_cost(all): 12:08:38/1 day, 15:57:18, loss=0.520343373871946, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=4.722174283349151, lr=0.08217824481426284
2023-11-26 21:45:54   INFO  epoch: 5/24, acc_iter=38085, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:32, time_cost(all): 12:09:36/1 day, 16:08:34, loss=0.520235831511843, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=1.8665671043239978, lr=0.08213815304152225
2023-11-26 21:46:52   INFO  epoch: 5/24, acc_iter=38135, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:12, time_cost(all): 12:10:34/1 day, 15:57:13, loss=0.52012828915174, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=2.6180225269860924, lr=0.08209806126878166
2023-11-26 21:47:49   INFO  epoch: 5/24, acc_iter=38185, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:33, time_cost(all): 12:11:31/1 day, 14:51:26, loss=0.520020746791638, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.472464632092483, lr=0.08205796949604108
2023-11-26 21:48:47   INFO  epoch: 5/24, acc_iter=38235, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:04, time_cost(all): 12:12:29/1 day, 13:49:47, loss=0.519913204431535, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=2.091530746206596, lr=0.08201787772330049
2023-11-26 21:49:45   INFO  epoch: 5/24, acc_iter=38285, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:36, time_cost(all): 12:13:27/1 day, 15:30:10, loss=0.519805662071432, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.4776744908634332, lr=0.0819777859505599
2023-11-26 21:50:43   INFO  epoch: 5/24, acc_iter=38335, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:44, time_cost(all): 12:14:25/1 day, 13:18:00, loss=0.51969811971133, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.131486877132084, lr=0.08193769417781932
2023-11-26 21:51:40   INFO  epoch: 5/24, acc_iter=38385, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:06, time_cost(all): 12:15:22/1 day, 13:35:49, loss=0.519590577351227, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.785842709130485, lr=0.08189760240507873
2023-11-26 21:52:38   INFO  epoch: 5/24, acc_iter=38435, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:24, time_cost(all): 12:16:20/1 day, 14:58:22, loss=0.519483034991124, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.686584807994363, lr=0.08185751063233813
2023-11-26 21:53:36   INFO  epoch: 5/24, acc_iter=38485, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:35, time_cost(all): 12:17:18/1 day, 14:46:19, loss=0.519375492631022, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=2.170990399051969, lr=0.08181741885959755
2023-11-26 21:54:34   INFO  epoch: 5/24, acc_iter=38535, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:36, time_cost(all): 12:18:16/1 day, 15:46:19, loss=0.519267950270919, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=0.7715653124095703, lr=0.08177732708685696
2023-11-26 21:55:31   INFO  epoch: 5/24, acc_iter=38585, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:31, time_cost(all): 12:19:13/1 day, 14:04:02, loss=0.519160407910816, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.841309456390475, lr=0.08173723531411638
2023-11-26 21:56:29   INFO  epoch: 5/24, acc_iter=38635, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:47, time_cost(all): 12:20:11/1 day, 15:04:13, loss=0.519052865550714, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.2020830313935607, lr=0.08169714354137579
2023-11-26 21:57:27   INFO  epoch: 5/24, acc_iter=38685, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:31, time_cost(all): 12:21:09/1 day, 13:17:17, loss=0.518945323190611, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=4.10134941793328, lr=0.08165705176863519
2023-11-26 21:58:25   INFO  epoch: 5/24, acc_iter=38735, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:43, time_cost(all): 12:22:07/1 day, 12:56:37, loss=0.518837780830508, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.7120814089477485, lr=0.0816169599958946
2023-11-26 21:59:22   INFO  epoch: 5/24, acc_iter=38785, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:25, time_cost(all): 12:23:04/1 day, 14:22:18, loss=0.518730238470406, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.2491825866507895, lr=0.08157686822315402
2023-11-26 22:00:20   INFO  epoch: 5/24, acc_iter=38835, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:00, time_cost(all): 12:24:02/1 day, 14:36:17, loss=0.518622696110303, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=4.91867923670677, lr=0.08153677645041343
2023-11-26 22:01:18   INFO  epoch: 5/24, acc_iter=38885, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:01, time_cost(all): 12:25:00/1 day, 15:44:07, loss=0.5185151537502, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=4.403003362348154, lr=0.08149668467767285
2023-11-26 22:02:16   INFO  epoch: 5/24, acc_iter=38935, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:46, time_cost(all): 12:25:58/1 day, 16:01:59, loss=0.518407611390098, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=2.109746474794413, lr=0.08145659290493226
2023-11-26 22:03:13   INFO  epoch: 5/24, acc_iter=38985, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:44, time_cost(all): 12:26:55/1 day, 13:15:50, loss=0.518300069029995, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=0.6031080765915231, lr=0.08141650113219168
2023-11-26 22:04:11   INFO  epoch: 5/24, acc_iter=39035, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:00, time_cost(all): 12:27:53/1 day, 14:03:35, loss=0.518192526669892, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=0.8748492650680924, lr=0.08137640935945108
2023-11-26 22:05:09   INFO  epoch: 5/24, acc_iter=39085, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:35, time_cost(all): 12:28:51/1 day, 12:37:08, loss=0.518084984309789, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=4.787933203633747, lr=0.08133631758671049
2023-11-26 22:06:07   INFO  epoch: 5/24, acc_iter=39135, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:47, time_cost(all): 12:29:49/1 day, 13:46:03, loss=0.517977441949687, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=3.7536290550515967, lr=0.0812962258139699
2023-11-26 22:07:04   INFO  epoch: 5/24, acc_iter=39185, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:11, time_cost(all): 12:30:46/1 day, 13:54:02, loss=0.517869899589584, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=3.1663552549949046, lr=0.08125613404122932
2023-11-26 22:08:02   INFO  epoch: 5/24, acc_iter=39235, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:35, time_cost(all): 12:31:44/1 day, 15:11:06, loss=0.517762357229481, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=0.5832571755211975, lr=0.08121604226848873
2023-11-26 22:09:00   INFO  epoch: 5/24, acc_iter=39285, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:37, time_cost(all): 12:32:42/1 day, 12:37:40, loss=0.517654814869379, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=4.500531131083535, lr=0.08117595049574813
2023-11-26 22:09:58   INFO  epoch: 5/24, acc_iter=39335, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:25, time_cost(all): 12:33:40/1 day, 13:37:14, loss=0.517547272509276, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.658731086765812, lr=0.08113585872300755
2023-11-26 22:10:55   INFO  epoch: 5/24, acc_iter=39385, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:30, time_cost(all): 12:34:37/1 day, 15:22:24, loss=0.517439730149173, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=0.9172075204470884, lr=0.08109576695026696
2023-11-26 22:11:53   INFO  epoch: 5/24, acc_iter=39435, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:37, time_cost(all): 12:35:35/1 day, 14:50:57, loss=0.517332187789071, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=1.258519067284177, lr=0.08105567517752638
2023-11-26 22:12:51   INFO  epoch: 5/24, acc_iter=39485, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:41, time_cost(all): 12:36:33/1 day, 14:58:03, loss=0.517224645428968, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=2.7267354372150963, lr=0.08101558340478579
2023-11-26 22:13:49   INFO  epoch: 6/24, acc_iter=39572, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:00:30, time_cost(all): 12:37:31/1 day, 13:25:12, loss=0.517037521722389, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=0.8078703120745517, lr=0.08094582372021716
2023-11-26 22:14:46   INFO  epoch: 6/24, acc_iter=39622, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:03:07, time_cost(all): 12:38:28/1 day, 13:36:13, loss=0.516929979362287, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.5965663833963704, lr=0.08090573194747658
2023-11-26 22:15:44   INFO  epoch: 6/24, acc_iter=39672, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:01:25, time_cost(all): 12:39:26/1 day, 15:38:22, loss=0.516822437002184, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.328650259439641, lr=0.08086564017473599
2023-11-26 22:16:42   INFO  epoch: 6/24, acc_iter=39722, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:04:55, time_cost(all): 12:40:24/1 day, 13:19:38, loss=0.516714894642081, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.1000445798311342, lr=0.0808255484019954
2023-11-26 22:17:40   INFO  epoch: 6/24, acc_iter=39772, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:08:03, time_cost(all): 12:41:22/1 day, 12:43:27, loss=0.516607352281979, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.92(1.03), norm=3.3371644298489187, lr=0.08078545662925482
2023-11-26 22:18:37   INFO  epoch: 6/24, acc_iter=39822, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:13, time_cost(all): 12:42:19/1 day, 15:02:01, loss=0.516499809921876, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=0.9021080794438363, lr=0.08074536485651422
2023-11-26 22:19:35   INFO  epoch: 6/24, acc_iter=39872, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:00:12, time_cost(all): 12:43:17/1 day, 13:39:10, loss=0.516392267561773, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.9599828750373214, lr=0.08070527308377363
2023-11-26 22:20:33   INFO  epoch: 6/24, acc_iter=39922, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:55:27, time_cost(all): 12:44:15/1 day, 13:10:40, loss=0.516284725201671, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=1.4116799789274532, lr=0.08066518131103305
2023-11-26 22:21:31   INFO  epoch: 6/24, acc_iter=39972, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:00:50, time_cost(all): 12:45:13/1 day, 13:22:44, loss=0.516177182841568, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.4011679889287785, lr=0.08062508953829246
2023-11-26 22:22:28   INFO  epoch: 6/24, acc_iter=40022, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:56:09, time_cost(all): 12:46:10/1 day, 14:44:37, loss=0.516069640481465, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=3.8477588324610146, lr=0.08058499776555188
2023-11-26 22:23:26   INFO  epoch: 6/24, acc_iter=40072, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/2:01:45, time_cost(all): 12:47:08/1 day, 12:42:08, loss=0.515962098121363, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.4877983565407202, lr=0.08054490599281128
2023-11-26 22:24:24   INFO  epoch: 6/24, acc_iter=40122, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:51:34, time_cost(all): 12:48:06/1 day, 12:51:04, loss=0.51585455576126, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=0.5865809582781221, lr=0.08050481422007069
2023-11-26 22:25:22   INFO  epoch: 6/24, acc_iter=40172, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:55:04, time_cost(all): 12:49:04/1 day, 14:22:18, loss=0.515747013401157, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=2.703380947897184, lr=0.0804647224473301
2023-11-26 22:26:19   INFO  epoch: 6/24, acc_iter=40222, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:58:02, time_cost(all): 12:50:01/1 day, 14:49:26, loss=0.515639471041055, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=4.54075247555288, lr=0.08042463067458952
2023-11-26 22:27:17   INFO  epoch: 6/24, acc_iter=40272, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:50:43, time_cost(all): 12:50:59/1 day, 12:55:44, loss=0.515531928680952, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.072480195451516, lr=0.08038453890184893
2023-11-26 22:28:15   INFO  epoch: 6/24, acc_iter=40322, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:47:28, time_cost(all): 12:51:57/1 day, 14:08:01, loss=0.515424386320849, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=1.4231090697180053, lr=0.08034444712910835
2023-11-26 22:29:13   INFO  epoch: 6/24, acc_iter=40372, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:45:12, time_cost(all): 12:52:55/1 day, 14:28:31, loss=0.515316843960747, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.8902637533219755, lr=0.08030435535636776
2023-11-26 22:30:10   INFO  epoch: 6/24, acc_iter=40422, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:52:08, time_cost(all): 12:53:52/1 day, 15:40:17, loss=0.515209301600644, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=0.8528778469267524, lr=0.08026426358362718
2023-11-26 22:31:08   INFO  epoch: 6/24, acc_iter=40472, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:50:33, time_cost(all): 12:54:50/1 day, 12:54:13, loss=0.515101759240541, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=4.396190674229896, lr=0.08022417181088658
2023-11-26 22:32:06   INFO  epoch: 6/24, acc_iter=40522, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:44:28, time_cost(all): 12:55:48/1 day, 14:31:12, loss=0.514994216880438, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=0.8884767270286467, lr=0.08018408003814599
2023-11-26 22:33:04   INFO  epoch: 6/24, acc_iter=40572, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:04, time_cost(all): 12:56:46/1 day, 15:04:50, loss=0.514886674520336, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=2.8170530952935344, lr=0.0801439882654054
2023-11-26 22:34:01   INFO  epoch: 6/24, acc_iter=40622, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:44:10, time_cost(all): 12:57:43/1 day, 14:36:43, loss=0.514779132160233, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=4.196646702204891, lr=0.08010389649266482
2023-11-26 22:34:59   INFO  epoch: 6/24, acc_iter=40672, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:47:01, time_cost(all): 12:58:41/1 day, 13:12:20, loss=0.514671589800131, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.117975306617723, lr=0.08006380471992423
2023-11-26 22:35:57   INFO  epoch: 6/24, acc_iter=40722, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:41:54, time_cost(all): 12:59:39/1 day, 14:09:18, loss=0.514564047440028, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.2015582296007032, lr=0.08002371294718363
2023-11-26 22:36:55   INFO  epoch: 6/24, acc_iter=40772, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:40:10, time_cost(all): 13:00:37/1 day, 13:13:24, loss=0.514456505079925, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=2.9269349823735786, lr=0.07998362117444305
2023-11-26 22:37:52   INFO  epoch: 6/24, acc_iter=40822, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:46:48, time_cost(all): 13:01:34/1 day, 11:56:00, loss=0.514348962719822, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=0.5238457896456752, lr=0.07994352940170246
2023-11-26 22:38:50   INFO  epoch: 6/24, acc_iter=40872, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:36:52, time_cost(all): 13:02:32/1 day, 12:38:28, loss=0.51424142035972, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=2.476733250118483, lr=0.07990343762896188
2023-11-26 22:39:48   INFO  epoch: 6/24, acc_iter=40922, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:36:34, time_cost(all): 13:03:30/1 day, 13:05:24, loss=0.514133877999617, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=3.7954632944059825, lr=0.07986334585622129
2023-11-26 22:40:46   INFO  epoch: 6/24, acc_iter=40972, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:37:04, time_cost(all): 13:04:28/1 day, 14:37:49, loss=0.514026335639514, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=4.129650896569752, lr=0.0798232540834807
2023-11-26 22:41:43   INFO  epoch: 6/24, acc_iter=41022, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:39:19, time_cost(all): 13:05:25/1 day, 14:19:07, loss=0.513918793279412, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=0.7255519819332703, lr=0.07978316231074012
2023-11-26 22:42:41   INFO  epoch: 6/24, acc_iter=41072, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:35:37, time_cost(all): 13:06:23/1 day, 15:28:24, loss=0.513811250919309, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=1.3374327892265252, lr=0.07974307053799952
2023-11-26 22:43:39   INFO  epoch: 6/24, acc_iter=41122, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:32:55, time_cost(all): 13:07:21/1 day, 13:19:47, loss=0.513703708559206, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=1.090989490129471, lr=0.07970297876525893
2023-11-26 22:44:37   INFO  epoch: 6/24, acc_iter=41172, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:30:28, time_cost(all): 13:08:19/1 day, 13:23:20, loss=0.513596166199104, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=1.8643699688333473, lr=0.07966288699251835
2023-11-26 22:45:34   INFO  epoch: 6/24, acc_iter=41222, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:30:52, time_cost(all): 13:09:16/1 day, 13:40:40, loss=0.513488623839001, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=1.0887901062105827, lr=0.07962279521977776
2023-11-26 22:46:32   INFO  epoch: 6/24, acc_iter=41272, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:29:18, time_cost(all): 13:10:14/1 day, 11:46:26, loss=0.513381081478898, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=1.6571079017750547, lr=0.07958270344703718
2023-11-26 22:47:30   INFO  epoch: 6/24, acc_iter=41322, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:28:01, time_cost(all): 13:11:12/1 day, 11:57:39, loss=0.513273539118796, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.4876868474117493, lr=0.07954261167429659
2023-11-26 22:48:28   INFO  epoch: 6/24, acc_iter=41372, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:30:03, time_cost(all): 13:12:10/1 day, 12:51:09, loss=0.513165996758693, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=4.768587373134222, lr=0.07950251990155599
2023-11-26 22:49:25   INFO  epoch: 6/24, acc_iter=41422, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:32:42, time_cost(all): 13:13:07/1 day, 14:55:20, loss=0.51305845439859, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=0.6963183505170999, lr=0.0794624281288154
2023-11-26 22:50:23   INFO  epoch: 6/24, acc_iter=41472, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:28:50, time_cost(all): 13:14:05/1 day, 12:22:36, loss=0.512950912038488, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=1.3164392642122027, lr=0.07942233635607482
2023-11-26 22:51:21   INFO  epoch: 6/24, acc_iter=41522, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:24:49, time_cost(all): 13:15:03/1 day, 11:47:24, loss=0.512843369678385, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=1.7120246877360183, lr=0.07938224458333423
2023-11-26 22:52:19   INFO  epoch: 6/24, acc_iter=41572, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:23:01, time_cost(all): 13:16:01/1 day, 12:00:39, loss=0.512735827318282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=4.665313054068815, lr=0.07934215281059365
2023-11-26 22:53:16   INFO  epoch: 6/24, acc_iter=41622, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:25:01, time_cost(all): 13:16:58/1 day, 13:27:33, loss=0.51262828495818, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.976620109215561, lr=0.07930206103785306
2023-11-26 22:54:14   INFO  epoch: 6/24, acc_iter=41672, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:25:46, time_cost(all): 13:17:56/1 day, 11:57:16, loss=0.512520742598077, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=2.8778344295474683, lr=0.07926196926511248
2023-11-26 22:55:12   INFO  epoch: 6/24, acc_iter=41722, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:22:10, time_cost(all): 13:18:54/1 day, 14:18:41, loss=0.512413200237974, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=1.905822551899325, lr=0.07922187749237188
2023-11-26 22:56:10   INFO  epoch: 6/24, acc_iter=41772, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:23:28, time_cost(all): 13:19:52/1 day, 12:18:09, loss=0.512305657877872, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.1942351303674246, lr=0.07918178571963129
2023-11-26 22:57:07   INFO  epoch: 6/24, acc_iter=41822, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:25:34, time_cost(all): 13:20:49/1 day, 14:15:28, loss=0.512198115517769, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=1.7733068556778409, lr=0.0791416939468907
2023-11-26 22:58:05   INFO  epoch: 6/24, acc_iter=41872, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:19, time_cost(all): 13:21:47/1 day, 12:59:46, loss=0.512090573157666, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=2.10879291181672, lr=0.07910160217415012
2023-11-26 22:59:03   INFO  epoch: 6/24, acc_iter=41922, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:24:33, time_cost(all): 13:22:45/1 day, 14:06:51, loss=0.511983030797563, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.065841334083286, lr=0.07906151040140953
2023-11-26 23:00:01   INFO  epoch: 6/24, acc_iter=41972, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:18:13, time_cost(all): 13:23:43/1 day, 13:57:38, loss=0.511875488437461, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=0.8927175971956074, lr=0.07902141862866893
2023-11-26 23:00:58   INFO  epoch: 6/24, acc_iter=42022, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:47, time_cost(all): 13:24:40/1 day, 14:50:04, loss=0.511767946077358, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.9888199080995017, lr=0.07898132685592835
2023-11-26 23:01:56   INFO  epoch: 6/24, acc_iter=42072, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:20:50, time_cost(all): 13:25:38/1 day, 12:44:34, loss=0.511660403717255, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=0.6450085763928362, lr=0.07894123508318776
2023-11-26 23:02:54   INFO  epoch: 6/24, acc_iter=42122, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:20:31, time_cost(all): 13:26:36/1 day, 13:45:04, loss=0.511552861357153, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.2743841549619181, lr=0.07890114331044717
2023-11-26 23:03:52   INFO  epoch: 6/24, acc_iter=42172, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:19:05, time_cost(all): 13:27:34/1 day, 12:33:43, loss=0.51144531899705, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.405659914099215, lr=0.07886105153770659
2023-11-26 23:04:49   INFO  epoch: 6/24, acc_iter=42222, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:11, time_cost(all): 13:28:31/1 day, 12:05:34, loss=0.511337776636947, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.9326832153021716, lr=0.078820959764966
2023-11-26 23:05:47   INFO  epoch: 6/24, acc_iter=42272, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:15:51, time_cost(all): 13:29:29/1 day, 12:27:20, loss=0.511230234276845, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.616417123322231, lr=0.07878086799222542
2023-11-26 23:06:45   INFO  epoch: 6/24, acc_iter=42322, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:13:52, time_cost(all): 13:30:27/1 day, 12:41:39, loss=0.511122691916742, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.37444718411553, lr=0.07874077621948483
2023-11-26 23:07:43   INFO  epoch: 6/24, acc_iter=42372, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:10:56, time_cost(all): 13:31:25/1 day, 14:11:05, loss=0.511015149556639, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=2.296325282863946, lr=0.07870068444674423
2023-11-26 23:08:40   INFO  epoch: 6/24, acc_iter=42422, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:40, time_cost(all): 13:32:22/1 day, 14:34:46, loss=0.510907607196537, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=4.829287763898728, lr=0.07866059267400365
2023-11-26 23:09:38   INFO  epoch: 6/24, acc_iter=42472, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:09:51, time_cost(all): 13:33:20/1 day, 11:47:32, loss=0.510800064836434, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=1.1907904661125386, lr=0.07862050090126306
2023-11-26 23:10:36   INFO  epoch: 6/24, acc_iter=42522, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:08:18, time_cost(all): 13:34:18/1 day, 11:47:13, loss=0.510692522476331, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=0.5006991238330072, lr=0.07858040912852247
2023-11-26 23:11:34   INFO  epoch: 6/24, acc_iter=42572, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:06:57, time_cost(all): 13:35:16/1 day, 11:44:29, loss=0.510584980116229, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=3.387010148560312, lr=0.07854031735578187
2023-11-26 23:12:31   INFO  epoch: 6/24, acc_iter=42622, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:08:22, time_cost(all): 13:36:13/1 day, 13:16:27, loss=0.510477437756126, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.913946852443497, lr=0.07850022558304129
2023-11-26 23:13:29   INFO  epoch: 6/24, acc_iter=42672, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:50, time_cost(all): 13:37:11/1 day, 12:10:18, loss=0.510369895396023, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=3.815729733227692, lr=0.0784601338103007
2023-11-26 23:14:27   INFO  epoch: 6/24, acc_iter=42722, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:08:14, time_cost(all): 13:38:09/1 day, 12:27:00, loss=0.510262353035921, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=0.6354949595709227, lr=0.07842004203756012
2023-11-26 23:15:25   INFO  epoch: 6/24, acc_iter=42772, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:01:01, time_cost(all): 13:39:07/1 day, 14:41:43, loss=0.510154810675818, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=4.346189922632439, lr=0.07837995026481953
2023-11-26 23:16:22   INFO  epoch: 6/24, acc_iter=42822, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:00:34, time_cost(all): 13:40:04/1 day, 12:45:26, loss=0.510047268315715, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=1.2675815575292078, lr=0.07833985849207895
2023-11-26 23:17:20   INFO  epoch: 6/24, acc_iter=42872, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:44, time_cost(all): 13:41:02/1 day, 13:56:34, loss=0.509939725955613, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=3.4014944335743382, lr=0.07829976671933836
2023-11-26 23:18:18   INFO  epoch: 6/24, acc_iter=42922, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:05, time_cost(all): 13:42:00/1 day, 11:56:32, loss=0.50983218359551, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.2453947530099345, lr=0.07825967494659777
2023-11-26 23:19:16   INFO  epoch: 6/24, acc_iter=42972, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:27, time_cost(all): 13:42:58/1 day, 13:43:49, loss=0.509724641235407, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=3.492564796028844, lr=0.07821958317385717
2023-11-26 23:20:13   INFO  epoch: 6/24, acc_iter=43022, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:02:17, time_cost(all): 13:43:55/1 day, 13:21:03, loss=0.509617098875305, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=0.812208445277194, lr=0.07817949140111659
2023-11-26 23:21:11   INFO  epoch: 6/24, acc_iter=43072, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:00:27, time_cost(all): 13:44:53/1 day, 14:34:56, loss=0.509509556515202, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.174473880709364, lr=0.078139399628376
2023-11-26 23:22:09   INFO  epoch: 6/24, acc_iter=43122, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:58:06, time_cost(all): 13:45:51/1 day, 12:07:33, loss=0.509402014155099, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=2.9848080522265463, lr=0.07809930785563542
2023-11-26 23:23:07   INFO  epoch: 6/24, acc_iter=43172, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:56:01, time_cost(all): 13:46:49/1 day, 12:48:38, loss=0.509294471794997, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=3.4635055805395116, lr=0.07805921608289482
2023-11-26 23:24:05   INFO  epoch: 6/24, acc_iter=43222, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:37, time_cost(all): 13:47:47/1 day, 12:48:32, loss=0.509186929434894, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=0.6246143512693623, lr=0.07801912431015423
2023-11-26 23:25:02   INFO  epoch: 6/24, acc_iter=43272, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:22, time_cost(all): 13:48:44/1 day, 14:30:19, loss=0.509079387074791, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=4.217320392331075, lr=0.07797903253741365
2023-11-26 23:26:00   INFO  epoch: 6/24, acc_iter=43322, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:55:49, time_cost(all): 13:49:42/1 day, 13:33:37, loss=0.508971844714688, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=1.1518166038662865, lr=0.07793894076467306
2023-11-26 23:26:58   INFO  epoch: 6/24, acc_iter=43372, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:50, time_cost(all): 13:50:40/1 day, 11:07:21, loss=0.508864302354586, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.457003977310121, lr=0.07789884899193247
2023-11-26 23:27:56   INFO  epoch: 6/24, acc_iter=43422, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:38, time_cost(all): 13:51:38/1 day, 12:12:53, loss=0.508756759994483, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.845115319080717, lr=0.07785875721919189
2023-11-26 23:28:53   INFO  epoch: 6/24, acc_iter=43472, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:51, time_cost(all): 13:52:35/1 day, 12:56:50, loss=0.50864921763438, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=1.822216693695901, lr=0.0778186654464513
2023-11-26 23:29:51   INFO  epoch: 6/24, acc_iter=43522, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:47:54, time_cost(all): 13:53:33/1 day, 12:33:11, loss=0.508541675274278, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=1.1957366125356175, lr=0.07777857367371072
2023-11-26 23:30:49   INFO  epoch: 6/24, acc_iter=43572, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:50:38, time_cost(all): 13:54:31/1 day, 11:29:35, loss=0.508434132914175, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=2.263735190966469, lr=0.07773848190097013
2023-11-26 23:31:47   INFO  epoch: 6/24, acc_iter=43622, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:46:06, time_cost(all): 13:55:29/1 day, 13:02:58, loss=0.508326590554072, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=0.9935721565311157, lr=0.07769839012822953
2023-11-26 23:32:44   INFO  epoch: 6/24, acc_iter=43672, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:45:17, time_cost(all): 13:56:26/1 day, 11:20:55, loss=0.50821904819397, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=2.679198866233727, lr=0.07765829835548894
2023-11-26 23:33:42   INFO  epoch: 6/24, acc_iter=43722, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:27, time_cost(all): 13:57:24/1 day, 12:18:45, loss=0.508111505833867, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.955239799394235, lr=0.07761820658274836
2023-11-26 23:34:40   INFO  epoch: 6/24, acc_iter=43772, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:43:38, time_cost(all): 13:58:22/1 day, 14:27:10, loss=0.508003963473764, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=2.022697669992559, lr=0.07757811481000777
2023-11-26 23:35:38   INFO  epoch: 6/24, acc_iter=43822, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:45:09, time_cost(all): 13:59:20/1 day, 12:14:55, loss=0.507896421113662, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.8295351240069573, lr=0.07753802303726717
2023-11-26 23:36:35   INFO  epoch: 6/24, acc_iter=43872, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:45:11, time_cost(all): 14:00:17/1 day, 13:50:17, loss=0.507788878753559, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=4.817892088660164, lr=0.07749793126452659
2023-11-26 23:37:33   INFO  epoch: 6/24, acc_iter=43922, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:43:04, time_cost(all): 14:01:15/1 day, 12:12:05, loss=0.507681336393456, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.2457031752145276, lr=0.077457839491786
2023-11-26 23:38:31   INFO  epoch: 6/24, acc_iter=43972, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:50, time_cost(all): 14:02:13/1 day, 11:11:35, loss=0.507573794033354, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=1.5832932771418864, lr=0.07741774771904542
2023-11-26 23:39:29   INFO  epoch: 6/24, acc_iter=44022, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:41, time_cost(all): 14:03:11/1 day, 12:38:38, loss=0.507466251673251, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.419720270907374, lr=0.07737765594630483
2023-11-26 23:40:26   INFO  epoch: 6/24, acc_iter=44072, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:30, time_cost(all): 14:04:08/1 day, 13:50:56, loss=0.507358709313148, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=1.8117349289488063, lr=0.07733756417356424
2023-11-26 23:41:24   INFO  epoch: 6/24, acc_iter=44122, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:39:51, time_cost(all): 14:05:06/1 day, 12:48:08, loss=0.507251166953046, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=2.1529227018244685, lr=0.07729747240082366
2023-11-26 23:42:22   INFO  epoch: 6/24, acc_iter=44172, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:24, time_cost(all): 14:06:04/1 day, 13:37:18, loss=0.507143624592943, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.983744890758891, lr=0.07725738062808307
2023-11-26 23:43:20   INFO  epoch: 6/24, acc_iter=44222, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:36:21, time_cost(all): 14:07:02/1 day, 11:54:59, loss=0.50703608223284, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=1.5166394912274854, lr=0.07721728885534247
2023-11-26 23:44:17   INFO  epoch: 6/24, acc_iter=44272, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:37:03, time_cost(all): 14:07:59/1 day, 13:08:09, loss=0.506928539872738, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=1.831729390976271, lr=0.07717719708260189
2023-11-26 23:45:15   INFO  epoch: 6/24, acc_iter=44322, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:17, time_cost(all): 14:08:57/1 day, 13:43:42, loss=0.506820997512635, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=3.789383131618338, lr=0.0771371053098613
2023-11-26 23:46:13   INFO  epoch: 6/24, acc_iter=44372, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:06, time_cost(all): 14:09:55/1 day, 13:19:48, loss=0.506713455152532, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=4.023429903887296, lr=0.07709701353712071
2023-11-26 23:47:11   INFO  epoch: 6/24, acc_iter=44422, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:33, time_cost(all): 14:10:53/1 day, 14:04:52, loss=0.50660591279243, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=3.6522671378250515, lr=0.07705692176438013
2023-11-26 23:48:08   INFO  epoch: 6/24, acc_iter=44472, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:47, time_cost(all): 14:11:50/1 day, 13:10:07, loss=0.506498370432327, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=4.93493073089853, lr=0.07701682999163953
2023-11-26 23:49:06   INFO  epoch: 6/24, acc_iter=44522, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:23, time_cost(all): 14:12:48/1 day, 12:25:04, loss=0.506390828072224, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=3.2296810223489674, lr=0.07697673821889894
2023-11-26 23:50:04   INFO  epoch: 6/24, acc_iter=44572, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:26, time_cost(all): 14:13:46/1 day, 13:54:03, loss=0.506283285712122, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.3563297284167515, lr=0.07693664644615836
2023-11-26 23:51:02   INFO  epoch: 6/24, acc_iter=44622, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:31, time_cost(all): 14:14:44/1 day, 11:38:16, loss=0.506175743352019, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=4.419955690356767, lr=0.07689655467341777
2023-11-26 23:51:59   INFO  epoch: 6/24, acc_iter=44672, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:53, time_cost(all): 14:15:41/1 day, 12:36:27, loss=0.506068200991916, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.9656422399463676, lr=0.07685646290067719
2023-11-26 23:52:57   INFO  epoch: 6/24, acc_iter=44722, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:49, time_cost(all): 14:16:39/1 day, 12:02:32, loss=0.505960658631813, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=4.256039623674974, lr=0.0768163711279366
2023-11-26 23:53:55   INFO  epoch: 6/24, acc_iter=44772, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:10, time_cost(all): 14:17:37/1 day, 12:34:20, loss=0.505853116271711, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=1.223492173261257, lr=0.07677627935519601
2023-11-26 23:54:53   INFO  epoch: 6/24, acc_iter=44822, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:54, time_cost(all): 14:18:35/1 day, 11:55:36, loss=0.505745573911608, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.1259981528703085, lr=0.07673618758245543
2023-11-26 23:55:50   INFO  epoch: 6/24, acc_iter=44872, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:46, time_cost(all): 14:19:32/1 day, 11:50:00, loss=0.505638031551505, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=1.288472419689268, lr=0.07669609580971483
2023-11-26 23:56:48   INFO  epoch: 6/24, acc_iter=44922, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:59, time_cost(all): 14:20:30/1 day, 14:07:54, loss=0.505530489191403, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=1.4054943131457178, lr=0.07665600403697424
2023-11-26 23:57:46   INFO  epoch: 6/24, acc_iter=44972, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:08, time_cost(all): 14:21:28/1 day, 11:56:22, loss=0.5054229468313, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.5988808979444116, lr=0.07661591226423366
2023-11-26 23:58:44   INFO  epoch: 6/24, acc_iter=45022, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:40, time_cost(all): 14:22:26/1 day, 10:32:26, loss=0.505315404471197, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=2.1749990180843817, lr=0.07657582049149306
2023-11-26 23:59:41   INFO  epoch: 6/24, acc_iter=45072, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:37, time_cost(all): 14:23:23/1 day, 12:47:04, loss=0.505207862111095, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=2.9282459734118964, lr=0.07653572871875249
2023-11-27 00:00:39   INFO  epoch: 6/24, acc_iter=45122, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:16, time_cost(all): 14:24:21/1 day, 13:53:00, loss=0.505100319750992, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=1.5841322710858459, lr=0.07649563694601189
2023-11-27 00:01:37   INFO  epoch: 6/24, acc_iter=45172, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:07, time_cost(all): 14:25:19/1 day, 13:24:15, loss=0.504992777390889, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=2.0471958576191716, lr=0.0764555451732713
2023-11-27 00:02:35   INFO  epoch: 6/24, acc_iter=45222, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:52, time_cost(all): 14:26:17/1 day, 12:58:58, loss=0.504885235030787, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=2.821983000273075, lr=0.07641545340053071
2023-11-27 00:03:32   INFO  epoch: 6/24, acc_iter=45272, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:58, time_cost(all): 14:27:14/1 day, 13:02:40, loss=0.504777692670684, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=2.4160223035001525, lr=0.07637536162779013
2023-11-27 00:04:30   INFO  epoch: 6/24, acc_iter=45322, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:28, time_cost(all): 14:28:12/1 day, 10:47:01, loss=0.504670150310581, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=4.3899308935104955, lr=0.07633526985504954
2023-11-27 00:05:28   INFO  epoch: 6/24, acc_iter=45372, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:17, time_cost(all): 14:29:10/1 day, 12:41:50, loss=0.504562607950479, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=1.5407653272874349, lr=0.07629517808230896
2023-11-27 00:06:26   INFO  epoch: 6/24, acc_iter=45422, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:12:34, time_cost(all): 14:30:08/1 day, 13:09:51, loss=0.504455065590376, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=3.3118972566181313, lr=0.07625508630956837
2023-11-27 00:07:23   INFO  epoch: 6/24, acc_iter=45472, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:09, time_cost(all): 14:31:05/1 day, 12:32:07, loss=0.504347523230273, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.1015979540254555, lr=0.07621499453682777
2023-11-27 00:08:21   INFO  epoch: 6/24, acc_iter=45522, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:44, time_cost(all): 14:32:03/1 day, 10:39:02, loss=0.504239980870171, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=1.0896740825328743, lr=0.07617490276408719
2023-11-27 00:09:19   INFO  epoch: 6/24, acc_iter=45572, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:49, time_cost(all): 14:33:01/1 day, 10:56:55, loss=0.504132438510068, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=1.30289685626858, lr=0.0761348109913466
2023-11-27 00:10:17   INFO  epoch: 6/24, acc_iter=45622, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:24, time_cost(all): 14:33:59/1 day, 13:31:49, loss=0.504024896149965, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=2.863961042484643, lr=0.07609471921860601
2023-11-27 00:11:14   INFO  epoch: 6/24, acc_iter=45672, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:38, time_cost(all): 14:34:56/1 day, 11:21:33, loss=0.503917353789863, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=1.81679815848127, lr=0.07605462744586541
2023-11-27 00:12:12   INFO  epoch: 6/24, acc_iter=45722, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:11, time_cost(all): 14:35:54/1 day, 13:37:45, loss=0.50380981142976, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=4.711475133384217, lr=0.07601453567312484
2023-11-27 00:13:10   INFO  epoch: 6/24, acc_iter=45772, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:20, time_cost(all): 14:36:52/1 day, 10:58:24, loss=0.503702269069657, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.6507688103541627, lr=0.07597444390038424
2023-11-27 00:14:08   INFO  epoch: 6/24, acc_iter=45822, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:26, time_cost(all): 14:37:50/1 day, 11:03:17, loss=0.503594726709555, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.0267544450639345, lr=0.07593435212764366
2023-11-27 00:15:05   INFO  epoch: 6/24, acc_iter=45872, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:31, time_cost(all): 14:38:47/1 day, 11:27:50, loss=0.503487184349452, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=2.953846303145214, lr=0.07589426035490307
2023-11-27 00:16:03   INFO  epoch: 6/24, acc_iter=45922, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:43, time_cost(all): 14:39:45/1 day, 13:26:36, loss=0.503379641989349, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=2.9182675390827337, lr=0.07585416858216248
2023-11-27 00:17:01   INFO  epoch: 6/24, acc_iter=45972, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:36, time_cost(all): 14:40:43/1 day, 11:46:11, loss=0.503272099629246, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.196338594655895, lr=0.0758140768094219
2023-11-27 00:17:59   INFO  epoch: 6/24, acc_iter=46022, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:42, time_cost(all): 14:41:41/1 day, 12:23:28, loss=0.503164557269144, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.4214922407268435, lr=0.07577398503668131
2023-11-27 00:18:56   INFO  epoch: 6/24, acc_iter=46072, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 14:42:38/1 day, 13:12:34, loss=0.503057014909041, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=3.786783816790252, lr=0.07573389326394071
2023-11-27 00:19:54   INFO  epoch: 7/24, acc_iter=46159, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/1:59:53, time_cost(all): 14:43:36/1 day, 12:24:52, loss=0.502869891202463, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=3.6452919401989963, lr=0.0756641335793721
2023-11-27 00:20:52   INFO  epoch: 7/24, acc_iter=46209, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:03:51, time_cost(all): 14:44:34/1 day, 13:32:24, loss=0.50276234884236, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=1.4487083057991925, lr=0.07562404180663151
2023-11-27 00:21:50   INFO  epoch: 7/24, acc_iter=46259, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:05:18, time_cost(all): 14:45:32/1 day, 12:29:18, loss=0.502654806482257, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=3.3676963786958076, lr=0.07558395003389093
2023-11-27 00:22:47   INFO  epoch: 7/24, acc_iter=46309, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/1:59:49, time_cost(all): 14:46:29/1 day, 12:19:13, loss=0.502547264122154, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=1.8901348905275768, lr=0.07554385826115033
2023-11-27 00:23:45   INFO  epoch: 7/24, acc_iter=46359, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:00:13, time_cost(all): 14:47:27/1 day, 11:37:17, loss=0.502439721762052, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=3.506435253732495, lr=0.07550376648840974
2023-11-27 00:24:43   INFO  epoch: 7/24, acc_iter=46409, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:58:08, time_cost(all): 14:48:25/1 day, 11:49:35, loss=0.502332179401949, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=0.8338287696829195, lr=0.07546367471566916
2023-11-27 00:25:41   INFO  epoch: 7/24, acc_iter=46459, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:05:42, time_cost(all): 14:49:23/1 day, 12:30:52, loss=0.502224637041846, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=4.139718275613044, lr=0.07542358294292857
2023-11-27 00:26:38   INFO  epoch: 7/24, acc_iter=46509, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:54:21, time_cost(all): 14:50:20/1 day, 10:58:35, loss=0.502117094681744, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=1.006376692528495, lr=0.07538349117018797
2023-11-27 00:27:36   INFO  epoch: 7/24, acc_iter=46559, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:03:05, time_cost(all): 14:51:18/1 day, 12:53:33, loss=0.502009552321641, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.327592474955831, lr=0.07534339939744739
2023-11-27 00:28:34   INFO  epoch: 7/24, acc_iter=46609, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:58:14, time_cost(all): 14:52:16/1 day, 12:26:50, loss=0.501902009961538, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=3.943518811345166, lr=0.0753033076247068
2023-11-27 00:29:32   INFO  epoch: 7/24, acc_iter=46659, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:55:00, time_cost(all): 14:53:14/1 day, 10:40:59, loss=0.501794467601436, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=1.0648706369503815, lr=0.07526321585196621
2023-11-27 00:30:29   INFO  epoch: 7/24, acc_iter=46709, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:54:31, time_cost(all): 14:54:11/1 day, 10:51:47, loss=0.501686925241333, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=1.6876280278775404, lr=0.07522312407922563
2023-11-27 00:31:27   INFO  epoch: 7/24, acc_iter=46759, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:55:29, time_cost(all): 14:55:09/1 day, 13:09:00, loss=0.50157938288123, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.0042345497482055, lr=0.07518303230648504
2023-11-27 00:32:25   INFO  epoch: 7/24, acc_iter=46809, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:53:01, time_cost(all): 14:56:07/1 day, 12:02:27, loss=0.501471840521128, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=4.750312684271945, lr=0.07514294053374446
2023-11-27 00:33:23   INFO  epoch: 7/24, acc_iter=46859, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:53:40, time_cost(all): 14:57:05/1 day, 13:21:25, loss=0.501364298161025, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=2.644784949671885, lr=0.07510284876100386
2023-11-27 00:34:20   INFO  epoch: 7/24, acc_iter=46909, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:53:52, time_cost(all): 14:58:02/1 day, 10:19:02, loss=0.501256755800922, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=2.1923243256586646, lr=0.07506275698826327
2023-11-27 00:35:18   INFO  epoch: 7/24, acc_iter=46959, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:53:21, time_cost(all): 14:59:00/1 day, 12:01:58, loss=0.50114921344082, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=3.665204198247406, lr=0.07502266521552269
2023-11-27 00:36:16   INFO  epoch: 7/24, acc_iter=47009, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:48:24, time_cost(all): 14:59:58/1 day, 12:39:58, loss=0.501041671080717, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.40000945876662, lr=0.0749825734427821
2023-11-27 00:37:14   INFO  epoch: 7/24, acc_iter=47059, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:49:00, time_cost(all): 15:00:56/1 day, 12:19:02, loss=0.500934128720614, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=3.6563511694351236, lr=0.07494248167004151
2023-11-27 00:38:11   INFO  epoch: 7/24, acc_iter=47109, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:47:11, time_cost(all): 15:01:53/1 day, 13:22:18, loss=0.500826586360512, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=2.9691788921299302, lr=0.07490238989730093
2023-11-27 00:39:09   INFO  epoch: 7/24, acc_iter=47159, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:47:29, time_cost(all): 15:02:51/1 day, 10:55:09, loss=0.500719044000409, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=0.6092864948046476, lr=0.07486229812456033
2023-11-27 00:40:07   INFO  epoch: 7/24, acc_iter=47209, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:46:22, time_cost(all): 15:03:49/1 day, 11:03:35, loss=0.500611501640306, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.124758100285921, lr=0.07482220635181974
2023-11-27 00:41:05   INFO  epoch: 7/24, acc_iter=47259, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:49:25, time_cost(all): 15:04:47/1 day, 10:57:55, loss=0.500503959280204, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=1.226109272039149, lr=0.07478211457907916
2023-11-27 00:42:02   INFO  epoch: 7/24, acc_iter=47309, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:39:39, time_cost(all): 15:05:44/1 day, 12:50:51, loss=0.500396416920101, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.4984776886521205, lr=0.07474202280633857
2023-11-27 00:43:00   INFO  epoch: 7/24, acc_iter=47359, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:42:16, time_cost(all): 15:06:42/1 day, 12:12:35, loss=0.500288874559998, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=3.5697475105220464, lr=0.07470193103359798
2023-11-27 00:43:58   INFO  epoch: 7/24, acc_iter=47409, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:44:30, time_cost(all): 15:07:40/1 day, 12:25:35, loss=0.500181332199895, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.3526292393721264, lr=0.0746618392608574
2023-11-27 00:44:56   INFO  epoch: 7/24, acc_iter=47459, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:36:50, time_cost(all): 15:08:38/1 day, 13:09:02, loss=0.500073789839793, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=2.7471374386568885, lr=0.07462174748811681
2023-11-27 00:45:53   INFO  epoch: 7/24, acc_iter=47509, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:43:41, time_cost(all): 15:09:35/1 day, 11:58:06, loss=0.49996624747969, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=2.92489562281079, lr=0.07458165571537621
2023-11-27 00:46:51   INFO  epoch: 7/24, acc_iter=47559, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:37:28, time_cost(all): 15:10:33/1 day, 11:15:45, loss=0.499858705119587, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=1.7810149555137533, lr=0.07454156394263563
2023-11-27 00:47:49   INFO  epoch: 7/24, acc_iter=47609, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:41:57, time_cost(all): 15:11:31/1 day, 13:15:13, loss=0.499751162759485, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=3.6938351863333705, lr=0.07450147216989504
2023-11-27 00:48:47   INFO  epoch: 7/24, acc_iter=47659, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:34:31, time_cost(all): 15:12:29/1 day, 11:56:00, loss=0.499643620399382, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.5881815575384373, lr=0.07446138039715446
2023-11-27 00:49:44   INFO  epoch: 7/24, acc_iter=47709, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:32:50, time_cost(all): 15:13:26/1 day, 11:09:04, loss=0.499536078039279, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=0.7934110529735265, lr=0.07442128862441386
2023-11-27 00:50:42   INFO  epoch: 7/24, acc_iter=47759, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:35:15, time_cost(all): 15:14:24/1 day, 12:19:03, loss=0.499428535679177, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=0.9018106952961731, lr=0.07438119685167328
2023-11-27 00:51:40   INFO  epoch: 7/24, acc_iter=47809, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:32:42, time_cost(all): 15:15:22/1 day, 11:46:33, loss=0.499320993319074, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=1.5712727034188605, lr=0.07434110507893268
2023-11-27 00:52:38   INFO  epoch: 7/24, acc_iter=47859, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:31:26, time_cost(all): 15:16:20/1 day, 11:32:04, loss=0.499213450958971, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.847489218389694, lr=0.0743010133061921
2023-11-27 00:53:35   INFO  epoch: 7/24, acc_iter=47909, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:35:59, time_cost(all): 15:17:17/1 day, 9:44:55, loss=0.499105908598869, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=1.7793995629611703, lr=0.07426092153345151
2023-11-27 00:54:33   INFO  epoch: 7/24, acc_iter=47959, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:29:50, time_cost(all): 15:18:15/1 day, 12:52:26, loss=0.498998366238766, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=4.471691556071646, lr=0.07422082976071093
2023-11-27 00:55:31   INFO  epoch: 7/24, acc_iter=48009, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:25:46, time_cost(all): 15:19:13/1 day, 9:42:46, loss=0.498890823878663, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=2.070994321225802, lr=0.07418073798797034
2023-11-27 00:56:29   INFO  epoch: 7/24, acc_iter=48059, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:29, time_cost(all): 15:20:11/1 day, 9:49:33, loss=0.498783281518561, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=1.0000035107215886, lr=0.07414064621522976
2023-11-27 00:57:26   INFO  epoch: 7/24, acc_iter=48109, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:29:00, time_cost(all): 15:21:08/1 day, 10:18:49, loss=0.498675739158458, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.0005519050586686, lr=0.07410055444248917
2023-11-27 00:58:24   INFO  epoch: 7/24, acc_iter=48159, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:26:04, time_cost(all): 15:22:06/1 day, 10:26:51, loss=0.498568196798355, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=2.0923909485459022, lr=0.07406046266974857
2023-11-27 00:59:22   INFO  epoch: 7/24, acc_iter=48209, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:27:19, time_cost(all): 15:23:04/1 day, 11:09:40, loss=0.498460654438253, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=1.5939683978739598, lr=0.07402037089700798
2023-11-27 01:00:20   INFO  epoch: 7/24, acc_iter=48259, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:26:06, time_cost(all): 15:24:02/1 day, 11:19:32, loss=0.49835311207815, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=4.927031789909987, lr=0.0739802791242674
2023-11-27 01:01:17   INFO  epoch: 7/24, acc_iter=48309, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:21:04, time_cost(all): 15:24:59/1 day, 12:23:06, loss=0.498245569718047, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.348485605883796, lr=0.07394018735152681
2023-11-27 01:02:15   INFO  epoch: 7/24, acc_iter=48359, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:24:13, time_cost(all): 15:25:57/1 day, 11:26:21, loss=0.498138027357945, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=0.6802830051269743, lr=0.07390009557878621
2023-11-27 01:03:13   INFO  epoch: 7/24, acc_iter=48409, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:24:06, time_cost(all): 15:26:55/1 day, 10:16:16, loss=0.498030484997842, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=0.7799127314047121, lr=0.07386000380604563
2023-11-27 01:04:11   INFO  epoch: 7/24, acc_iter=48459, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:09, time_cost(all): 15:27:53/1 day, 10:52:01, loss=0.497922942637739, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.93746570481038, lr=0.07381991203330504
2023-11-27 01:05:08   INFO  epoch: 7/24, acc_iter=48509, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:17:06, time_cost(all): 15:28:50/1 day, 9:38:25, loss=0.497815400277637, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=3.8716071656899125, lr=0.07377982026056445
2023-11-27 01:06:06   INFO  epoch: 7/24, acc_iter=48559, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:18:29, time_cost(all): 15:29:48/1 day, 10:38:21, loss=0.497707857917534, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=2.869428909420943, lr=0.07373972848782387
2023-11-27 01:07:04   INFO  epoch: 7/24, acc_iter=48609, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:32, time_cost(all): 15:30:46/1 day, 10:36:05, loss=0.497600315557431, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=3.540421000148713, lr=0.07369963671508328
2023-11-27 01:08:02   INFO  epoch: 7/24, acc_iter=48659, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:45, time_cost(all): 15:31:44/1 day, 9:37:39, loss=0.497492773197329, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=2.8817416211989664, lr=0.0736595449423427
2023-11-27 01:09:00   INFO  epoch: 7/24, acc_iter=48709, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:14:28, time_cost(all): 15:32:42/1 day, 12:45:12, loss=0.497385230837226, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=2.4987471737735305, lr=0.07361945316960211
2023-11-27 01:09:57   INFO  epoch: 7/24, acc_iter=48759, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:15:32, time_cost(all): 15:33:39/1 day, 11:32:50, loss=0.497277688477123, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=1.0299526423381073, lr=0.07357936139686151
2023-11-27 01:10:55   INFO  epoch: 7/24, acc_iter=48809, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:16:07, time_cost(all): 15:34:37/1 day, 12:34:16, loss=0.49717014611702, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=0.563809466894738, lr=0.07353926962412093
2023-11-27 01:11:53   INFO  epoch: 7/24, acc_iter=48859, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:17:02, time_cost(all): 15:35:35/1 day, 9:52:32, loss=0.497062603756918, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=1.4171028245949249, lr=0.07349917785138034
2023-11-27 01:12:51   INFO  epoch: 7/24, acc_iter=48909, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:16:25, time_cost(all): 15:36:33/1 day, 11:45:28, loss=0.496955061396815, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=1.1010750305207284, lr=0.07345908607863975
2023-11-27 01:13:48   INFO  epoch: 7/24, acc_iter=48959, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:21, time_cost(all): 15:37:30/1 day, 11:14:25, loss=0.496847519036712, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=4.158900109269158, lr=0.07341899430589917
2023-11-27 01:14:46   INFO  epoch: 7/24, acc_iter=49009, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:12, time_cost(all): 15:38:28/1 day, 11:10:22, loss=0.49673997667661, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=3.014138715050053, lr=0.07337890253315857
2023-11-27 01:15:44   INFO  epoch: 7/24, acc_iter=49059, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:12:16, time_cost(all): 15:39:26/1 day, 12:44:06, loss=0.496632434316507, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.336497200695149, lr=0.07333881076041798
2023-11-27 01:16:42   INFO  epoch: 7/24, acc_iter=49109, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:12:04, time_cost(all): 15:40:24/1 day, 10:10:10, loss=0.496524891956404, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=2.753409436753194, lr=0.0732987189876774
2023-11-27 01:17:39   INFO  epoch: 7/24, acc_iter=49159, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:11:20, time_cost(all): 15:41:21/1 day, 10:50:14, loss=0.496417349596302, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=1.866081511728401, lr=0.07325862721493681
2023-11-27 01:18:37   INFO  epoch: 7/24, acc_iter=49209, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:08:26, time_cost(all): 15:42:19/1 day, 11:43:02, loss=0.496309807236199, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.907397695521823, lr=0.07321853544219623
2023-11-27 01:19:35   INFO  epoch: 7/24, acc_iter=49259, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:05:57, time_cost(all): 15:43:17/1 day, 9:48:17, loss=0.496202264876096, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=1.1327148109046246, lr=0.07317844366945564
2023-11-27 01:20:33   INFO  epoch: 7/24, acc_iter=49309, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:08:06, time_cost(all): 15:44:15/1 day, 11:27:36, loss=0.496094722515994, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=3.3452289215810787, lr=0.07313835189671505
2023-11-27 01:21:30   INFO  epoch: 7/24, acc_iter=49359, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:02:49, time_cost(all): 15:45:12/1 day, 11:56:56, loss=0.495987180155891, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=1.4414829737133368, lr=0.07309826012397447
2023-11-27 01:22:28   INFO  epoch: 7/24, acc_iter=49409, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:04:33, time_cost(all): 15:46:10/1 day, 11:59:36, loss=0.495879637795788, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=3.081360536253745, lr=0.07305816835123387
2023-11-27 01:23:26   INFO  epoch: 7/24, acc_iter=49459, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:04:55, time_cost(all): 15:47:08/1 day, 9:21:30, loss=0.495772095435686, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.8129571083430087, lr=0.07301807657849328
2023-11-27 01:24:24   INFO  epoch: 7/24, acc_iter=49509, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:00, time_cost(all): 15:48:06/1 day, 9:56:35, loss=0.495664553075583, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.056357087229178, lr=0.0729779848057527
2023-11-27 01:25:21   INFO  epoch: 7/24, acc_iter=49559, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:41, time_cost(all): 15:49:03/1 day, 9:17:00, loss=0.49555701071548, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.821807883375011, lr=0.07293789303301211
2023-11-27 01:26:19   INFO  epoch: 7/24, acc_iter=49609, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:57:57, time_cost(all): 15:50:01/1 day, 11:39:27, loss=0.495449468355378, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.429832838107985, lr=0.07289780126027152
2023-11-27 01:27:17   INFO  epoch: 7/24, acc_iter=49659, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:58:22, time_cost(all): 15:50:59/1 day, 11:39:26, loss=0.495341925995275, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=1.8475297958748562, lr=0.07285770948753093
2023-11-27 01:28:15   INFO  epoch: 7/24, acc_iter=49709, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:59:48, time_cost(all): 15:51:57/1 day, 11:03:40, loss=0.495234383635172, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=0.953918887938756, lr=0.07281761771479034
2023-11-27 01:29:12   INFO  epoch: 7/24, acc_iter=49759, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:56:27, time_cost(all): 15:52:54/1 day, 12:25:39, loss=0.49512684127507, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.4867835963033098, lr=0.07277752594204975
2023-11-27 01:30:10   INFO  epoch: 7/24, acc_iter=49809, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:54:40, time_cost(all): 15:53:52/1 day, 10:03:30, loss=0.495019298914967, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=3.1000838413083316, lr=0.07273743416930917
2023-11-27 01:31:08   INFO  epoch: 7/24, acc_iter=49859, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:55:05, time_cost(all): 15:54:50/1 day, 11:13:10, loss=0.494911756554864, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=3.2271299578740344, lr=0.07269734239656858
2023-11-27 01:32:06   INFO  epoch: 7/24, acc_iter=49909, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:55:25, time_cost(all): 15:55:48/1 day, 9:55:16, loss=0.494804214194762, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=3.1551505566503826, lr=0.072657250623828
2023-11-27 01:33:03   INFO  epoch: 7/24, acc_iter=49959, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:37, time_cost(all): 15:56:45/1 day, 9:43:36, loss=0.494696671834659, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=1.332774255079816, lr=0.0726171588510874
2023-11-27 01:34:01   INFO  epoch: 7/24, acc_iter=50009, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:24, time_cost(all): 15:57:43/1 day, 11:57:19, loss=0.494589129474556, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.967374014364738, lr=0.07257706707834682
2023-11-27 01:34:59   INFO  epoch: 7/24, acc_iter=50059, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:05, time_cost(all): 15:58:41/1 day, 12:09:17, loss=0.494481587114454, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=2.5543157195033834, lr=0.07253697530560622
2023-11-27 01:35:57   INFO  epoch: 7/24, acc_iter=50109, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:49:35, time_cost(all): 15:59:39/1 day, 10:21:07, loss=0.494374044754351, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=1.7638288950531193, lr=0.07249688353286564
2023-11-27 01:36:54   INFO  epoch: 7/24, acc_iter=50159, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:44, time_cost(all): 16:00:36/1 day, 9:30:41, loss=0.494266502394248, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=1.9714746155925662, lr=0.07245679176012505
2023-11-27 01:37:52   INFO  epoch: 7/24, acc_iter=50209, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:29, time_cost(all): 16:01:34/1 day, 11:55:15, loss=0.494158960034146, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=1.9541440819988005, lr=0.07241669998738445
2023-11-27 01:38:50   INFO  epoch: 7/24, acc_iter=50259, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:46:43, time_cost(all): 16:02:32/1 day, 9:24:53, loss=0.494051417674043, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=4.209463970249315, lr=0.07237660821464388
2023-11-27 01:39:48   INFO  epoch: 7/24, acc_iter=50309, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:15, time_cost(all): 16:03:30/1 day, 10:17:19, loss=0.49394387531394, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=4.098732579234668, lr=0.07233651644190328
2023-11-27 01:40:45   INFO  epoch: 7/24, acc_iter=50359, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:01, time_cost(all): 16:04:27/1 day, 9:35:28, loss=0.493836332953837, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=1.6718584289116263, lr=0.0722964246691627
2023-11-27 01:41:43   INFO  epoch: 7/24, acc_iter=50409, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:44:07, time_cost(all): 16:05:25/1 day, 9:03:09, loss=0.493728790593735, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=3.9304015613457475, lr=0.07225633289642211
2023-11-27 01:42:41   INFO  epoch: 7/24, acc_iter=50459, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:21, time_cost(all): 16:06:23/1 day, 12:07:16, loss=0.493621248233632, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=3.8990682526125964, lr=0.07221624112368152
2023-11-27 01:43:39   INFO  epoch: 7/24, acc_iter=50509, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:41:59, time_cost(all): 16:07:21/1 day, 9:23:58, loss=0.493513705873529, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.6462507619000775, lr=0.07217614935094094
2023-11-27 01:44:36   INFO  epoch: 7/24, acc_iter=50559, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:37, time_cost(all): 16:08:18/1 day, 9:35:44, loss=0.493406163513427, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.1608352407991944, lr=0.07213605757820035
2023-11-27 01:45:34   INFO  epoch: 7/24, acc_iter=50609, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:51, time_cost(all): 16:09:16/1 day, 11:37:31, loss=0.493298621153324, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=4.209379232276882, lr=0.07209596580545975
2023-11-27 01:46:32   INFO  epoch: 7/24, acc_iter=50659, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:42, time_cost(all): 16:10:14/1 day, 9:32:28, loss=0.493191078793221, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=1.250638795850473, lr=0.07205587403271917
2023-11-27 01:47:30   INFO  epoch: 7/24, acc_iter=50709, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:26, time_cost(all): 16:11:12/1 day, 9:13:39, loss=0.493083536433119, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=0.5518854685747432, lr=0.07201578225997858
2023-11-27 01:48:27   INFO  epoch: 7/24, acc_iter=50759, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:06, time_cost(all): 16:12:09/1 day, 10:50:11, loss=0.492975994073016, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=2.8312421588742773, lr=0.071975690487238
2023-11-27 01:49:25   INFO  epoch: 7/24, acc_iter=50809, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:55, time_cost(all): 16:13:07/1 day, 9:23:46, loss=0.492868451712913, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=3.8025290555527045, lr=0.07193559871449741
2023-11-27 01:50:23   INFO  epoch: 7/24, acc_iter=50859, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:05, time_cost(all): 16:14:05/1 day, 12:01:15, loss=0.492760909352811, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=0.6721062823317316, lr=0.07189550694175681
2023-11-27 01:51:21   INFO  epoch: 7/24, acc_iter=50909, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:19, time_cost(all): 16:15:03/1 day, 10:50:17, loss=0.492653366992708, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=3.2610169200429566, lr=0.07185541516901622
2023-11-27 01:52:18   INFO  epoch: 7/24, acc_iter=50959, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:15, time_cost(all): 16:16:00/1 day, 8:47:37, loss=0.492545824632605, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.692399802595778, lr=0.07181532339627564
2023-11-27 01:53:16   INFO  epoch: 7/24, acc_iter=51009, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:47, time_cost(all): 16:16:58/1 day, 9:28:42, loss=0.492438282272503, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=2.2003123626047856, lr=0.07177523162353505
2023-11-27 01:54:14   INFO  epoch: 7/24, acc_iter=51059, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:00, time_cost(all): 16:17:56/1 day, 8:50:24, loss=0.4923307399124, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.68131834899505, lr=0.07173513985079447
2023-11-27 01:55:12   INFO  epoch: 7/24, acc_iter=51109, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:06, time_cost(all): 16:18:54/1 day, 9:26:06, loss=0.492223197552297, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.8581351508347548, lr=0.07169504807805388
2023-11-27 01:56:09   INFO  epoch: 7/24, acc_iter=51159, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:28:43, time_cost(all): 16:19:51/1 day, 11:18:55, loss=0.492115655192195, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=0.8810364977473504, lr=0.0716549563053133
2023-11-27 01:57:07   INFO  epoch: 7/24, acc_iter=51209, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:09, time_cost(all): 16:20:49/1 day, 9:36:41, loss=0.492008112832092, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=4.965395985881761, lr=0.07161486453257271
2023-11-27 01:58:05   INFO  epoch: 7/24, acc_iter=51259, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:48, time_cost(all): 16:21:47/1 day, 9:49:16, loss=0.491900570471989, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=2.9256498824230968, lr=0.07157477275983211
2023-11-27 01:59:03   INFO  epoch: 7/24, acc_iter=51309, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:32, time_cost(all): 16:22:45/1 day, 9:57:32, loss=0.491793028111887, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=1.7901396589031815, lr=0.07153468098709152
2023-11-27 02:00:00   INFO  epoch: 7/24, acc_iter=51359, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:16, time_cost(all): 16:23:42/1 day, 9:50:25, loss=0.491685485751784, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.7625930815960054, lr=0.07149458921435094
2023-11-27 02:00:58   INFO  epoch: 7/24, acc_iter=51409, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:23, time_cost(all): 16:24:40/1 day, 10:37:33, loss=0.491577943391681, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=3.4409842307865994, lr=0.07145449744161035
2023-11-27 02:01:56   INFO  epoch: 7/24, acc_iter=51459, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:08, time_cost(all): 16:25:38/1 day, 11:59:32, loss=0.491470401031579, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.7052625011833675, lr=0.07141440566886977
2023-11-27 02:02:54   INFO  epoch: 7/24, acc_iter=51509, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:21:49, time_cost(all): 16:26:36/1 day, 10:23:41, loss=0.491362858671476, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=4.88830306634452, lr=0.07137431389612917
2023-11-27 02:03:51   INFO  epoch: 7/24, acc_iter=51559, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:50, time_cost(all): 16:27:33/1 day, 10:25:37, loss=0.491255316311373, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=0.7448159073757137, lr=0.07133422212338858
2023-11-27 02:04:49   INFO  epoch: 7/24, acc_iter=51609, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:34, time_cost(all): 16:28:31/1 day, 9:40:11, loss=0.491147773951271, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.5265796387134776, lr=0.071294130350648
2023-11-27 02:05:47   INFO  epoch: 7/24, acc_iter=51659, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:00, time_cost(all): 16:29:29/1 day, 11:20:32, loss=0.491040231591168, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=3.4563254860986063, lr=0.07125403857790741
2023-11-27 02:06:45   INFO  epoch: 7/24, acc_iter=51709, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:17, time_cost(all): 16:30:27/1 day, 8:50:05, loss=0.490932689231065, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=3.0197294904668723, lr=0.07121394680516682
2023-11-27 02:07:42   INFO  epoch: 7/24, acc_iter=51759, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:54, time_cost(all): 16:31:24/1 day, 10:48:35, loss=0.490825146870962, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=1.419075963898355, lr=0.07117385503242624
2023-11-27 02:08:40   INFO  epoch: 7/24, acc_iter=51809, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:32, time_cost(all): 16:32:22/1 day, 11:07:44, loss=0.49071760451086, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=2.7477765188081698, lr=0.07113376325968565
2023-11-27 02:09:38   INFO  epoch: 7/24, acc_iter=51859, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:03, time_cost(all): 16:33:20/1 day, 8:47:52, loss=0.490610062150757, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.8367204662376486, lr=0.07109367148694506
2023-11-27 02:10:36   INFO  epoch: 7/24, acc_iter=51909, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:57, time_cost(all): 16:34:18/1 day, 9:46:44, loss=0.490502519790654, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=0.5239415191457577, lr=0.07105357971420447
2023-11-27 02:11:33   INFO  epoch: 7/24, acc_iter=51959, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:53, time_cost(all): 16:35:15/1 day, 11:21:18, loss=0.490394977430552, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=3.981566194429513, lr=0.07101348794146388
2023-11-27 02:12:31   INFO  epoch: 7/24, acc_iter=52009, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:01, time_cost(all): 16:36:13/1 day, 11:06:31, loss=0.490287435070449, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=2.283599139069934, lr=0.0709733961687233
2023-11-27 02:13:29   INFO  epoch: 7/24, acc_iter=52059, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:42, time_cost(all): 16:37:11/1 day, 10:59:21, loss=0.490179892710346, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.968866390400907, lr=0.07093330439598271
2023-11-27 02:14:27   INFO  epoch: 7/24, acc_iter=52109, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:50, time_cost(all): 16:38:09/1 day, 9:54:49, loss=0.490072350350244, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.952251783793, lr=0.07089321262324212
2023-11-27 02:15:24   INFO  epoch: 7/24, acc_iter=52159, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:07, time_cost(all): 16:39:06/1 day, 10:41:30, loss=0.489964807990141, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=4.235271355510647, lr=0.07085312085050152
2023-11-27 02:16:22   INFO  epoch: 7/24, acc_iter=52209, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:12, time_cost(all): 16:40:04/1 day, 11:04:19, loss=0.489857265630038, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=2.188783602658895, lr=0.07081302907776094
2023-11-27 02:17:20   INFO  epoch: 7/24, acc_iter=52259, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:16, time_cost(all): 16:41:02/1 day, 9:52:44, loss=0.489749723269936, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=3.390663999834816, lr=0.07077293730502035
2023-11-27 02:18:18   INFO  epoch: 7/24, acc_iter=52309, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:31, time_cost(all): 16:42:00/1 day, 9:29:52, loss=0.489642180909833, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=1.5847264729148505, lr=0.07073284553227976
2023-11-27 02:19:15   INFO  epoch: 7/24, acc_iter=52359, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:31, time_cost(all): 16:42:57/1 day, 8:35:47, loss=0.48953463854973, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.0357210259957603, lr=0.07069275375953918
2023-11-27 02:20:13   INFO  epoch: 7/24, acc_iter=52409, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:19, time_cost(all): 16:43:55/1 day, 11:26:08, loss=0.489427096189628, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=0.8276398401925968, lr=0.07065266198679859
2023-11-27 02:21:11   INFO  epoch: 7/24, acc_iter=52459, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:22, time_cost(all): 16:44:53/1 day, 10:40:04, loss=0.489319553829525, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=4.228191594019149, lr=0.070612570214058
2023-11-27 02:22:09   INFO  epoch: 7/24, acc_iter=52509, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:32, time_cost(all): 16:45:51/1 day, 11:25:56, loss=0.489212011469422, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.254184908007237, lr=0.07057247844131742
2023-11-27 02:23:06   INFO  epoch: 7/24, acc_iter=52559, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:41, time_cost(all): 16:46:48/1 day, 11:34:15, loss=0.48910446910932, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=2.432707691614324, lr=0.07053238666857682
2023-11-27 02:24:04   INFO  epoch: 7/24, acc_iter=52609, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:45, time_cost(all): 16:47:46/1 day, 8:53:34, loss=0.488996926749217, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.0061746915839476, lr=0.07049229489583624
2023-11-27 02:25:02   INFO  epoch: 7/24, acc_iter=52659, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 16:48:44/1 day, 9:04:40, loss=0.488889384389114, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=4.2517796189022246, lr=0.07045220312309565
2023-11-27 02:26:00   INFO  epoch: 8/24, acc_iter=52746, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:11:18, time_cost(all): 16:49:42/1 day, 9:38:33, loss=0.488702260682536, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.630511211912967, lr=0.07038244343852702
2023-11-27 02:26:57   INFO  epoch: 8/24, acc_iter=52796, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:05:25, time_cost(all): 16:50:39/1 day, 8:33:20, loss=0.488594718322433, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=0.5225609442109723, lr=0.07034235166578644
2023-11-27 02:27:55   INFO  epoch: 8/24, acc_iter=52846, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:35, time_cost(all): 16:51:37/1 day, 8:31:14, loss=0.48848717596233, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=1.1191511242827574, lr=0.07030225989304585
2023-11-27 02:28:53   INFO  epoch: 8/24, acc_iter=52896, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:05:26, time_cost(all): 16:52:35/1 day, 10:05:26, loss=0.488379633602228, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=2.5879527447042334, lr=0.07026216812030525
2023-11-27 02:29:51   INFO  epoch: 8/24, acc_iter=52946, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:06:01, time_cost(all): 16:53:33/1 day, 10:03:48, loss=0.488272091242125, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.5203501085546627, lr=0.07022207634756468
2023-11-27 02:30:48   INFO  epoch: 8/24, acc_iter=52996, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:58:12, time_cost(all): 16:54:30/1 day, 9:47:06, loss=0.488164548882022, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=2.119175355483466, lr=0.07018198457482408
2023-11-27 02:31:46   INFO  epoch: 8/24, acc_iter=53046, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:05:36, time_cost(all): 16:55:28/1 day, 11:09:30, loss=0.488057006521919, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.9524632421602088, lr=0.0701418928020835
2023-11-27 02:32:44   INFO  epoch: 8/24, acc_iter=53096, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:57:03, time_cost(all): 16:56:26/1 day, 8:35:33, loss=0.487949464161817, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=1.0571937035598795, lr=0.07010180102934291
2023-11-27 02:33:42   INFO  epoch: 8/24, acc_iter=53146, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:53:27, time_cost(all): 16:57:24/1 day, 10:35:46, loss=0.487841921801714, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=2.046580206946783, lr=0.07006170925660232
2023-11-27 02:34:39   INFO  epoch: 8/24, acc_iter=53196, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:56:08, time_cost(all): 16:58:21/1 day, 8:48:22, loss=0.487734379441611, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=3.920415125269853, lr=0.07002161748386174
2023-11-27 02:35:37   INFO  epoch: 8/24, acc_iter=53246, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:51:12, time_cost(all): 16:59:19/1 day, 10:17:49, loss=0.487626837081509, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=4.83440555463472, lr=0.06998152571112115
2023-11-27 02:36:35   INFO  epoch: 8/24, acc_iter=53296, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:51:15, time_cost(all): 17:00:17/1 day, 9:49:24, loss=0.487519294721406, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=1.6699236051393367, lr=0.06994143393838055
2023-11-27 02:37:33   INFO  epoch: 8/24, acc_iter=53346, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:49:06, time_cost(all): 17:01:15/1 day, 9:53:40, loss=0.487411752361303, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.110205868892659, lr=0.06990134216563997
2023-11-27 02:38:30   INFO  epoch: 8/24, acc_iter=53396, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:49:34, time_cost(all): 17:02:12/1 day, 8:45:44, loss=0.487304210001201, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=4.926834834583777, lr=0.06986125039289938
2023-11-27 02:39:28   INFO  epoch: 8/24, acc_iter=53446, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:50:28, time_cost(all): 17:03:10/1 day, 8:27:35, loss=0.487196667641098, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=0.8399899276830887, lr=0.0698211586201588
2023-11-27 02:40:26   INFO  epoch: 8/24, acc_iter=53496, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:50:38, time_cost(all): 17:04:08/1 day, 9:39:31, loss=0.487089125280995, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.534937602744596, lr=0.06978106684741821
2023-11-27 02:41:24   INFO  epoch: 8/24, acc_iter=53546, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:52:38, time_cost(all): 17:05:06/1 day, 10:50:02, loss=0.486981582920893, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=1.4853999455643863, lr=0.06974097507467761
2023-11-27 02:42:21   INFO  epoch: 8/24, acc_iter=53596, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:47:37, time_cost(all): 17:06:03/1 day, 8:10:57, loss=0.48687404056079, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.208205563402137, lr=0.06970088330193702
2023-11-27 02:43:19   INFO  epoch: 8/24, acc_iter=53646, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:53:53, time_cost(all): 17:07:01/1 day, 8:15:00, loss=0.486766498200687, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.9541376163211526, lr=0.06966079152919644
2023-11-27 02:44:17   INFO  epoch: 8/24, acc_iter=53696, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:46:17, time_cost(all): 17:07:59/1 day, 8:18:17, loss=0.486658955840585, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.673928179809926, lr=0.06962069975645585
2023-11-27 02:45:15   INFO  epoch: 8/24, acc_iter=53746, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:42:32, time_cost(all): 17:08:57/1 day, 10:29:07, loss=0.486551413480482, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.3307789341652765, lr=0.06958060798371526
2023-11-27 02:46:12   INFO  epoch: 8/24, acc_iter=53796, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:42:23, time_cost(all): 17:09:54/1 day, 10:44:31, loss=0.486443871120379, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.257417560901186, lr=0.06954051621097468
2023-11-27 02:47:10   INFO  epoch: 8/24, acc_iter=53846, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:44:03, time_cost(all): 17:10:52/1 day, 10:35:15, loss=0.486336328760277, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=3.5817158234143425, lr=0.06950042443823409
2023-11-27 02:48:08   INFO  epoch: 8/24, acc_iter=53896, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:40:39, time_cost(all): 17:11:50/1 day, 8:01:47, loss=0.486228786400174, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=4.887285388461455, lr=0.06946033266549351
2023-11-27 02:49:06   INFO  epoch: 8/24, acc_iter=53946, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:43:36, time_cost(all): 17:12:48/1 day, 9:41:24, loss=0.486121244040071, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.1481415678689624, lr=0.06942024089275291
2023-11-27 02:50:04   INFO  epoch: 8/24, acc_iter=53996, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:41:16, time_cost(all): 17:13:46/1 day, 8:37:05, loss=0.486013701679969, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=4.445256436551787, lr=0.06938014912001232
2023-11-27 02:51:01   INFO  epoch: 8/24, acc_iter=54046, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:42:01, time_cost(all): 17:14:43/1 day, 10:50:33, loss=0.485906159319866, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=3.9001987752081306, lr=0.06934005734727174
2023-11-27 02:51:59   INFO  epoch: 8/24, acc_iter=54096, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:41:16, time_cost(all): 17:15:41/1 day, 8:20:37, loss=0.485798616959763, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=3.70153286303771, lr=0.06929996557453115
2023-11-27 02:52:57   INFO  epoch: 8/24, acc_iter=54146, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:39:55, time_cost(all): 17:16:39/1 day, 9:36:45, loss=0.485691074599661, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=1.0860801284079593, lr=0.06925987380179056
2023-11-27 02:53:55   INFO  epoch: 8/24, acc_iter=54196, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:42:08, time_cost(all): 17:17:37/1 day, 9:19:35, loss=0.485583532239558, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=3.3651660017848846, lr=0.06921978202904996
2023-11-27 02:54:52   INFO  epoch: 8/24, acc_iter=54246, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:41:05, time_cost(all): 17:18:34/1 day, 7:57:27, loss=0.485475989879455, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.3424634350937785, lr=0.06917969025630938
2023-11-27 02:55:50   INFO  epoch: 8/24, acc_iter=54296, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:32:30, time_cost(all): 17:19:32/1 day, 10:34:01, loss=0.485368447519353, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=2.4978905527750896, lr=0.06913959848356879
2023-11-27 02:56:48   INFO  epoch: 8/24, acc_iter=54346, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:35:25, time_cost(all): 17:20:30/1 day, 8:02:04, loss=0.48526090515925, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=1.413721550976056, lr=0.0690995067108282
2023-11-27 02:57:46   INFO  epoch: 8/24, acc_iter=54396, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:36:43, time_cost(all): 17:21:28/1 day, 8:59:17, loss=0.485153362799147, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=2.7189854094351658, lr=0.06905941493808762
2023-11-27 02:58:43   INFO  epoch: 8/24, acc_iter=54446, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:33:09, time_cost(all): 17:22:25/1 day, 8:43:36, loss=0.485045820439044, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=1.544694370371957, lr=0.06901932316534704
2023-11-27 02:59:41   INFO  epoch: 8/24, acc_iter=54496, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:28:23, time_cost(all): 17:23:23/1 day, 8:21:37, loss=0.484938278078942, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=2.352282424846208, lr=0.06897923139260645
2023-11-27 03:00:39   INFO  epoch: 8/24, acc_iter=54546, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:29:42, time_cost(all): 17:24:21/1 day, 7:43:09, loss=0.484830735718839, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=3.914363002563927, lr=0.06893913961986586
2023-11-27 03:01:37   INFO  epoch: 8/24, acc_iter=54596, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:29:48, time_cost(all): 17:25:19/1 day, 9:18:37, loss=0.484723193358736, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=4.233280799552746, lr=0.06889904784712526
2023-11-27 03:02:34   INFO  epoch: 8/24, acc_iter=54646, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:29:07, time_cost(all): 17:26:16/1 day, 7:55:49, loss=0.484615650998634, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=3.6190899633400697, lr=0.06885895607438468
2023-11-27 03:03:32   INFO  epoch: 8/24, acc_iter=54696, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:32:35, time_cost(all): 17:27:14/1 day, 9:49:07, loss=0.484508108638531, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=2.9861712721805, lr=0.06881886430164409
2023-11-27 03:04:30   INFO  epoch: 8/24, acc_iter=54746, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:28:20, time_cost(all): 17:28:12/1 day, 10:20:48, loss=0.484400566278428, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=4.9402992638765415, lr=0.0687787725289035
2023-11-27 03:05:28   INFO  epoch: 8/24, acc_iter=54796, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:28:58, time_cost(all): 17:29:10/1 day, 8:07:21, loss=0.484293023918326, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=4.41952304870137, lr=0.0687386807561629
2023-11-27 03:06:25   INFO  epoch: 8/24, acc_iter=54846, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:24:47, time_cost(all): 17:30:07/1 day, 8:32:02, loss=0.484185481558223, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=1.5658112196177372, lr=0.06869858898342232
2023-11-27 03:07:23   INFO  epoch: 8/24, acc_iter=54896, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:22:48, time_cost(all): 17:31:05/1 day, 7:48:41, loss=0.48407793919812, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=2.503308911805527, lr=0.06865849721068173
2023-11-27 03:08:21   INFO  epoch: 8/24, acc_iter=54946, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:25:47, time_cost(all): 17:32:03/1 day, 10:32:21, loss=0.483970396838018, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=0.9100183401650153, lr=0.06861840543794115
2023-11-27 03:09:19   INFO  epoch: 8/24, acc_iter=54996, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:40, time_cost(all): 17:33:01/1 day, 8:19:36, loss=0.483862854477915, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=4.567659612984086, lr=0.06857831366520056
2023-11-27 03:10:16   INFO  epoch: 8/24, acc_iter=55046, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:31, time_cost(all): 17:33:58/1 day, 7:39:46, loss=0.483755312117812, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=0.8754026862935984, lr=0.06853822189245998
2023-11-27 03:11:14   INFO  epoch: 8/24, acc_iter=55096, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:22:10, time_cost(all): 17:34:56/1 day, 8:10:59, loss=0.48364776975771, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=4.899546531384689, lr=0.06849813011971939
2023-11-27 03:12:12   INFO  epoch: 8/24, acc_iter=55146, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:18:38, time_cost(all): 17:35:54/1 day, 8:06:18, loss=0.483540227397607, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=3.661954932150658, lr=0.0684580383469788
2023-11-27 03:13:10   INFO  epoch: 8/24, acc_iter=55196, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:17:15, time_cost(all): 17:36:52/1 day, 9:12:41, loss=0.483432685037504, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.5332851943419572, lr=0.06841794657423822
2023-11-27 03:14:07   INFO  epoch: 8/24, acc_iter=55246, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:19:28, time_cost(all): 17:37:49/1 day, 10:21:39, loss=0.483325142677402, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.9064204874471833, lr=0.06837785480149762
2023-11-27 03:15:05   INFO  epoch: 8/24, acc_iter=55296, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:11, time_cost(all): 17:38:47/1 day, 8:53:21, loss=0.483217600317299, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=4.059545463271114, lr=0.06833776302875703
2023-11-27 03:16:03   INFO  epoch: 8/24, acc_iter=55346, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:58, time_cost(all): 17:39:45/1 day, 8:56:00, loss=0.483110057957196, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=4.407508659139511, lr=0.06829767125601643
2023-11-27 03:17:01   INFO  epoch: 8/24, acc_iter=55396, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:15:47, time_cost(all): 17:40:43/1 day, 9:06:30, loss=0.483002515597094, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=1.7506767116830428, lr=0.06825757948327586
2023-11-27 03:17:58   INFO  epoch: 8/24, acc_iter=55446, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:10:23, time_cost(all): 17:41:40/1 day, 9:06:00, loss=0.482894973236991, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.407404881465891, lr=0.06821748771053526
2023-11-27 03:18:56   INFO  epoch: 8/24, acc_iter=55496, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:16, time_cost(all): 17:42:38/1 day, 9:02:48, loss=0.482787430876888, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.2043498087009818, lr=0.06817739593779468
2023-11-27 03:19:54   INFO  epoch: 8/24, acc_iter=55546, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:11:59, time_cost(all): 17:43:36/1 day, 8:16:33, loss=0.482679888516786, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=4.939284136815688, lr=0.06813730416505409
2023-11-27 03:20:52   INFO  epoch: 8/24, acc_iter=55596, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:56, time_cost(all): 17:44:34/1 day, 7:51:03, loss=0.482572346156683, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=4.537927712900886, lr=0.0680972123923135
2023-11-27 03:21:49   INFO  epoch: 8/24, acc_iter=55646, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:14, time_cost(all): 17:45:31/1 day, 8:02:29, loss=0.48246480379658, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.363255526176622, lr=0.06805712061957292
2023-11-27 03:22:47   INFO  epoch: 8/24, acc_iter=55696, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:50, time_cost(all): 17:46:29/1 day, 8:17:55, loss=0.482357261436478, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=0.5111346622705282, lr=0.06801702884683233
2023-11-27 03:23:45   INFO  epoch: 8/24, acc_iter=55746, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:10:28, time_cost(all): 17:47:27/1 day, 7:37:02, loss=0.482249719076375, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=4.224840453714787, lr=0.06797693707409175
2023-11-27 03:24:43   INFO  epoch: 8/24, acc_iter=55796, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:09:12, time_cost(all): 17:48:25/1 day, 7:53:20, loss=0.482142176716272, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=4.470583306809831, lr=0.06793684530135115
2023-11-27 03:25:40   INFO  epoch: 8/24, acc_iter=55846, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:57, time_cost(all): 17:49:22/1 day, 7:57:32, loss=0.48203463435617, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=2.6613812656092795, lr=0.06789675352861058
2023-11-27 03:26:38   INFO  epoch: 8/24, acc_iter=55896, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:03:22, time_cost(all): 17:50:20/1 day, 7:53:39, loss=0.481927091996067, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=4.322368167205612, lr=0.06785666175586998
2023-11-27 03:27:36   INFO  epoch: 8/24, acc_iter=55946, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:05:17, time_cost(all): 17:51:18/1 day, 7:34:58, loss=0.481819549635964, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=0.9308680041123754, lr=0.06781656998312939
2023-11-27 03:28:34   INFO  epoch: 8/24, acc_iter=55996, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:12, time_cost(all): 17:52:16/1 day, 8:29:50, loss=0.481712007275861, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=0.9651456193877308, lr=0.06777647821038879
2023-11-27 03:29:31   INFO  epoch: 8/24, acc_iter=56046, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:03:58, time_cost(all): 17:53:13/1 day, 7:51:39, loss=0.481604464915759, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.4183690396322455, lr=0.0677363864376482
2023-11-27 03:30:29   INFO  epoch: 8/24, acc_iter=56096, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:58:52, time_cost(all): 17:54:11/1 day, 8:41:18, loss=0.481496922555656, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=1.3951443072748078, lr=0.06769629466490762
2023-11-27 03:31:27   INFO  epoch: 8/24, acc_iter=56146, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:25, time_cost(all): 17:55:09/1 day, 9:12:58, loss=0.481389380195553, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.898439075300848, lr=0.06765620289216703
2023-11-27 03:32:25   INFO  epoch: 8/24, acc_iter=56196, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:01:28, time_cost(all): 17:56:07/1 day, 8:41:02, loss=0.481281837835451, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=0.8106570839557032, lr=0.06761611111942645
2023-11-27 03:33:22   INFO  epoch: 8/24, acc_iter=56246, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:55:54, time_cost(all): 17:57:04/1 day, 9:18:22, loss=0.481174295475348, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.0075808692600778, lr=0.06757601934668586
2023-11-27 03:34:20   INFO  epoch: 8/24, acc_iter=56296, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:59:46, time_cost(all): 17:58:02/1 day, 10:16:02, loss=0.481066753115245, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=4.688621146080619, lr=0.06753592757394528
2023-11-27 03:35:18   INFO  epoch: 8/24, acc_iter=56346, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:58:41, time_cost(all): 17:59:00/1 day, 7:43:48, loss=0.480959210755143, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.6117196911314617, lr=0.06749583580120469
2023-11-27 03:36:16   INFO  epoch: 8/24, acc_iter=56396, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:57:42, time_cost(all): 17:59:58/1 day, 8:12:12, loss=0.48085166839504, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=3.4857415620013597, lr=0.0674557440284641
2023-11-27 03:37:13   INFO  epoch: 8/24, acc_iter=56446, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:49, time_cost(all): 18:00:55/1 day, 8:01:45, loss=0.480744126034937, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=3.9668805779375957, lr=0.0674156522557235
2023-11-27 03:38:11   INFO  epoch: 8/24, acc_iter=56496, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:54:54, time_cost(all): 18:01:53/1 day, 7:42:23, loss=0.480636583674835, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=1.6832954110993483, lr=0.06737556048298292
2023-11-27 03:39:09   INFO  epoch: 8/24, acc_iter=56546, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:53:24, time_cost(all): 18:02:51/1 day, 9:15:09, loss=0.480529041314732, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.9152548658244117, lr=0.06733546871024233
2023-11-27 03:40:07   INFO  epoch: 8/24, acc_iter=56596, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:28, time_cost(all): 18:03:49/1 day, 10:14:37, loss=0.480421498954629, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=2.714043708417982, lr=0.06729537693750175
2023-11-27 03:41:04   INFO  epoch: 8/24, acc_iter=56646, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:11, time_cost(all): 18:04:46/1 day, 7:17:01, loss=0.480313956594527, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=1.756445320137876, lr=0.06725528516476115
2023-11-27 03:42:02   INFO  epoch: 8/24, acc_iter=56696, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:47:33, time_cost(all): 18:05:44/1 day, 8:02:58, loss=0.480206414234424, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=4.05135383492296, lr=0.06721519339202056
2023-11-27 03:43:00   INFO  epoch: 8/24, acc_iter=56746, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:48:17, time_cost(all): 18:06:42/1 day, 8:02:02, loss=0.480098871874321, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.4969833726062594, lr=0.06717510161927998
2023-11-27 03:43:58   INFO  epoch: 8/24, acc_iter=56796, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:15, time_cost(all): 18:07:40/1 day, 9:40:39, loss=0.479991329514219, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=0.7380472174001147, lr=0.06713500984653939
2023-11-27 03:44:55   INFO  epoch: 8/24, acc_iter=56846, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:45:51, time_cost(all): 18:08:37/1 day, 6:58:50, loss=0.479883787154116, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=0.8813609779606398, lr=0.0670949180737988
2023-11-27 03:45:53   INFO  epoch: 8/24, acc_iter=56896, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:06, time_cost(all): 18:09:35/1 day, 9:58:12, loss=0.479776244794013, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.336493567047391, lr=0.06705482630105822
2023-11-27 03:46:51   INFO  epoch: 8/24, acc_iter=56946, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:37, time_cost(all): 18:10:33/1 day, 7:27:22, loss=0.479668702433911, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=0.9237004364482865, lr=0.06701473452831763
2023-11-27 03:47:49   INFO  epoch: 8/24, acc_iter=56996, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:44:49, time_cost(all): 18:11:31/1 day, 9:55:47, loss=0.479561160073808, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=2.751187874024126, lr=0.06697464275557705
2023-11-27 03:48:46   INFO  epoch: 8/24, acc_iter=57046, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:04, time_cost(all): 18:12:28/1 day, 9:45:00, loss=0.479453617713705, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.529754788855571, lr=0.06693455098283646
2023-11-27 03:49:44   INFO  epoch: 8/24, acc_iter=57096, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:43:45, time_cost(all): 18:13:26/1 day, 8:10:14, loss=0.479346075353603, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=3.3827129618664964, lr=0.06689445921009586
2023-11-27 03:50:42   INFO  epoch: 8/24, acc_iter=57146, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:43:01, time_cost(all): 18:14:24/1 day, 7:31:14, loss=0.4792385329935, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=2.6956909018220943, lr=0.06685436743735527
2023-11-27 03:51:40   INFO  epoch: 8/24, acc_iter=57196, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:16, time_cost(all): 18:15:22/1 day, 6:54:29, loss=0.479130990633397, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.840014770948491, lr=0.06681427566461469
2023-11-27 03:52:37   INFO  epoch: 8/24, acc_iter=57246, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:38:30, time_cost(all): 18:16:19/1 day, 9:11:57, loss=0.479023448273294, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=0.5999326102383091, lr=0.0667741838918741
2023-11-27 03:53:35   INFO  epoch: 8/24, acc_iter=57296, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:12, time_cost(all): 18:17:17/1 day, 9:26:42, loss=0.478915905913192, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=2.086945707023086, lr=0.0667340921191335
2023-11-27 03:54:33   INFO  epoch: 8/24, acc_iter=57346, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:40, time_cost(all): 18:18:15/1 day, 9:46:28, loss=0.478808363553089, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=2.8892192434851456, lr=0.06669400034639292
2023-11-27 03:55:31   INFO  epoch: 8/24, acc_iter=57396, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:18, time_cost(all): 18:19:13/1 day, 8:50:41, loss=0.478700821192986, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.40933023277382, lr=0.06665390857365233
2023-11-27 03:56:28   INFO  epoch: 8/24, acc_iter=57446, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:37:06, time_cost(all): 18:20:10/1 day, 7:29:38, loss=0.478593278832884, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=2.7268616156762855, lr=0.06661381680091175
2023-11-27 03:57:26   INFO  epoch: 8/24, acc_iter=57496, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:27, time_cost(all): 18:21:08/1 day, 8:22:58, loss=0.478485736472781, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=2.990681469991761, lr=0.06657372502817116
2023-11-27 03:58:24   INFO  epoch: 8/24, acc_iter=57546, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:06, time_cost(all): 18:22:06/1 day, 7:28:22, loss=0.478378194112678, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=4.622676661713732, lr=0.06653363325543057
2023-11-27 03:59:22   INFO  epoch: 8/24, acc_iter=57596, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:45, time_cost(all): 18:23:04/1 day, 9:51:10, loss=0.478270651752576, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=1.4981807126266482, lr=0.06649354148268999
2023-11-27 04:00:19   INFO  epoch: 8/24, acc_iter=57646, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:37, time_cost(all): 18:24:01/1 day, 7:20:31, loss=0.478163109392473, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=1.8138915599816827, lr=0.0664534497099494
2023-11-27 04:01:17   INFO  epoch: 8/24, acc_iter=57696, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:03, time_cost(all): 18:24:59/1 day, 8:10:12, loss=0.47805556703237, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=0.7758935267072871, lr=0.06641335793720882
2023-11-27 04:02:15   INFO  epoch: 8/24, acc_iter=57746, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:05, time_cost(all): 18:25:57/1 day, 7:31:14, loss=0.477948024672268, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.043693694859365, lr=0.06637326616446822
2023-11-27 04:03:13   INFO  epoch: 8/24, acc_iter=57796, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:57, time_cost(all): 18:26:55/1 day, 6:51:04, loss=0.477840482312165, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=3.1003156314317963, lr=0.06633317439172763
2023-11-27 04:04:10   INFO  epoch: 8/24, acc_iter=57846, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:10, time_cost(all): 18:27:52/1 day, 8:17:09, loss=0.477732939952062, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=0.8684592998602216, lr=0.06629308261898705
2023-11-27 04:05:08   INFO  epoch: 8/24, acc_iter=57896, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:31, time_cost(all): 18:28:50/1 day, 7:33:09, loss=0.47762539759196, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=1.7373370897605376, lr=0.06625299084624646
2023-11-27 04:06:06   INFO  epoch: 8/24, acc_iter=57946, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:26, time_cost(all): 18:29:48/1 day, 9:03:28, loss=0.477517855231857, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=3.8560142814330254, lr=0.06621289907350586
2023-11-27 04:07:04   INFO  epoch: 8/24, acc_iter=57996, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:29, time_cost(all): 18:30:46/1 day, 7:56:07, loss=0.477410312871754, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.492112630606538, lr=0.06617280730076527
2023-11-27 04:08:01   INFO  epoch: 8/24, acc_iter=58046, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:53, time_cost(all): 18:31:43/1 day, 9:21:51, loss=0.477302770511652, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=0.8774899589366972, lr=0.06613271552802469
2023-11-27 04:08:59   INFO  epoch: 8/24, acc_iter=58096, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:23, time_cost(all): 18:32:41/1 day, 7:46:25, loss=0.477195228151549, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.464555515318591, lr=0.0660926237552841
2023-11-27 04:09:57   INFO  epoch: 8/24, acc_iter=58146, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:49, time_cost(all): 18:33:39/1 day, 8:14:25, loss=0.477087685791446, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=1.8334535196772812, lr=0.06605253198254352
2023-11-27 04:10:55   INFO  epoch: 8/24, acc_iter=58196, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:22, time_cost(all): 18:34:37/1 day, 7:13:15, loss=0.476980143431344, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.789644345124926, lr=0.06601244020980293
2023-11-27 04:11:52   INFO  epoch: 8/24, acc_iter=58246, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:37, time_cost(all): 18:35:34/1 day, 7:42:04, loss=0.476872601071241, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.02(1.03), norm=1.121454201295577, lr=0.06597234843706234
2023-11-27 04:12:50   INFO  epoch: 8/24, acc_iter=58296, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:48, time_cost(all): 18:36:32/1 day, 9:14:31, loss=0.476765058711138, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=4.200331868745388, lr=0.06593225666432176
2023-11-27 04:13:48   INFO  epoch: 8/24, acc_iter=58346, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:55, time_cost(all): 18:37:30/1 day, 8:11:09, loss=0.476657516351036, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=0.7005928402652711, lr=0.06589216489158117
2023-11-27 04:14:46   INFO  epoch: 8/24, acc_iter=58396, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:29, time_cost(all): 18:38:28/1 day, 6:53:49, loss=0.476549973990933, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.8869883333375412, lr=0.06585207311884057
2023-11-27 04:15:43   INFO  epoch: 8/24, acc_iter=58446, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:35, time_cost(all): 18:39:25/1 day, 7:19:26, loss=0.47644243163083, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=4.7834722647045815, lr=0.06581198134609999
2023-11-27 04:16:41   INFO  epoch: 8/24, acc_iter=58496, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:31, time_cost(all): 18:40:23/1 day, 9:02:37, loss=0.476334889270728, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=3.5590394472945666, lr=0.06577188957335939
2023-11-27 04:17:39   INFO  epoch: 8/24, acc_iter=58546, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:48, time_cost(all): 18:41:21/1 day, 9:18:52, loss=0.476227346910625, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=1.3640605358495044, lr=0.0657317978006188
2023-11-27 04:18:37   INFO  epoch: 8/24, acc_iter=58596, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:38, time_cost(all): 18:42:19/1 day, 7:41:06, loss=0.476119804550522, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.1064567569820825, lr=0.06569170602787822
2023-11-27 04:19:34   INFO  epoch: 8/24, acc_iter=58646, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:55, time_cost(all): 18:43:16/1 day, 8:35:22, loss=0.476012262190419, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.1207282130613412, lr=0.06565161425513763
2023-11-27 04:20:32   INFO  epoch: 8/24, acc_iter=58696, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:12, time_cost(all): 18:44:14/1 day, 8:05:12, loss=0.475904719830317, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=3.249099720949547, lr=0.06561152248239704
2023-11-27 04:21:30   INFO  epoch: 8/24, acc_iter=58746, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:22, time_cost(all): 18:45:12/1 day, 7:38:04, loss=0.475797177470214, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.336627067988613, lr=0.06557143070965646
2023-11-27 04:22:28   INFO  epoch: 8/24, acc_iter=58796, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:12, time_cost(all): 18:46:10/1 day, 7:56:59, loss=0.475689635110111, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.13390576149663, lr=0.06553133893691587
2023-11-27 04:23:25   INFO  epoch: 8/24, acc_iter=58846, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:14, time_cost(all): 18:47:07/1 day, 6:37:11, loss=0.475582092750009, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.9277988653427225, lr=0.06549124716417529
2023-11-27 04:24:23   INFO  epoch: 8/24, acc_iter=58896, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:30, time_cost(all): 18:48:05/1 day, 6:56:32, loss=0.475474550389906, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=1.6527764058586236, lr=0.0654511553914347
2023-11-27 04:25:21   INFO  epoch: 8/24, acc_iter=58946, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:14, time_cost(all): 18:49:03/1 day, 8:39:02, loss=0.475367008029803, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=1.0423681324857033, lr=0.0654110636186941
2023-11-27 04:26:19   INFO  epoch: 8/24, acc_iter=58996, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:41, time_cost(all): 18:50:01/1 day, 7:11:47, loss=0.475259465669701, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=4.14252691146952, lr=0.06537097184595353
2023-11-27 04:27:16   INFO  epoch: 8/24, acc_iter=59046, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:41, time_cost(all): 18:50:58/1 day, 7:54:55, loss=0.475151923309598, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=2.9755035790621296, lr=0.06533088007321293
2023-11-27 04:28:14   INFO  epoch: 8/24, acc_iter=59096, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:34, time_cost(all): 18:51:56/1 day, 7:54:40, loss=0.475044380949495, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=2.611254384995029, lr=0.06529078830047234
2023-11-27 04:29:12   INFO  epoch: 8/24, acc_iter=59146, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:30, time_cost(all): 18:52:54/1 day, 8:54:38, loss=0.474936838589393, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=3.552829395871294, lr=0.06525069652773174
2023-11-27 04:30:10   INFO  epoch: 8/24, acc_iter=59196, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:43, time_cost(all): 18:53:52/1 day, 6:44:54, loss=0.47482929622929, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=2.827048669504393, lr=0.06521060475499116
2023-11-27 04:31:07   INFO  epoch: 8/24, acc_iter=59246, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 18:54:49/1 day, 7:27:43, loss=0.474721753869187, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=1.3122894355992836, lr=0.06517051298225057
2023-11-27 04:32:05   INFO  epoch: 9/24, acc_iter=59333, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:09:33, time_cost(all): 18:55:47/1 day, 6:56:33, loss=0.474534630162609, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.716617181331446, lr=0.06510075329768195
2023-11-27 04:33:03   INFO  epoch: 9/24, acc_iter=59383, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:04:38, time_cost(all): 18:56:45/1 day, 6:47:53, loss=0.474427087802506, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.28513728072539, lr=0.06506066152494136
2023-11-27 04:34:01   INFO  epoch: 9/24, acc_iter=59433, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:05:10, time_cost(all): 18:57:43/1 day, 6:56:01, loss=0.474319545442403, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=0.7712204793763164, lr=0.06502056975220077
2023-11-27 04:34:59   INFO  epoch: 9/24, acc_iter=59483, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:04:21, time_cost(all): 18:58:41/1 day, 7:36:46, loss=0.474212003082301, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=2.1711665144781658, lr=0.06498047797946019
2023-11-27 04:35:56   INFO  epoch: 9/24, acc_iter=59533, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:56:59, time_cost(all): 18:59:38/1 day, 6:57:43, loss=0.474104460722198, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=3.3090500722584832, lr=0.0649403862067196
2023-11-27 04:36:54   INFO  epoch: 9/24, acc_iter=59583, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:06:59, time_cost(all): 19:00:36/1 day, 9:06:08, loss=0.473996918362095, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=2.556881762865212, lr=0.06490029443397902
2023-11-27 04:37:52   INFO  epoch: 9/24, acc_iter=59633, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:03:32, time_cost(all): 19:01:34/1 day, 7:39:29, loss=0.473889376001993, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=2.4009878543758605, lr=0.06486020266123843
2023-11-27 04:38:50   INFO  epoch: 9/24, acc_iter=59683, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:54:23, time_cost(all): 19:02:32/1 day, 6:38:51, loss=0.47378183364189, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=4.7038379060928435, lr=0.06482011088849784
2023-11-27 04:39:47   INFO  epoch: 9/24, acc_iter=59733, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:03:41, time_cost(all): 19:03:29/1 day, 8:56:34, loss=0.473674291281787, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.0860272730623857, lr=0.06478001911575726
2023-11-27 04:40:45   INFO  epoch: 9/24, acc_iter=59783, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:58:35, time_cost(all): 19:04:27/1 day, 7:53:42, loss=0.473566748921685, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=2.8398404580442125, lr=0.06473992734301666
2023-11-27 04:41:43   INFO  epoch: 9/24, acc_iter=59833, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:50:53, time_cost(all): 19:05:25/1 day, 8:22:12, loss=0.473459206561582, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=1.8503262501662894, lr=0.06469983557027607
2023-11-27 04:42:41   INFO  epoch: 9/24, acc_iter=59883, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:52:42, time_cost(all): 19:06:23/1 day, 7:21:46, loss=0.473351664201479, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=3.296598014786557, lr=0.06465974379753549
2023-11-27 04:43:38   INFO  epoch: 9/24, acc_iter=59933, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:56:12, time_cost(all): 19:07:20/1 day, 7:20:08, loss=0.473244121841377, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=4.206251272998543, lr=0.0646196520247949
2023-11-27 04:44:36   INFO  epoch: 9/24, acc_iter=59983, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:56:46, time_cost(all): 19:08:18/1 day, 8:51:00, loss=0.473136579481274, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=2.2224509100557066, lr=0.0645795602520543
2023-11-27 04:45:34   INFO  epoch: 9/24, acc_iter=60033, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:48:54, time_cost(all): 19:09:16/1 day, 6:09:59, loss=0.473029037121171, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=2.8823447011123173, lr=0.06453946847931372
2023-11-27 04:46:32   INFO  epoch: 9/24, acc_iter=60083, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:47:27, time_cost(all): 19:10:14/1 day, 8:58:09, loss=0.472921494761068, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.81785105996489, lr=0.06449937670657313
2023-11-27 04:47:29   INFO  epoch: 9/24, acc_iter=60133, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:55:36, time_cost(all): 19:11:11/1 day, 6:48:13, loss=0.472813952400966, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=3.5511695067160156, lr=0.06445928493383254
2023-11-27 04:48:27   INFO  epoch: 9/24, acc_iter=60183, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:53:40, time_cost(all): 19:12:09/1 day, 7:29:56, loss=0.472706410040863, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=1.139856832027841, lr=0.06441919316109196
2023-11-27 04:49:25   INFO  epoch: 9/24, acc_iter=60233, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:51:36, time_cost(all): 19:13:07/1 day, 7:18:27, loss=0.47259886768076, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=4.387042784733186, lr=0.06437910138835137
2023-11-27 04:50:23   INFO  epoch: 9/24, acc_iter=60283, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:48:58, time_cost(all): 19:14:05/1 day, 6:53:19, loss=0.472491325320658, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=0.9626850503398114, lr=0.06433900961561079
2023-11-27 04:51:20   INFO  epoch: 9/24, acc_iter=60333, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:48, time_cost(all): 19:15:02/1 day, 7:16:11, loss=0.472383782960555, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=2.8708619842325906, lr=0.0642989178428702
2023-11-27 04:52:18   INFO  epoch: 9/24, acc_iter=60383, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:44:16, time_cost(all): 19:16:00/1 day, 8:49:05, loss=0.472276240600452, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.381393441696089, lr=0.06425882607012962
2023-11-27 04:53:16   INFO  epoch: 9/24, acc_iter=60433, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:44:42, time_cost(all): 19:16:58/1 day, 6:05:48, loss=0.47216869824035, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=4.408904433730979, lr=0.06421873429738902
2023-11-27 04:54:14   INFO  epoch: 9/24, acc_iter=60483, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:48:20, time_cost(all): 19:17:56/1 day, 6:47:05, loss=0.472061155880247, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=2.5769665322825244, lr=0.06417864252464843
2023-11-27 04:55:11   INFO  epoch: 9/24, acc_iter=60533, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:40:46, time_cost(all): 19:18:53/1 day, 8:47:35, loss=0.471953613520144, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=2.2241837555223603, lr=0.06413855075190783
2023-11-27 04:56:09   INFO  epoch: 9/24, acc_iter=60583, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:45:02, time_cost(all): 19:19:51/1 day, 6:58:28, loss=0.471846071160042, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=1.1815697342605438, lr=0.06409845897916724
2023-11-27 04:57:07   INFO  epoch: 9/24, acc_iter=60633, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:42:14, time_cost(all): 19:20:49/1 day, 6:28:42, loss=0.471738528799939, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=3.8320160046850034, lr=0.06405836720642666
2023-11-27 04:58:05   INFO  epoch: 9/24, acc_iter=60683, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:41:49, time_cost(all): 19:21:47/1 day, 5:48:22, loss=0.471630986439836, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=3.6137791029912134, lr=0.06401827543368607
2023-11-27 04:59:02   INFO  epoch: 9/24, acc_iter=60733, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:37:27, time_cost(all): 19:22:44/1 day, 8:43:15, loss=0.471523444079734, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=4.456407124821154, lr=0.06397818366094549
2023-11-27 05:00:00   INFO  epoch: 9/24, acc_iter=60783, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:34:03, time_cost(all): 19:23:42/1 day, 6:48:26, loss=0.471415901719631, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=4.211789163743557, lr=0.0639380918882049
2023-11-27 05:00:58   INFO  epoch: 9/24, acc_iter=60833, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:38:07, time_cost(all): 19:24:40/1 day, 6:11:35, loss=0.471308359359528, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=4.402633855356921, lr=0.06389800011546432
2023-11-27 05:01:56   INFO  epoch: 9/24, acc_iter=60883, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:33:20, time_cost(all): 19:25:38/1 day, 6:30:50, loss=0.471200816999426, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.6278331704038185, lr=0.06385790834272373
2023-11-27 05:02:53   INFO  epoch: 9/24, acc_iter=60933, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:31:52, time_cost(all): 19:26:35/1 day, 5:51:50, loss=0.471093274639323, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=4.203963981679135, lr=0.06381781656998314
2023-11-27 05:03:51   INFO  epoch: 9/24, acc_iter=60983, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:35:58, time_cost(all): 19:27:33/1 day, 7:25:19, loss=0.47098573227922, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=3.379552373271329, lr=0.06377772479724254
2023-11-27 05:04:49   INFO  epoch: 9/24, acc_iter=61033, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:36:11, time_cost(all): 19:28:31/1 day, 6:49:26, loss=0.470878189919118, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=1.3318650751699768, lr=0.06373763302450197
2023-11-27 05:05:47   INFO  epoch: 9/24, acc_iter=61083, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:34:36, time_cost(all): 19:29:29/1 day, 6:05:40, loss=0.470770647559015, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=4.173694728018807, lr=0.06369754125176137
2023-11-27 05:06:44   INFO  epoch: 9/24, acc_iter=61133, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:28:22, time_cost(all): 19:30:26/1 day, 7:42:49, loss=0.470663105198912, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=2.480428036735289, lr=0.06365744947902079
2023-11-27 05:07:42   INFO  epoch: 9/24, acc_iter=61183, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:31:28, time_cost(all): 19:31:24/1 day, 7:44:17, loss=0.47055556283881, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=1.061056551808656, lr=0.06361735770628019
2023-11-27 05:08:40   INFO  epoch: 9/24, acc_iter=61233, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:29:13, time_cost(all): 19:32:22/1 day, 6:26:53, loss=0.470448020478707, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=1.2051571244741475, lr=0.0635772659335396
2023-11-27 05:09:38   INFO  epoch: 9/24, acc_iter=61283, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:32:20, time_cost(all): 19:33:20/1 day, 6:18:52, loss=0.470340478118604, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=2.6651646441028216, lr=0.06353717416079901
2023-11-27 05:10:35   INFO  epoch: 9/24, acc_iter=61333, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:27:22, time_cost(all): 19:34:17/1 day, 6:54:16, loss=0.470232935758502, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=1.6157583637543356, lr=0.06349708238805843
2023-11-27 05:11:33   INFO  epoch: 9/24, acc_iter=61383, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:25:56, time_cost(all): 19:35:15/1 day, 5:51:38, loss=0.470125393398399, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=3.8424634738193237, lr=0.06345699061531784
2023-11-27 05:12:31   INFO  epoch: 9/24, acc_iter=61433, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:29:02, time_cost(all): 19:36:13/1 day, 7:53:16, loss=0.470017851038296, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=3.7321005837946073, lr=0.06341689884257726
2023-11-27 05:13:29   INFO  epoch: 9/24, acc_iter=61483, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:22:38, time_cost(all): 19:37:11/1 day, 6:49:47, loss=0.469910308678194, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=3.761113339007836, lr=0.06337680706983667
2023-11-27 05:14:26   INFO  epoch: 9/24, acc_iter=61533, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:27:33, time_cost(all): 19:38:08/1 day, 5:43:55, loss=0.469802766318091, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=1.6684005891383462, lr=0.06333671529709609
2023-11-27 05:15:24   INFO  epoch: 9/24, acc_iter=61583, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:24:13, time_cost(all): 19:39:06/1 day, 8:31:15, loss=0.469695223957988, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=1.5909494572892395, lr=0.0632966235243555
2023-11-27 05:16:22   INFO  epoch: 9/24, acc_iter=61633, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:19:20, time_cost(all): 19:40:04/1 day, 6:50:26, loss=0.469587681597885, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=2.549444915632234, lr=0.0632565317516149
2023-11-27 05:17:20   INFO  epoch: 9/24, acc_iter=61683, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:21:47, time_cost(all): 19:41:02/1 day, 7:56:12, loss=0.469480139237783, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=0.6281468440135856, lr=0.06321643997887431
2023-11-27 05:18:17   INFO  epoch: 9/24, acc_iter=61733, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:23:04, time_cost(all): 19:41:59/1 day, 6:46:42, loss=0.46937259687768, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=2.612689900428184, lr=0.06317634820613373
2023-11-27 05:19:15   INFO  epoch: 9/24, acc_iter=61783, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:01, time_cost(all): 19:42:57/1 day, 8:01:25, loss=0.469265054517577, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=0.7328536114598089, lr=0.06313625643339314
2023-11-27 05:20:13   INFO  epoch: 9/24, acc_iter=61833, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:18:09, time_cost(all): 19:43:55/1 day, 8:15:54, loss=0.469157512157475, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=1.9051635969450036, lr=0.06309616466065254
2023-11-27 05:21:11   INFO  epoch: 9/24, acc_iter=61883, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:33, time_cost(all): 19:44:53/1 day, 7:18:47, loss=0.469049969797372, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.3405814559966744, lr=0.06305607288791196
2023-11-27 05:22:08   INFO  epoch: 9/24, acc_iter=61933, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:41, time_cost(all): 19:45:50/1 day, 7:29:08, loss=0.468942427437269, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.6210758065306927, lr=0.06301598111517137
2023-11-27 05:23:06   INFO  epoch: 9/24, acc_iter=61983, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:28, time_cost(all): 19:46:48/1 day, 6:12:11, loss=0.468834885077167, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=0.8732217527066832, lr=0.06297588934243079
2023-11-27 05:24:04   INFO  epoch: 9/24, acc_iter=62033, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:15:01, time_cost(all): 19:47:46/1 day, 5:41:22, loss=0.468727342717064, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=4.60859687667888, lr=0.0629357975696902
2023-11-27 05:25:02   INFO  epoch: 9/24, acc_iter=62083, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:14:48, time_cost(all): 19:48:44/1 day, 6:36:15, loss=0.468619800356961, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=3.7102980059788924, lr=0.06289570579694961
2023-11-27 05:25:59   INFO  epoch: 9/24, acc_iter=62133, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:08:59, time_cost(all): 19:49:41/1 day, 6:47:17, loss=0.468512257996859, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.9668092529736065, lr=0.06285561402420903
2023-11-27 05:26:57   INFO  epoch: 9/24, acc_iter=62183, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:08:21, time_cost(all): 19:50:39/1 day, 7:57:34, loss=0.468404715636756, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=2.890839864373261, lr=0.06281552225146844
2023-11-27 05:27:55   INFO  epoch: 9/24, acc_iter=62233, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:54, time_cost(all): 19:51:37/1 day, 5:37:05, loss=0.468297173276653, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=3.3459958733493305, lr=0.06277543047872786
2023-11-27 05:28:53   INFO  epoch: 9/24, acc_iter=62283, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:10:50, time_cost(all): 19:52:35/1 day, 6:17:23, loss=0.468189630916551, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=3.8796982904305484, lr=0.06273533870598726
2023-11-27 05:29:50   INFO  epoch: 9/24, acc_iter=62333, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:06:00, time_cost(all): 19:53:32/1 day, 7:33:47, loss=0.468082088556448, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=2.707464160989391, lr=0.06269524693324667
2023-11-27 05:30:48   INFO  epoch: 9/24, acc_iter=62383, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:09:46, time_cost(all): 19:54:30/1 day, 5:38:57, loss=0.467974546196345, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.1150486414751175, lr=0.06265515516050608
2023-11-27 05:31:46   INFO  epoch: 9/24, acc_iter=62433, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:09:03, time_cost(all): 19:55:28/1 day, 6:28:01, loss=0.467867003836243, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=4.680765094295065, lr=0.06261506338776548
2023-11-27 05:32:44   INFO  epoch: 9/24, acc_iter=62483, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:06:51, time_cost(all): 19:56:26/1 day, 6:54:39, loss=0.46775946147614, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=0.7759456742342606, lr=0.0625749716150249
2023-11-27 05:33:41   INFO  epoch: 9/24, acc_iter=62533, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:06:58, time_cost(all): 19:57:23/1 day, 7:41:44, loss=0.467651919116037, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=3.7501462143760627, lr=0.06253487984228431
2023-11-27 05:34:39   INFO  epoch: 9/24, acc_iter=62583, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:02:41, time_cost(all): 19:58:21/1 day, 6:12:43, loss=0.467544376755935, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=1.0414189670847764, lr=0.062494788069543734
2023-11-27 05:35:37   INFO  epoch: 9/24, acc_iter=62633, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/0:59:27, time_cost(all): 19:59:19/1 day, 5:15:28, loss=0.467436834395832, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=2.69640284957181, lr=0.06245469629680314
2023-11-27 05:36:35   INFO  epoch: 9/24, acc_iter=62683, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:24, time_cost(all): 20:00:17/1 day, 6:44:05, loss=0.467329292035729, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=2.673885785556918, lr=0.062414604524062556
2023-11-27 05:37:32   INFO  epoch: 9/24, acc_iter=62733, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:02:52, time_cost(all): 20:01:14/1 day, 7:35:01, loss=0.467221749675626, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=2.8896248849519233, lr=0.06237451275132196
2023-11-27 05:38:30   INFO  epoch: 9/24, acc_iter=62783, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:00:30, time_cost(all): 20:02:12/1 day, 5:34:24, loss=0.467114207315524, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=2.232137639049502, lr=0.062334420978581384
2023-11-27 05:39:28   INFO  epoch: 9/24, acc_iter=62833, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:55:32, time_cost(all): 20:03:10/1 day, 5:57:28, loss=0.467006664955421, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=4.165443321616995, lr=0.06229432920584079
2023-11-27 05:40:26   INFO  epoch: 9/24, acc_iter=62883, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:54:38, time_cost(all): 20:04:08/1 day, 6:40:09, loss=0.466899122595318, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.181763024389102, lr=0.062254237433100205
2023-11-27 05:41:23   INFO  epoch: 9/24, acc_iter=62933, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:13, time_cost(all): 20:05:05/1 day, 7:26:19, loss=0.466791580235216, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=2.6011986246633634, lr=0.06221414566035961
2023-11-27 05:42:21   INFO  epoch: 9/24, acc_iter=62983, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:57:02, time_cost(all): 20:06:03/1 day, 8:05:07, loss=0.466684037875113, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=3.6872503098260783, lr=0.06217405388761903
2023-11-27 05:43:19   INFO  epoch: 9/24, acc_iter=63033, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:29, time_cost(all): 20:07:01/1 day, 7:43:35, loss=0.46657649551501, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=0.8757131400019553, lr=0.06213396211487844
2023-11-27 05:44:17   INFO  epoch: 9/24, acc_iter=63083, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:56:08, time_cost(all): 20:07:59/1 day, 6:18:53, loss=0.466468953154908, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.969679942171396, lr=0.06209387034213785
2023-11-27 05:45:14   INFO  epoch: 9/24, acc_iter=63133, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:52:38, time_cost(all): 20:08:56/1 day, 8:02:25, loss=0.466361410794805, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=4.43424268261, lr=0.06205377856939726
2023-11-27 05:46:12   INFO  epoch: 9/24, acc_iter=63183, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:54:12, time_cost(all): 20:09:54/1 day, 5:02:15, loss=0.466253868434702, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=2.3875504266333167, lr=0.06201368679665667
2023-11-27 05:47:10   INFO  epoch: 9/24, acc_iter=63233, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:48:16, time_cost(all): 20:10:52/1 day, 7:21:16, loss=0.4661463260746, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=0.855521094894483, lr=0.06197359502391609
2023-11-27 05:48:08   INFO  epoch: 9/24, acc_iter=63283, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:51:36, time_cost(all): 20:11:50/1 day, 7:47:46, loss=0.466038783714497, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=1.751744266430273, lr=0.0619335032511755
2023-11-27 05:49:05   INFO  epoch: 9/24, acc_iter=63333, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:48:43, time_cost(all): 20:12:47/1 day, 7:31:09, loss=0.465931241354394, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=3.024840132714717, lr=0.06189341147843491
2023-11-27 05:50:03   INFO  epoch: 9/24, acc_iter=63383, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:49:18, time_cost(all): 20:13:45/1 day, 7:11:06, loss=0.465823698994292, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=2.6909464942856984, lr=0.06185331970569432
2023-11-27 05:51:01   INFO  epoch: 9/24, acc_iter=63433, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:45:28, time_cost(all): 20:14:43/1 day, 7:35:41, loss=0.465716156634189, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.3085881897314735, lr=0.06181322793295373
2023-11-27 05:51:59   INFO  epoch: 9/24, acc_iter=63483, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:38, time_cost(all): 20:15:41/1 day, 5:19:53, loss=0.465608614274086, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.22663037211079, lr=0.06177313616021315
2023-11-27 05:52:56   INFO  epoch: 9/24, acc_iter=63533, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:23, time_cost(all): 20:16:38/1 day, 6:17:52, loss=0.465501071913984, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=3.1634888542775155, lr=0.061733044387472555
2023-11-27 05:53:54   INFO  epoch: 9/24, acc_iter=63583, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:42:12, time_cost(all): 20:17:36/1 day, 7:30:31, loss=0.465393529553881, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=1.7591355456809274, lr=0.06169295261473197
2023-11-27 05:54:52   INFO  epoch: 9/24, acc_iter=63633, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:45:04, time_cost(all): 20:18:34/1 day, 7:00:13, loss=0.465285987193778, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=2.3985388229025637, lr=0.06165286084199138
2023-11-27 05:55:50   INFO  epoch: 9/24, acc_iter=63683, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:08, time_cost(all): 20:19:32/1 day, 5:16:50, loss=0.465178444833676, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=3.730922481591964, lr=0.0616127690692508
2023-11-27 05:56:47   INFO  epoch: 9/24, acc_iter=63733, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:54, time_cost(all): 20:20:29/1 day, 7:13:53, loss=0.465070902473573, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=4.572127331114366, lr=0.061572677296510205
2023-11-27 05:57:45   INFO  epoch: 9/24, acc_iter=63783, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:19, time_cost(all): 20:21:27/1 day, 7:51:57, loss=0.46496336011347, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=2.6582492868342613, lr=0.06153258552376962
2023-11-27 05:58:43   INFO  epoch: 9/24, acc_iter=63833, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:38:43, time_cost(all): 20:22:25/1 day, 7:50:52, loss=0.464855817753368, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=2.828303301349439, lr=0.061492493751029026
2023-11-27 05:59:41   INFO  epoch: 9/24, acc_iter=63883, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:45, time_cost(all): 20:23:23/1 day, 5:24:02, loss=0.464748275393265, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=0.6653728141005728, lr=0.06145240197828844
2023-11-27 06:00:38   INFO  epoch: 9/24, acc_iter=63933, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:07, time_cost(all): 20:24:20/1 day, 6:05:29, loss=0.464640733033162, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=0.9994405675093476, lr=0.061412310205547854
2023-11-27 06:01:36   INFO  epoch: 9/24, acc_iter=63983, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:00, time_cost(all): 20:25:18/1 day, 7:14:37, loss=0.46453319067306, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=0.9219761315232747, lr=0.06137221843280727
2023-11-27 06:02:34   INFO  epoch: 9/24, acc_iter=64033, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:49, time_cost(all): 20:26:16/1 day, 6:14:14, loss=0.464425648312957, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.4009647445398734, lr=0.061332126660066676
2023-11-27 06:03:32   INFO  epoch: 9/24, acc_iter=64083, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:24, time_cost(all): 20:27:14/1 day, 6:24:00, loss=0.464318105952854, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=3.1091042090441046, lr=0.06129203488732609
2023-11-27 06:04:29   INFO  epoch: 9/24, acc_iter=64133, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:56, time_cost(all): 20:28:11/1 day, 6:29:30, loss=0.464210563592751, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=2.4211384638102293, lr=0.061251943114585504
2023-11-27 06:05:27   INFO  epoch: 9/24, acc_iter=64183, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:24, time_cost(all): 20:29:09/1 day, 6:45:33, loss=0.464103021232649, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.4114729152932464, lr=0.06121185134184491
2023-11-27 06:06:25   INFO  epoch: 9/24, acc_iter=64233, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:37, time_cost(all): 20:30:07/1 day, 6:11:58, loss=0.463995478872546, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=0.8183263786288606, lr=0.061171759569104325
2023-11-27 06:07:23   INFO  epoch: 9/24, acc_iter=64283, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:34, time_cost(all): 20:31:05/1 day, 5:44:36, loss=0.463887936512443, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=1.47068772185198, lr=0.06113166779636373
2023-11-27 06:08:20   INFO  epoch: 9/24, acc_iter=64333, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:40, time_cost(all): 20:32:02/1 day, 7:13:01, loss=0.463780394152341, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.5978195738986611, lr=0.06109157602362315
2023-11-27 06:09:18   INFO  epoch: 9/24, acc_iter=64383, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:59, time_cost(all): 20:33:00/1 day, 6:02:51, loss=0.463672851792238, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=4.258804671470606, lr=0.06105148425088256
2023-11-27 06:10:16   INFO  epoch: 9/24, acc_iter=64433, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:15, time_cost(all): 20:33:58/1 day, 7:01:48, loss=0.463565309432135, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=4.749558475743161, lr=0.061011392478141975
2023-11-27 06:11:14   INFO  epoch: 9/24, acc_iter=64483, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:25:53, time_cost(all): 20:34:56/1 day, 6:50:50, loss=0.463457767072033, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=3.3026204027623103, lr=0.06097130070540138
2023-11-27 06:12:11   INFO  epoch: 9/24, acc_iter=64533, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:24:37, time_cost(all): 20:35:53/1 day, 7:00:40, loss=0.46335022471193, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.672874947074067, lr=0.060931208932660796
2023-11-27 06:13:09   INFO  epoch: 9/24, acc_iter=64583, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:23:59, time_cost(all): 20:36:51/1 day, 7:11:29, loss=0.463242682351827, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=2.9735010886776263, lr=0.06089111715992021
2023-11-27 06:14:07   INFO  epoch: 9/24, acc_iter=64633, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:11, time_cost(all): 20:37:49/1 day, 7:08:40, loss=0.463135139991725, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=0.6824124306961513, lr=0.06085102538717962
2023-11-27 06:15:05   INFO  epoch: 9/24, acc_iter=64683, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:31, time_cost(all): 20:38:47/1 day, 7:10:45, loss=0.463027597631622, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=3.052898369943693, lr=0.06081093361443903
2023-11-27 06:16:02   INFO  epoch: 9/24, acc_iter=64733, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:00, time_cost(all): 20:39:44/1 day, 5:17:38, loss=0.462920055271519, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.230728347253572, lr=0.060770841841698446
2023-11-27 06:17:00   INFO  epoch: 9/24, acc_iter=64783, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:46, time_cost(all): 20:40:42/1 day, 5:12:45, loss=0.462812512911417, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.2383927074272256, lr=0.06073075006895785
2023-11-27 06:17:58   INFO  epoch: 9/24, acc_iter=64833, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:48, time_cost(all): 20:41:40/1 day, 5:57:10, loss=0.462704970551314, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=2.205458310548912, lr=0.06069065829621727
2023-11-27 06:18:56   INFO  epoch: 9/24, acc_iter=64883, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:36, time_cost(all): 20:42:38/1 day, 4:36:28, loss=0.462597428191211, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.6798202789593462, lr=0.06065056652347668
2023-11-27 06:19:54   INFO  epoch: 9/24, acc_iter=64933, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:26, time_cost(all): 20:43:36/1 day, 4:47:56, loss=0.462489885831109, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=3.422268417362548, lr=0.06061047475073609
2023-11-27 06:20:51   INFO  epoch: 9/24, acc_iter=64983, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:06, time_cost(all): 20:44:33/1 day, 6:06:15, loss=0.462382343471006, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.0698634727420155, lr=0.0605703829779955
2023-11-27 06:21:49   INFO  epoch: 9/24, acc_iter=65033, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:52, time_cost(all): 20:45:31/1 day, 5:32:00, loss=0.462274801110903, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.678154858542811, lr=0.06053029120525492
2023-11-27 06:22:47   INFO  epoch: 9/24, acc_iter=65083, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:16, time_cost(all): 20:46:29/1 day, 5:41:53, loss=0.462167258750801, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=4.219755134834873, lr=0.06049019943251433
2023-11-27 06:23:45   INFO  epoch: 9/24, acc_iter=65133, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:24, time_cost(all): 20:47:27/1 day, 5:43:44, loss=0.462059716390698, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.0439076411903327, lr=0.06045010765977374
2023-11-27 06:24:42   INFO  epoch: 9/24, acc_iter=65183, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:12:52, time_cost(all): 20:48:24/1 day, 7:24:33, loss=0.461952174030595, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=1.563830956007068, lr=0.06041001588703315
2023-11-27 06:25:40   INFO  epoch: 9/24, acc_iter=65233, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:52, time_cost(all): 20:49:22/1 day, 7:23:18, loss=0.461844631670493, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=2.0643062820883777, lr=0.06036992411429256
2023-11-27 06:26:38   INFO  epoch: 9/24, acc_iter=65283, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:08, time_cost(all): 20:50:20/1 day, 7:12:25, loss=0.46173708931039, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=4.92350778975029, lr=0.060329832341551974
2023-11-27 06:27:36   INFO  epoch: 9/24, acc_iter=65333, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:51, time_cost(all): 20:51:18/1 day, 4:54:45, loss=0.461629546950287, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.2788266542177582, lr=0.06028974056881139
2023-11-27 06:28:33   INFO  epoch: 9/24, acc_iter=65383, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:09, time_cost(all): 20:52:15/1 day, 5:46:20, loss=0.461522004590185, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=0.5416874334021841, lr=0.060249648796070795
2023-11-27 06:29:31   INFO  epoch: 9/24, acc_iter=65433, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:35, time_cost(all): 20:53:13/1 day, 5:40:24, loss=0.461414462230082, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=4.6533264186772145, lr=0.06020955702333021
2023-11-27 06:30:29   INFO  epoch: 9/24, acc_iter=65483, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:21, time_cost(all): 20:54:11/1 day, 6:58:49, loss=0.461306919869979, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=1.5056113026105296, lr=0.060169465250589624
2023-11-27 06:31:27   INFO  epoch: 9/24, acc_iter=65533, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:37, time_cost(all): 20:55:09/1 day, 6:37:16, loss=0.461199377509876, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.057415002771739, lr=0.06012937347784904
2023-11-27 06:32:24   INFO  epoch: 9/24, acc_iter=65583, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:45, time_cost(all): 20:56:06/1 day, 5:51:45, loss=0.461091835149774, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=4.283826411594138, lr=0.060089281705108445
2023-11-27 06:33:22   INFO  epoch: 9/24, acc_iter=65633, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:21, time_cost(all): 20:57:04/1 day, 5:34:59, loss=0.460984292789671, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.6105900940199733, lr=0.06004918993236786
2023-11-27 06:34:20   INFO  epoch: 9/24, acc_iter=65683, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:30, time_cost(all): 20:58:02/1 day, 6:26:49, loss=0.460876750429568, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.795482670521841, lr=0.060009098159627274
2023-11-27 06:35:18   INFO  epoch: 9/24, acc_iter=65733, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:41, time_cost(all): 20:59:00/1 day, 4:53:00, loss=0.460769208069466, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=0.7153003365186985, lr=0.05996900638688669
2023-11-27 06:36:15   INFO  epoch: 9/24, acc_iter=65783, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:38, time_cost(all): 20:59:57/1 day, 6:45:46, loss=0.460661665709363, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=1.8580050891299407, lr=0.059928914614146095
2023-11-27 06:37:13   INFO  epoch: 9/24, acc_iter=65833, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 21:00:55/1 day, 5:22:18, loss=0.46055412334926, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=3.6527765140306374, lr=0.05988882284140551
2023-11-27 06:38:11   INFO  epoch: 10/24, acc_iter=65920, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:04:11, time_cost(all): 21:01:53/1 day, 6:49:48, loss=0.460366999642682, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.8077185738591015, lr=0.05981906315683688
2023-11-27 06:39:09   INFO  epoch: 10/24, acc_iter=65970, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:02:47, time_cost(all): 21:02:51/1 day, 6:34:05, loss=0.460259457282579, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=1.8552858231480875, lr=0.0597789713840963
2023-11-27 06:40:06   INFO  epoch: 10/24, acc_iter=66020, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:07:53, time_cost(all): 21:03:48/1 day, 6:31:48, loss=0.460151914922476, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=2.7291314737432617, lr=0.05973887961135571
2023-11-27 06:41:04   INFO  epoch: 10/24, acc_iter=66070, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:03:21, time_cost(all): 21:04:46/1 day, 5:38:09, loss=0.460044372562374, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=3.790212254883445, lr=0.05969878783861512
2023-11-27 06:42:02   INFO  epoch: 10/24, acc_iter=66120, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:05:12, time_cost(all): 21:05:44/1 day, 4:31:52, loss=0.459936830202271, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.541300799186651, lr=0.05965869606587453
2023-11-27 06:43:00   INFO  epoch: 10/24, acc_iter=66170, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:13, time_cost(all): 21:06:42/1 day, 4:50:03, loss=0.459829287842168, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=3.4370307010656367, lr=0.059618604293133946
2023-11-27 06:43:57   INFO  epoch: 10/24, acc_iter=66220, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:55:07, time_cost(all): 21:07:39/1 day, 5:54:13, loss=0.459721745482066, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=1.1131696508406927, lr=0.059578512520393354
2023-11-27 06:44:55   INFO  epoch: 10/24, acc_iter=66270, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:59:26, time_cost(all): 21:08:37/1 day, 6:26:40, loss=0.459614203121963, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.3634857222973182, lr=0.05953842074765277
2023-11-27 06:45:53   INFO  epoch: 10/24, acc_iter=66320, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:54:02, time_cost(all): 21:09:35/1 day, 4:36:46, loss=0.45950666076186, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=3.3255730009579128, lr=0.059498328974912175
2023-11-27 06:46:51   INFO  epoch: 10/24, acc_iter=66370, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:57:45, time_cost(all): 21:10:33/1 day, 6:56:17, loss=0.459399118401758, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.293533293157742, lr=0.05945823720217159
2023-11-27 06:47:48   INFO  epoch: 10/24, acc_iter=66420, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:58:19, time_cost(all): 21:11:30/1 day, 5:14:53, loss=0.459291576041655, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.5464309332019543, lr=0.059418145429431
2023-11-27 06:48:46   INFO  epoch: 10/24, acc_iter=66470, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:54:10, time_cost(all): 21:12:28/1 day, 4:08:27, loss=0.459184033681552, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.895079366420034, lr=0.05937805365669042
2023-11-27 06:49:44   INFO  epoch: 10/24, acc_iter=66520, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:50:55, time_cost(all): 21:13:26/1 day, 5:02:40, loss=0.45907649132145, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=3.6364375694535442, lr=0.059337961883949825
2023-11-27 06:50:42   INFO  epoch: 10/24, acc_iter=66570, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:57:43, time_cost(all): 21:14:24/1 day, 5:25:42, loss=0.458968948961347, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=2.240063818070934, lr=0.05929787011120924
2023-11-27 06:51:39   INFO  epoch: 10/24, acc_iter=66620, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:49:48, time_cost(all): 21:15:21/1 day, 4:05:57, loss=0.458861406601244, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=1.1491889512923001, lr=0.05925777833846865
2023-11-27 06:52:37   INFO  epoch: 10/24, acc_iter=66670, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:46:42, time_cost(all): 21:16:19/1 day, 5:13:17, loss=0.458753864241142, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=2.524504718712474, lr=0.05921768656572807
2023-11-27 06:53:35   INFO  epoch: 10/24, acc_iter=66720, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:45:50, time_cost(all): 21:17:17/1 day, 6:29:44, loss=0.458646321881039, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=4.505049406105001, lr=0.059177594792987474
2023-11-27 06:54:33   INFO  epoch: 10/24, acc_iter=66770, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:52:30, time_cost(all): 21:18:15/1 day, 5:20:11, loss=0.458538779520936, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.316958101970588, lr=0.05913750302024689
2023-11-27 06:55:30   INFO  epoch: 10/24, acc_iter=66820, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:47:57, time_cost(all): 21:19:12/1 day, 4:20:30, loss=0.458431237160834, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=1.260713573401236, lr=0.059097411247506296
2023-11-27 06:56:28   INFO  epoch: 10/24, acc_iter=66870, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:50:33, time_cost(all): 21:20:10/1 day, 4:11:48, loss=0.458323694800731, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.7126279762128434, lr=0.05905731947476571
2023-11-27 06:57:26   INFO  epoch: 10/24, acc_iter=66920, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:45:03, time_cost(all): 21:21:08/1 day, 5:45:45, loss=0.458216152440628, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=0.8509490371586528, lr=0.059017227702025124
2023-11-27 06:58:24   INFO  epoch: 10/24, acc_iter=66970, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:43:55, time_cost(all): 21:22:06/1 day, 4:14:29, loss=0.458108610080526, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=0.7996854445021355, lr=0.05897713592928453
2023-11-27 06:59:21   INFO  epoch: 10/24, acc_iter=67020, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:42:14, time_cost(all): 21:23:03/1 day, 4:24:44, loss=0.458001067720423, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=4.29338043128418, lr=0.058937044156543945
2023-11-27 07:00:19   INFO  epoch: 10/24, acc_iter=67070, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:47:03, time_cost(all): 21:24:01/1 day, 4:58:06, loss=0.45789352536032, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.1511252204405427, lr=0.05889695238380336
2023-11-27 07:01:17   INFO  epoch: 10/24, acc_iter=67120, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:38:44, time_cost(all): 21:24:59/1 day, 6:04:19, loss=0.457785983000217, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.5918151090900674, lr=0.058856860611062774
2023-11-27 07:02:15   INFO  epoch: 10/24, acc_iter=67170, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:38:47, time_cost(all): 21:25:57/1 day, 6:05:55, loss=0.457678440640115, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=4.216497925955203, lr=0.05881676883832218
2023-11-27 07:03:12   INFO  epoch: 10/24, acc_iter=67220, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:39:17, time_cost(all): 21:26:54/1 day, 4:12:07, loss=0.457570898280012, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=0.5625349906857366, lr=0.058776677065581595
2023-11-27 07:04:10   INFO  epoch: 10/24, acc_iter=67270, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:37:53, time_cost(all): 21:27:52/1 day, 4:46:51, loss=0.457463355919909, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=3.616047168158122, lr=0.058736585292841
2023-11-27 07:05:08   INFO  epoch: 10/24, acc_iter=67320, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:39:46, time_cost(all): 21:28:50/1 day, 6:19:44, loss=0.457355813559807, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=2.32196305979907, lr=0.058696493520100416
2023-11-27 07:06:06   INFO  epoch: 10/24, acc_iter=67370, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:39:25, time_cost(all): 21:29:48/1 day, 5:07:58, loss=0.457248271199704, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=3.400979411529687, lr=0.05865640174735983
2023-11-27 07:07:03   INFO  epoch: 10/24, acc_iter=67420, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:38:52, time_cost(all): 21:30:45/1 day, 5:55:01, loss=0.457140728839601, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=1.2523266632962253, lr=0.058616309974619245
2023-11-27 07:08:01   INFO  epoch: 10/24, acc_iter=67470, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:31:54, time_cost(all): 21:31:43/1 day, 6:10:11, loss=0.457033186479499, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=0.9044550960741414, lr=0.05857621820187865
2023-11-27 07:08:59   INFO  epoch: 10/24, acc_iter=67520, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:37:52, time_cost(all): 21:32:41/1 day, 5:12:38, loss=0.456925644119396, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=4.257556944247121, lr=0.058536126429138066
2023-11-27 07:09:57   INFO  epoch: 10/24, acc_iter=67570, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:38:34, time_cost(all): 21:33:39/1 day, 4:16:18, loss=0.456818101759293, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=3.829538394040627, lr=0.05849603465639748
2023-11-27 07:10:54   INFO  epoch: 10/24, acc_iter=67620, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:28:57, time_cost(all): 21:34:36/1 day, 6:18:50, loss=0.456710559399191, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.0532418800203942, lr=0.05845594288365689
2023-11-27 07:11:52   INFO  epoch: 10/24, acc_iter=67670, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:33:34, time_cost(all): 21:35:34/1 day, 6:28:30, loss=0.456603017039088, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.0508860452812923, lr=0.0584158511109163
2023-11-27 07:12:50   INFO  epoch: 10/24, acc_iter=67720, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:33:35, time_cost(all): 21:36:32/1 day, 4:35:03, loss=0.456495474678985, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=2.5450992286674374, lr=0.05837575933817571
2023-11-27 07:13:48   INFO  epoch: 10/24, acc_iter=67770, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:34:15, time_cost(all): 21:37:30/1 day, 3:55:43, loss=0.456387932318883, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=4.436675505609683, lr=0.05833566756543513
2023-11-27 07:14:45   INFO  epoch: 10/24, acc_iter=67820, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:33:07, time_cost(all): 21:38:27/1 day, 5:52:07, loss=0.45628038995878, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=4.545356519169433, lr=0.05829557579269454
2023-11-27 07:15:43   INFO  epoch: 10/24, acc_iter=67870, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:29:40, time_cost(all): 21:39:25/1 day, 4:13:00, loss=0.456172847598677, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=0.7043136160614873, lr=0.05825548401995395
2023-11-27 07:16:41   INFO  epoch: 10/24, acc_iter=67920, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:24:55, time_cost(all): 21:40:23/1 day, 5:49:40, loss=0.456065305238575, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.1540536520747557, lr=0.05821539224721336
2023-11-27 07:17:39   INFO  epoch: 10/24, acc_iter=67970, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:08, time_cost(all): 21:41:21/1 day, 6:17:00, loss=0.455957762878472, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.646642156529922, lr=0.05817530047447277
2023-11-27 07:18:36   INFO  epoch: 10/24, acc_iter=68020, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:29:26, time_cost(all): 21:42:18/1 day, 3:34:01, loss=0.455850220518369, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=2.9758322017340144, lr=0.05813520870173219
2023-11-27 07:19:34   INFO  epoch: 10/24, acc_iter=68070, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:23:37, time_cost(all): 21:43:16/1 day, 6:15:17, loss=0.455742678158267, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.200101877649351, lr=0.058095116928991594
2023-11-27 07:20:32   INFO  epoch: 10/24, acc_iter=68120, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:23:26, time_cost(all): 21:44:14/1 day, 4:39:18, loss=0.455635135798164, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=1.1066551968801028, lr=0.05805502515625101
2023-11-27 07:21:30   INFO  epoch: 10/24, acc_iter=68170, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:40, time_cost(all): 21:45:12/1 day, 4:11:04, loss=0.455527593438061, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.7936735676322564, lr=0.058014933383510416
2023-11-27 07:22:27   INFO  epoch: 10/24, acc_iter=68220, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:21:44, time_cost(all): 21:46:09/1 day, 4:27:11, loss=0.455420051077959, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=0.7752646827601004, lr=0.05797484161076984
2023-11-27 07:23:25   INFO  epoch: 10/24, acc_iter=68270, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:22:44, time_cost(all): 21:47:07/1 day, 3:49:59, loss=0.455312508717856, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=1.318794401867124, lr=0.057934749838029244
2023-11-27 07:24:23   INFO  epoch: 10/24, acc_iter=68320, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:20:37, time_cost(all): 21:48:05/1 day, 4:40:11, loss=0.455204966357753, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=4.809456002958243, lr=0.05789465806528866
2023-11-27 07:25:21   INFO  epoch: 10/24, acc_iter=68370, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:25, time_cost(all): 21:49:03/1 day, 4:35:10, loss=0.455097423997651, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=4.4719975324233685, lr=0.057854566292548065
2023-11-27 07:26:18   INFO  epoch: 10/24, acc_iter=68420, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:45, time_cost(all): 21:50:00/1 day, 4:20:12, loss=0.454989881637548, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=2.1196683032934738, lr=0.05781447451980748
2023-11-27 07:27:16   INFO  epoch: 10/24, acc_iter=68470, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:50, time_cost(all): 21:50:58/1 day, 4:20:21, loss=0.454882339277445, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=2.4146997393797753, lr=0.057774382747066894
2023-11-27 07:28:14   INFO  epoch: 10/24, acc_iter=68520, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:59, time_cost(all): 21:51:56/1 day, 5:15:33, loss=0.454774796917342, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=1.9384048581086972, lr=0.05773429097432631
2023-11-27 07:29:12   INFO  epoch: 10/24, acc_iter=68570, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:13:06, time_cost(all): 21:52:54/1 day, 5:50:55, loss=0.45466725455724, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.14(1.03), norm=3.2527723398147605, lr=0.057694199201585715
2023-11-27 07:30:09   INFO  epoch: 10/24, acc_iter=68620, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:10:38, time_cost(all): 21:53:51/1 day, 4:28:38, loss=0.454559712197137, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=1.1654333877268335, lr=0.05765410742884513
2023-11-27 07:31:07   INFO  epoch: 10/24, acc_iter=68670, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:35, time_cost(all): 21:54:49/1 day, 4:14:50, loss=0.454452169837034, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=3.503649223061985, lr=0.05761401565610454
2023-11-27 07:32:05   INFO  epoch: 10/24, acc_iter=68720, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:40, time_cost(all): 21:55:47/1 day, 6:09:41, loss=0.454344627476932, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=2.154861627973418, lr=0.05757392388336395
2023-11-27 07:33:03   INFO  epoch: 10/24, acc_iter=68770, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:49, time_cost(all): 21:56:45/1 day, 6:00:37, loss=0.454237085116829, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=3.890562689874942, lr=0.057533832110623365
2023-11-27 07:34:00   INFO  epoch: 10/24, acc_iter=68820, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:40, time_cost(all): 21:57:42/1 day, 5:11:33, loss=0.454129542756726, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=1.452509539081177, lr=0.05749374033788277
2023-11-27 07:34:58   INFO  epoch: 10/24, acc_iter=68870, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:12:14, time_cost(all): 21:58:40/1 day, 6:06:30, loss=0.454022000396624, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.6669436570313465, lr=0.05745364856514219
2023-11-27 07:35:56   INFO  epoch: 10/24, acc_iter=68920, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:05:49, time_cost(all): 21:59:38/1 day, 3:59:12, loss=0.453914458036521, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.910955660311574, lr=0.0574135567924016
2023-11-27 07:36:54   INFO  epoch: 10/24, acc_iter=68970, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:04:05, time_cost(all): 22:00:36/1 day, 3:26:12, loss=0.453806915676418, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=4.559027711720934, lr=0.057373465019661014
2023-11-27 07:37:51   INFO  epoch: 10/24, acc_iter=69020, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:02:59, time_cost(all): 22:01:33/1 day, 5:07:51, loss=0.453699373316316, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=1.2652267843987954, lr=0.05733337324692042
2023-11-27 07:38:49   INFO  epoch: 10/24, acc_iter=69070, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:03:01, time_cost(all): 22:02:31/1 day, 4:32:00, loss=0.453591830956213, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=4.90256540323005, lr=0.057293281474179836
2023-11-27 07:39:47   INFO  epoch: 10/24, acc_iter=69120, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:04:48, time_cost(all): 22:03:29/1 day, 4:26:52, loss=0.45348428859611, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=4.167732093559999, lr=0.05725318970143925
2023-11-27 07:40:45   INFO  epoch: 10/24, acc_iter=69170, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:03:16, time_cost(all): 22:04:27/1 day, 5:46:26, loss=0.453376746236008, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=3.2689766757553236, lr=0.05721309792869866
2023-11-27 07:41:42   INFO  epoch: 10/24, acc_iter=69220, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:27, time_cost(all): 22:05:24/1 day, 4:07:03, loss=0.453269203875905, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.105066516050511, lr=0.05717300615595807
2023-11-27 07:42:40   INFO  epoch: 10/24, acc_iter=69270, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:32, time_cost(all): 22:06:22/1 day, 3:28:42, loss=0.453161661515802, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=3.1703002768505693, lr=0.05713291438321748
2023-11-27 07:43:38   INFO  epoch: 10/24, acc_iter=69320, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:02:34, time_cost(all): 22:07:20/1 day, 5:13:07, loss=0.4530541191557, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.629510483352836, lr=0.0570928226104769
2023-11-27 07:44:36   INFO  epoch: 10/24, acc_iter=69370, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:56:54, time_cost(all): 22:08:18/1 day, 3:25:30, loss=0.452946576795597, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=3.8241637529223187, lr=0.05705273083773631
2023-11-27 07:45:33   INFO  epoch: 10/24, acc_iter=69420, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:00:39, time_cost(all): 22:09:15/1 day, 4:14:47, loss=0.452839034435494, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=2.3836609326577847, lr=0.05701263906499572
2023-11-27 07:46:31   INFO  epoch: 10/24, acc_iter=69470, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:20, time_cost(all): 22:10:13/1 day, 4:58:53, loss=0.452731492075392, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=4.363453666509294, lr=0.05697254729225513
2023-11-27 07:47:29   INFO  epoch: 10/24, acc_iter=69520, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:22, time_cost(all): 22:11:11/1 day, 3:53:54, loss=0.452623949715289, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.388336807418722, lr=0.05693245551951455
2023-11-27 07:48:27   INFO  epoch: 10/24, acc_iter=69570, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:11, time_cost(all): 22:12:09/1 day, 4:56:16, loss=0.452516407355186, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.491665272227897, lr=0.056892363746773957
2023-11-27 07:49:24   INFO  epoch: 10/24, acc_iter=69620, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:51:59, time_cost(all): 22:13:06/1 day, 3:18:18, loss=0.452408864995083, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.443843871345116, lr=0.05685227197403337
2023-11-27 07:50:22   INFO  epoch: 10/24, acc_iter=69670, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:55:52, time_cost(all): 22:14:04/1 day, 3:22:53, loss=0.452301322634981, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=4.305632363638571, lr=0.05681218020129278
2023-11-27 07:51:20   INFO  epoch: 10/24, acc_iter=69720, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:54:44, time_cost(all): 22:15:02/1 day, 4:35:37, loss=0.452193780274878, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.0379736932929835, lr=0.05677208842855219
2023-11-27 07:52:18   INFO  epoch: 10/24, acc_iter=69770, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:51:42, time_cost(all): 22:16:00/1 day, 3:34:42, loss=0.452086237914775, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=0.8448495775245034, lr=0.056731996655811606
2023-11-27 07:53:15   INFO  epoch: 10/24, acc_iter=69820, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:21, time_cost(all): 22:16:57/1 day, 5:15:20, loss=0.451978695554673, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.246315141810079, lr=0.05669190488307101
2023-11-27 07:54:13   INFO  epoch: 10/24, acc_iter=69870, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:47:59, time_cost(all): 22:17:55/1 day, 3:18:30, loss=0.45187115319457, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=2.6330220392355663, lr=0.05665181311033043
2023-11-27 07:55:11   INFO  epoch: 10/24, acc_iter=69920, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:50:37, time_cost(all): 22:18:53/1 day, 4:57:01, loss=0.451763610834467, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=2.2421158453452676, lr=0.056611721337589835
2023-11-27 07:56:09   INFO  epoch: 10/24, acc_iter=69970, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:45:46, time_cost(all): 22:19:51/1 day, 2:58:48, loss=0.451656068474365, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=1.83346671060688, lr=0.056571629564849256
2023-11-27 07:57:06   INFO  epoch: 10/24, acc_iter=70020, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:46:23, time_cost(all): 22:20:48/1 day, 4:41:38, loss=0.451548526114262, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=2.5787552354571854, lr=0.05653153779210866
2023-11-27 07:58:04   INFO  epoch: 10/24, acc_iter=70070, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:44:40, time_cost(all): 22:21:46/1 day, 2:59:12, loss=0.451440983754159, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=2.790506239129557, lr=0.05649144601936808
2023-11-27 07:59:02   INFO  epoch: 10/24, acc_iter=70120, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:43:59, time_cost(all): 22:22:44/1 day, 3:49:05, loss=0.451333441394057, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=1.4776467867572236, lr=0.056451354246627485
2023-11-27 08:00:00   INFO  epoch: 10/24, acc_iter=70170, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:45:24, time_cost(all): 22:23:42/1 day, 5:16:03, loss=0.451225899033954, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.285105403033107, lr=0.0564112624738869
2023-11-27 08:00:58   INFO  epoch: 10/24, acc_iter=70220, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:42:27, time_cost(all): 22:24:40/1 day, 3:58:25, loss=0.451118356673851, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=1.736464914064737, lr=0.05637117070114631
2023-11-27 08:01:55   INFO  epoch: 10/24, acc_iter=70270, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:35, time_cost(all): 22:25:37/1 day, 4:24:42, loss=0.451010814313749, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=2.7539308895915005, lr=0.05633107892840572
2023-11-27 08:02:53   INFO  epoch: 10/24, acc_iter=70320, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:24, time_cost(all): 22:26:35/1 day, 4:40:47, loss=0.450903271953646, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.1228446506838115, lr=0.056290987155665134
2023-11-27 08:03:51   INFO  epoch: 10/24, acc_iter=70370, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:03, time_cost(all): 22:27:33/1 day, 4:50:07, loss=0.450795729593543, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=4.002203019172381, lr=0.05625089538292455
2023-11-27 08:04:49   INFO  epoch: 10/24, acc_iter=70420, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:37:25, time_cost(all): 22:28:31/1 day, 4:35:07, loss=0.450688187233441, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.1647147185489048, lr=0.05621080361018396
2023-11-27 08:05:46   INFO  epoch: 10/24, acc_iter=70470, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:13, time_cost(all): 22:29:28/1 day, 3:42:32, loss=0.450580644873338, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=3.8832123103509213, lr=0.05617071183744337
2023-11-27 08:06:44   INFO  epoch: 10/24, acc_iter=70520, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:59, time_cost(all): 22:30:26/1 day, 3:10:32, loss=0.450473102513235, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.38164746989133, lr=0.056130620064702784
2023-11-27 08:07:42   INFO  epoch: 10/24, acc_iter=70570, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:47, time_cost(all): 22:31:24/1 day, 4:36:46, loss=0.450365560153133, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=4.6375457574583985, lr=0.05609052829196219
2023-11-27 08:08:40   INFO  epoch: 10/24, acc_iter=70620, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:17, time_cost(all): 22:32:22/1 day, 3:26:14, loss=0.45025801779303, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=1.0583025606895449, lr=0.056050436519221605
2023-11-27 08:09:37   INFO  epoch: 10/24, acc_iter=70670, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:16, time_cost(all): 22:33:19/1 day, 4:59:38, loss=0.450150475432927, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.9008384751155556, lr=0.05601034474648102
2023-11-27 08:10:35   INFO  epoch: 10/24, acc_iter=70720, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:47, time_cost(all): 22:34:17/1 day, 4:20:12, loss=0.450042933072825, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.4870546759271424, lr=0.055970252973740434
2023-11-27 08:11:33   INFO  epoch: 10/24, acc_iter=70770, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:10, time_cost(all): 22:35:15/1 day, 5:03:54, loss=0.449935390712722, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.991137821806748, lr=0.05593016120099984
2023-11-27 08:12:31   INFO  epoch: 10/24, acc_iter=70820, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:17, time_cost(all): 22:36:13/1 day, 4:59:34, loss=0.449827848352619, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.788574483593457, lr=0.055890069428259255
2023-11-27 08:13:28   INFO  epoch: 10/24, acc_iter=70870, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:27, time_cost(all): 22:37:10/1 day, 4:10:23, loss=0.449720305992517, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.737809875094687, lr=0.05584997765551867
2023-11-27 08:14:26   INFO  epoch: 10/24, acc_iter=70920, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:28:25, time_cost(all): 22:38:08/1 day, 2:54:55, loss=0.449612763632414, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=4.608581850490278, lr=0.055809885882778076
2023-11-27 08:15:24   INFO  epoch: 10/24, acc_iter=70970, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:56, time_cost(all): 22:39:06/1 day, 3:17:46, loss=0.449505221272311, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=0.7430081327047858, lr=0.05576979411003749
2023-11-27 08:16:22   INFO  epoch: 10/24, acc_iter=71020, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:31, time_cost(all): 22:40:04/1 day, 3:50:28, loss=0.449397678912208, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.748263011028823, lr=0.0557297023372969
2023-11-27 08:17:19   INFO  epoch: 10/24, acc_iter=71070, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:44, time_cost(all): 22:41:01/1 day, 3:03:38, loss=0.449290136552106, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=3.821116872955722, lr=0.05568961056455631
2023-11-27 08:18:17   INFO  epoch: 10/24, acc_iter=71120, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:24:44, time_cost(all): 22:41:59/1 day, 3:10:09, loss=0.449182594192003, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=1.9174837800641333, lr=0.055649518791815726
2023-11-27 08:19:15   INFO  epoch: 10/24, acc_iter=71170, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:23:43, time_cost(all): 22:42:57/1 day, 5:13:34, loss=0.4490750518319, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.912771874481578, lr=0.05560942701907514
2023-11-27 08:20:13   INFO  epoch: 10/24, acc_iter=71220, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:22:59, time_cost(all): 22:43:55/1 day, 4:29:18, loss=0.448967509471798, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=1.9553019426200609, lr=0.05556933524633455
2023-11-27 08:21:10   INFO  epoch: 10/24, acc_iter=71270, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:21:47, time_cost(all): 22:44:52/1 day, 5:10:12, loss=0.448859967111695, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=2.7094803330128996, lr=0.05552924347359396
2023-11-27 08:22:08   INFO  epoch: 10/24, acc_iter=71320, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:08, time_cost(all): 22:45:50/1 day, 5:03:43, loss=0.448752424751592, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=4.358694855844941, lr=0.055489151700853376
2023-11-27 08:23:06   INFO  epoch: 10/24, acc_iter=71370, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:09, time_cost(all): 22:46:48/1 day, 3:58:24, loss=0.44864488239149, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.4648746312125125, lr=0.05544905992811278
2023-11-27 08:24:04   INFO  epoch: 10/24, acc_iter=71420, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:37, time_cost(all): 22:47:46/1 day, 5:00:19, loss=0.448537340031387, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=4.040543450476052, lr=0.0554089681553722
2023-11-27 08:25:01   INFO  epoch: 10/24, acc_iter=71470, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:24, time_cost(all): 22:48:43/1 day, 3:41:40, loss=0.448429797671284, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=0.8627823737775775, lr=0.05536887638263161
2023-11-27 08:25:59   INFO  epoch: 10/24, acc_iter=71520, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:08, time_cost(all): 22:49:41/1 day, 4:00:04, loss=0.448322255311182, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=2.504252323087544, lr=0.05532878460989102
2023-11-27 08:26:57   INFO  epoch: 10/24, acc_iter=71570, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:16, time_cost(all): 22:50:39/1 day, 4:34:49, loss=0.448214712951079, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.1875674845315425, lr=0.05528869283715043
2023-11-27 08:27:55   INFO  epoch: 10/24, acc_iter=71620, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:50, time_cost(all): 22:51:37/1 day, 4:57:45, loss=0.448107170590976, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.08215872464562, lr=0.05524860106440985
2023-11-27 08:28:52   INFO  epoch: 10/24, acc_iter=71670, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:53, time_cost(all): 22:52:34/1 day, 2:29:16, loss=0.447999628230874, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=4.045429255127127, lr=0.055208509291669254
2023-11-27 08:29:50   INFO  epoch: 10/24, acc_iter=71720, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:29, time_cost(all): 22:53:32/1 day, 5:02:27, loss=0.447892085870771, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=0.9491957874760195, lr=0.05516841751892867
2023-11-27 08:30:48   INFO  epoch: 10/24, acc_iter=71770, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:16, time_cost(all): 22:54:30/1 day, 3:18:21, loss=0.447784543510668, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=0.7965982822884861, lr=0.05512832574618808
2023-11-27 08:31:46   INFO  epoch: 10/24, acc_iter=71820, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:34, time_cost(all): 22:55:28/1 day, 3:18:43, loss=0.447677001150566, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.452333001173935, lr=0.0550882339734475
2023-11-27 08:32:43   INFO  epoch: 10/24, acc_iter=71870, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:47, time_cost(all): 22:56:25/1 day, 4:04:15, loss=0.447569458790463, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=1.3399826147757257, lr=0.055048142200706904
2023-11-27 08:33:41   INFO  epoch: 10/24, acc_iter=71920, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:49, time_cost(all): 22:57:23/1 day, 4:51:08, loss=0.44746191643036, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=2.3753983128531857, lr=0.05500805042796632
2023-11-27 08:34:39   INFO  epoch: 10/24, acc_iter=71970, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:24, time_cost(all): 22:58:21/1 day, 4:54:35, loss=0.447354374070258, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=2.1220191946285434, lr=0.054967958655225725
2023-11-27 08:35:37   INFO  epoch: 10/24, acc_iter=72020, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:31, time_cost(all): 22:59:19/1 day, 2:26:50, loss=0.447246831710155, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=1.7557998558298322, lr=0.05492786688248514
2023-11-27 08:36:34   INFO  epoch: 10/24, acc_iter=72070, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:09, time_cost(all): 23:00:16/1 day, 4:55:40, loss=0.447139289350052, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.976112961858315, lr=0.05488777510974455
2023-11-27 08:37:32   INFO  epoch: 10/24, acc_iter=72120, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:48, time_cost(all): 23:01:14/1 day, 3:14:12, loss=0.44703174698995, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=0.548365576956795, lr=0.05484768333700397
2023-11-27 08:38:30   INFO  epoch: 10/24, acc_iter=72170, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:38, time_cost(all): 23:02:12/1 day, 2:23:49, loss=0.446924204629847, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=0.9766527388838099, lr=0.05480759156426338
2023-11-27 08:39:28   INFO  epoch: 10/24, acc_iter=72220, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:21, time_cost(all): 23:03:10/1 day, 2:26:02, loss=0.446816662269744, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=0.5744847877959984, lr=0.05476749979152279
2023-11-27 08:40:25   INFO  epoch: 10/24, acc_iter=72270, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:37, time_cost(all): 23:04:07/1 day, 3:40:01, loss=0.446709119909642, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=4.484590391914857, lr=0.0547274080187822
2023-11-27 08:41:23   INFO  epoch: 10/24, acc_iter=72320, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:37, time_cost(all): 23:05:05/1 day, 4:51:17, loss=0.446601577549539, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=3.3139125014644355, lr=0.05468731624604161
2023-11-27 08:42:21   INFO  epoch: 10/24, acc_iter=72370, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:43, time_cost(all): 23:06:03/1 day, 2:56:07, loss=0.446494035189436, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.1369648454584294, lr=0.054647224473301025
2023-11-27 08:43:19   INFO  epoch: 10/24, acc_iter=72420, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 23:07:01/1 day, 3:38:17, loss=0.446386492829334, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.8149281865773927, lr=0.05460713270056043
2023-11-27 08:44:16   INFO  epoch: 11/24, acc_iter=72507, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:00:21, time_cost(all): 23:07:58/1 day, 3:51:59, loss=0.446199369122755, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.8743117467194326, lr=0.05453737301599181
2023-11-27 08:45:14   INFO  epoch: 11/24, acc_iter=72557, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:09:44, time_cost(all): 23:08:56/1 day, 3:46:57, loss=0.446091826762652, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=2.47272618065461, lr=0.05449728124325122
2023-11-27 08:46:12   INFO  epoch: 11/24, acc_iter=72607, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:05:44, time_cost(all): 23:09:54/1 day, 2:51:27, loss=0.445984284402549, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=4.269332016178945, lr=0.05445718947051064
2023-11-27 08:47:10   INFO  epoch: 11/24, acc_iter=72657, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:06:59, time_cost(all): 23:10:52/1 day, 3:35:59, loss=0.445876742042447, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=2.364075514692561, lr=0.054417097697770055
2023-11-27 08:48:07   INFO  epoch: 11/24, acc_iter=72707, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:58:55, time_cost(all): 23:11:49/1 day, 3:06:37, loss=0.445769199682344, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.656717645562966, lr=0.05437700592502946
2023-11-27 08:49:05   INFO  epoch: 11/24, acc_iter=72757, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:16, time_cost(all): 23:12:47/1 day, 4:30:32, loss=0.445661657322241, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=3.7144146452006295, lr=0.054336914152288876
2023-11-27 08:50:03   INFO  epoch: 11/24, acc_iter=72807, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:53, time_cost(all): 23:13:45/1 day, 4:10:16, loss=0.445554114962139, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=2.7579474442158896, lr=0.05429682237954828
2023-11-27 08:51:01   INFO  epoch: 11/24, acc_iter=72857, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:56:35, time_cost(all): 23:14:43/1 day, 2:33:05, loss=0.445446572602036, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=0.8319375284952455, lr=0.0542567306068077
2023-11-27 08:51:58   INFO  epoch: 11/24, acc_iter=72907, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:56:41, time_cost(all): 23:15:40/1 day, 2:16:08, loss=0.445339030241933, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.9927201874049136, lr=0.054216638834067105
2023-11-27 08:52:56   INFO  epoch: 11/24, acc_iter=72957, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:54:57, time_cost(all): 23:16:38/1 day, 2:06:19, loss=0.445231487881831, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=3.745465100365959, lr=0.05417654706132652
2023-11-27 08:53:54   INFO  epoch: 11/24, acc_iter=73007, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/2:00:19, time_cost(all): 23:17:36/1 day, 3:04:52, loss=0.445123945521728, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=2.3316962619201327, lr=0.05413645528858594
2023-11-27 08:54:52   INFO  epoch: 11/24, acc_iter=73057, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:54:26, time_cost(all): 23:18:34/1 day, 4:42:59, loss=0.445016403161625, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=3.1533242086904636, lr=0.05409636351584535
2023-11-27 08:55:49   INFO  epoch: 11/24, acc_iter=73107, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:55:07, time_cost(all): 23:19:31/1 day, 3:32:41, loss=0.444908860801523, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.7760737604020793, lr=0.05405627174310476
2023-11-27 08:56:47   INFO  epoch: 11/24, acc_iter=73157, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:55:10, time_cost(all): 23:20:29/1 day, 2:27:22, loss=0.44480131844142, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=0.9684594850183121, lr=0.05401617997036417
2023-11-27 08:57:45   INFO  epoch: 11/24, acc_iter=73207, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:49:44, time_cost(all): 23:21:27/1 day, 4:19:09, loss=0.444693776081317, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.5419321921121716, lr=0.05397608819762358
2023-11-27 08:58:43   INFO  epoch: 11/24, acc_iter=73257, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:54:00, time_cost(all): 23:22:25/1 day, 3:20:58, loss=0.444586233721215, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.480501906922741, lr=0.05393599642488299
2023-11-27 08:59:40   INFO  epoch: 11/24, acc_iter=73307, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:53:15, time_cost(all): 23:23:22/1 day, 2:48:18, loss=0.444478691361112, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.8278813469661537, lr=0.053895904652142404
2023-11-27 09:00:38   INFO  epoch: 11/24, acc_iter=73357, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:54:25, time_cost(all): 23:24:20/1 day, 4:01:58, loss=0.444371149001009, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.6063370028564816, lr=0.05385581287940181
2023-11-27 09:01:36   INFO  epoch: 11/24, acc_iter=73407, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:48:55, time_cost(all): 23:25:18/1 day, 3:56:00, loss=0.444263606640907, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=3.2069897771916795, lr=0.053815721106661225
2023-11-27 09:02:34   INFO  epoch: 11/24, acc_iter=73457, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:42:46, time_cost(all): 23:26:16/1 day, 2:58:09, loss=0.444156064280804, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.7124867398477925, lr=0.053775629333920646
2023-11-27 09:03:31   INFO  epoch: 11/24, acc_iter=73507, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:42:10, time_cost(all): 23:27:13/1 day, 3:45:18, loss=0.444048521920701, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=0.8931509970255478, lr=0.053735537561180054
2023-11-27 09:04:29   INFO  epoch: 11/24, acc_iter=73557, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:45:58, time_cost(all): 23:28:11/1 day, 2:00:55, loss=0.443940979560599, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=0.6455720173423346, lr=0.05369544578843947
2023-11-27 09:05:27   INFO  epoch: 11/24, acc_iter=73607, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:49:03, time_cost(all): 23:29:09/1 day, 2:22:55, loss=0.443833437200496, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.15(1.03), norm=1.987783954078741, lr=0.053655354015698875
2023-11-27 09:06:25   INFO  epoch: 11/24, acc_iter=73657, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:42:11, time_cost(all): 23:30:07/1 day, 3:18:05, loss=0.443725894840393, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=0.5080174207005406, lr=0.05361526224295829
2023-11-27 09:07:22   INFO  epoch: 11/24, acc_iter=73707, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:37:39, time_cost(all): 23:31:04/1 day, 2:37:55, loss=0.443618352480291, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=3.293429473920519, lr=0.053575170470217696
2023-11-27 09:08:20   INFO  epoch: 11/24, acc_iter=73757, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:46:00, time_cost(all): 23:32:02/1 day, 4:30:41, loss=0.443510810120188, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.9525196440494623, lr=0.05353507869747711
2023-11-27 09:09:18   INFO  epoch: 11/24, acc_iter=73807, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:44:13, time_cost(all): 23:33:00/1 day, 3:25:53, loss=0.443403267760085, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=4.0199482188792715, lr=0.05349498692473652
2023-11-27 09:10:16   INFO  epoch: 11/24, acc_iter=73857, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:42:45, time_cost(all): 23:33:58/1 day, 3:33:48, loss=0.443295725399983, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=3.410013345052107, lr=0.05345489515199593
2023-11-27 09:11:13   INFO  epoch: 11/24, acc_iter=73907, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:40:55, time_cost(all): 23:34:55/1 day, 3:48:46, loss=0.44318818303988, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.4324180927483114, lr=0.05341480337925535
2023-11-27 09:12:11   INFO  epoch: 11/24, acc_iter=73957, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:42:08, time_cost(all): 23:35:53/1 day, 3:23:27, loss=0.443080640679777, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=2.683341283148462, lr=0.05337471160651476
2023-11-27 09:13:09   INFO  epoch: 11/24, acc_iter=74007, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:38:47, time_cost(all): 23:36:51/1 day, 2:25:53, loss=0.442973098319674, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=2.3493106385386113, lr=0.053334619833774174
2023-11-27 09:14:07   INFO  epoch: 11/24, acc_iter=74057, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:31:13, time_cost(all): 23:37:49/1 day, 3:39:35, loss=0.442865555959572, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=3.3572993255564088, lr=0.05329452806103358
2023-11-27 09:15:04   INFO  epoch: 11/24, acc_iter=74107, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:34:59, time_cost(all): 23:38:46/1 day, 3:52:14, loss=0.442758013599469, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=0.7447337700508665, lr=0.053254436288292996
2023-11-27 09:16:02   INFO  epoch: 11/24, acc_iter=74157, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:37:17, time_cost(all): 23:39:44/1 day, 3:58:01, loss=0.442650471239366, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=3.195397172166797, lr=0.0532143445155524
2023-11-27 09:17:00   INFO  epoch: 11/24, acc_iter=74207, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:28:56, time_cost(all): 23:40:42/1 day, 3:49:27, loss=0.442542928879264, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.297572117596833, lr=0.05317425274281182
2023-11-27 09:17:58   INFO  epoch: 11/24, acc_iter=74257, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:28:43, time_cost(all): 23:41:40/1 day, 2:12:28, loss=0.442435386519161, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=3.3681886200721647, lr=0.05313416097007123
2023-11-27 09:18:55   INFO  epoch: 11/24, acc_iter=74307, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:27:22, time_cost(all): 23:42:37/1 day, 3:37:51, loss=0.442327844159058, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.8844712712150864, lr=0.05309406919733064
2023-11-27 09:19:53   INFO  epoch: 11/24, acc_iter=74357, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:33:10, time_cost(all): 23:43:35/1 day, 2:11:47, loss=0.442220301798956, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=0.5493713198740138, lr=0.05305397742459006
2023-11-27 09:20:51   INFO  epoch: 11/24, acc_iter=74407, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:02, time_cost(all): 23:44:33/1 day, 1:38:30, loss=0.442112759438853, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=4.392868600015048, lr=0.053013885651849474
2023-11-27 09:21:49   INFO  epoch: 11/24, acc_iter=74457, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:25:53, time_cost(all): 23:45:31/1 day, 3:36:54, loss=0.44200521707875, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=4.180333920323489, lr=0.05297379387910888
2023-11-27 09:22:46   INFO  epoch: 11/24, acc_iter=74507, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:30:10, time_cost(all): 23:46:28/1 day, 2:11:01, loss=0.441897674718648, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=3.218486795711959, lr=0.052933702106368295
2023-11-27 09:23:44   INFO  epoch: 11/24, acc_iter=74557, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:51, time_cost(all): 23:47:26/1 day, 1:57:12, loss=0.441790132358545, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=2.122742731077331, lr=0.0528936103336277
2023-11-27 09:24:42   INFO  epoch: 11/24, acc_iter=74607, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:23:27, time_cost(all): 23:48:24/1 day, 2:38:23, loss=0.441682589998442, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=3.0332314104698215, lr=0.05285351856088712
2023-11-27 09:25:40   INFO  epoch: 11/24, acc_iter=74657, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:25:55, time_cost(all): 23:49:22/1 day, 3:51:28, loss=0.44157504763834, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.138944597800039, lr=0.052813426788146524
2023-11-27 09:26:37   INFO  epoch: 11/24, acc_iter=74707, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:26:40, time_cost(all): 23:50:19/1 day, 4:05:43, loss=0.441467505278237, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.047864540185137, lr=0.05277333501540594
2023-11-27 09:27:35   INFO  epoch: 11/24, acc_iter=74757, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:46, time_cost(all): 23:51:17/1 day, 1:38:06, loss=0.441359962918134, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=2.837136015513857, lr=0.052733243242665345
2023-11-27 09:28:33   INFO  epoch: 11/24, acc_iter=74807, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:18:50, time_cost(all): 23:52:15/1 day, 2:10:15, loss=0.441252420558032, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=3.9954700572301545, lr=0.052693151469924766
2023-11-27 09:29:31   INFO  epoch: 11/24, acc_iter=74857, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:24:34, time_cost(all): 23:53:13/1 day, 3:50:52, loss=0.441144878197929, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=4.287991998630182, lr=0.05265305969718418
2023-11-27 09:30:28   INFO  epoch: 11/24, acc_iter=74907, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:20:04, time_cost(all): 23:54:10/1 day, 3:08:12, loss=0.441037335837826, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=4.520753509795171, lr=0.05261296792444359
2023-11-27 09:31:26   INFO  epoch: 11/24, acc_iter=74957, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:18, time_cost(all): 23:55:08/1 day, 3:27:53, loss=0.440929793477724, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=4.904344544373417, lr=0.052572876151703
2023-11-27 09:32:24   INFO  epoch: 11/24, acc_iter=75007, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:14:38, time_cost(all): 23:56:06/1 day, 2:25:56, loss=0.440822251117621, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.181344334289552, lr=0.05253278437896241
2023-11-27 09:33:22   INFO  epoch: 11/24, acc_iter=75057, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:17:13, time_cost(all): 23:57:04/1 day, 3:40:01, loss=0.440714708757518, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=2.5143640938071306, lr=0.05249269260622182
2023-11-27 09:34:19   INFO  epoch: 11/24, acc_iter=75107, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:19:06, time_cost(all): 23:58:01/1 day, 3:17:33, loss=0.440607166397416, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=2.0798831381192717, lr=0.05245260083348123
2023-11-27 09:35:17   INFO  epoch: 11/24, acc_iter=75157, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:12:05, time_cost(all): 23:58:59/1 day, 2:32:11, loss=0.440499624037313, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=1.1852803660324325, lr=0.052412509060740645
2023-11-27 09:36:15   INFO  epoch: 11/24, acc_iter=75207, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:13:31, time_cost(all): 23:59:57/1 day, 3:52:14, loss=0.44039208167721, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=2.8462347706334072, lr=0.05237241728800005
2023-11-27 09:37:13   INFO  epoch: 11/24, acc_iter=75257, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:11:19, time_cost(all): 1 day, 0:00:55/1 day, 1:42:27, loss=0.440284539317107, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.5292455620628287, lr=0.05233232551525947
2023-11-27 09:38:10   INFO  epoch: 11/24, acc_iter=75307, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:30, time_cost(all): 1 day, 0:01:52/1 day, 1:23:12, loss=0.440176996957005, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=3.175846468184935, lr=0.05229223374251889
2023-11-27 09:39:08   INFO  epoch: 11/24, acc_iter=75357, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:19, time_cost(all): 1 day, 0:02:50/1 day, 3:03:24, loss=0.440069454596902, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=3.4599430084556224, lr=0.052252141969778294
2023-11-27 09:40:06   INFO  epoch: 11/24, acc_iter=75407, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:13:26, time_cost(all): 1 day, 0:03:48/1 day, 3:56:55, loss=0.439961912236799, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=1.6736857276454107, lr=0.05221205019703771
2023-11-27 09:41:04   INFO  epoch: 11/24, acc_iter=75457, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:09:02, time_cost(all): 1 day, 0:04:46/1 day, 1:49:18, loss=0.439854369876697, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.67186239771345, lr=0.052171958424297116
2023-11-27 09:42:01   INFO  epoch: 11/24, acc_iter=75507, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:05:01, time_cost(all): 1 day, 0:05:43/1 day, 3:12:32, loss=0.439746827516594, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=0.8942746065437904, lr=0.05213186665155653
2023-11-27 09:42:59   INFO  epoch: 11/24, acc_iter=75557, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:07:24, time_cost(all): 1 day, 0:06:41/1 day, 1:48:06, loss=0.439639285156491, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=4.4712507792947465, lr=0.05209177487881594
2023-11-27 09:43:57   INFO  epoch: 11/24, acc_iter=75607, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:00, time_cost(all): 1 day, 0:07:39/1 day, 1:58:31, loss=0.439531742796389, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=0.5235924658746236, lr=0.05205168310607535
2023-11-27 09:44:55   INFO  epoch: 11/24, acc_iter=75657, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:52, time_cost(all): 1 day, 0:08:37/1 day, 3:27:32, loss=0.439424200436286, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=1.9374512421491126, lr=0.05201159133333476
2023-11-27 09:45:53   INFO  epoch: 11/24, acc_iter=75707, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:06:50, time_cost(all): 1 day, 0:09:35/1 day, 1:34:16, loss=0.439316658076183, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=3.4830548390694336, lr=0.05197149956059418
2023-11-27 09:46:50   INFO  epoch: 11/24, acc_iter=75757, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:02:26, time_cost(all): 1 day, 0:10:32/1 day, 1:16:57, loss=0.439209115716081, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=3.388064994601989, lr=0.051931407787853594
2023-11-27 09:47:48   INFO  epoch: 11/24, acc_iter=75807, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:08, time_cost(all): 1 day, 0:11:30/1 day, 2:46:06, loss=0.439101573355978, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=2.8712971279319293, lr=0.051891316015113
2023-11-27 09:48:46   INFO  epoch: 11/24, acc_iter=75857, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:03:16, time_cost(all): 1 day, 0:12:28/1 day, 1:50:22, loss=0.438994030995875, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=4.795486790468843, lr=0.051851224242372415
2023-11-27 09:49:44   INFO  epoch: 11/24, acc_iter=75907, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:02:51, time_cost(all): 1 day, 0:13:26/1 day, 1:55:32, loss=0.438886488635773, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=2.1673815004869232, lr=0.05181113246963182
2023-11-27 09:50:41   INFO  epoch: 11/24, acc_iter=75957, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:00:54, time_cost(all): 1 day, 0:14:23/1 day, 2:36:40, loss=0.43877894627567, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.9457928243268972, lr=0.051771040696891236
2023-11-27 09:51:39   INFO  epoch: 11/24, acc_iter=76007, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:56:13, time_cost(all): 1 day, 0:15:21/1 day, 1:15:29, loss=0.438671403915567, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=4.013474620323108, lr=0.051730948924150644
2023-11-27 09:52:37   INFO  epoch: 11/24, acc_iter=76057, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:47, time_cost(all): 1 day, 0:16:19/1 day, 2:43:23, loss=0.438563861555465, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=4.125462948173263, lr=0.05169085715141006
2023-11-27 09:53:35   INFO  epoch: 11/24, acc_iter=76107, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:54:18, time_cost(all): 1 day, 0:17:17/1 day, 2:20:00, loss=0.438456319195362, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=4.746550279113773, lr=0.05165076537866947
2023-11-27 09:54:32   INFO  epoch: 11/24, acc_iter=76157, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:36, time_cost(all): 1 day, 0:18:14/1 day, 1:24:53, loss=0.438348776835259, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=2.7401355443808444, lr=0.051610673605928886
2023-11-27 09:55:30   INFO  epoch: 11/24, acc_iter=76207, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:55:52, time_cost(all): 1 day, 0:19:12/1 day, 1:50:56, loss=0.438241234475157, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.9288177546679433, lr=0.0515705818331883
2023-11-27 09:56:28   INFO  epoch: 11/24, acc_iter=76257, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:54:16, time_cost(all): 1 day, 0:20:10/1 day, 2:47:14, loss=0.438133692115054, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=4.305485423774856, lr=0.051530490060447715
2023-11-27 09:57:26   INFO  epoch: 11/24, acc_iter=76307, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:20, time_cost(all): 1 day, 0:21:08/1 day, 1:52:01, loss=0.438026149754951, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=2.5499564916416215, lr=0.05149039828770712
2023-11-27 09:58:23   INFO  epoch: 11/24, acc_iter=76357, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:59, time_cost(all): 1 day, 0:22:05/1 day, 1:32:50, loss=0.437918607394849, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=1.7470466741448978, lr=0.051450306514966536
2023-11-27 09:59:21   INFO  epoch: 11/24, acc_iter=76407, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:48:33, time_cost(all): 1 day, 0:23:03/1 day, 3:24:13, loss=0.437811065034746, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=1.224793546262736, lr=0.05141021474222594
2023-11-27 10:00:19   INFO  epoch: 11/24, acc_iter=76457, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:49:41, time_cost(all): 1 day, 0:24:01/1 day, 2:44:14, loss=0.437703522674643, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.043056101658509, lr=0.05137012296948536
2023-11-27 10:01:17   INFO  epoch: 11/24, acc_iter=76507, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:55, time_cost(all): 1 day, 0:24:59/1 day, 2:16:20, loss=0.437595980314541, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=0.7721454849853272, lr=0.051330031196744764
2023-11-27 10:02:14   INFO  epoch: 11/24, acc_iter=76557, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:45:31, time_cost(all): 1 day, 0:25:56/1 day, 1:16:27, loss=0.437488437954438, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=4.151937225246245, lr=0.05128993942400418
2023-11-27 10:03:12   INFO  epoch: 11/24, acc_iter=76607, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:27, time_cost(all): 1 day, 0:26:54/1 day, 2:59:09, loss=0.437380895594335, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=1.1608214309749028, lr=0.0512498476512636
2023-11-27 10:04:10   INFO  epoch: 11/24, acc_iter=76657, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:47:42, time_cost(all): 1 day, 0:27:52/1 day, 1:48:47, loss=0.437273353234232, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.976344070764774, lr=0.05120975587852301
2023-11-27 10:05:08   INFO  epoch: 11/24, acc_iter=76707, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:28, time_cost(all): 1 day, 0:28:50/1 day, 2:07:38, loss=0.43716581087413, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=1.7247125080605774, lr=0.05116966410578242
2023-11-27 10:06:05   INFO  epoch: 11/24, acc_iter=76757, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:45:33, time_cost(all): 1 day, 0:29:47/1 day, 2:08:38, loss=0.437058268514027, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=0.7222478979778568, lr=0.05112957233304183
2023-11-27 10:07:03   INFO  epoch: 11/24, acc_iter=76807, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:43:02, time_cost(all): 1 day, 0:30:45/1 day, 1:18:34, loss=0.436950726153924, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.746759247346787, lr=0.05108948056030124
2023-11-27 10:08:01   INFO  epoch: 11/24, acc_iter=76857, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:27, time_cost(all): 1 day, 0:31:43/1 day, 2:36:37, loss=0.436843183793822, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=4.473449204294354, lr=0.05104938878756065
2023-11-27 10:08:59   INFO  epoch: 11/24, acc_iter=76907, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:09, time_cost(all): 1 day, 0:32:41/1 day, 2:55:17, loss=0.436735641433719, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.5751526667346165, lr=0.051009297014820064
2023-11-27 10:09:56   INFO  epoch: 11/24, acc_iter=76957, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:39:04, time_cost(all): 1 day, 0:33:38/1 day, 2:57:09, loss=0.436628099073616, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.1799845110895, lr=0.05096920524207947
2023-11-27 10:10:54   INFO  epoch: 11/24, acc_iter=77007, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:28, time_cost(all): 1 day, 0:34:36/1 day, 1:22:55, loss=0.436520556713514, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.152011593014761, lr=0.050929113469338885
2023-11-27 10:11:52   INFO  epoch: 11/24, acc_iter=77057, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:17, time_cost(all): 1 day, 0:35:34/1 day, 3:03:41, loss=0.436413014353411, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=4.261407261062299, lr=0.050889021696598306
2023-11-27 10:12:50   INFO  epoch: 11/24, acc_iter=77107, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:13, time_cost(all): 1 day, 0:36:32/1 day, 1:19:03, loss=0.436305471993308, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.3129926509700334, lr=0.050848929923857714
2023-11-27 10:13:47   INFO  epoch: 11/24, acc_iter=77157, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:19, time_cost(all): 1 day, 0:37:29/1 day, 2:17:30, loss=0.436197929633206, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.824244178146267, lr=0.05080883815111713
2023-11-27 10:14:45   INFO  epoch: 11/24, acc_iter=77207, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:34, time_cost(all): 1 day, 0:38:27/1 day, 1:09:26, loss=0.436090387273103, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=2.215187148471993, lr=0.050768746378376535
2023-11-27 10:15:43   INFO  epoch: 11/24, acc_iter=77257, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:06, time_cost(all): 1 day, 0:39:25/1 day, 1:32:10, loss=0.435982844913, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=3.0282102845466814, lr=0.05072865460563595
2023-11-27 10:16:41   INFO  epoch: 11/24, acc_iter=77307, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:13, time_cost(all): 1 day, 0:40:23/1 day, 1:25:29, loss=0.435875302552898, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=1.6527827107293132, lr=0.050688562832895356
2023-11-27 10:17:38   INFO  epoch: 11/24, acc_iter=77357, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:06, time_cost(all): 1 day, 0:41:20/1 day, 2:18:26, loss=0.435767760192795, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.7567892797292441, lr=0.05064847106015477
2023-11-27 10:18:36   INFO  epoch: 11/24, acc_iter=77407, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:55, time_cost(all): 1 day, 0:42:18/1 day, 2:28:24, loss=0.435660217832692, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=4.093021428075945, lr=0.05060837928741418
2023-11-27 10:19:34   INFO  epoch: 11/24, acc_iter=77457, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:43, time_cost(all): 1 day, 0:43:16/1 day, 2:03:57, loss=0.43555267547259, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.2104449897136416, lr=0.05056828751467359
2023-11-27 10:20:32   INFO  epoch: 11/24, acc_iter=77507, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:02, time_cost(all): 1 day, 0:44:14/1 day, 2:47:07, loss=0.435445133112487, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.409309323980643, lr=0.05052819574193301
2023-11-27 10:21:29   INFO  epoch: 11/24, acc_iter=77557, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:01, time_cost(all): 1 day, 0:45:11/1 day, 0:47:16, loss=0.435337590752384, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=3.311248212679016, lr=0.05048810396919242
2023-11-27 10:22:27   INFO  epoch: 11/24, acc_iter=77607, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:40, time_cost(all): 1 day, 0:46:09/1 day, 2:40:27, loss=0.435230048392282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.8281757132090997, lr=0.050448012196451834
2023-11-27 10:23:25   INFO  epoch: 11/24, acc_iter=77657, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:12, time_cost(all): 1 day, 0:47:07/1 day, 1:26:16, loss=0.435122506032179, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=4.907075200386688, lr=0.05040792042371124
2023-11-27 10:24:23   INFO  epoch: 11/24, acc_iter=77707, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:41, time_cost(all): 1 day, 0:48:05/1 day, 1:54:45, loss=0.435014963672076, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.4679108876461013, lr=0.050367828650970656
2023-11-27 10:25:20   INFO  epoch: 11/24, acc_iter=77757, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:55, time_cost(all): 1 day, 0:49:02/1 day, 2:10:34, loss=0.434907421311974, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=0.730954177444503, lr=0.05032773687823006
2023-11-27 10:26:18   INFO  epoch: 11/24, acc_iter=77807, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:36, time_cost(all): 1 day, 0:50:00/1 day, 1:20:55, loss=0.434799878951871, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=0.7302168971402088, lr=0.05028764510548948
2023-11-27 10:27:16   INFO  epoch: 11/24, acc_iter=77857, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:21, time_cost(all): 1 day, 0:50:58/1 day, 2:51:16, loss=0.434692336591768, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=1.168903327820256, lr=0.050247553332748884
2023-11-27 10:28:14   INFO  epoch: 11/24, acc_iter=77907, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:46, time_cost(all): 1 day, 0:51:56/1 day, 1:18:54, loss=0.434584794231666, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.7109836168215855, lr=0.0502074615600083
2023-11-27 10:29:11   INFO  epoch: 11/24, acc_iter=77957, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:02, time_cost(all): 1 day, 0:52:53/1 day, 1:29:08, loss=0.434477251871563, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.402170538266634, lr=0.05016736978726772
2023-11-27 10:30:09   INFO  epoch: 11/24, acc_iter=78007, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:35, time_cost(all): 1 day, 0:53:51/1 day, 2:22:15, loss=0.43436970951146, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=1.007352250504725, lr=0.05012727801452713
2023-11-27 10:31:07   INFO  epoch: 11/24, acc_iter=78057, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:54, time_cost(all): 1 day, 0:54:49/1 day, 2:28:06, loss=0.434262167151357, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.4088914375805033, lr=0.05008718624178654
2023-11-27 10:32:05   INFO  epoch: 11/24, acc_iter=78107, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:22, time_cost(all): 1 day, 0:55:47/1 day, 2:27:34, loss=0.434154624791255, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.976114234278167, lr=0.05004709446904595
2023-11-27 10:33:02   INFO  epoch: 11/24, acc_iter=78157, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:47, time_cost(all): 1 day, 0:56:44/1 day, 1:14:06, loss=0.434047082431152, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.772279804573958, lr=0.05000700269630536
2023-11-27 10:34:00   INFO  epoch: 11/24, acc_iter=78207, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:24, time_cost(all): 1 day, 0:57:42/1 day, 1:29:46, loss=0.433939540071049, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=3.3001626782849467, lr=0.04996691092356478
2023-11-27 10:34:58   INFO  epoch: 11/24, acc_iter=78257, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:31, time_cost(all): 1 day, 0:58:40/1 day, 0:37:44, loss=0.433831997710947, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.773318138586265, lr=0.049926819150824184
2023-11-27 10:35:56   INFO  epoch: 11/24, acc_iter=78307, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:31, time_cost(all): 1 day, 0:59:38/1 day, 1:54:31, loss=0.433724455350844, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=3.7957615967565106, lr=0.0498867273780836
2023-11-27 10:36:53   INFO  epoch: 11/24, acc_iter=78357, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:34, time_cost(all): 1 day, 1:00:35/1 day, 0:43:07, loss=0.433616912990741, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=4.6644356698713905, lr=0.049846635605343005
2023-11-27 10:37:51   INFO  epoch: 11/24, acc_iter=78407, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:34, time_cost(all): 1 day, 1:01:33/1 day, 1:13:15, loss=0.433509370630639, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=4.2673868948656395, lr=0.049806543832602426
2023-11-27 10:38:49   INFO  epoch: 11/24, acc_iter=78457, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:20, time_cost(all): 1 day, 1:02:31/1 day, 0:24:40, loss=0.433401828270536, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=3.8401945605448655, lr=0.04976645205986184
2023-11-27 10:39:47   INFO  epoch: 11/24, acc_iter=78507, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:59, time_cost(all): 1 day, 1:03:29/1 day, 0:54:33, loss=0.433294285910433, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=3.2718503682581543, lr=0.04972636028712125
2023-11-27 10:40:44   INFO  epoch: 11/24, acc_iter=78557, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:08, time_cost(all): 1 day, 1:04:26/1 day, 1:17:54, loss=0.433186743550331, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=1.0123448420942007, lr=0.04968626851438066
2023-11-27 10:41:42   INFO  epoch: 11/24, acc_iter=78607, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:26, time_cost(all): 1 day, 1:05:24/1 day, 1:39:29, loss=0.433079201190228, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.6121650999839465, lr=0.04964617674164007
2023-11-27 10:42:40   INFO  epoch: 11/24, acc_iter=78657, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:26, time_cost(all): 1 day, 1:06:22/1 day, 0:52:46, loss=0.432971658830125, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.0477455839910184, lr=0.04960608496889948
2023-11-27 10:43:38   INFO  epoch: 11/24, acc_iter=78707, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:17, time_cost(all): 1 day, 1:07:20/1 day, 1:34:07, loss=0.432864116470023, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=2.312477314867287, lr=0.04956599319615889
2023-11-27 10:44:35   INFO  epoch: 11/24, acc_iter=78757, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:18, time_cost(all): 1 day, 1:08:17/1 day, 2:02:18, loss=0.43275657410992, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.302568477399915, lr=0.049525901423418305
2023-11-27 10:45:33   INFO  epoch: 11/24, acc_iter=78807, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:27, time_cost(all): 1 day, 1:09:15/1 day, 1:50:56, loss=0.432649031749817, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=0.8778872842259424, lr=0.04948580965067771
2023-11-27 10:46:31   INFO  epoch: 11/24, acc_iter=78857, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:42, time_cost(all): 1 day, 1:10:13/1 day, 2:49:25, loss=0.432541489389715, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=0.5611068429638006, lr=0.04944571787793713
2023-11-27 10:47:29   INFO  epoch: 11/24, acc_iter=78907, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:31, time_cost(all): 1 day, 1:11:11/1 day, 1:28:02, loss=0.432433947029612, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.8060991584972417, lr=0.04940562610519655
2023-11-27 10:48:26   INFO  epoch: 11/24, acc_iter=78957, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:41, time_cost(all): 1 day, 1:12:08/1 day, 0:14:47, loss=0.432326404669509, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=1.9308945311556163, lr=0.049365534332455954
2023-11-27 10:49:24   INFO  epoch: 11/24, acc_iter=79007, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 1 day, 1:13:06/1 day, 0:49:44, loss=0.432218862309407, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.3692021137464105, lr=0.04932544255971537
2023-11-27 10:50:22   INFO  epoch: 12/24, acc_iter=79094, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:01:58, time_cost(all): 1 day, 1:14:04/1 day, 0:31:23, loss=0.432031738602828, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=1.1673765225436192, lr=0.04925568287514674
2023-11-27 10:51:20   INFO  epoch: 12/24, acc_iter=79144, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:03:10, time_cost(all): 1 day, 1:15:02/1 day, 1:48:29, loss=0.431924196242725, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=1.9515033132059598, lr=0.049215591102406156
2023-11-27 10:52:17   INFO  epoch: 12/24, acc_iter=79194, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:00:26, time_cost(all): 1 day, 1:15:59/1 day, 0:49:56, loss=0.431816653882623, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=4.947358842812745, lr=0.04917549932966556
2023-11-27 10:53:15   INFO  epoch: 12/24, acc_iter=79244, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:02:37, time_cost(all): 1 day, 1:16:57/1 day, 2:04:16, loss=0.43170911152252, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.951968751007174, lr=0.04913540755692498
2023-11-27 10:54:13   INFO  epoch: 12/24, acc_iter=79294, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:03:37, time_cost(all): 1 day, 1:17:55/1 day, 2:08:12, loss=0.431601569162417, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=2.108720816379536, lr=0.0490953157841844
2023-11-27 10:55:11   INFO  epoch: 12/24, acc_iter=79344, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:06:47, time_cost(all): 1 day, 1:18:53/1 day, 2:02:27, loss=0.431494026802315, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.9956902066700577, lr=0.049055224011443806
2023-11-27 10:56:08   INFO  epoch: 12/24, acc_iter=79394, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:01:47, time_cost(all): 1 day, 1:19:50/1 day, 2:21:01, loss=0.431386484442212, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.9071419974839894, lr=0.04901513223870322
2023-11-27 10:57:06   INFO  epoch: 12/24, acc_iter=79444, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:00:03, time_cost(all): 1 day, 1:20:48/1 day, 0:09:34, loss=0.431278942082109, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.4813277168863015, lr=0.04897504046596263
2023-11-27 10:58:04   INFO  epoch: 12/24, acc_iter=79494, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:56:01, time_cost(all): 1 day, 1:21:46/1 day, 2:31:23, loss=0.431171399722006, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=3.8119012140121264, lr=0.04893494869322204
2023-11-27 10:59:02   INFO  epoch: 12/24, acc_iter=79544, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:52:17, time_cost(all): 1 day, 1:22:44/1 day, 0:33:41, loss=0.431063857361904, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.990299772311377, lr=0.04889485692048145
2023-11-27 10:59:59   INFO  epoch: 12/24, acc_iter=79594, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:56:10, time_cost(all): 1 day, 1:23:41/1 day, 1:26:58, loss=0.430956315001801, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=2.0617611761575656, lr=0.04885476514774086
2023-11-27 11:00:57   INFO  epoch: 12/24, acc_iter=79644, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:52:23, time_cost(all): 1 day, 1:24:39/1 day, 1:38:47, loss=0.430848772641698, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=0.9884330964905753, lr=0.04881467337500027
2023-11-27 11:01:55   INFO  epoch: 12/24, acc_iter=79694, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:55:10, time_cost(all): 1 day, 1:25:37/1 day, 2:05:53, loss=0.430741230281596, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.6006755280667693, lr=0.048774581602259684
2023-11-27 11:02:53   INFO  epoch: 12/24, acc_iter=79744, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:58:45, time_cost(all): 1 day, 1:26:35/1 day, 2:22:13, loss=0.430633687921493, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=1.2076039854611853, lr=0.048734489829519105
2023-11-27 11:03:50   INFO  epoch: 12/24, acc_iter=79794, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:57:18, time_cost(all): 1 day, 1:27:32/1 day, 0:25:30, loss=0.43052614556139, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=4.584184225568835, lr=0.04869439805677851
2023-11-27 11:04:48   INFO  epoch: 12/24, acc_iter=79844, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:49:22, time_cost(all): 1 day, 1:28:30/1 day, 0:20:49, loss=0.430418603201288, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.9071530659749523, lr=0.048654306284037926
2023-11-27 11:05:46   INFO  epoch: 12/24, acc_iter=79894, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:55:11, time_cost(all): 1 day, 1:29:28/1 day, 0:38:57, loss=0.430311060841185, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=4.6549597578508966, lr=0.048614214511297334
2023-11-27 11:06:44   INFO  epoch: 12/24, acc_iter=79944, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:51:56, time_cost(all): 1 day, 1:30:26/1 day, 1:19:58, loss=0.430203518481082, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=4.162372943993949, lr=0.04857412273855675
2023-11-27 11:07:41   INFO  epoch: 12/24, acc_iter=79994, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:51:01, time_cost(all): 1 day, 1:31:23/1 day, 1:24:17, loss=0.43009597612098, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.8682583486398934, lr=0.048534030965816155
2023-11-27 11:08:39   INFO  epoch: 12/24, acc_iter=80044, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:47:43, time_cost(all): 1 day, 1:32:21/1 day, 1:34:03, loss=0.429988433760877, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.5776205071513427, lr=0.04849393919307557
2023-11-27 11:09:37   INFO  epoch: 12/24, acc_iter=80094, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:27, time_cost(all): 1 day, 1:33:19/1 day, 0:09:44, loss=0.429880891400774, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=1.58972535091109, lr=0.048453847420334976
2023-11-27 11:10:35   INFO  epoch: 12/24, acc_iter=80144, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:48:13, time_cost(all): 1 day, 1:34:17/1 day, 1:35:57, loss=0.429773349040672, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=3.6666739005877624, lr=0.04841375564759439
2023-11-27 11:11:32   INFO  epoch: 12/24, acc_iter=80194, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:43:56, time_cost(all): 1 day, 1:35:14/23:57:08, loss=0.429665806680569, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.3078743630455545, lr=0.04837366387485381
2023-11-27 11:12:30   INFO  epoch: 12/24, acc_iter=80244, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:41:27, time_cost(all): 1 day, 1:36:12/1 day, 0:04:53, loss=0.429558264320466, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=2.55302013801868, lr=0.04833357210211322
2023-11-27 11:13:28   INFO  epoch: 12/24, acc_iter=80294, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:47:44, time_cost(all): 1 day, 1:37:10/1 day, 0:42:22, loss=0.429450721960364, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=1.2835720120850604, lr=0.04829348032937263
2023-11-27 11:14:26   INFO  epoch: 12/24, acc_iter=80344, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:45:22, time_cost(all): 1 day, 1:38:08/1 day, 2:17:07, loss=0.429343179600261, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=4.094595629258539, lr=0.04825338855663204
2023-11-27 11:15:23   INFO  epoch: 12/24, acc_iter=80394, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:39:15, time_cost(all): 1 day, 1:39:05/1 day, 2:18:11, loss=0.429235637240158, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.9628194943445398, lr=0.048213296783891454
2023-11-27 11:16:21   INFO  epoch: 12/24, acc_iter=80444, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:43:17, time_cost(all): 1 day, 1:40:03/1 day, 1:14:08, loss=0.429128094880056, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=2.9464104719391115, lr=0.04817320501115086
2023-11-27 11:17:19   INFO  epoch: 12/24, acc_iter=80494, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:39:57, time_cost(all): 1 day, 1:41:01/1 day, 0:32:50, loss=0.429020552519953, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=4.6478463985510965, lr=0.048133113238410276
2023-11-27 11:18:17   INFO  epoch: 12/24, acc_iter=80544, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:41:50, time_cost(all): 1 day, 1:41:59/23:55:25, loss=0.42891301015985, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.6513415910420592, lr=0.04809302146566968
2023-11-27 11:19:14   INFO  epoch: 12/24, acc_iter=80594, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:37:58, time_cost(all): 1 day, 1:42:56/1 day, 0:51:33, loss=0.428805467799748, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=1.891797505782106, lr=0.0480529296929291
2023-11-27 11:20:12   INFO  epoch: 12/24, acc_iter=80644, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:40:41, time_cost(all): 1 day, 1:43:54/1 day, 1:47:46, loss=0.428697925439645, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=1.4845276209565872, lr=0.04801283792018852
2023-11-27 11:21:10   INFO  epoch: 12/24, acc_iter=80694, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:30:39, time_cost(all): 1 day, 1:44:52/1 day, 0:40:18, loss=0.428590383079542, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.01(1.03), norm=1.1506080456340038, lr=0.047972746147447926
2023-11-27 11:22:08   INFO  epoch: 12/24, acc_iter=80744, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:33:39, time_cost(all): 1 day, 1:45:50/23:44:01, loss=0.42848284071944, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=3.2030371891934055, lr=0.04793265437470734
2023-11-27 11:23:05   INFO  epoch: 12/24, acc_iter=80794, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:30:49, time_cost(all): 1 day, 1:46:47/1 day, 0:01:30, loss=0.428375298359337, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=1.4949767375824354, lr=0.04789256260196675
2023-11-27 11:24:03   INFO  epoch: 12/24, acc_iter=80844, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:32:01, time_cost(all): 1 day, 1:47:45/1 day, 1:58:30, loss=0.428267755999234, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=4.212855300873028, lr=0.04785247082922616
2023-11-27 11:25:01   INFO  epoch: 12/24, acc_iter=80894, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:32:15, time_cost(all): 1 day, 1:48:43/1 day, 1:21:01, loss=0.428160213639131, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=2.809322647824401, lr=0.04781237905648557
2023-11-27 11:25:59   INFO  epoch: 12/24, acc_iter=80944, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:29:05, time_cost(all): 1 day, 1:49:41/1 day, 0:35:43, loss=0.428052671279029, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.288953207504206, lr=0.04777228728374498
2023-11-27 11:26:57   INFO  epoch: 12/24, acc_iter=80994, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:31:53, time_cost(all): 1 day, 1:50:39/1 day, 0:22:30, loss=0.427945128918926, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.7855041984665396, lr=0.0477321955110044
2023-11-27 11:27:54   INFO  epoch: 12/24, acc_iter=81044, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:28:39, time_cost(all): 1 day, 1:51:36/1 day, 1:46:01, loss=0.427837586558823, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=1.0184354248880854, lr=0.047692103738263804
2023-11-27 11:28:52   INFO  epoch: 12/24, acc_iter=81094, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:28:59, time_cost(all): 1 day, 1:52:34/1 day, 0:09:08, loss=0.427730044198721, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=4.348146869648367, lr=0.047652011965523225
2023-11-27 11:29:50   INFO  epoch: 12/24, acc_iter=81144, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:24:44, time_cost(all): 1 day, 1:53:32/1 day, 1:43:53, loss=0.427622501838618, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=4.0963057289278755, lr=0.04761192019278264
2023-11-27 11:30:48   INFO  epoch: 12/24, acc_iter=81194, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:27:28, time_cost(all): 1 day, 1:54:30/1 day, 1:43:47, loss=0.427514959478515, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=1.9332887540281491, lr=0.047571828420042046
2023-11-27 11:31:45   INFO  epoch: 12/24, acc_iter=81244, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:23:33, time_cost(all): 1 day, 1:55:27/1 day, 1:37:28, loss=0.427407417118413, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=3.4451638898865484, lr=0.04753173664730146
2023-11-27 11:32:43   INFO  epoch: 12/24, acc_iter=81294, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:21:24, time_cost(all): 1 day, 1:56:25/23:52:27, loss=0.42729987475831, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=1.810243057996561, lr=0.04749164487456087
2023-11-27 11:33:41   INFO  epoch: 12/24, acc_iter=81344, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:20:36, time_cost(all): 1 day, 1:57:23/1 day, 1:47:31, loss=0.427192332398207, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=4.343056287217667, lr=0.04745155310182028
2023-11-27 11:34:39   INFO  epoch: 12/24, acc_iter=81394, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:19:47, time_cost(all): 1 day, 1:58:21/1 day, 0:47:07, loss=0.427084790038105, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=4.72640617654086, lr=0.04741146132907969
2023-11-27 11:35:36   INFO  epoch: 12/24, acc_iter=81444, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:22:55, time_cost(all): 1 day, 1:59:18/1 day, 0:00:39, loss=0.426977247678002, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=3.164118326408957, lr=0.0473713695563391
2023-11-27 11:36:34   INFO  epoch: 12/24, acc_iter=81494, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:16:07, time_cost(all): 1 day, 2:00:16/1 day, 1:40:34, loss=0.426869705317899, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.183824192600664, lr=0.04733127778359851
2023-11-27 11:37:32   INFO  epoch: 12/24, acc_iter=81544, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:16:28, time_cost(all): 1 day, 2:01:14/1 day, 1:54:25, loss=0.426762162957797, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=2.055219333301816, lr=0.04729118601085793
2023-11-27 11:38:30   INFO  epoch: 12/24, acc_iter=81594, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:57, time_cost(all): 1 day, 2:02:12/1 day, 1:28:50, loss=0.426654620597694, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=4.795853951605317, lr=0.047251094238117346
2023-11-27 11:39:27   INFO  epoch: 12/24, acc_iter=81644, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:38, time_cost(all): 1 day, 2:03:09/23:38:31, loss=0.426547078237591, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.961956518022556, lr=0.04721100246537675
2023-11-27 11:40:25   INFO  epoch: 12/24, acc_iter=81694, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:50, time_cost(all): 1 day, 2:04:07/1 day, 0:09:12, loss=0.426439535877489, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=0.935361791702739, lr=0.04717091069263617
2023-11-27 11:41:23   INFO  epoch: 12/24, acc_iter=81744, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:13:18, time_cost(all): 1 day, 2:05:05/23:37:30, loss=0.426331993517386, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=1.2600657349095297, lr=0.047130818919895574
2023-11-27 11:42:21   INFO  epoch: 12/24, acc_iter=81794, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:13:41, time_cost(all): 1 day, 2:06:03/1 day, 1:43:31, loss=0.426224451157283, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=2.8998922315883613, lr=0.04709072714715499
2023-11-27 11:43:18   INFO  epoch: 12/24, acc_iter=81844, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:11:41, time_cost(all): 1 day, 2:07:00/23:52:34, loss=0.426116908797181, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.40790088602681, lr=0.047050635374414396
2023-11-27 11:44:16   INFO  epoch: 12/24, acc_iter=81894, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:10:11, time_cost(all): 1 day, 2:07:58/1 day, 0:19:16, loss=0.426009366437078, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.9368825539380614, lr=0.04701054360167381
2023-11-27 11:45:14   INFO  epoch: 12/24, acc_iter=81944, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:08:46, time_cost(all): 1 day, 2:08:56/1 day, 1:18:28, loss=0.425901824076975, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=3.8329245389366733, lr=0.04697045182893322
2023-11-27 11:46:12   INFO  epoch: 12/24, acc_iter=81994, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:55, time_cost(all): 1 day, 2:09:54/1 day, 0:42:42, loss=0.425794281716873, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=2.381291681913051, lr=0.04693036005619264
2023-11-27 11:47:09   INFO  epoch: 12/24, acc_iter=82044, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:05:54, time_cost(all): 1 day, 2:10:51/1 day, 1:02:45, loss=0.42568673935677, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.863306410501985, lr=0.04689026828345205
2023-11-27 11:48:07   INFO  epoch: 12/24, acc_iter=82094, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:10:27, time_cost(all): 1 day, 2:11:49/1 day, 0:52:46, loss=0.425579196996667, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=4.956743594173101, lr=0.04685017651071146
2023-11-27 11:49:05   INFO  epoch: 12/24, acc_iter=82144, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:06:00, time_cost(all): 1 day, 2:12:47/1 day, 1:32:24, loss=0.425471654636565, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=4.262859408612084, lr=0.046810084737970874
2023-11-27 11:50:03   INFO  epoch: 12/24, acc_iter=82194, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:05:51, time_cost(all): 1 day, 2:13:45/23:17:10, loss=0.425364112276462, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.8777910906857767, lr=0.04676999296523028
2023-11-27 11:51:00   INFO  epoch: 12/24, acc_iter=82244, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:13, time_cost(all): 1 day, 2:14:42/1 day, 1:02:36, loss=0.425256569916359, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.183576561194091, lr=0.046729901192489695
2023-11-27 11:51:58   INFO  epoch: 12/24, acc_iter=82294, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:03:28, time_cost(all): 1 day, 2:15:40/1 day, 0:34:40, loss=0.425149027556256, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=4.999640832790029, lr=0.0466898094197491
2023-11-27 11:52:56   INFO  epoch: 12/24, acc_iter=82344, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:11, time_cost(all): 1 day, 2:16:38/1 day, 0:02:39, loss=0.425041485196154, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=2.87799048704165, lr=0.046649717647008516
2023-11-27 11:53:54   INFO  epoch: 12/24, acc_iter=82394, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:17, time_cost(all): 1 day, 2:17:36/1 day, 1:38:19, loss=0.424933942836051, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=3.9038225262484167, lr=0.046609625874267924
2023-11-27 11:54:51   INFO  epoch: 12/24, acc_iter=82444, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:03:03, time_cost(all): 1 day, 2:18:33/1 day, 1:23:28, loss=0.424826400475948, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=1.9997772362028696, lr=0.046569534101527345
2023-11-27 11:55:49   INFO  epoch: 12/24, acc_iter=82494, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:39, time_cost(all): 1 day, 2:19:31/1 day, 0:18:40, loss=0.424718858115846, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=2.6610935093471335, lr=0.04652944232878676
2023-11-27 11:56:47   INFO  epoch: 12/24, acc_iter=82544, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:56:53, time_cost(all): 1 day, 2:20:29/1 day, 0:49:03, loss=0.424611315755743, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=1.0383454239515975, lr=0.046489350556046166
2023-11-27 11:57:45   INFO  epoch: 12/24, acc_iter=82594, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:55:49, time_cost(all): 1 day, 2:21:27/23:27:33, loss=0.42450377339564, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=3.5558050116612767, lr=0.04644925878330558
2023-11-27 11:58:42   INFO  epoch: 12/24, acc_iter=82644, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:59:11, time_cost(all): 1 day, 2:22:24/1 day, 0:03:05, loss=0.424396231035538, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.0109861128263815, lr=0.04640916701056499
2023-11-27 11:59:40   INFO  epoch: 12/24, acc_iter=82694, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:04, time_cost(all): 1 day, 2:23:22/23:51:11, loss=0.424288688675435, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=3.7828958955976475, lr=0.0463690752378244
2023-11-27 12:00:38   INFO  epoch: 12/24, acc_iter=82744, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:33, time_cost(all): 1 day, 2:24:20/23:27:03, loss=0.424181146315332, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=1.1244242955213481, lr=0.04632898346508381
2023-11-27 12:01:36   INFO  epoch: 12/24, acc_iter=82794, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:53:24, time_cost(all): 1 day, 2:25:18/1 day, 0:09:59, loss=0.42407360395523, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=2.886971905070066, lr=0.04628889169234322
2023-11-27 12:02:33   INFO  epoch: 12/24, acc_iter=82844, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:54:49, time_cost(all): 1 day, 2:26:15/1 day, 0:03:00, loss=0.423966061595127, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=1.0278373993320378, lr=0.04624879991960263
2023-11-27 12:03:31   INFO  epoch: 12/24, acc_iter=82894, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:52:36, time_cost(all): 1 day, 2:27:13/1 day, 0:13:37, loss=0.423858519235024, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=0.7342206828754129, lr=0.04620870814686205
2023-11-27 12:04:29   INFO  epoch: 12/24, acc_iter=82944, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:54:11, time_cost(all): 1 day, 2:28:11/1 day, 0:35:16, loss=0.423750976874922, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=4.59410002437838, lr=0.046168616374121466
2023-11-27 12:05:27   INFO  epoch: 12/24, acc_iter=82994, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:22, time_cost(all): 1 day, 2:29:09/23:06:19, loss=0.423643434514819, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=4.0574976261491535, lr=0.04612852460138087
2023-11-27 12:06:24   INFO  epoch: 12/24, acc_iter=83044, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:49:18, time_cost(all): 1 day, 2:30:06/1 day, 1:14:38, loss=0.423535892154716, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.5468311396869625, lr=0.04608843282864029
2023-11-27 12:07:22   INFO  epoch: 12/24, acc_iter=83094, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:48:10, time_cost(all): 1 day, 2:31:04/23:48:38, loss=0.423428349794614, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=2.583585708287815, lr=0.0460483410558997
2023-11-27 12:08:20   INFO  epoch: 12/24, acc_iter=83144, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:45:34, time_cost(all): 1 day, 2:32:02/1 day, 1:22:21, loss=0.423320807434511, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=0.5638502051096029, lr=0.04600824928315911
2023-11-27 12:09:18   INFO  epoch: 12/24, acc_iter=83194, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:44:55, time_cost(all): 1 day, 2:33:00/23:53:01, loss=0.423213265074408, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=4.91253853027071, lr=0.04596815751041852
2023-11-27 12:10:15   INFO  epoch: 12/24, acc_iter=83244, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:36, time_cost(all): 1 day, 2:33:57/1 day, 0:31:18, loss=0.423105722714306, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.8017872425737933, lr=0.04592806573767793
2023-11-27 12:11:13   INFO  epoch: 12/24, acc_iter=83294, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:44:59, time_cost(all): 1 day, 2:34:55/1 day, 1:20:15, loss=0.422998180354203, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=0.5910619081310122, lr=0.045887973964937344
2023-11-27 12:12:11   INFO  epoch: 12/24, acc_iter=83344, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:45:45, time_cost(all): 1 day, 2:35:53/1 day, 1:00:16, loss=0.4228906379941, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.618939147899392, lr=0.045847882192196765
2023-11-27 12:13:09   INFO  epoch: 12/24, acc_iter=83394, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:28, time_cost(all): 1 day, 2:36:51/23:12:04, loss=0.422783095633998, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=0.6825563364644396, lr=0.04580779041945617
2023-11-27 12:14:06   INFO  epoch: 12/24, acc_iter=83444, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:41:31, time_cost(all): 1 day, 2:37:48/1 day, 0:37:30, loss=0.422675553273895, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=2.2559511024414065, lr=0.045767698646715586
2023-11-27 12:15:04   INFO  epoch: 12/24, acc_iter=83494, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:11, time_cost(all): 1 day, 2:38:46/1 day, 0:39:07, loss=0.422568010913792, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.5841452726220044, lr=0.045727606873974994
2023-11-27 12:16:02   INFO  epoch: 12/24, acc_iter=83544, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:42:01, time_cost(all): 1 day, 2:39:44/23:57:55, loss=0.422460468553689, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.7974489898274735, lr=0.04568751510123441
2023-11-27 12:17:00   INFO  epoch: 12/24, acc_iter=83594, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:14, time_cost(all): 1 day, 2:40:42/1 day, 0:34:57, loss=0.422352926193587, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=1.0468748624552355, lr=0.045647423328493815
2023-11-27 12:17:57   INFO  epoch: 12/24, acc_iter=83644, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:10, time_cost(all): 1 day, 2:41:39/23:25:33, loss=0.422245383833484, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=3.273456908638669, lr=0.04560733155575323
2023-11-27 12:18:55   INFO  epoch: 12/24, acc_iter=83694, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:25, time_cost(all): 1 day, 2:42:37/23:36:49, loss=0.422137841473381, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=1.2394059089345726, lr=0.045567239783012636
2023-11-27 12:19:53   INFO  epoch: 12/24, acc_iter=83744, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:51, time_cost(all): 1 day, 2:43:35/1 day, 0:22:37, loss=0.422030299113279, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=0.5090148153702226, lr=0.04552714801027205
2023-11-27 12:20:51   INFO  epoch: 12/24, acc_iter=83794, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:49, time_cost(all): 1 day, 2:44:33/23:27:36, loss=0.421922756753176, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=1.864069142216829, lr=0.04548705623753147
2023-11-27 12:21:48   INFO  epoch: 12/24, acc_iter=83844, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:21, time_cost(all): 1 day, 2:45:30/23:12:21, loss=0.421815214393073, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=4.017302486812395, lr=0.04544696446479088
2023-11-27 12:22:46   INFO  epoch: 12/24, acc_iter=83894, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:08, time_cost(all): 1 day, 2:46:28/23:57:32, loss=0.421707672032971, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=1.881970330503917, lr=0.04540687269205029
2023-11-27 12:23:44   INFO  epoch: 12/24, acc_iter=83944, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:17, time_cost(all): 1 day, 2:47:26/23:20:00, loss=0.421600129672868, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=1.1894180596297357, lr=0.0453667809193097
2023-11-27 12:24:42   INFO  epoch: 12/24, acc_iter=83994, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:03, time_cost(all): 1 day, 2:48:24/1 day, 0:49:24, loss=0.421492587312765, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=1.1086749360091472, lr=0.045326689146569114
2023-11-27 12:25:39   INFO  epoch: 12/24, acc_iter=84044, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:54, time_cost(all): 1 day, 2:49:21/1 day, 0:05:50, loss=0.421385044952663, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=2.1775483262458475, lr=0.04528659737382852
2023-11-27 12:26:37   INFO  epoch: 12/24, acc_iter=84094, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:34, time_cost(all): 1 day, 2:50:19/23:29:03, loss=0.42127750259256, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=1.2298854731340545, lr=0.045246505601087936
2023-11-27 12:27:35   INFO  epoch: 12/24, acc_iter=84144, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:35, time_cost(all): 1 day, 2:51:17/23:08:55, loss=0.421169960232457, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=4.445396372430444, lr=0.04520641382834734
2023-11-27 12:28:33   INFO  epoch: 12/24, acc_iter=84194, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:45, time_cost(all): 1 day, 2:52:15/1 day, 0:57:37, loss=0.421062417872355, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.833385052566861, lr=0.04516632205560676
2023-11-27 12:29:30   INFO  epoch: 12/24, acc_iter=84244, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:12, time_cost(all): 1 day, 2:53:12/23:19:41, loss=0.420954875512252, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.389511521149146, lr=0.04512623028286618
2023-11-27 12:30:28   INFO  epoch: 12/24, acc_iter=84294, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:27, time_cost(all): 1 day, 2:54:10/1 day, 0:08:58, loss=0.420847333152149, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=4.627957226820578, lr=0.045086138510125585
2023-11-27 12:31:26   INFO  epoch: 12/24, acc_iter=84344, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:49, time_cost(all): 1 day, 2:55:08/1 day, 0:05:58, loss=0.420739790792047, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=4.250970622639386, lr=0.045046046737385
2023-11-27 12:32:24   INFO  epoch: 12/24, acc_iter=84394, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:15, time_cost(all): 1 day, 2:56:06/23:16:44, loss=0.420632248431944, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.4010000283095516, lr=0.04500595496464441
2023-11-27 12:33:21   INFO  epoch: 12/24, acc_iter=84444, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:18, time_cost(all): 1 day, 2:57:03/22:41:45, loss=0.420524706071841, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.3134787283035743, lr=0.04496586319190382
2023-11-27 12:34:19   INFO  epoch: 12/24, acc_iter=84494, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:48, time_cost(all): 1 day, 2:58:01/22:44:28, loss=0.420417163711739, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=4.314885688331265, lr=0.04492577141916323
2023-11-27 12:35:17   INFO  epoch: 12/24, acc_iter=84544, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:55, time_cost(all): 1 day, 2:58:59/1 day, 0:46:49, loss=0.420309621351636, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=1.0981442818700753, lr=0.04488567964642264
2023-11-27 12:36:15   INFO  epoch: 12/24, acc_iter=84594, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:34, time_cost(all): 1 day, 2:59:57/23:40:34, loss=0.420202078991533, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.8794182072716232, lr=0.04484558787368205
2023-11-27 12:37:12   INFO  epoch: 12/24, acc_iter=84644, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:47, time_cost(all): 1 day, 3:00:54/23:31:59, loss=0.420094536631431, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=1.8432097846228341, lr=0.044805496100941464
2023-11-27 12:38:10   INFO  epoch: 12/24, acc_iter=84694, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:44, time_cost(all): 1 day, 3:01:52/23:17:05, loss=0.419986994271328, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=0.6227249458238675, lr=0.044765404328200885
2023-11-27 12:39:08   INFO  epoch: 12/24, acc_iter=84744, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:47, time_cost(all): 1 day, 3:02:50/23:52:37, loss=0.419879451911225, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.8514175420127725, lr=0.04472531255546029
2023-11-27 12:40:06   INFO  epoch: 12/24, acc_iter=84794, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:19, time_cost(all): 1 day, 3:03:48/23:09:05, loss=0.419771909551123, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.639070931695435, lr=0.044685220782719706
2023-11-27 12:41:03   INFO  epoch: 12/24, acc_iter=84844, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:51, time_cost(all): 1 day, 3:04:45/23:01:23, loss=0.41966436719102, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=1.5163572141618387, lr=0.04464512900997911
2023-11-27 12:42:01   INFO  epoch: 12/24, acc_iter=84894, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:17, time_cost(all): 1 day, 3:05:43/1 day, 0:48:20, loss=0.419556824830917, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=1.1248367044543148, lr=0.04460503723723853
2023-11-27 12:42:59   INFO  epoch: 12/24, acc_iter=84944, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:36, time_cost(all): 1 day, 3:06:41/23:42:22, loss=0.419449282470814, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=4.533089424285176, lr=0.04456494546449794
2023-11-27 12:43:57   INFO  epoch: 12/24, acc_iter=84994, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:54, time_cost(all): 1 day, 3:07:39/23:04:01, loss=0.419341740110712, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=4.941035770424565, lr=0.04452485369175735
2023-11-27 12:44:54   INFO  epoch: 12/24, acc_iter=85044, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:47, time_cost(all): 1 day, 3:08:36/23:19:05, loss=0.419234197750609, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=4.077566460432677, lr=0.04448476191901676
2023-11-27 12:45:52   INFO  epoch: 12/24, acc_iter=85094, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:58, time_cost(all): 1 day, 3:09:34/23:30:52, loss=0.419126655390506, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=4.259874483016908, lr=0.044444670146276184
2023-11-27 12:46:50   INFO  epoch: 12/24, acc_iter=85144, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:44, time_cost(all): 1 day, 3:10:32/23:57:51, loss=0.419019113030404, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=2.159972521397136, lr=0.04440457837353559
2023-11-27 12:47:48   INFO  epoch: 12/24, acc_iter=85194, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:13, time_cost(all): 1 day, 3:11:30/23:32:05, loss=0.418911570670301, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=4.461340597532713, lr=0.044364486600795006
2023-11-27 12:48:45   INFO  epoch: 12/24, acc_iter=85244, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:18, time_cost(all): 1 day, 3:12:27/1 day, 0:37:31, loss=0.418804028310198, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=1.6154868344481441, lr=0.04432439482805441
2023-11-27 12:49:43   INFO  epoch: 12/24, acc_iter=85294, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:40, time_cost(all): 1 day, 3:13:25/23:33:58, loss=0.418696485950096, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=0.9010779006864582, lr=0.04428430305531383
2023-11-27 12:50:41   INFO  epoch: 12/24, acc_iter=85344, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:21, time_cost(all): 1 day, 3:14:23/23:15:56, loss=0.418588943589993, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=2.1027661042597976, lr=0.044244211282573234
2023-11-27 12:51:39   INFO  epoch: 12/24, acc_iter=85394, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:31, time_cost(all): 1 day, 3:15:21/1 day, 0:38:21, loss=0.41848140122989, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=4.084635091982538, lr=0.04420411950983265
2023-11-27 12:52:36   INFO  epoch: 12/24, acc_iter=85444, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:30, time_cost(all): 1 day, 3:16:18/22:38:13, loss=0.418373858869788, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=4.928071796557584, lr=0.044164027737092056
2023-11-27 12:53:34   INFO  epoch: 12/24, acc_iter=85494, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:37, time_cost(all): 1 day, 3:17:16/1 day, 0:32:48, loss=0.418266316509685, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=2.9188863136800456, lr=0.04412393596435147
2023-11-27 12:54:32   INFO  epoch: 12/24, acc_iter=85544, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:37, time_cost(all): 1 day, 3:18:14/1 day, 0:15:56, loss=0.418158774149582, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=4.686028935653744, lr=0.04408384419161089
2023-11-27 12:55:30   INFO  epoch: 12/24, acc_iter=85594, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:42, time_cost(all): 1 day, 3:19:12/22:15:31, loss=0.41805123178948, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=1.5301980774699133, lr=0.0440437524188703
2023-11-27 12:56:27   INFO  epoch: 13/24, acc_iter=85681, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:08:01, time_cost(all): 1 day, 3:20:09/22:59:50, loss=0.417864108082901, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=3.729431288552185, lr=0.04397399273430167
2023-11-27 12:57:25   INFO  epoch: 13/24, acc_iter=85731, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:09:40, time_cost(all): 1 day, 3:21:07/23:44:34, loss=0.417756565722798, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=2.908348290253922, lr=0.043933900961561086
2023-11-27 12:58:23   INFO  epoch: 13/24, acc_iter=85781, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/1:59:16, time_cost(all): 1 day, 3:22:05/23:57:17, loss=0.417649023362696, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.239175642604868, lr=0.04389380918882049
2023-11-27 12:59:21   INFO  epoch: 13/24, acc_iter=85831, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:06:57, time_cost(all): 1 day, 3:23:03/23:09:02, loss=0.417541481002593, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=3.8833169073366878, lr=0.04385371741607991
2023-11-27 13:00:18   INFO  epoch: 13/24, acc_iter=85881, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:56:27, time_cost(all): 1 day, 3:24:00/23:56:39, loss=0.41743393864249, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=1.851962111865919, lr=0.04381362564333932
2023-11-27 13:01:16   INFO  epoch: 13/24, acc_iter=85931, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:04:19, time_cost(all): 1 day, 3:24:58/23:02:51, loss=0.417326396282388, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=2.571919898062989, lr=0.04377353387059873
2023-11-27 13:02:14   INFO  epoch: 13/24, acc_iter=85981, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:55:53, time_cost(all): 1 day, 3:25:56/23:11:22, loss=0.417218853922285, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=2.4867603983992232, lr=0.04373344209785814
2023-11-27 13:03:12   INFO  epoch: 13/24, acc_iter=86031, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:56:21, time_cost(all): 1 day, 3:26:54/22:12:20, loss=0.417111311562182, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.628346683804598, lr=0.043693350325117564
2023-11-27 13:04:09   INFO  epoch: 13/24, acc_iter=86081, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:59:01, time_cost(all): 1 day, 3:27:51/1 day, 0:14:34, loss=0.41700376920208, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.0032245985171868, lr=0.04365325855237697
2023-11-27 13:05:07   INFO  epoch: 13/24, acc_iter=86131, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:52:53, time_cost(all): 1 day, 3:28:49/22:33:53, loss=0.416896226841977, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=4.491163928296528, lr=0.043613166779636385
2023-11-27 13:06:05   INFO  epoch: 13/24, acc_iter=86181, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:50:49, time_cost(all): 1 day, 3:29:47/1 day, 0:19:59, loss=0.416788684481874, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=1.0912047809814385, lr=0.04357307500689579
2023-11-27 13:07:03   INFO  epoch: 13/24, acc_iter=86231, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:58:48, time_cost(all): 1 day, 3:30:45/23:09:22, loss=0.416681142121772, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=3.2766908215348587, lr=0.043532983234155206
2023-11-27 13:08:00   INFO  epoch: 13/24, acc_iter=86281, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:48:40, time_cost(all): 1 day, 3:31:42/23:21:25, loss=0.416573599761669, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.9997075062142784, lr=0.043492891461414614
2023-11-27 13:08:58   INFO  epoch: 13/24, acc_iter=86331, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:49:46, time_cost(all): 1 day, 3:32:40/22:37:48, loss=0.416466057401566, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.649160434956964, lr=0.04345279968867403
2023-11-27 13:09:56   INFO  epoch: 13/24, acc_iter=86381, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:52:51, time_cost(all): 1 day, 3:33:38/23:41:07, loss=0.416358515041463, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=3.517022714337753, lr=0.043412707915933435
2023-11-27 13:10:54   INFO  epoch: 13/24, acc_iter=86431, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:46:42, time_cost(all): 1 day, 3:34:36/23:42:31, loss=0.416250972681361, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.783309846991714, lr=0.04337261614319285
2023-11-27 13:11:52   INFO  epoch: 13/24, acc_iter=86481, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:47:10, time_cost(all): 1 day, 3:35:34/1 day, 0:15:23, loss=0.416143430321258, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=3.0563826398517127, lr=0.04333252437045227
2023-11-27 13:12:49   INFO  epoch: 13/24, acc_iter=86531, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:50:02, time_cost(all): 1 day, 3:36:31/23:58:48, loss=0.416035887961155, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=1.9304622226026982, lr=0.04329243259771168
2023-11-27 13:13:47   INFO  epoch: 13/24, acc_iter=86581, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:46:46, time_cost(all): 1 day, 3:37:29/23:08:41, loss=0.415928345601053, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.110155271991145, lr=0.04325234082497109
2023-11-27 13:14:45   INFO  epoch: 13/24, acc_iter=86631, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:49:41, time_cost(all): 1 day, 3:38:27/22:44:41, loss=0.41582080324095, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.385711767181101, lr=0.0432122490522305
2023-11-27 13:15:43   INFO  epoch: 13/24, acc_iter=86681, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:01, time_cost(all): 1 day, 3:39:25/22:56:43, loss=0.415713260880847, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=3.5861644965023056, lr=0.04317215727948991
2023-11-27 13:16:40   INFO  epoch: 13/24, acc_iter=86731, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:49:29, time_cost(all): 1 day, 3:40:22/23:30:15, loss=0.415605718520745, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.882299047890772, lr=0.04313206550674932
2023-11-27 13:17:38   INFO  epoch: 13/24, acc_iter=86781, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:42:20, time_cost(all): 1 day, 3:41:20/22:40:22, loss=0.415498176160642, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.18(1.03), norm=4.844808302929142, lr=0.043091973734008734
2023-11-27 13:18:36   INFO  epoch: 13/24, acc_iter=86831, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:43:28, time_cost(all): 1 day, 3:42:18/22:21:43, loss=0.415390633800539, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=1.2565830531543627, lr=0.04305188196126814
2023-11-27 13:19:34   INFO  epoch: 13/24, acc_iter=86881, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:44:35, time_cost(all): 1 day, 3:43:16/23:22:14, loss=0.415283091440437, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=4.485391941234722, lr=0.043011790188527556
2023-11-27 13:20:31   INFO  epoch: 13/24, acc_iter=86931, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:46:31, time_cost(all): 1 day, 3:44:13/23:42:25, loss=0.415175549080334, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.442675359220089, lr=0.04297169841578698
2023-11-27 13:21:29   INFO  epoch: 13/24, acc_iter=86981, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:40:10, time_cost(all): 1 day, 3:45:11/22:13:40, loss=0.415068006720231, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=4.958155641074244, lr=0.042931606643046384
2023-11-27 13:22:27   INFO  epoch: 13/24, acc_iter=87031, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:36:58, time_cost(all): 1 day, 3:46:09/1 day, 0:05:53, loss=0.414960464360129, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=3.2080797383604427, lr=0.0428915148703058
2023-11-27 13:23:25   INFO  epoch: 13/24, acc_iter=87081, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:43:14, time_cost(all): 1 day, 3:47:07/21:51:22, loss=0.414852922000026, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=3.835006234913654, lr=0.042851423097565206
2023-11-27 13:24:22   INFO  epoch: 13/24, acc_iter=87131, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:41:32, time_cost(all): 1 day, 3:48:04/23:49:45, loss=0.414745379639923, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=4.235396930136287, lr=0.04281133132482462
2023-11-27 13:25:20   INFO  epoch: 13/24, acc_iter=87181, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:41:28, time_cost(all): 1 day, 3:49:02/22:50:17, loss=0.414637837279821, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=3.313949646278845, lr=0.04277123955208403
2023-11-27 13:26:18   INFO  epoch: 13/24, acc_iter=87231, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:37:21, time_cost(all): 1 day, 3:50:00/21:55:26, loss=0.414530294919718, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.838456117436456, lr=0.04273114777934344
2023-11-27 13:27:16   INFO  epoch: 13/24, acc_iter=87281, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:35:58, time_cost(all): 1 day, 3:50:58/23:07:13, loss=0.414422752559615, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=4.669334924481023, lr=0.04269105600660285
2023-11-27 13:28:13   INFO  epoch: 13/24, acc_iter=87331, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:42, time_cost(all): 1 day, 3:51:55/23:13:23, loss=0.414315210199513, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=1.2774372350603609, lr=0.04265096423386226
2023-11-27 13:29:11   INFO  epoch: 13/24, acc_iter=87381, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:34:52, time_cost(all): 1 day, 3:52:53/23:50:32, loss=0.41420766783941, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=3.4842162309565876, lr=0.042610872461121684
2023-11-27 13:30:09   INFO  epoch: 13/24, acc_iter=87431, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:34:10, time_cost(all): 1 day, 3:53:51/22:41:20, loss=0.414100125479307, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=2.0055527538806883, lr=0.04257078068838109
2023-11-27 13:31:07   INFO  epoch: 13/24, acc_iter=87481, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:33:45, time_cost(all): 1 day, 3:54:49/22:30:29, loss=0.413992583119205, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=3.068585116089505, lr=0.042530688915640505
2023-11-27 13:32:04   INFO  epoch: 13/24, acc_iter=87531, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:34:19, time_cost(all): 1 day, 3:55:46/22:24:20, loss=0.413885040759102, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=3.1815966823544874, lr=0.04249059714289991
2023-11-27 13:33:02   INFO  epoch: 13/24, acc_iter=87581, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:33:07, time_cost(all): 1 day, 3:56:44/23:15:08, loss=0.413777498398999, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=2.7036722477531803, lr=0.042450505370159326
2023-11-27 13:34:00   INFO  epoch: 13/24, acc_iter=87631, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:28:49, time_cost(all): 1 day, 3:57:42/22:44:33, loss=0.413669956038897, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.9096504622607255, lr=0.042410413597418734
2023-11-27 13:34:58   INFO  epoch: 13/24, acc_iter=87681, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:31:05, time_cost(all): 1 day, 3:58:40/23:10:56, loss=0.413562413678794, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.3036442595735465, lr=0.04237032182467815
2023-11-27 13:35:55   INFO  epoch: 13/24, acc_iter=87731, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:25:00, time_cost(all): 1 day, 3:59:37/22:19:15, loss=0.413454871318691, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=4.332560883480968, lr=0.04233023005193756
2023-11-27 13:36:53   INFO  epoch: 13/24, acc_iter=87781, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:21:49, time_cost(all): 1 day, 4:00:35/23:12:22, loss=0.413347328958589, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=2.289314739903287, lr=0.04229013827919697
2023-11-27 13:37:51   INFO  epoch: 13/24, acc_iter=87831, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:24:42, time_cost(all): 1 day, 4:01:33/22:20:47, loss=0.413239786598486, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=1.3295660582211308, lr=0.04225004650645639
2023-11-27 13:38:49   INFO  epoch: 13/24, acc_iter=87881, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:26:10, time_cost(all): 1 day, 4:02:31/22:08:04, loss=0.413132244238383, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=3.3626893032037213, lr=0.042209954733715804
2023-11-27 13:39:46   INFO  epoch: 13/24, acc_iter=87931, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:20:06, time_cost(all): 1 day, 4:03:28/22:10:27, loss=0.41302470187828, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=1.978725595905658, lr=0.04216986296097521
2023-11-27 13:40:44   INFO  epoch: 13/24, acc_iter=87981, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:20:33, time_cost(all): 1 day, 4:04:26/21:41:12, loss=0.412917159518178, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=2.566072422767466, lr=0.042129771188234626
2023-11-27 13:41:42   INFO  epoch: 13/24, acc_iter=88031, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:24:36, time_cost(all): 1 day, 4:05:24/21:30:45, loss=0.412809617158075, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=0.8917292492572844, lr=0.04208967941549403
2023-11-27 13:42:40   INFO  epoch: 13/24, acc_iter=88081, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:20:38, time_cost(all): 1 day, 4:06:22/23:10:27, loss=0.412702074797972, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=2.395551921521543, lr=0.04204958764275345
2023-11-27 13:43:37   INFO  epoch: 13/24, acc_iter=88131, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:17:53, time_cost(all): 1 day, 4:07:19/23:23:03, loss=0.41259453243787, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=3.1181509186240612, lr=0.042009495870012854
2023-11-27 13:44:35   INFO  epoch: 13/24, acc_iter=88181, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:20:05, time_cost(all): 1 day, 4:08:17/22:43:24, loss=0.412486990077767, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=0.6383836049473709, lr=0.04196940409727227
2023-11-27 13:45:33   INFO  epoch: 13/24, acc_iter=88231, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:16:24, time_cost(all): 1 day, 4:09:15/21:58:05, loss=0.412379447717664, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=4.512878493351685, lr=0.041929312324531676
2023-11-27 13:46:31   INFO  epoch: 13/24, acc_iter=88281, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:13:05, time_cost(all): 1 day, 4:10:13/23:15:30, loss=0.412271905357562, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=3.448276069609732, lr=0.0418892205517911
2023-11-27 13:47:28   INFO  epoch: 13/24, acc_iter=88331, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:13:29, time_cost(all): 1 day, 4:11:10/21:47:51, loss=0.412164362997459, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=1.2920677370762053, lr=0.04184912877905051
2023-11-27 13:48:26   INFO  epoch: 13/24, acc_iter=88381, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:17:24, time_cost(all): 1 day, 4:12:08/23:25:53, loss=0.412056820637356, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.8973264012681996, lr=0.04180903700630992
2023-11-27 13:49:24   INFO  epoch: 13/24, acc_iter=88431, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:30, time_cost(all): 1 day, 4:13:06/22:52:51, loss=0.411949278277254, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=3.366067237920833, lr=0.04176894523356933
2023-11-27 13:50:22   INFO  epoch: 13/24, acc_iter=88481, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:12:14, time_cost(all): 1 day, 4:14:04/22:59:27, loss=0.411841735917151, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.7484865706834305, lr=0.04172885346082874
2023-11-27 13:51:19   INFO  epoch: 13/24, acc_iter=88531, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:11:04, time_cost(all): 1 day, 4:15:01/23:31:48, loss=0.411734193557048, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=1.6421113171575197, lr=0.041688761688088154
2023-11-27 13:52:17   INFO  epoch: 13/24, acc_iter=88581, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:54, time_cost(all): 1 day, 4:15:59/21:31:28, loss=0.411626651196946, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=3.6130471691593713, lr=0.04164866991534756
2023-11-27 13:53:15   INFO  epoch: 13/24, acc_iter=88631, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:08:35, time_cost(all): 1 day, 4:16:57/23:09:41, loss=0.411519108836843, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.84(1.03), norm=3.6747880916207487, lr=0.041608578142606975
2023-11-27 13:54:13   INFO  epoch: 13/24, acc_iter=88681, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:06:33, time_cost(all): 1 day, 4:17:55/22:36:09, loss=0.41141156647674, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.1233593689262173, lr=0.04156848636986638
2023-11-27 13:55:10   INFO  epoch: 13/24, acc_iter=88731, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:04:18, time_cost(all): 1 day, 4:18:52/22:47:03, loss=0.411304024116638, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.175971325142707, lr=0.0415283945971258
2023-11-27 13:56:08   INFO  epoch: 13/24, acc_iter=88781, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:06:37, time_cost(all): 1 day, 4:19:50/23:03:13, loss=0.411196481756535, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=4.9091489816879195, lr=0.04148830282438522
2023-11-27 13:57:06   INFO  epoch: 13/24, acc_iter=88831, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:03:42, time_cost(all): 1 day, 4:20:48/21:54:34, loss=0.411088939396432, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.83(1.03), norm=0.6083515954304042, lr=0.041448211051644625
2023-11-27 13:58:04   INFO  epoch: 13/24, acc_iter=88881, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:01:28, time_cost(all): 1 day, 4:21:46/23:25:57, loss=0.41098139703633, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.2651346028112576, lr=0.04140811927890404
2023-11-27 13:59:01   INFO  epoch: 13/24, acc_iter=88931, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:28, time_cost(all): 1 day, 4:22:43/23:22:32, loss=0.410873854676227, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.5936619545961854, lr=0.041368027506163446
2023-11-27 13:59:59   INFO  epoch: 13/24, acc_iter=88981, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:34, time_cost(all): 1 day, 4:23:41/22:54:55, loss=0.410766312316124, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=4.931017709726318, lr=0.04132793573342286
2023-11-27 14:00:57   INFO  epoch: 13/24, acc_iter=89031, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:46, time_cost(all): 1 day, 4:24:39/23:04:43, loss=0.410658769956022, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=4.829217940654505, lr=0.04128784396068227
2023-11-27 14:01:55   INFO  epoch: 13/24, acc_iter=89081, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:01:31, time_cost(all): 1 day, 4:25:37/22:28:55, loss=0.410551227595919, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=3.617444122214389, lr=0.04124775218794168
2023-11-27 14:02:52   INFO  epoch: 13/24, acc_iter=89131, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:57:47, time_cost(all): 1 day, 4:26:34/22:46:50, loss=0.410443685235816, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.361952738621415, lr=0.04120766041520109
2023-11-27 14:03:50   INFO  epoch: 13/24, acc_iter=89181, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:57:59, time_cost(all): 1 day, 4:27:32/21:49:03, loss=0.410336142875714, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.311084703429008, lr=0.04116756864246051
2023-11-27 14:04:48   INFO  epoch: 13/24, acc_iter=89231, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:25, time_cost(all): 1 day, 4:28:30/21:24:09, loss=0.410228600515611, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=0.6200897055617012, lr=0.041127476869719924
2023-11-27 14:05:46   INFO  epoch: 13/24, acc_iter=89281, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:54:12, time_cost(all): 1 day, 4:29:28/23:09:59, loss=0.410121058155508, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=1.9053162150022087, lr=0.04108738509697933
2023-11-27 14:06:43   INFO  epoch: 13/24, acc_iter=89331, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:59, time_cost(all): 1 day, 4:30:25/23:05:43, loss=0.410013515795405, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=1.1642314125741071, lr=0.041047293324238746
2023-11-27 14:07:41   INFO  epoch: 13/24, acc_iter=89381, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:54:33, time_cost(all): 1 day, 4:31:23/21:35:02, loss=0.409905973435303, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=1.920079244932036, lr=0.04100720155149815
2023-11-27 14:08:39   INFO  epoch: 13/24, acc_iter=89431, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:43, time_cost(all): 1 day, 4:32:21/22:18:05, loss=0.4097984310752, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.950410308056096, lr=0.04096710977875757
2023-11-27 14:09:37   INFO  epoch: 13/24, acc_iter=89481, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:55:14, time_cost(all): 1 day, 4:33:19/21:43:16, loss=0.409690888715097, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.0349354113942644, lr=0.040927018006016974
2023-11-27 14:10:34   INFO  epoch: 13/24, acc_iter=89531, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:21, time_cost(all): 1 day, 4:34:16/23:09:21, loss=0.409583346354995, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.1708794820054922, lr=0.04088692623327639
2023-11-27 14:11:32   INFO  epoch: 13/24, acc_iter=89581, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:45, time_cost(all): 1 day, 4:35:14/22:57:18, loss=0.409475803994892, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=2.688246908225799, lr=0.040846834460535796
2023-11-27 14:12:30   INFO  epoch: 13/24, acc_iter=89631, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:48:55, time_cost(all): 1 day, 4:36:12/21:09:07, loss=0.409368261634789, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=0.6963593392789865, lr=0.04080674268779522
2023-11-27 14:13:28   INFO  epoch: 13/24, acc_iter=89681, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:47:17, time_cost(all): 1 day, 4:37:10/21:18:20, loss=0.409260719274687, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=4.45011026433566, lr=0.04076665091505463
2023-11-27 14:14:25   INFO  epoch: 13/24, acc_iter=89731, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:14, time_cost(all): 1 day, 4:38:07/21:22:38, loss=0.409153176914584, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=1.5515657425682083, lr=0.04072655914231404
2023-11-27 14:15:23   INFO  epoch: 13/24, acc_iter=89781, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:45:45, time_cost(all): 1 day, 4:39:05/21:57:34, loss=0.409045634554481, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=0.5893561112327361, lr=0.04068646736957345
2023-11-27 14:16:21   INFO  epoch: 13/24, acc_iter=89831, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:48, time_cost(all): 1 day, 4:40:03/23:05:53, loss=0.408938092194379, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=1.945221022648457, lr=0.040646375596832866
2023-11-27 14:17:19   INFO  epoch: 13/24, acc_iter=89881, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:36, time_cost(all): 1 day, 4:41:01/21:28:51, loss=0.408830549834276, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=2.718632087369273, lr=0.040606283824092274
2023-11-27 14:18:16   INFO  epoch: 13/24, acc_iter=89931, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:42:40, time_cost(all): 1 day, 4:41:58/21:33:59, loss=0.408723007474173, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=4.396037796728854, lr=0.04056619205135169
2023-11-27 14:19:14   INFO  epoch: 13/24, acc_iter=89981, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:42:25, time_cost(all): 1 day, 4:42:56/22:01:04, loss=0.408615465114071, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.317142790279403, lr=0.040526100278611095
2023-11-27 14:20:12   INFO  epoch: 13/24, acc_iter=90031, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:50, time_cost(all): 1 day, 4:43:54/21:43:20, loss=0.408507922753968, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=2.654375539931244, lr=0.04048600850587051
2023-11-27 14:21:10   INFO  epoch: 13/24, acc_iter=90081, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:17, time_cost(all): 1 day, 4:44:52/23:03:12, loss=0.408400380393865, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=0.52650835400635, lr=0.04044591673312993
2023-11-27 14:22:07   INFO  epoch: 13/24, acc_iter=90131, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:42:03, time_cost(all): 1 day, 4:45:49/21:39:16, loss=0.408292838033763, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=4.563777801926475, lr=0.04040582496038934
2023-11-27 14:23:05   INFO  epoch: 13/24, acc_iter=90181, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:50, time_cost(all): 1 day, 4:46:47/21:14:56, loss=0.40818529567366, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=3.36889052291825, lr=0.04036573318764875
2023-11-27 14:24:03   INFO  epoch: 13/24, acc_iter=90231, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:50, time_cost(all): 1 day, 4:47:45/21:48:56, loss=0.408077753313557, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=1.206220440440289, lr=0.04032564141490816
2023-11-27 14:25:01   INFO  epoch: 13/24, acc_iter=90281, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:49, time_cost(all): 1 day, 4:48:43/21:32:01, loss=0.407970210953455, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=3.1655089691225085, lr=0.04028554964216757
2023-11-27 14:25:58   INFO  epoch: 13/24, acc_iter=90331, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:18, time_cost(all): 1 day, 4:49:40/22:05:42, loss=0.407862668593352, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=4.990320900153853, lr=0.04024545786942698
2023-11-27 14:26:56   INFO  epoch: 13/24, acc_iter=90381, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:27, time_cost(all): 1 day, 4:50:38/21:45:37, loss=0.407755126233249, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=2.687966389894904, lr=0.040205366096686394
2023-11-27 14:27:54   INFO  epoch: 13/24, acc_iter=90431, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:19, time_cost(all): 1 day, 4:51:36/22:22:20, loss=0.407647583873146, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=4.445675997273003, lr=0.0401652743239458
2023-11-27 14:28:52   INFO  epoch: 13/24, acc_iter=90481, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:40, time_cost(all): 1 day, 4:52:34/21:15:57, loss=0.407540041513044, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=0.8192522513671434, lr=0.040125182551205216
2023-11-27 14:29:49   INFO  epoch: 13/24, acc_iter=90531, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:33:08, time_cost(all): 1 day, 4:53:31/22:18:00, loss=0.407432499152941, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=3.852616192597109, lr=0.04008509077846464
2023-11-27 14:30:47   INFO  epoch: 13/24, acc_iter=90581, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:41, time_cost(all): 1 day, 4:54:29/22:24:49, loss=0.407324956792838, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.567717628055642, lr=0.040044999005724044
2023-11-27 14:31:45   INFO  epoch: 13/24, acc_iter=90631, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:38, time_cost(all): 1 day, 4:55:27/21:12:36, loss=0.407217414432736, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.83(1.03), norm=2.81919617387217, lr=0.04000490723298346
2023-11-27 14:32:43   INFO  epoch: 13/24, acc_iter=90681, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:08, time_cost(all): 1 day, 4:56:25/22:15:38, loss=0.407109872072633, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.191092636040857, lr=0.039964815460242865
2023-11-27 14:33:40   INFO  epoch: 13/24, acc_iter=90731, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:15, time_cost(all): 1 day, 4:57:22/22:13:16, loss=0.40700232971253, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=2.4088918997201203, lr=0.03992472368750228
2023-11-27 14:34:38   INFO  epoch: 13/24, acc_iter=90781, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:09, time_cost(all): 1 day, 4:58:20/21:13:44, loss=0.406894787352428, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=2.761655695945581, lr=0.03988463191476169
2023-11-27 14:35:36   INFO  epoch: 13/24, acc_iter=90831, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:54, time_cost(all): 1 day, 4:59:18/22:09:58, loss=0.406787244992325, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=3.8013863381261412, lr=0.0398445401420211
2023-11-27 14:36:34   INFO  epoch: 13/24, acc_iter=90881, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:09, time_cost(all): 1 day, 5:00:16/22:30:21, loss=0.406679702632222, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=2.021242871147475, lr=0.03980444836928051
2023-11-27 14:37:31   INFO  epoch: 13/24, acc_iter=90931, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:46, time_cost(all): 1 day, 5:01:13/22:31:50, loss=0.40657216027212, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.907993127359509, lr=0.03976435659653992
2023-11-27 14:38:29   INFO  epoch: 13/24, acc_iter=90981, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:12, time_cost(all): 1 day, 5:02:11/21:39:25, loss=0.406464617912017, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.98292581905078, lr=0.03972426482379934
2023-11-27 14:39:27   INFO  epoch: 13/24, acc_iter=91031, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:15, time_cost(all): 1 day, 5:03:09/20:43:16, loss=0.406357075551914, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.577196655084348, lr=0.03968417305105875
2023-11-27 14:40:25   INFO  epoch: 13/24, acc_iter=91081, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:02, time_cost(all): 1 day, 5:04:07/21:02:48, loss=0.406249533191812, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=0.7978337545168072, lr=0.039644081278318165
2023-11-27 14:41:22   INFO  epoch: 13/24, acc_iter=91131, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:04, time_cost(all): 1 day, 5:05:04/21:03:27, loss=0.406141990831709, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=2.676403160615001, lr=0.03960398950557757
2023-11-27 14:42:20   INFO  epoch: 13/24, acc_iter=91181, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:14, time_cost(all): 1 day, 5:06:02/22:19:55, loss=0.406034448471606, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=0.7988394448296736, lr=0.039563897732836986
2023-11-27 14:43:18   INFO  epoch: 13/24, acc_iter=91231, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:14, time_cost(all): 1 day, 5:07:00/22:40:09, loss=0.405926906111504, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=1.3894304833983613, lr=0.03952380596009639
2023-11-27 14:44:16   INFO  epoch: 13/24, acc_iter=91281, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:44, time_cost(all): 1 day, 5:07:58/21:30:58, loss=0.405819363751401, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=2.4894301690694176, lr=0.03948371418735581
2023-11-27 14:45:13   INFO  epoch: 13/24, acc_iter=91331, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:43, time_cost(all): 1 day, 5:08:55/20:45:05, loss=0.405711821391298, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.0071808202449049, lr=0.039443622414615215
2023-11-27 14:46:11   INFO  epoch: 13/24, acc_iter=91381, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:19, time_cost(all): 1 day, 5:09:53/21:04:27, loss=0.405604279031196, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.9833895872302647, lr=0.039403530641874636
2023-11-27 14:47:09   INFO  epoch: 13/24, acc_iter=91431, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:41, time_cost(all): 1 day, 5:10:51/20:32:19, loss=0.405496736671093, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=2.809956730421564, lr=0.03936343886913405
2023-11-27 14:48:07   INFO  epoch: 13/24, acc_iter=91481, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:02, time_cost(all): 1 day, 5:11:49/22:17:18, loss=0.40538919431099, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=3.2913151235352367, lr=0.03932334709639346
2023-11-27 14:49:04   INFO  epoch: 13/24, acc_iter=91531, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:21, time_cost(all): 1 day, 5:12:46/22:21:19, loss=0.405281651950888, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=2.362068124403782, lr=0.03928325532365287
2023-11-27 14:50:02   INFO  epoch: 13/24, acc_iter=91581, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:29, time_cost(all): 1 day, 5:13:44/20:39:46, loss=0.405174109590785, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=3.46497938857883, lr=0.03924316355091228
2023-11-27 14:51:00   INFO  epoch: 13/24, acc_iter=91631, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:37, time_cost(all): 1 day, 5:14:42/21:14:09, loss=0.405066567230682, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=1.900304369383432, lr=0.03920307177817169
2023-11-27 14:51:58   INFO  epoch: 13/24, acc_iter=91681, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:21, time_cost(all): 1 day, 5:15:40/21:13:24, loss=0.40495902487058, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.4215128642808839, lr=0.0391629800054311
2023-11-27 14:52:55   INFO  epoch: 13/24, acc_iter=91731, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:36, time_cost(all): 1 day, 5:16:37/21:32:05, loss=0.404851482510477, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.23(1.03), norm=3.1044217272247345, lr=0.039122888232690514
2023-11-27 14:53:53   INFO  epoch: 13/24, acc_iter=91781, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:38, time_cost(all): 1 day, 5:17:35/21:08:36, loss=0.404743940150374, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.5655356565414893, lr=0.03908279645994993
2023-11-27 14:54:51   INFO  epoch: 13/24, acc_iter=91831, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:46, time_cost(all): 1 day, 5:18:33/21:12:09, loss=0.404636397790271, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.0055770073815635, lr=0.03904270468720934
2023-11-27 14:55:49   INFO  epoch: 13/24, acc_iter=91881, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:33, time_cost(all): 1 day, 5:19:31/22:19:01, loss=0.404528855430169, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.9018832533709666, lr=0.03900261291446876
2023-11-27 14:56:47   INFO  epoch: 13/24, acc_iter=91931, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:32, time_cost(all): 1 day, 5:20:29/21:29:25, loss=0.404421313070066, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=4.204700693031034, lr=0.03896252114172817
2023-11-27 14:57:44   INFO  epoch: 13/24, acc_iter=91981, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:42, time_cost(all): 1 day, 5:21:26/20:20:17, loss=0.404313770709963, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=3.193320338979159, lr=0.03892242936898758
2023-11-27 14:58:42   INFO  epoch: 13/24, acc_iter=92031, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:36, time_cost(all): 1 day, 5:22:24/21:06:13, loss=0.404206228349861, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=4.952982115883986, lr=0.03888233759624699
2023-11-27 14:59:40   INFO  epoch: 13/24, acc_iter=92081, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:42, time_cost(all): 1 day, 5:23:22/21:02:14, loss=0.404098685989758, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.7426159399147751, lr=0.0388422458235064
2023-11-27 15:00:38   INFO  epoch: 13/24, acc_iter=92131, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:36, time_cost(all): 1 day, 5:24:20/22:06:33, loss=0.403991143629655, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=0.5097280499216845, lr=0.038802154050765814
2023-11-27 15:01:35   INFO  epoch: 13/24, acc_iter=92181, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:42, time_cost(all): 1 day, 5:25:17/21:56:48, loss=0.403883601269553, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=3.623029577157185, lr=0.03876206227802522
2023-11-27 15:02:33   INFO  epoch: 14/24, acc_iter=92268, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:07:36, time_cost(all): 1 day, 5:26:15/21:16:22, loss=0.403696477562974, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=2.5441391386417935, lr=0.038692302593456594
2023-11-27 15:03:31   INFO  epoch: 14/24, acc_iter=92318, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:05:21, time_cost(all): 1 day, 5:27:13/21:24:27, loss=0.403588935202871, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=1.8999314070575746, lr=0.038652210820716015
2023-11-27 15:04:29   INFO  epoch: 14/24, acc_iter=92368, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:01:27, time_cost(all): 1 day, 5:28:11/22:12:17, loss=0.403481392842769, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=4.83838426456695, lr=0.03861211904797543
2023-11-27 15:05:26   INFO  epoch: 14/24, acc_iter=92418, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/1:59:30, time_cost(all): 1 day, 5:29:08/20:34:46, loss=0.403373850482666, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=4.647235080588744, lr=0.03857202727523484
2023-11-27 15:06:24   INFO  epoch: 14/24, acc_iter=92468, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:07:13, time_cost(all): 1 day, 5:30:06/21:42:40, loss=0.403266308122563, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.9548239504443625, lr=0.03853193550249425
2023-11-27 15:07:22   INFO  epoch: 14/24, acc_iter=92518, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:00:56, time_cost(all): 1 day, 5:31:04/20:27:25, loss=0.403158765762461, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.859830729644499, lr=0.03849184372975366
2023-11-27 15:08:20   INFO  epoch: 14/24, acc_iter=92568, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:57:01, time_cost(all): 1 day, 5:32:02/20:45:21, loss=0.403051223402358, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=4.415723266687486, lr=0.03845175195701307
2023-11-27 15:09:17   INFO  epoch: 14/24, acc_iter=92618, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:04:17, time_cost(all): 1 day, 5:32:59/21:47:50, loss=0.402943681042255, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.9928590468494543, lr=0.038411660184272486
2023-11-27 15:10:15   INFO  epoch: 14/24, acc_iter=92668, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:56:07, time_cost(all): 1 day, 5:33:57/20:23:08, loss=0.402836138682153, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=3.0489509077592998, lr=0.038371568411531894
2023-11-27 15:11:13   INFO  epoch: 14/24, acc_iter=92718, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:02:42, time_cost(all): 1 day, 5:34:55/21:41:39, loss=0.40272859632205, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=4.593641684184187, lr=0.03833147663879131
2023-11-27 15:12:11   INFO  epoch: 14/24, acc_iter=92768, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:54:57, time_cost(all): 1 day, 5:35:53/20:36:52, loss=0.402621053961947, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=3.7968672054511026, lr=0.03829138486605073
2023-11-27 15:13:08   INFO  epoch: 14/24, acc_iter=92818, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:59:17, time_cost(all): 1 day, 5:36:50/20:51:44, loss=0.402513511601845, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=1.6387849854762522, lr=0.038251293093310136
2023-11-27 15:14:06   INFO  epoch: 14/24, acc_iter=92868, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:58:44, time_cost(all): 1 day, 5:37:48/20:10:34, loss=0.402405969241742, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=2.2224583956108885, lr=0.03821120132056955
2023-11-27 15:15:04   INFO  epoch: 14/24, acc_iter=92918, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:55:07, time_cost(all): 1 day, 5:38:46/21:14:17, loss=0.402298426881639, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.6946864546446322, lr=0.03817110954782896
2023-11-27 15:16:02   INFO  epoch: 14/24, acc_iter=92968, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:56:13, time_cost(all): 1 day, 5:39:44/20:01:23, loss=0.402190884521537, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=4.052986041410611, lr=0.03813101777508837
2023-11-27 15:16:59   INFO  epoch: 14/24, acc_iter=93018, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:51:20, time_cost(all): 1 day, 5:40:41/20:51:22, loss=0.402083342161434, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=1.9474025838837683, lr=0.03809092600234778
2023-11-27 15:17:57   INFO  epoch: 14/24, acc_iter=93068, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:51:12, time_cost(all): 1 day, 5:41:39/21:43:51, loss=0.401975799801331, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=3.1889499217657917, lr=0.03805083422960719
2023-11-27 15:18:55   INFO  epoch: 14/24, acc_iter=93118, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:44:57, time_cost(all): 1 day, 5:42:37/21:00:21, loss=0.401868257441229, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=4.91888608606724, lr=0.0380107424568666
2023-11-27 15:19:53   INFO  epoch: 14/24, acc_iter=93168, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:52:13, time_cost(all): 1 day, 5:43:35/20:56:41, loss=0.401760715081126, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=1.9414310182279015, lr=0.037970650684126014
2023-11-27 15:20:50   INFO  epoch: 14/24, acc_iter=93218, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:43:45, time_cost(all): 1 day, 5:44:32/20:01:23, loss=0.401653172721023, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.7713461977058054, lr=0.037930558911385436
2023-11-27 15:21:48   INFO  epoch: 14/24, acc_iter=93268, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:50:58, time_cost(all): 1 day, 5:45:30/20:06:00, loss=0.401545630360921, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=3.420116900481212, lr=0.03789046713864484
2023-11-27 15:22:46   INFO  epoch: 14/24, acc_iter=93318, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:48:48, time_cost(all): 1 day, 5:46:28/21:14:11, loss=0.401438088000818, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=2.9827710681005293, lr=0.03785037536590426
2023-11-27 15:23:44   INFO  epoch: 14/24, acc_iter=93368, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:41:23, time_cost(all): 1 day, 5:47:26/21:11:01, loss=0.401330545640715, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=2.84009479701587, lr=0.037810283593163664
2023-11-27 15:24:41   INFO  epoch: 14/24, acc_iter=93418, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:41:42, time_cost(all): 1 day, 5:48:23/20:11:02, loss=0.401223003280613, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=0.8947716191715341, lr=0.03777019182042308
2023-11-27 15:25:39   INFO  epoch: 14/24, acc_iter=93468, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:38:15, time_cost(all): 1 day, 5:49:21/21:52:43, loss=0.40111546092051, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=4.170778199901303, lr=0.037730100047682485
2023-11-27 15:26:37   INFO  epoch: 14/24, acc_iter=93518, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:45:53, time_cost(all): 1 day, 5:50:19/20:57:48, loss=0.401007918560407, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=4.963020317473176, lr=0.0376900082749419
2023-11-27 15:27:35   INFO  epoch: 14/24, acc_iter=93568, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:38:04, time_cost(all): 1 day, 5:51:17/20:56:37, loss=0.400900376200304, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.360682059769835, lr=0.03764991650220131
2023-11-27 15:28:32   INFO  epoch: 14/24, acc_iter=93618, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:41:41, time_cost(all): 1 day, 5:52:14/21:49:13, loss=0.400792833840202, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=3.1992928933656843, lr=0.03760982472946072
2023-11-27 15:29:30   INFO  epoch: 14/24, acc_iter=93668, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:38:48, time_cost(all): 1 day, 5:53:12/21:13:07, loss=0.400685291480099, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=2.9475292676585756, lr=0.03756973295672014
2023-11-27 15:30:28   INFO  epoch: 14/24, acc_iter=93718, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:34:17, time_cost(all): 1 day, 5:54:10/20:27:29, loss=0.400577749119996, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=3.453312633447972, lr=0.03752964118397955
2023-11-27 15:31:26   INFO  epoch: 14/24, acc_iter=93768, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:39:12, time_cost(all): 1 day, 5:55:08/21:15:26, loss=0.400470206759894, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.7653670836546371, lr=0.03748954941123896
2023-11-27 15:32:23   INFO  epoch: 14/24, acc_iter=93818, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:34:47, time_cost(all): 1 day, 5:56:05/21:03:55, loss=0.400362664399791, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.350212050557054, lr=0.03744945763849837
2023-11-27 15:33:21   INFO  epoch: 14/24, acc_iter=93868, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:33:32, time_cost(all): 1 day, 5:57:03/20:53:41, loss=0.400255122039688, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=0.7263055006643477, lr=0.037409365865757785
2023-11-27 15:34:19   INFO  epoch: 14/24, acc_iter=93918, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:40, time_cost(all): 1 day, 5:58:01/21:34:42, loss=0.400147579679586, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.6796890954843793, lr=0.0373692740930172
2023-11-27 15:35:17   INFO  epoch: 14/24, acc_iter=93968, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:28:45, time_cost(all): 1 day, 5:58:59/21:33:36, loss=0.400040037319483, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=3.706924198555541, lr=0.0373291823202766
2023-11-27 15:36:14   INFO  epoch: 14/24, acc_iter=94018, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:28:22, time_cost(all): 1 day, 5:59:56/20:01:05, loss=0.39993249495938, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=3.029566281275724, lr=0.037289090547536013
2023-11-27 15:37:12   INFO  epoch: 14/24, acc_iter=94068, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:28:08, time_cost(all): 1 day, 6:00:54/20:29:43, loss=0.399824952599278, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=4.952042700812721, lr=0.03724899877479543
2023-11-27 15:38:10   INFO  epoch: 14/24, acc_iter=94118, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:26:07, time_cost(all): 1 day, 6:01:52/20:42:46, loss=0.399717410239175, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.1027611312486891, lr=0.03720890700205484
2023-11-27 15:39:08   INFO  epoch: 14/24, acc_iter=94168, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:33:39, time_cost(all): 1 day, 6:02:50/19:54:25, loss=0.399609867879072, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=4.656216568194448, lr=0.037168815229314256
2023-11-27 15:40:05   INFO  epoch: 14/24, acc_iter=94218, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:27:35, time_cost(all): 1 day, 6:03:47/20:31:00, loss=0.39950232551897, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=4.8711999491065985, lr=0.03712872345657367
2023-11-27 15:41:03   INFO  epoch: 14/24, acc_iter=94268, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:29:22, time_cost(all): 1 day, 6:04:45/21:17:35, loss=0.399394783158867, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=2.513764132573715, lr=0.037088631683833084
2023-11-27 15:42:01   INFO  epoch: 14/24, acc_iter=94318, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:12, time_cost(all): 1 day, 6:05:43/20:23:24, loss=0.399287240798764, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.4450685266987362, lr=0.037048539911092485
2023-11-27 15:42:59   INFO  epoch: 14/24, acc_iter=94368, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:28:14, time_cost(all): 1 day, 6:06:41/20:40:01, loss=0.399179698438662, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.526658422198087, lr=0.0370084481383519
2023-11-27 15:43:56   INFO  epoch: 14/24, acc_iter=94418, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:20:28, time_cost(all): 1 day, 6:07:38/19:44:37, loss=0.399072156078559, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.7280907937180787, lr=0.03696835636561131
2023-11-27 15:44:54   INFO  epoch: 14/24, acc_iter=94468, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:26:52, time_cost(all): 1 day, 6:08:36/21:18:06, loss=0.398964613718456, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=4.20470112418978, lr=0.03692826459287073
2023-11-27 15:45:52   INFO  epoch: 14/24, acc_iter=94518, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:22:48, time_cost(all): 1 day, 6:09:34/21:25:11, loss=0.398857071358354, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.203502780770699, lr=0.03688817282013014
2023-11-27 15:46:50   INFO  epoch: 14/24, acc_iter=94568, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:24:49, time_cost(all): 1 day, 6:10:32/20:03:17, loss=0.398749528998251, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.358921627325482, lr=0.036848081047389555
2023-11-27 15:47:47   INFO  epoch: 14/24, acc_iter=94618, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:21:52, time_cost(all): 1 day, 6:11:29/20:39:36, loss=0.398641986638148, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.716765572502764, lr=0.03680798927464897
2023-11-27 15:48:45   INFO  epoch: 14/24, acc_iter=94668, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:20:25, time_cost(all): 1 day, 6:12:27/19:30:16, loss=0.398534444278046, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=4.657821468138528, lr=0.036767897501908384
2023-11-27 15:49:43   INFO  epoch: 14/24, acc_iter=94718, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:15:05, time_cost(all): 1 day, 6:13:25/19:33:23, loss=0.398426901917943, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=3.863600719224327, lr=0.036727805729167784
2023-11-27 15:50:41   INFO  epoch: 14/24, acc_iter=94768, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:20:22, time_cost(all): 1 day, 6:14:23/20:58:55, loss=0.39831935955784, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.774724952458483, lr=0.0366877139564272
2023-11-27 15:51:38   INFO  epoch: 14/24, acc_iter=94818, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:16, time_cost(all): 1 day, 6:15:20/20:55:53, loss=0.398211817197737, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=2.3778971637546054, lr=0.03664762218368661
2023-11-27 15:52:36   INFO  epoch: 14/24, acc_iter=94868, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:30, time_cost(all): 1 day, 6:16:18/20:25:28, loss=0.398104274837635, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=2.0224798856290582, lr=0.036607530410946026
2023-11-27 15:53:34   INFO  epoch: 14/24, acc_iter=94918, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:11:59, time_cost(all): 1 day, 6:17:16/20:40:27, loss=0.397996732477532, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=0.6095810561584887, lr=0.03656743863820543
2023-11-27 15:54:32   INFO  epoch: 14/24, acc_iter=94968, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:12:13, time_cost(all): 1 day, 6:18:14/21:13:07, loss=0.397889190117429, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=3.5440862670810835, lr=0.03652734686546484
2023-11-27 15:55:29   INFO  epoch: 14/24, acc_iter=95018, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:14:06, time_cost(all): 1 day, 6:19:11/19:51:21, loss=0.397781647757327, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.362844608755823, lr=0.03648725509272427
2023-11-27 15:56:27   INFO  epoch: 14/24, acc_iter=95068, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:09:55, time_cost(all): 1 day, 6:20:09/20:24:43, loss=0.397674105397224, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=4.670149461117132, lr=0.03644716331998367
2023-11-27 15:57:25   INFO  epoch: 14/24, acc_iter=95118, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:09:43, time_cost(all): 1 day, 6:21:07/21:00:54, loss=0.397566563037121, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=2.734406393348761, lr=0.03640707154724308
2023-11-27 15:58:23   INFO  epoch: 14/24, acc_iter=95168, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:57, time_cost(all): 1 day, 6:22:05/20:03:38, loss=0.397459020677019, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=0.799166073084665, lr=0.0363669797745025
2023-11-27 15:59:20   INFO  epoch: 14/24, acc_iter=95218, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:17, time_cost(all): 1 day, 6:23:02/21:02:12, loss=0.397351478316916, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=2.530180990264162, lr=0.03632688800176191
2023-11-27 16:00:18   INFO  epoch: 14/24, acc_iter=95268, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:11:28, time_cost(all): 1 day, 6:24:00/19:29:36, loss=0.397243935956813, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=3.618430332699365, lr=0.03628679622902131
2023-11-27 16:01:16   INFO  epoch: 14/24, acc_iter=95318, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:10:12, time_cost(all): 1 day, 6:24:58/20:29:58, loss=0.397136393596711, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.777612340316247, lr=0.036246704456280726
2023-11-27 16:02:14   INFO  epoch: 14/24, acc_iter=95368, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:04:24, time_cost(all): 1 day, 6:25:56/20:20:59, loss=0.397028851236608, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=1.3444127815286433, lr=0.03620661268354014
2023-11-27 16:03:11   INFO  epoch: 14/24, acc_iter=95418, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:02:29, time_cost(all): 1 day, 6:26:53/19:28:09, loss=0.396921308876505, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.5475176608000698, lr=0.036166520910799554
2023-11-27 16:04:09   INFO  epoch: 14/24, acc_iter=95468, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:02:46, time_cost(all): 1 day, 6:27:51/19:18:03, loss=0.396813766516403, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=1.0148766784914816, lr=0.03612642913805897
2023-11-27 16:05:07   INFO  epoch: 14/24, acc_iter=95518, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:04:00, time_cost(all): 1 day, 6:28:49/21:08:44, loss=0.3967062241563, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=2.8925248192243433, lr=0.03608633736531838
2023-11-27 16:06:05   INFO  epoch: 14/24, acc_iter=95568, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:03:41, time_cost(all): 1 day, 6:29:47/19:50:40, loss=0.396598681796197, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.0730379328788728, lr=0.0360462455925778
2023-11-27 16:07:02   INFO  epoch: 14/24, acc_iter=95618, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:03:51, time_cost(all): 1 day, 6:30:44/19:45:44, loss=0.396491139436095, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.4703691861942088, lr=0.0360061538198372
2023-11-27 16:08:00   INFO  epoch: 14/24, acc_iter=95668, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:16, time_cost(all): 1 day, 6:31:42/19:37:06, loss=0.396383597075992, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=2.492093429699513, lr=0.03596606204709661
2023-11-27 16:08:58   INFO  epoch: 14/24, acc_iter=95718, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:00, time_cost(all): 1 day, 6:32:40/20:33:21, loss=0.396276054715889, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=4.50415685403691, lr=0.035925970274356026
2023-11-27 16:09:56   INFO  epoch: 14/24, acc_iter=95768, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:57:00, time_cost(all): 1 day, 6:33:38/21:05:22, loss=0.396168512355787, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=3.125897318522374, lr=0.03588587850161544
2023-11-27 16:10:53   INFO  epoch: 14/24, acc_iter=95818, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:59:31, time_cost(all): 1 day, 6:34:35/21:06:09, loss=0.396060969995684, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.4282655065853964, lr=0.03584578672887484
2023-11-27 16:11:51   INFO  epoch: 14/24, acc_iter=95868, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:55:17, time_cost(all): 1 day, 6:35:33/20:29:13, loss=0.395953427635581, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.0601028542470976, lr=0.035805694956134254
2023-11-27 16:12:49   INFO  epoch: 14/24, acc_iter=95918, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:09, time_cost(all): 1 day, 6:36:31/21:06:43, loss=0.395845885275479, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.293823185127993, lr=0.03576560318339368
2023-11-27 16:13:47   INFO  epoch: 14/24, acc_iter=95968, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:52:49, time_cost(all): 1 day, 6:37:29/20:25:51, loss=0.395738342915376, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=4.8572778750775125, lr=0.03572551141065308
2023-11-27 16:14:44   INFO  epoch: 14/24, acc_iter=96018, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:37, time_cost(all): 1 day, 6:38:26/19:18:46, loss=0.395630800555273, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.5000808960593142, lr=0.0356854196379125
2023-11-27 16:15:42   INFO  epoch: 14/24, acc_iter=96068, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:53:41, time_cost(all): 1 day, 6:39:24/19:59:27, loss=0.39552325819517, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=1.9538014234987262, lr=0.03564532786517191
2023-11-27 16:16:40   INFO  epoch: 14/24, acc_iter=96118, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:51:33, time_cost(all): 1 day, 6:40:22/20:14:12, loss=0.395415715835068, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=3.077650176234219, lr=0.035605236092431325
2023-11-27 16:17:38   INFO  epoch: 14/24, acc_iter=96168, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:04, time_cost(all): 1 day, 6:41:20/19:15:31, loss=0.395308173474965, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=4.165369872713731, lr=0.035565144319690725
2023-11-27 16:18:35   INFO  epoch: 14/24, acc_iter=96218, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:48:12, time_cost(all): 1 day, 6:42:17/19:36:43, loss=0.395200631114862, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.9089738048405334, lr=0.03552505254695014
2023-11-27 16:19:33   INFO  epoch: 14/24, acc_iter=96268, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:30, time_cost(all): 1 day, 6:43:15/20:42:39, loss=0.39509308875476, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.2347962688606862, lr=0.035484960774209554
2023-11-27 16:20:31   INFO  epoch: 14/24, acc_iter=96318, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:46:38, time_cost(all): 1 day, 6:44:13/20:13:10, loss=0.394985546394657, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=3.643581585207484, lr=0.03544486900146897
2023-11-27 16:21:29   INFO  epoch: 14/24, acc_iter=96368, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:48:50, time_cost(all): 1 day, 6:45:11/20:47:58, loss=0.394878004034554, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=0.5027145453182991, lr=0.03540477722872838
2023-11-27 16:22:26   INFO  epoch: 14/24, acc_iter=96418, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:47:07, time_cost(all): 1 day, 6:46:08/19:43:22, loss=0.394770461674452, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.2060012009348555, lr=0.035364685455987796
2023-11-27 16:23:24   INFO  epoch: 14/24, acc_iter=96468, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:42:45, time_cost(all): 1 day, 6:47:06/19:48:15, loss=0.394662919314349, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=3.2397646888009675, lr=0.03532459368324721
2023-11-27 16:24:22   INFO  epoch: 14/24, acc_iter=96518, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:48, time_cost(all): 1 day, 6:48:04/19:37:22, loss=0.394555376954246, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=4.658874818012879, lr=0.03528450191050661
2023-11-27 16:25:20   INFO  epoch: 14/24, acc_iter=96568, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:44:05, time_cost(all): 1 day, 6:49:02/20:02:38, loss=0.394447834594144, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.8129989239812325, lr=0.035244410137766025
2023-11-27 16:26:17   INFO  epoch: 14/24, acc_iter=96618, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:44:01, time_cost(all): 1 day, 6:49:59/19:56:41, loss=0.394340292234041, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=4.053311034601203, lr=0.03520431836502544
2023-11-27 16:27:15   INFO  epoch: 14/24, acc_iter=96668, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:43:01, time_cost(all): 1 day, 6:50:57/19:40:00, loss=0.394232749873938, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=4.534985775929425, lr=0.03516422659228485
2023-11-27 16:28:13   INFO  epoch: 14/24, acc_iter=96718, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:39:04, time_cost(all): 1 day, 6:51:55/19:53:40, loss=0.394125207513836, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=4.775992350463321, lr=0.03512413481954427
2023-11-27 16:29:11   INFO  epoch: 14/24, acc_iter=96768, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:39:30, time_cost(all): 1 day, 6:52:53/19:56:51, loss=0.394017665153733, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=2.427866641371062, lr=0.03508404304680367
2023-11-27 16:30:08   INFO  epoch: 14/24, acc_iter=96818, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:28, time_cost(all): 1 day, 6:53:50/19:40:30, loss=0.39391012279363, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=1.6339622405829604, lr=0.035043951274063095
2023-11-27 16:31:06   INFO  epoch: 14/24, acc_iter=96868, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:29, time_cost(all): 1 day, 6:54:48/19:08:01, loss=0.393802580433528, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=4.186008131386952, lr=0.03500385950132251
2023-11-27 16:32:04   INFO  epoch: 14/24, acc_iter=96918, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:51, time_cost(all): 1 day, 6:55:46/20:31:37, loss=0.393695038073425, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=2.009241384148141, lr=0.03496376772858191
2023-11-27 16:33:02   INFO  epoch: 14/24, acc_iter=96968, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:54, time_cost(all): 1 day, 6:56:44/20:43:54, loss=0.393587495713322, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=0.5488648240398035, lr=0.034923675955841324
2023-11-27 16:33:59   INFO  epoch: 14/24, acc_iter=97018, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:59, time_cost(all): 1 day, 6:57:41/20:22:57, loss=0.39347995335322, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.821167608537236, lr=0.03488358418310074
2023-11-27 16:34:57   INFO  epoch: 14/24, acc_iter=97068, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:33:37, time_cost(all): 1 day, 6:58:39/19:11:27, loss=0.393372410993117, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.69322039458212, lr=0.03484349241036015
2023-11-27 16:35:55   INFO  epoch: 14/24, acc_iter=97118, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:17, time_cost(all): 1 day, 6:59:37/19:53:30, loss=0.393264868633014, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=0.7332343972076756, lr=0.03480340063761955
2023-11-27 16:36:53   INFO  epoch: 14/24, acc_iter=97168, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:48, time_cost(all): 1 day, 7:00:35/20:41:24, loss=0.393157326272912, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.635639681442654, lr=0.03476330886487897
2023-11-27 16:37:51   INFO  epoch: 14/24, acc_iter=97218, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:29:14, time_cost(all): 1 day, 7:01:33/20:29:26, loss=0.393049783912809, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=2.663604253761517, lr=0.034723217092138395
2023-11-27 16:38:48   INFO  epoch: 14/24, acc_iter=97268, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:11, time_cost(all): 1 day, 7:02:30/18:50:03, loss=0.392942241552706, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=1.7899623603408708, lr=0.034683125319397795
2023-11-27 16:39:46   INFO  epoch: 14/24, acc_iter=97318, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:32, time_cost(all): 1 day, 7:03:28/19:45:33, loss=0.392834699192603, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=3.823497340900408, lr=0.03464303354665721
2023-11-27 16:40:44   INFO  epoch: 14/24, acc_iter=97368, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:49, time_cost(all): 1 day, 7:04:26/19:18:17, loss=0.392727156832501, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=0.5244789567590858, lr=0.03460294177391662
2023-11-27 16:41:42   INFO  epoch: 14/24, acc_iter=97418, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:22, time_cost(all): 1 day, 7:05:24/18:58:23, loss=0.392619614472398, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.8003094564089863, lr=0.03456285000117604
2023-11-27 16:42:39   INFO  epoch: 14/24, acc_iter=97468, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:24:57, time_cost(all): 1 day, 7:06:21/19:23:01, loss=0.392512072112295, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=3.685562629248283, lr=0.03452275822843544
2023-11-27 16:43:37   INFO  epoch: 14/24, acc_iter=97518, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:28, time_cost(all): 1 day, 7:07:19/18:59:42, loss=0.392404529752193, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=0.8467383112106626, lr=0.03448266645569485
2023-11-27 16:44:35   INFO  epoch: 14/24, acc_iter=97568, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:31, time_cost(all): 1 day, 7:08:17/20:31:31, loss=0.39229698739209, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=2.5262514525628825, lr=0.034442574682954266
2023-11-27 16:45:33   INFO  epoch: 14/24, acc_iter=97618, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:11, time_cost(all): 1 day, 7:09:15/18:36:41, loss=0.392189445031987, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.681579017482243, lr=0.03440248291021368
2023-11-27 16:46:30   INFO  epoch: 14/24, acc_iter=97668, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:05, time_cost(all): 1 day, 7:10:12/20:18:54, loss=0.392081902671885, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=4.5289269433339925, lr=0.034362391137473094
2023-11-27 16:47:28   INFO  epoch: 14/24, acc_iter=97718, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:45, time_cost(all): 1 day, 7:11:10/19:04:41, loss=0.391974360311782, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.561913362463015, lr=0.03432229936473251
2023-11-27 16:48:26   INFO  epoch: 14/24, acc_iter=97768, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:53, time_cost(all): 1 day, 7:12:08/19:48:17, loss=0.391866817951679, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=1.296304324661198, lr=0.03428220759199192
2023-11-27 16:49:24   INFO  epoch: 14/24, acc_iter=97818, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:47, time_cost(all): 1 day, 7:13:06/20:07:01, loss=0.391759275591577, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=4.778681406937502, lr=0.03424211581925132
2023-11-27 16:50:21   INFO  epoch: 14/24, acc_iter=97868, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:14, time_cost(all): 1 day, 7:14:03/18:58:22, loss=0.391651733231474, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=3.0580243173774573, lr=0.03420202404651074
2023-11-27 16:51:19   INFO  epoch: 14/24, acc_iter=97918, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:19, time_cost(all): 1 day, 7:15:01/18:46:06, loss=0.391544190871371, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=1.0354701826178936, lr=0.03416193227377015
2023-11-27 16:52:17   INFO  epoch: 14/24, acc_iter=97968, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:56, time_cost(all): 1 day, 7:15:59/18:50:03, loss=0.391436648511269, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=0.6353150972470996, lr=0.034121840501029566
2023-11-27 16:53:15   INFO  epoch: 14/24, acc_iter=98018, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:20, time_cost(all): 1 day, 7:16:57/19:16:11, loss=0.391329106151166, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=2.375284332724264, lr=0.034081748728288966
2023-11-27 16:54:12   INFO  epoch: 14/24, acc_iter=98068, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:30, time_cost(all): 1 day, 7:17:54/19:12:04, loss=0.391221563791063, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.3474198280024243, lr=0.03404165695554838
2023-11-27 16:55:10   INFO  epoch: 14/24, acc_iter=98118, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:12:54, time_cost(all): 1 day, 7:18:52/19:55:39, loss=0.391114021430961, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=0.8486898201506052, lr=0.03400156518280781
2023-11-27 16:56:08   INFO  epoch: 14/24, acc_iter=98168, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:31, time_cost(all): 1 day, 7:19:50/20:00:41, loss=0.391006479070858, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=1.4008795782796664, lr=0.03396147341006721
2023-11-27 16:57:06   INFO  epoch: 14/24, acc_iter=98218, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:49, time_cost(all): 1 day, 7:20:48/18:58:04, loss=0.390898936710755, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=1.073034451622753, lr=0.03392138163732662
2023-11-27 16:58:03   INFO  epoch: 14/24, acc_iter=98268, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:15, time_cost(all): 1 day, 7:21:45/20:13:37, loss=0.390791394350653, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.215328923414871, lr=0.03388128986458604
2023-11-27 16:59:01   INFO  epoch: 14/24, acc_iter=98318, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:18, time_cost(all): 1 day, 7:22:43/19:30:59, loss=0.39068385199055, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.8177551058751447, lr=0.03384119809184545
2023-11-27 16:59:59   INFO  epoch: 14/24, acc_iter=98368, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:03, time_cost(all): 1 day, 7:23:41/19:01:33, loss=0.390576309630447, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=2.908979944667411, lr=0.03380110631910485
2023-11-27 17:00:57   INFO  epoch: 14/24, acc_iter=98418, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:22, time_cost(all): 1 day, 7:24:39/18:21:06, loss=0.390468767270345, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=4.290534389810167, lr=0.033761014546364265
2023-11-27 17:01:54   INFO  epoch: 14/24, acc_iter=98468, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:24, time_cost(all): 1 day, 7:25:36/19:32:26, loss=0.390361224910242, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.880841447104499, lr=0.03372092277362368
2023-11-27 17:02:52   INFO  epoch: 14/24, acc_iter=98518, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:37, time_cost(all): 1 day, 7:26:34/19:25:14, loss=0.390253682550139, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=1.7995684006423112, lr=0.033680831000883094
2023-11-27 17:03:50   INFO  epoch: 14/24, acc_iter=98568, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:27, time_cost(all): 1 day, 7:27:32/20:10:34, loss=0.390146140190037, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=1.929254022974813, lr=0.03364073922814251
2023-11-27 17:04:48   INFO  epoch: 14/24, acc_iter=98618, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:25, time_cost(all): 1 day, 7:28:30/19:40:03, loss=0.390038597829934, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=4.347351113426889, lr=0.03360064745540192
2023-11-27 17:05:45   INFO  epoch: 14/24, acc_iter=98668, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:35, time_cost(all): 1 day, 7:29:27/18:30:39, loss=0.389931055469831, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.2898709577382577, lr=0.033560555682661336
2023-11-27 17:06:43   INFO  epoch: 14/24, acc_iter=98718, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:40, time_cost(all): 1 day, 7:30:25/20:09:41, loss=0.389823513109729, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=1.5622025540445565, lr=0.03352046390992075
2023-11-27 17:07:41   INFO  epoch: 14/24, acc_iter=98768, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 1 day, 7:31:23/19:53:01, loss=0.389715970749626, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.3319521479541896, lr=0.03348037213718015
2023-11-27 17:08:39   INFO  epoch: 15/24, acc_iter=98855, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:10:42, time_cost(all): 1 day, 7:32:21/18:55:20, loss=0.389528847043047, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=4.642386957724271, lr=0.033410612452611524
2023-11-27 17:09:36   INFO  epoch: 15/24, acc_iter=98905, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:08:19, time_cost(all): 1 day, 7:33:18/20:02:46, loss=0.389421304682945, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=3.2691973094331597, lr=0.03337052067987094
2023-11-27 17:10:34   INFO  epoch: 15/24, acc_iter=98955, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:08:02, time_cost(all): 1 day, 7:34:16/20:05:55, loss=0.389313762322842, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=2.7706127800563625, lr=0.03333042890713035
2023-11-27 17:11:32   INFO  epoch: 15/24, acc_iter=99005, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:03:56, time_cost(all): 1 day, 7:35:14/19:14:27, loss=0.389206219962739, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.6328018680045786, lr=0.033290337134389766
2023-11-27 17:12:30   INFO  epoch: 15/24, acc_iter=99055, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:07:52, time_cost(all): 1 day, 7:36:12/19:56:38, loss=0.389098677602637, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=0.6286099595120047, lr=0.03325024536164918
2023-11-27 17:13:27   INFO  epoch: 15/24, acc_iter=99105, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:13, time_cost(all): 1 day, 7:37:09/18:54:58, loss=0.388991135242534, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.9252661364691925, lr=0.033210153588908595
2023-11-27 17:14:25   INFO  epoch: 15/24, acc_iter=99155, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:36, time_cost(all): 1 day, 7:38:07/18:18:22, loss=0.388883592882431, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=0.7920072606793498, lr=0.03317006181616801
2023-11-27 17:15:23   INFO  epoch: 15/24, acc_iter=99205, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:04:37, time_cost(all): 1 day, 7:39:05/18:14:54, loss=0.388776050522328, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=1.8536238976762387, lr=0.03312997004342741
2023-11-27 17:16:21   INFO  epoch: 15/24, acc_iter=99255, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:55:05, time_cost(all): 1 day, 7:40:03/18:35:25, loss=0.388668508162226, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=1.8087173047276157, lr=0.03308987827068682
2023-11-27 17:17:18   INFO  epoch: 15/24, acc_iter=99305, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:54:26, time_cost(all): 1 day, 7:41:00/19:57:13, loss=0.388560965802123, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=0.7054311167002162, lr=0.03304978649794624
2023-11-27 17:18:16   INFO  epoch: 15/24, acc_iter=99355, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:56:31, time_cost(all): 1 day, 7:41:58/19:18:02, loss=0.38845342344202, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.05215026103734, lr=0.03300969472520565
2023-11-27 17:19:14   INFO  epoch: 15/24, acc_iter=99405, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:53:02, time_cost(all): 1 day, 7:42:56/19:51:58, loss=0.388345881081918, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.6369935747055466, lr=0.032969602952465066
2023-11-27 17:20:12   INFO  epoch: 15/24, acc_iter=99455, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:49:14, time_cost(all): 1 day, 7:43:54/19:48:53, loss=0.388238338721815, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=0.6371699570164941, lr=0.032929511179724466
2023-11-27 17:21:09   INFO  epoch: 15/24, acc_iter=99505, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:52:40, time_cost(all): 1 day, 7:44:51/19:06:08, loss=0.388130796361712, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.8043233353997183, lr=0.032889419406983894
2023-11-27 17:22:07   INFO  epoch: 15/24, acc_iter=99555, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:57:23, time_cost(all): 1 day, 7:45:49/19:37:42, loss=0.38802325400161, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=2.8775666878980424, lr=0.03284932763424331
2023-11-27 17:23:05   INFO  epoch: 15/24, acc_iter=99605, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:55:42, time_cost(all): 1 day, 7:46:47/19:36:26, loss=0.387915711641507, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=0.9490751109370734, lr=0.03280923586150271
2023-11-27 17:24:03   INFO  epoch: 15/24, acc_iter=99655, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:46:19, time_cost(all): 1 day, 7:47:45/19:15:42, loss=0.387808169281404, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=4.534313354425536, lr=0.03276914408876212
2023-11-27 17:25:00   INFO  epoch: 15/24, acc_iter=99705, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:49:29, time_cost(all): 1 day, 7:48:42/18:46:19, loss=0.387700626921302, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=3.772389805299779, lr=0.03272905231602154
2023-11-27 17:25:58   INFO  epoch: 15/24, acc_iter=99755, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:43:31, time_cost(all): 1 day, 7:49:40/19:33:00, loss=0.387593084561199, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=4.099786197479554, lr=0.03268896054328095
2023-11-27 17:26:56   INFO  epoch: 15/24, acc_iter=99805, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:45:28, time_cost(all): 1 day, 7:50:38/18:40:30, loss=0.387485542201096, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=2.9487126404785133, lr=0.03264886877054035
2023-11-27 17:27:54   INFO  epoch: 15/24, acc_iter=99855, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:49:55, time_cost(all): 1 day, 7:51:36/19:41:07, loss=0.387377999840994, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=0.7428691768436777, lr=0.032608776997799765
2023-11-27 17:28:51   INFO  epoch: 15/24, acc_iter=99905, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:49:10, time_cost(all): 1 day, 7:52:33/18:45:09, loss=0.387270457480891, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.196288701664538, lr=0.03256868522505918
2023-11-27 17:29:49   INFO  epoch: 15/24, acc_iter=99955, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:48:55, time_cost(all): 1 day, 7:53:31/19:02:28, loss=0.387162915120788, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=2.303334745287158, lr=0.032528593452318594
2023-11-27 17:30:47   INFO  epoch: 15/24, acc_iter=100005, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:48:36, time_cost(all): 1 day, 7:54:29/18:35:01, loss=0.387055372760686, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=4.590531462636304, lr=0.03248850167957801
2023-11-27 17:31:45   INFO  epoch: 15/24, acc_iter=100055, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:47:27, time_cost(all): 1 day, 7:55:27/19:18:42, loss=0.386947830400583, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.9268777501543386, lr=0.03244840990683742
2023-11-27 17:32:42   INFO  epoch: 15/24, acc_iter=100105, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:43:49, time_cost(all): 1 day, 7:56:24/18:54:10, loss=0.38684028804048, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.0664320106170977, lr=0.032408318134096836
2023-11-27 17:33:40   INFO  epoch: 15/24, acc_iter=100155, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:41:11, time_cost(all): 1 day, 7:57:22/18:16:52, loss=0.386732745680378, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=4.456677117573895, lr=0.03236822636135624
2023-11-27 17:34:38   INFO  epoch: 15/24, acc_iter=100205, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:38:59, time_cost(all): 1 day, 7:58:20/18:20:32, loss=0.386625203320275, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=1.2141486267230537, lr=0.03232813458861565
2023-11-27 17:35:36   INFO  epoch: 15/24, acc_iter=100255, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:40:34, time_cost(all): 1 day, 7:59:18/18:55:27, loss=0.386517660960172, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=2.9076528388692293, lr=0.032288042815875065
2023-11-27 17:36:33   INFO  epoch: 15/24, acc_iter=100305, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:38:11, time_cost(all): 1 day, 8:00:15/18:04:48, loss=0.386410118600069, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.2(1.03), norm=1.715768700893823, lr=0.03224795104313448
2023-11-27 17:37:31   INFO  epoch: 15/24, acc_iter=100355, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:34:08, time_cost(all): 1 day, 8:01:13/18:45:55, loss=0.386302576239967, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=2.451006187676635, lr=0.03220785927039388
2023-11-27 17:38:29   INFO  epoch: 15/24, acc_iter=100405, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:39:54, time_cost(all): 1 day, 8:02:11/18:18:57, loss=0.386195033879864, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=3.1768086438040704, lr=0.03216776749765331
2023-11-27 17:39:27   INFO  epoch: 15/24, acc_iter=100455, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:37:18, time_cost(all): 1 day, 8:03:09/19:10:05, loss=0.386087491519761, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.6341143839730132, lr=0.03212767572491272
2023-11-27 17:40:24   INFO  epoch: 15/24, acc_iter=100505, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:29:38, time_cost(all): 1 day, 8:04:06/18:42:13, loss=0.385979949159659, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.97408401718301, lr=0.03208758395217212
2023-11-27 17:41:22   INFO  epoch: 15/24, acc_iter=100555, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:32:20, time_cost(all): 1 day, 8:05:04/19:26:03, loss=0.385872406799556, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.6682531828246567, lr=0.032047492179431536
2023-11-27 17:42:20   INFO  epoch: 15/24, acc_iter=100605, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:35:22, time_cost(all): 1 day, 8:06:02/19:16:23, loss=0.385764864439453, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=1.219957521149637, lr=0.03200740040669095
2023-11-27 17:43:18   INFO  epoch: 15/24, acc_iter=100655, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:30:42, time_cost(all): 1 day, 8:07:00/18:07:15, loss=0.385657322079351, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.6319623797180807, lr=0.031967308633950364
2023-11-27 17:44:15   INFO  epoch: 15/24, acc_iter=100705, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:27:04, time_cost(all): 1 day, 8:07:57/18:46:04, loss=0.385549779719248, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=0.7114895329172426, lr=0.031927216861209765
2023-11-27 17:45:13   INFO  epoch: 15/24, acc_iter=100755, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:25:52, time_cost(all): 1 day, 8:08:55/18:38:20, loss=0.385442237359145, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=4.10429303845856, lr=0.03188712508846918
2023-11-27 17:46:11   INFO  epoch: 15/24, acc_iter=100805, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:30:14, time_cost(all): 1 day, 8:09:53/17:46:38, loss=0.385334694999043, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.317822941096874, lr=0.03184703331572859
2023-11-27 17:47:09   INFO  epoch: 15/24, acc_iter=100855, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:28:21, time_cost(all): 1 day, 8:10:51/18:08:11, loss=0.38522715263894, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=4.316527748986994, lr=0.03180694154298801
2023-11-27 17:48:06   INFO  epoch: 15/24, acc_iter=100905, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:22:43, time_cost(all): 1 day, 8:11:48/18:15:03, loss=0.385119610278837, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.8031953772936007, lr=0.03176684977024742
2023-11-27 17:49:04   INFO  epoch: 15/24, acc_iter=100955, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:25:27, time_cost(all): 1 day, 8:12:46/19:16:31, loss=0.385012067918735, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=1.7870301500919548, lr=0.031726757997506835
2023-11-27 17:50:02   INFO  epoch: 15/24, acc_iter=101005, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:20:29, time_cost(all): 1 day, 8:13:44/17:48:07, loss=0.384904525558632, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.7291140116292585, lr=0.03168666622476625
2023-11-27 17:51:00   INFO  epoch: 15/24, acc_iter=101055, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:21:33, time_cost(all): 1 day, 8:14:42/17:59:40, loss=0.384796983198529, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=2.2579901260057618, lr=0.03164657445202565
2023-11-27 17:51:57   INFO  epoch: 15/24, acc_iter=101105, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:25:28, time_cost(all): 1 day, 8:15:39/17:50:51, loss=0.384689440838427, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=0.6064817713341205, lr=0.031606482679285064
2023-11-27 17:52:55   INFO  epoch: 15/24, acc_iter=101155, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:22:18, time_cost(all): 1 day, 8:16:37/18:00:33, loss=0.384581898478324, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.019736196333021, lr=0.03156639090654448
2023-11-27 17:53:53   INFO  epoch: 15/24, acc_iter=101205, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:18:39, time_cost(all): 1 day, 8:17:35/18:51:47, loss=0.384474356118221, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=2.194228234120198, lr=0.03152629913380389
2023-11-27 17:54:51   INFO  epoch: 15/24, acc_iter=101255, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:22:16, time_cost(all): 1 day, 8:18:33/17:55:41, loss=0.384366813758119, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=1.7960406367183084, lr=0.031486207361063306
2023-11-27 17:55:48   INFO  epoch: 15/24, acc_iter=101305, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:14:55, time_cost(all): 1 day, 8:19:30/17:49:36, loss=0.384259271398016, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=3.4721780491625, lr=0.03144611558832272
2023-11-27 17:56:46   INFO  epoch: 15/24, acc_iter=101355, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:20:24, time_cost(all): 1 day, 8:20:28/18:04:13, loss=0.384151729037913, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.603884971026358, lr=0.031406023815582135
2023-11-27 17:57:44   INFO  epoch: 15/24, acc_iter=101405, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:33, time_cost(all): 1 day, 8:21:26/17:38:04, loss=0.384044186677811, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=0.7511666120378264, lr=0.03136593204284155
2023-11-27 17:58:42   INFO  epoch: 15/24, acc_iter=101455, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:12:14, time_cost(all): 1 day, 8:22:24/17:53:43, loss=0.383936644317708, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=2.414241341767994, lr=0.03132584027010095
2023-11-27 17:59:39   INFO  epoch: 15/24, acc_iter=101505, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:18:03, time_cost(all): 1 day, 8:23:21/18:13:37, loss=0.383829101957605, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=2.252341322732395, lr=0.03128574849736036
2023-11-27 18:00:37   INFO  epoch: 15/24, acc_iter=101555, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:11:57, time_cost(all): 1 day, 8:24:19/18:33:15, loss=0.383721559597503, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.0825673467089167, lr=0.031245656724619778
2023-11-27 18:01:35   INFO  epoch: 15/24, acc_iter=101605, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:11:58, time_cost(all): 1 day, 8:25:17/17:47:09, loss=0.3836140172374, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=0.9660159881131827, lr=0.03120556495187919
2023-11-27 18:02:33   INFO  epoch: 15/24, acc_iter=101655, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:12:41, time_cost(all): 1 day, 8:26:15/18:10:48, loss=0.383506474877297, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=0.542543551301889, lr=0.031165473179138592
2023-11-27 18:03:30   INFO  epoch: 15/24, acc_iter=101705, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:20, time_cost(all): 1 day, 8:27:12/17:46:56, loss=0.383398932517194, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=1.1556353211869337, lr=0.031125381406398006
2023-11-27 18:04:28   INFO  epoch: 15/24, acc_iter=101755, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:07:45, time_cost(all): 1 day, 8:28:10/18:28:40, loss=0.383291390157092, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.4595734063043744, lr=0.031085289633657434
2023-11-27 18:05:26   INFO  epoch: 15/24, acc_iter=101805, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:56, time_cost(all): 1 day, 8:29:08/17:33:00, loss=0.383183847796989, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.982381698042602, lr=0.031045197860916834
2023-11-27 18:06:24   INFO  epoch: 15/24, acc_iter=101855, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:09:07, time_cost(all): 1 day, 8:30:06/17:35:59, loss=0.383076305436886, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=2.9596879403229486, lr=0.03100510608817625
2023-11-27 18:07:21   INFO  epoch: 15/24, acc_iter=101905, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:06:07, time_cost(all): 1 day, 8:31:03/17:47:52, loss=0.382968763076784, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.8885307359480468, lr=0.030965014315435663
2023-11-27 18:08:19   INFO  epoch: 15/24, acc_iter=101955, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:03:41, time_cost(all): 1 day, 8:32:01/17:36:59, loss=0.382861220716681, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=2.1906844379142143, lr=0.030924922542695077
2023-11-27 18:09:17   INFO  epoch: 15/24, acc_iter=102005, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:06:01, time_cost(all): 1 day, 8:32:59/17:46:27, loss=0.382753678356578, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.6072609256010373, lr=0.030884830769954477
2023-11-27 18:10:15   INFO  epoch: 15/24, acc_iter=102055, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:03:50, time_cost(all): 1 day, 8:33:57/17:21:58, loss=0.382646135996476, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.5686205841659886, lr=0.03084473899721389
2023-11-27 18:11:12   INFO  epoch: 15/24, acc_iter=102105, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:06:09, time_cost(all): 1 day, 8:34:54/18:30:51, loss=0.382538593636373, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=3.1007682595488677, lr=0.030804647224473306
2023-11-27 18:12:10   INFO  epoch: 15/24, acc_iter=102155, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:02:52, time_cost(all): 1 day, 8:35:52/18:26:13, loss=0.38243105127627, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=4.328265929567255, lr=0.03076455545173272
2023-11-27 18:13:08   INFO  epoch: 15/24, acc_iter=102205, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:58:42, time_cost(all): 1 day, 8:36:50/17:50:04, loss=0.382323508916168, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.3247400974195358, lr=0.030724463678992134
2023-11-27 18:14:06   INFO  epoch: 15/24, acc_iter=102255, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:58:38, time_cost(all): 1 day, 8:37:48/18:14:07, loss=0.382215966556065, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=4.16151090367509, lr=0.030684371906251548
2023-11-27 18:15:03   INFO  epoch: 15/24, acc_iter=102305, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:01:56, time_cost(all): 1 day, 8:38:45/17:16:58, loss=0.382108424195962, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.408801824588851, lr=0.030644280133510962
2023-11-27 18:16:01   INFO  epoch: 15/24, acc_iter=102355, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:56:20, time_cost(all): 1 day, 8:39:43/17:32:01, loss=0.38200088183586, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=2.010313216153552, lr=0.030604188360770362
2023-11-27 18:16:59   INFO  epoch: 15/24, acc_iter=102405, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:14, time_cost(all): 1 day, 8:40:41/17:31:52, loss=0.381893339475757, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=4.190449999379624, lr=0.030564096588029777
2023-11-27 18:17:57   INFO  epoch: 15/24, acc_iter=102455, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:56:33, time_cost(all): 1 day, 8:41:39/18:39:25, loss=0.381785797115654, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=1.5242866051307236, lr=0.03052400481528919
2023-11-27 18:18:54   INFO  epoch: 15/24, acc_iter=102505, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:03, time_cost(all): 1 day, 8:42:36/18:38:46, loss=0.381678254755552, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.337077155968448, lr=0.030483913042548605
2023-11-27 18:19:52   INFO  epoch: 15/24, acc_iter=102555, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:07, time_cost(all): 1 day, 8:43:34/17:32:15, loss=0.381570712395449, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.1788249249104883, lr=0.030443821269808005
2023-11-27 18:20:50   INFO  epoch: 15/24, acc_iter=102605, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:05, time_cost(all): 1 day, 8:44:32/18:25:56, loss=0.381463170035346, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=2.504577429741567, lr=0.03040372949706742
2023-11-27 18:21:48   INFO  epoch: 15/24, acc_iter=102655, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:52:46, time_cost(all): 1 day, 8:45:30/18:22:44, loss=0.381355627675244, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.792524831099784, lr=0.030363637724326847
2023-11-27 18:22:46   INFO  epoch: 15/24, acc_iter=102705, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:54:12, time_cost(all): 1 day, 8:46:28/18:45:58, loss=0.381248085315141, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.1042910246798514, lr=0.030323545951586248
2023-11-27 18:23:43   INFO  epoch: 15/24, acc_iter=102755, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:25, time_cost(all): 1 day, 8:47:25/17:18:33, loss=0.381140542955038, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=3.0580708639525556, lr=0.030283454178845662
2023-11-27 18:24:41   INFO  epoch: 15/24, acc_iter=102805, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:50:28, time_cost(all): 1 day, 8:48:23/17:32:34, loss=0.381033000594936, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.331554822751764, lr=0.030243362406105076
2023-11-27 18:25:39   INFO  epoch: 15/24, acc_iter=102855, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:48:01, time_cost(all): 1 day, 8:49:21/17:26:32, loss=0.380925458234833, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.0820554259787953, lr=0.03020327063336449
2023-11-27 18:26:37   INFO  epoch: 15/24, acc_iter=102905, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:40, time_cost(all): 1 day, 8:50:19/17:34:20, loss=0.38081791587473, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.374674607527189, lr=0.03016317886062389
2023-11-27 18:27:34   INFO  epoch: 15/24, acc_iter=102955, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:01, time_cost(all): 1 day, 8:51:16/18:14:04, loss=0.380710373514627, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=0.5551542270205981, lr=0.030123087087883305
2023-11-27 18:28:32   INFO  epoch: 15/24, acc_iter=103005, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:45:08, time_cost(all): 1 day, 8:52:14/16:58:00, loss=0.380602831154525, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=4.4355220599604595, lr=0.03008299531514272
2023-11-27 18:29:30   INFO  epoch: 15/24, acc_iter=103055, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:43:08, time_cost(all): 1 day, 8:53:12/18:13:16, loss=0.380495288794422, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=2.908521303233885, lr=0.030042903542402133
2023-11-27 18:30:28   INFO  epoch: 15/24, acc_iter=103105, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:07, time_cost(all): 1 day, 8:54:10/18:12:14, loss=0.380387746434319, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=3.661149146154142, lr=0.030002811769661547
2023-11-27 18:31:25   INFO  epoch: 15/24, acc_iter=103155, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:19, time_cost(all): 1 day, 8:55:07/18:39:49, loss=0.380280204074217, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=2.0594180132245183, lr=0.02996271999692096
2023-11-27 18:32:23   INFO  epoch: 15/24, acc_iter=103205, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:12, time_cost(all): 1 day, 8:56:05/18:12:31, loss=0.380172661714114, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=2.2845648507378202, lr=0.029922628224180375
2023-11-27 18:33:21   INFO  epoch: 15/24, acc_iter=103255, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:43:07, time_cost(all): 1 day, 8:57:03/18:25:59, loss=0.380065119354011, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=3.5856968567648115, lr=0.029882536451439776
2023-11-27 18:34:19   INFO  epoch: 15/24, acc_iter=103305, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:17, time_cost(all): 1 day, 8:58:01/18:31:58, loss=0.379957576993909, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=3.1158938600209307, lr=0.02984244467869919
2023-11-27 18:35:16   INFO  epoch: 15/24, acc_iter=103355, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:04, time_cost(all): 1 day, 8:58:58/17:19:28, loss=0.379850034633806, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=0.6004346762882121, lr=0.029802352905958604
2023-11-27 18:36:14   INFO  epoch: 15/24, acc_iter=103405, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:00, time_cost(all): 1 day, 8:59:56/18:05:30, loss=0.379742492273703, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.0339811762800455, lr=0.029762261133218018
2023-11-27 18:37:12   INFO  epoch: 15/24, acc_iter=103455, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:39:05, time_cost(all): 1 day, 9:00:54/18:35:01, loss=0.379634949913601, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.619306489351439, lr=0.029722169360477432
2023-11-27 18:38:10   INFO  epoch: 15/24, acc_iter=103505, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:36:12, time_cost(all): 1 day, 9:01:52/17:21:19, loss=0.379527407553498, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=4.686394190392182, lr=0.029682077587736846
2023-11-27 18:39:07   INFO  epoch: 15/24, acc_iter=103555, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:28, time_cost(all): 1 day, 9:02:49/17:13:06, loss=0.379419865193395, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.7381690147288777, lr=0.02964198581499626
2023-11-27 18:40:05   INFO  epoch: 15/24, acc_iter=103605, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:19, time_cost(all): 1 day, 9:03:47/17:22:54, loss=0.379312322833293, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=2.738858759339182, lr=0.029601894042255675
2023-11-27 18:41:03   INFO  epoch: 15/24, acc_iter=103655, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:31:49, time_cost(all): 1 day, 9:04:45/17:40:58, loss=0.37920478047319, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=4.160993542364074, lr=0.029561802269515075
2023-11-27 18:42:01   INFO  epoch: 15/24, acc_iter=103705, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:42, time_cost(all): 1 day, 9:05:43/18:22:39, loss=0.379097238113087, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.2029976204204136, lr=0.02952171049677449
2023-11-27 18:42:58   INFO  epoch: 15/24, acc_iter=103755, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:45, time_cost(all): 1 day, 9:06:40/17:49:56, loss=0.378989695752985, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=1.9441494156379717, lr=0.029481618724033903
2023-11-27 18:43:56   INFO  epoch: 15/24, acc_iter=103805, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:25, time_cost(all): 1 day, 9:07:38/18:11:23, loss=0.378882153392882, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=0.9883852660149092, lr=0.029441526951293318
2023-11-27 18:44:54   INFO  epoch: 15/24, acc_iter=103855, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:28:08, time_cost(all): 1 day, 9:08:36/16:45:20, loss=0.378774611032779, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=3.263652767362175, lr=0.029401435178552718
2023-11-27 18:45:52   INFO  epoch: 15/24, acc_iter=103905, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:02, time_cost(all): 1 day, 9:09:34/16:55:02, loss=0.378667068672677, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=0.6035814416320093, lr=0.029361343405812132
2023-11-27 18:46:49   INFO  epoch: 15/24, acc_iter=103955, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:00, time_cost(all): 1 day, 9:10:31/17:34:08, loss=0.378559526312574, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=0.8774107872219137, lr=0.02932125163307156
2023-11-27 18:47:47   INFO  epoch: 15/24, acc_iter=104005, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:22, time_cost(all): 1 day, 9:11:29/17:57:00, loss=0.378451983952471, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=1.9852147170822585, lr=0.02928115986033096
2023-11-27 18:48:45   INFO  epoch: 15/24, acc_iter=104055, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:44, time_cost(all): 1 day, 9:12:27/18:00:15, loss=0.378344441592369, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=1.05659495935752, lr=0.029241068087590374
2023-11-27 18:49:43   INFO  epoch: 15/24, acc_iter=104105, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:28, time_cost(all): 1 day, 9:13:25/17:10:19, loss=0.378236899232266, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=2.405034169417152, lr=0.02920097631484979
2023-11-27 18:50:40   INFO  epoch: 15/24, acc_iter=104155, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:42, time_cost(all): 1 day, 9:14:22/16:53:18, loss=0.378129356872163, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=1.485993710424741, lr=0.029160884542109203
2023-11-27 18:51:38   INFO  epoch: 15/24, acc_iter=104205, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:53, time_cost(all): 1 day, 9:15:20/16:43:48, loss=0.378021814512061, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=0.7376516622367089, lr=0.029120792769368603
2023-11-27 18:52:36   INFO  epoch: 15/24, acc_iter=104255, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:43, time_cost(all): 1 day, 9:16:18/17:11:22, loss=0.377914272151958, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=4.4053382807993575, lr=0.029080700996628017
2023-11-27 18:53:34   INFO  epoch: 15/24, acc_iter=104305, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:47, time_cost(all): 1 day, 9:17:16/16:39:48, loss=0.377806729791855, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=3.700253524042619, lr=0.02904060922388743
2023-11-27 18:54:31   INFO  epoch: 15/24, acc_iter=104355, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:39, time_cost(all): 1 day, 9:18:13/17:39:18, loss=0.377699187431753, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=1.7304263576406687, lr=0.029000517451146846
2023-11-27 18:55:29   INFO  epoch: 15/24, acc_iter=104405, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:57, time_cost(all): 1 day, 9:19:11/17:26:26, loss=0.37759164507165, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=4.3568834959227996, lr=0.02896042567840626
2023-11-27 18:56:27   INFO  epoch: 15/24, acc_iter=104455, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:28, time_cost(all): 1 day, 9:20:09/16:56:32, loss=0.377484102711547, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.1887712798226087, lr=0.028920333905665674
2023-11-27 18:57:25   INFO  epoch: 15/24, acc_iter=104505, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:11, time_cost(all): 1 day, 9:21:07/17:56:56, loss=0.377376560351445, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=4.513996774378004, lr=0.028880242132925088
2023-11-27 18:58:22   INFO  epoch: 15/24, acc_iter=104555, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:55, time_cost(all): 1 day, 9:22:04/17:21:20, loss=0.377269017991342, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.919275768421267, lr=0.02884015036018449
2023-11-27 18:59:20   INFO  epoch: 15/24, acc_iter=104605, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:34, time_cost(all): 1 day, 9:23:02/17:12:30, loss=0.377161475631239, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=1.5994843761765627, lr=0.028800058587443902
2023-11-27 19:00:18   INFO  epoch: 15/24, acc_iter=104655, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:49, time_cost(all): 1 day, 9:24:00/16:57:27, loss=0.377053933271136, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.9172696360602868, lr=0.028759966814703317
2023-11-27 19:01:16   INFO  epoch: 15/24, acc_iter=104705, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:48, time_cost(all): 1 day, 9:24:58/18:06:51, loss=0.376946390911034, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=0.729907720188273, lr=0.02871987504196273
2023-11-27 19:02:13   INFO  epoch: 15/24, acc_iter=104755, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:11, time_cost(all): 1 day, 9:25:55/17:18:47, loss=0.376838848550931, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=3.5075855192580656, lr=0.02867978326922213
2023-11-27 19:03:11   INFO  epoch: 15/24, acc_iter=104805, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:18, time_cost(all): 1 day, 9:26:53/17:17:51, loss=0.376731306190828, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.824226193848296, lr=0.028639691496481545
2023-11-27 19:04:09   INFO  epoch: 15/24, acc_iter=104855, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:47, time_cost(all): 1 day, 9:27:51/16:48:19, loss=0.376623763830726, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=2.586266521086613, lr=0.028599599723740973
2023-11-27 19:05:07   INFO  epoch: 15/24, acc_iter=104905, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:01, time_cost(all): 1 day, 9:28:49/16:54:11, loss=0.376516221470623, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=3.0114544540043076, lr=0.028559507951000374
2023-11-27 19:06:04   INFO  epoch: 15/24, acc_iter=104955, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:06, time_cost(all): 1 day, 9:29:46/16:34:29, loss=0.37640867911052, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.977827675956754, lr=0.028519416178259788
2023-11-27 19:07:02   INFO  epoch: 15/24, acc_iter=105005, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:37, time_cost(all): 1 day, 9:30:44/17:52:39, loss=0.376301136750418, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=3.9608568333373744, lr=0.028479324405519202
2023-11-27 19:08:00   INFO  epoch: 15/24, acc_iter=105055, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:41, time_cost(all): 1 day, 9:31:42/16:23:20, loss=0.376193594390315, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=3.0843328389766786, lr=0.028439232632778616
2023-11-27 19:08:58   INFO  epoch: 15/24, acc_iter=105105, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:17, time_cost(all): 1 day, 9:32:40/16:19:46, loss=0.376086052030212, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.714229238422878, lr=0.028399140860038016
2023-11-27 19:09:55   INFO  epoch: 15/24, acc_iter=105155, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:29, time_cost(all): 1 day, 9:33:37/16:58:53, loss=0.37597850967011, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=1.2174210197875226, lr=0.02835904908729743
2023-11-27 19:10:53   INFO  epoch: 15/24, acc_iter=105205, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:38, time_cost(all): 1 day, 9:34:35/17:32:20, loss=0.375870967310007, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.900742451674873, lr=0.028318957314556845
2023-11-27 19:11:51   INFO  epoch: 15/24, acc_iter=105255, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:33, time_cost(all): 1 day, 9:35:33/17:50:23, loss=0.375763424949904, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.6591099367253213, lr=0.02827886554181626
2023-11-27 19:12:49   INFO  epoch: 15/24, acc_iter=105305, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:42, time_cost(all): 1 day, 9:36:31/17:47:03, loss=0.375655882589802, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.227862924091403, lr=0.028238773769075673
2023-11-27 19:13:46   INFO  epoch: 15/24, acc_iter=105355, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 1 day, 9:37:28/17:53:13, loss=0.375548340229699, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.938016012266958, lr=0.028198681996335087
2023-11-27 19:14:44   INFO  epoch: 16/24, acc_iter=105442, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:08:59, time_cost(all): 1 day, 9:38:26/17:42:22, loss=0.37536121652312, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=3.4187508407459957, lr=0.02812892231176646
2023-11-27 19:15:42   INFO  epoch: 16/24, acc_iter=105492, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/1:58:42, time_cost(all): 1 day, 9:39:24/16:17:55, loss=0.375253674163018, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=0.9436003741790562, lr=0.028088830539025875
2023-11-27 19:16:40   INFO  epoch: 16/24, acc_iter=105542, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:09:04, time_cost(all): 1 day, 9:40:22/17:47:47, loss=0.375146131802915, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=3.3534282788140883, lr=0.02804873876628529
2023-11-27 19:17:37   INFO  epoch: 16/24, acc_iter=105592, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:03:45, time_cost(all): 1 day, 9:41:19/17:15:54, loss=0.375038589442812, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.5685180228881739, lr=0.02800864699354469
2023-11-27 19:18:35   INFO  epoch: 16/24, acc_iter=105642, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:57:01, time_cost(all): 1 day, 9:42:17/16:35:15, loss=0.37493104708271, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.814985502067421, lr=0.027968555220804103
2023-11-27 19:19:33   INFO  epoch: 16/24, acc_iter=105692, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:57:01, time_cost(all): 1 day, 9:43:15/16:58:39, loss=0.374823504722607, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=3.8046135127720606, lr=0.027928463448063517
2023-11-27 19:20:31   INFO  epoch: 16/24, acc_iter=105742, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:55:42, time_cost(all): 1 day, 9:44:13/16:46:33, loss=0.374715962362504, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=0.6995345492519893, lr=0.02788837167532293
2023-11-27 19:21:28   INFO  epoch: 16/24, acc_iter=105792, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:04:32, time_cost(all): 1 day, 9:45:10/17:15:23, loss=0.374608420002402, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.1531638946433134, lr=0.027848279902582346
2023-11-27 19:22:26   INFO  epoch: 16/24, acc_iter=105842, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:52:18, time_cost(all): 1 day, 9:46:08/16:57:59, loss=0.374500877642299, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.1390193574711915, lr=0.02780818812984176
2023-11-27 19:23:24   INFO  epoch: 16/24, acc_iter=105892, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:02:39, time_cost(all): 1 day, 9:47:06/16:34:55, loss=0.374393335282196, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.874871753952391, lr=0.027768096357101174
2023-11-27 19:24:22   INFO  epoch: 16/24, acc_iter=105942, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:53:23, time_cost(all): 1 day, 9:48:04/16:29:29, loss=0.374285792922094, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=0.6117206678152286, lr=0.027728004584360574
2023-11-27 19:25:19   INFO  epoch: 16/24, acc_iter=105992, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:52:22, time_cost(all): 1 day, 9:49:01/17:44:56, loss=0.374178250561991, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=1.02373653163652, lr=0.02768791281161999
2023-11-27 19:26:17   INFO  epoch: 16/24, acc_iter=106042, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:57:22, time_cost(all): 1 day, 9:49:59/17:38:52, loss=0.374070708201888, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=3.852484749966895, lr=0.027647821038879403
2023-11-27 19:27:15   INFO  epoch: 16/24, acc_iter=106092, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:54:34, time_cost(all): 1 day, 9:50:57/17:13:53, loss=0.373963165841785, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=3.2700990084828097, lr=0.027607729266138817
2023-11-27 19:28:13   INFO  epoch: 16/24, acc_iter=106142, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:55:54, time_cost(all): 1 day, 9:51:55/17:00:04, loss=0.373855623481683, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=2.1447230049355595, lr=0.02756763749339823
2023-11-27 19:29:10   INFO  epoch: 16/24, acc_iter=106192, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:46:53, time_cost(all): 1 day, 9:52:52/17:31:39, loss=0.37374808112158, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=4.715039871271232, lr=0.02752754572065763
2023-11-27 19:30:08   INFO  epoch: 16/24, acc_iter=106242, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:50:12, time_cost(all): 1 day, 9:53:50/17:05:57, loss=0.373640538761477, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=3.8538699385258477, lr=0.02748745394791706
2023-11-27 19:31:06   INFO  epoch: 16/24, acc_iter=106292, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:52:47, time_cost(all): 1 day, 9:54:48/17:19:33, loss=0.373532996401375, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=1.6718337546536046, lr=0.027447362175176473
2023-11-27 19:32:04   INFO  epoch: 16/24, acc_iter=106342, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:53:48, time_cost(all): 1 day, 9:55:46/16:27:35, loss=0.373425454041272, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.892826169978844, lr=0.027407270402435874
2023-11-27 19:33:01   INFO  epoch: 16/24, acc_iter=106392, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:47:57, time_cost(all): 1 day, 9:56:43/16:53:14, loss=0.373317911681169, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=4.42688605546519, lr=0.027367178629695288
2023-11-27 19:33:59   INFO  epoch: 16/24, acc_iter=106442, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:14, time_cost(all): 1 day, 9:57:41/17:22:42, loss=0.373210369321067, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.194994375120121, lr=0.027327086856954702
2023-11-27 19:34:57   INFO  epoch: 16/24, acc_iter=106492, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:50:30, time_cost(all): 1 day, 9:58:39/16:27:25, loss=0.373102826960964, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=1.403049385885012, lr=0.027286995084214116
2023-11-27 19:35:55   INFO  epoch: 16/24, acc_iter=106542, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:49:37, time_cost(all): 1 day, 9:59:37/15:57:51, loss=0.372995284600861, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=0.9614180993674015, lr=0.027246903311473517
2023-11-27 19:36:52   INFO  epoch: 16/24, acc_iter=106592, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:40:23, time_cost(all): 1 day, 10:00:34/16:16:09, loss=0.372887742240759, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=4.73148837788422, lr=0.02720681153873293
2023-11-27 19:37:50   INFO  epoch: 16/24, acc_iter=106642, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:44:34, time_cost(all): 1 day, 10:01:32/17:19:12, loss=0.372780199880656, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=3.046386264401264, lr=0.027166719765992345
2023-11-27 19:38:48   INFO  epoch: 16/24, acc_iter=106692, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:46:51, time_cost(all): 1 day, 10:02:30/16:05:57, loss=0.372672657520553, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=0.8538596780913004, lr=0.02712662799325176
2023-11-27 19:39:46   INFO  epoch: 16/24, acc_iter=106742, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:44:29, time_cost(all): 1 day, 10:03:28/16:05:38, loss=0.372565115160451, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.250435315576686, lr=0.027086536220511173
2023-11-27 19:40:43   INFO  epoch: 16/24, acc_iter=106792, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:43:13, time_cost(all): 1 day, 10:04:25/16:52:05, loss=0.372457572800348, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=4.459784027986877, lr=0.027046444447770587
2023-11-27 19:41:41   INFO  epoch: 16/24, acc_iter=106842, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:34:26, time_cost(all): 1 day, 10:05:23/17:22:43, loss=0.372350030440245, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=3.552902824870929, lr=0.02700635267503
2023-11-27 19:42:39   INFO  epoch: 16/24, acc_iter=106892, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:39:18, time_cost(all): 1 day, 10:06:21/15:58:14, loss=0.372242488080143, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.0991366491180583, lr=0.026966260902289402
2023-11-27 19:43:37   INFO  epoch: 16/24, acc_iter=106942, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:40:24, time_cost(all): 1 day, 10:07:19/16:04:55, loss=0.37213494572004, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.0242172289205618, lr=0.026926169129548816
2023-11-27 19:44:34   INFO  epoch: 16/24, acc_iter=106992, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:33:57, time_cost(all): 1 day, 10:08:16/16:28:12, loss=0.372027403359937, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=1.5649974010235894, lr=0.02688607735680823
2023-11-27 19:45:32   INFO  epoch: 16/24, acc_iter=107042, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:39:29, time_cost(all): 1 day, 10:09:14/17:11:11, loss=0.371919860999835, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=4.360531854904141, lr=0.026845985584067644
2023-11-27 19:46:30   INFO  epoch: 16/24, acc_iter=107092, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:29:27, time_cost(all): 1 day, 10:10:12/16:38:31, loss=0.371812318639732, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.075338563440221, lr=0.026805893811327045
2023-11-27 19:47:28   INFO  epoch: 16/24, acc_iter=107142, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:31:47, time_cost(all): 1 day, 10:11:10/16:28:21, loss=0.371704776279629, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=4.5583168901459725, lr=0.026765802038586473
2023-11-27 19:48:25   INFO  epoch: 16/24, acc_iter=107192, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:35:11, time_cost(all): 1 day, 10:12:07/16:21:31, loss=0.371597233919526, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=3.595243738478539, lr=0.026725710265845887
2023-11-27 19:49:23   INFO  epoch: 16/24, acc_iter=107242, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:34:05, time_cost(all): 1 day, 10:13:05/17:10:24, loss=0.371489691559424, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=2.230255976295524, lr=0.026685618493105287
2023-11-27 19:50:21   INFO  epoch: 16/24, acc_iter=107292, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:32:26, time_cost(all): 1 day, 10:14:03/15:48:39, loss=0.371382149199321, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=4.81020029254002, lr=0.0266455267203647
2023-11-27 19:51:19   INFO  epoch: 16/24, acc_iter=107342, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:25:35, time_cost(all): 1 day, 10:15:01/16:52:27, loss=0.371274606839218, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=3.534089794244927, lr=0.026605434947624115
2023-11-27 19:52:16   INFO  epoch: 16/24, acc_iter=107392, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:24:05, time_cost(all): 1 day, 10:15:58/16:56:43, loss=0.371167064479116, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=4.213585575789269, lr=0.02656534317488353
2023-11-27 19:53:14   INFO  epoch: 16/24, acc_iter=107442, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:26:40, time_cost(all): 1 day, 10:16:56/16:37:56, loss=0.371059522119013, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.178555278739223, lr=0.02652525140214293
2023-11-27 19:54:12   INFO  epoch: 16/24, acc_iter=107492, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:30:23, time_cost(all): 1 day, 10:17:54/16:28:51, loss=0.37095197975891, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=4.49649401949948, lr=0.026485159629402344
2023-11-27 19:55:10   INFO  epoch: 16/24, acc_iter=107542, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:22:25, time_cost(all): 1 day, 10:18:52/16:00:09, loss=0.370844437398808, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.6728037869903254, lr=0.026445067856661758
2023-11-27 19:56:07   INFO  epoch: 16/24, acc_iter=107592, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:52, time_cost(all): 1 day, 10:19:49/16:27:00, loss=0.370736895038705, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=3.8002823108846284, lr=0.026404976083921172
2023-11-27 19:57:05   INFO  epoch: 16/24, acc_iter=107642, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:20:14, time_cost(all): 1 day, 10:20:47/16:27:22, loss=0.370629352678602, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=1.0807336586302174, lr=0.026364884311180586
2023-11-27 19:58:03   INFO  epoch: 16/24, acc_iter=107692, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:19:16, time_cost(all): 1 day, 10:21:45/16:19:57, loss=0.3705218103185, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=4.327680151397786, lr=0.02632479253844
2023-11-27 19:59:01   INFO  epoch: 16/24, acc_iter=107742, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:22:28, time_cost(all): 1 day, 10:22:43/16:55:43, loss=0.370414267958397, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.1354010458784987, lr=0.026284700765699415
2023-11-27 19:59:58   INFO  epoch: 16/24, acc_iter=107792, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:23:53, time_cost(all): 1 day, 10:23:40/16:14:01, loss=0.370306725598294, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=1.078561413644747, lr=0.026244608992958815
2023-11-27 20:00:56   INFO  epoch: 16/24, acc_iter=107842, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:17:19, time_cost(all): 1 day, 10:24:38/15:40:16, loss=0.370199183238192, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=4.482220855419061, lr=0.02620451722021823
2023-11-27 20:01:54   INFO  epoch: 16/24, acc_iter=107892, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:54, time_cost(all): 1 day, 10:25:36/15:45:51, loss=0.370091640878089, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.2792214144269733, lr=0.026164425447477643
2023-11-27 20:02:52   INFO  epoch: 16/24, acc_iter=107942, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:17:25, time_cost(all): 1 day, 10:26:34/16:43:13, loss=0.369984098517986, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=3.2081730961139967, lr=0.026124333674737057
2023-11-27 20:03:50   INFO  epoch: 16/24, acc_iter=107992, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:20:13, time_cost(all): 1 day, 10:27:32/16:07:26, loss=0.369876556157884, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=1.8379846384506344, lr=0.026084241901996458
2023-11-27 20:04:47   INFO  epoch: 16/24, acc_iter=108042, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:23, time_cost(all): 1 day, 10:28:29/16:12:57, loss=0.369769013797781, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=4.8592667128686395, lr=0.026044150129255886
2023-11-27 20:05:45   INFO  epoch: 16/24, acc_iter=108092, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:13:51, time_cost(all): 1 day, 10:29:27/16:13:27, loss=0.369661471437678, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.22069645053844, lr=0.0260040583565153
2023-11-27 20:06:43   INFO  epoch: 16/24, acc_iter=108142, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:17:14, time_cost(all): 1 day, 10:30:25/15:35:05, loss=0.369553929077576, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=1.9881518041813797, lr=0.0259639665837747
2023-11-27 20:07:41   INFO  epoch: 16/24, acc_iter=108192, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:13:02, time_cost(all): 1 day, 10:31:23/15:49:22, loss=0.369446386717473, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.447872421865799, lr=0.025923874811034114
2023-11-27 20:08:38   INFO  epoch: 16/24, acc_iter=108242, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:14:04, time_cost(all): 1 day, 10:32:20/16:15:17, loss=0.36933884435737, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=2.5506383668731663, lr=0.02588378303829353
2023-11-27 20:09:36   INFO  epoch: 16/24, acc_iter=108292, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:14:24, time_cost(all): 1 day, 10:33:18/16:06:24, loss=0.369231301997268, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=1.603271523912742, lr=0.025843691265552943
2023-11-27 20:10:34   INFO  epoch: 16/24, acc_iter=108342, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:46, time_cost(all): 1 day, 10:34:16/15:33:14, loss=0.369123759637165, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=2.317381030310594, lr=0.025803599492812357
2023-11-27 20:11:32   INFO  epoch: 16/24, acc_iter=108392, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:06:25, time_cost(all): 1 day, 10:35:14/16:50:08, loss=0.369016217277062, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=1.3650780838251686, lr=0.025763507720071757
2023-11-27 20:12:29   INFO  epoch: 16/24, acc_iter=108442, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:05:00, time_cost(all): 1 day, 10:36:11/15:54:37, loss=0.36890867491696, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.765604442084042, lr=0.02572341594733117
2023-11-27 20:13:27   INFO  epoch: 16/24, acc_iter=108492, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:09:53, time_cost(all): 1 day, 10:37:09/15:50:12, loss=0.368801132556857, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=2.47172427474148, lr=0.0256833241745906
2023-11-27 20:14:25   INFO  epoch: 16/24, acc_iter=108542, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:07:45, time_cost(all): 1 day, 10:38:07/15:28:55, loss=0.368693590196754, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=1.43112337364916, lr=0.02564323240185
2023-11-27 20:15:23   INFO  epoch: 16/24, acc_iter=108592, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:02:52, time_cost(all): 1 day, 10:39:05/15:50:54, loss=0.368586047836651, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.084994532505016, lr=0.025603140629109414
2023-11-27 20:16:20   INFO  epoch: 16/24, acc_iter=108642, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:03:02, time_cost(all): 1 day, 10:40:02/15:17:58, loss=0.368478505476549, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=2.330144639604166, lr=0.025563048856368828
2023-11-27 20:17:18   INFO  epoch: 16/24, acc_iter=108692, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:00:29, time_cost(all): 1 day, 10:41:00/16:29:35, loss=0.368370963116446, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=1.1854713992746304, lr=0.025522957083628242
2023-11-27 20:18:16   INFO  epoch: 16/24, acc_iter=108742, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/0:59:49, time_cost(all): 1 day, 10:41:58/16:48:59, loss=0.368263420756343, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=2.0024572656298663, lr=0.025482865310887642
2023-11-27 20:19:14   INFO  epoch: 16/24, acc_iter=108792, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:41, time_cost(all): 1 day, 10:42:56/16:34:52, loss=0.368155878396241, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=0.7590105086671076, lr=0.025442773538147057
2023-11-27 20:20:11   INFO  epoch: 16/24, acc_iter=108842, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:20, time_cost(all): 1 day, 10:43:53/15:46:43, loss=0.368048336036138, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=4.814941904213116, lr=0.02540268176540647
2023-11-27 20:21:09   INFO  epoch: 16/24, acc_iter=108892, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:57:28, time_cost(all): 1 day, 10:44:51/16:33:06, loss=0.367940793676035, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=1.8206426048627762, lr=0.025362589992665885
2023-11-27 20:22:07   INFO  epoch: 16/24, acc_iter=108942, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:01:13, time_cost(all): 1 day, 10:45:49/16:35:38, loss=0.367833251315933, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=4.649517858359248, lr=0.0253224982199253
2023-11-27 20:23:05   INFO  epoch: 16/24, acc_iter=108992, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:44, time_cost(all): 1 day, 10:46:47/15:18:56, loss=0.36772570895583, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.92(1.03), norm=4.990171198218885, lr=0.025282406447184713
2023-11-27 20:24:02   INFO  epoch: 16/24, acc_iter=109042, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:55:37, time_cost(all): 1 day, 10:47:44/15:33:44, loss=0.367618166595727, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=4.1624967210465655, lr=0.025242314674444127
2023-11-27 20:25:00   INFO  epoch: 16/24, acc_iter=109092, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:57:23, time_cost(all): 1 day, 10:48:42/16:23:57, loss=0.367510624235625, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.4170773690007095, lr=0.025202222901703528
2023-11-27 20:25:58   INFO  epoch: 16/24, acc_iter=109142, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:54:06, time_cost(all): 1 day, 10:49:40/15:46:47, loss=0.367403081875522, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=2.9187416137186837, lr=0.025162131128962942
2023-11-27 20:26:56   INFO  epoch: 16/24, acc_iter=109192, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:52:11, time_cost(all): 1 day, 10:50:38/16:20:40, loss=0.367295539515419, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.0043599543547788, lr=0.025122039356222356
2023-11-27 20:27:53   INFO  epoch: 16/24, acc_iter=109242, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:54:44, time_cost(all): 1 day, 10:51:35/16:37:35, loss=0.367187997155317, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.1302925918891942, lr=0.02508194758348177
2023-11-27 20:28:51   INFO  epoch: 16/24, acc_iter=109292, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:44, time_cost(all): 1 day, 10:52:33/15:35:11, loss=0.367080454795214, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=0.5404308401366149, lr=0.02504185581074117
2023-11-27 20:29:49   INFO  epoch: 16/24, acc_iter=109342, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:50, time_cost(all): 1 day, 10:53:31/16:18:27, loss=0.366972912435111, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=4.016149196315974, lr=0.0250017640380006
2023-11-27 20:30:47   INFO  epoch: 16/24, acc_iter=109392, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:49:25, time_cost(all): 1 day, 10:54:29/16:04:41, loss=0.366865370075009, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=2.8408590058049765, lr=0.024961672265260013
2023-11-27 20:31:44   INFO  epoch: 16/24, acc_iter=109442, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:56, time_cost(all): 1 day, 10:55:26/15:21:04, loss=0.366757827714906, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=3.9744245546921606, lr=0.024921580492519413
2023-11-27 20:32:42   INFO  epoch: 16/24, acc_iter=109492, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:47, time_cost(all): 1 day, 10:56:24/15:32:04, loss=0.366650285354803, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=4.420994030110053, lr=0.024881488719778827
2023-11-27 20:33:40   INFO  epoch: 16/24, acc_iter=109542, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:45:43, time_cost(all): 1 day, 10:57:22/16:33:12, loss=0.366542742994701, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=3.196570234355771, lr=0.02484139694703824
2023-11-27 20:34:38   INFO  epoch: 16/24, acc_iter=109592, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:43:47, time_cost(all): 1 day, 10:58:20/16:19:40, loss=0.366435200634598, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=2.763961873803116, lr=0.024801305174297655
2023-11-27 20:35:35   INFO  epoch: 16/24, acc_iter=109642, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:42:49, time_cost(all): 1 day, 10:59:17/16:19:15, loss=0.366327658274495, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=1.2683983702887935, lr=0.024761213401557056
2023-11-27 20:36:33   INFO  epoch: 16/24, acc_iter=109692, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:57, time_cost(all): 1 day, 11:00:15/15:16:27, loss=0.366220115914393, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=2.875818664946614, lr=0.02472112162881647
2023-11-27 20:37:31   INFO  epoch: 16/24, acc_iter=109742, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:25, time_cost(all): 1 day, 11:01:13/15:31:38, loss=0.36611257355429, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=2.764119008176641, lr=0.024681029856075884
2023-11-27 20:38:29   INFO  epoch: 16/24, acc_iter=109792, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:55, time_cost(all): 1 day, 11:02:11/16:17:53, loss=0.366005031194187, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=2.50387162312723, lr=0.024640938083335298
2023-11-27 20:39:26   INFO  epoch: 16/24, acc_iter=109842, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:41:27, time_cost(all): 1 day, 11:03:08/15:00:50, loss=0.365897488834085, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=1.4399981161063549, lr=0.024600846310594712
2023-11-27 20:40:24   INFO  epoch: 16/24, acc_iter=109892, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:38:51, time_cost(all): 1 day, 11:04:06/15:18:46, loss=0.365789946473982, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.088073996472765, lr=0.024560754537854126
2023-11-27 20:41:22   INFO  epoch: 16/24, acc_iter=109942, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:25, time_cost(all): 1 day, 11:05:04/15:44:09, loss=0.365682404113879, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=0.9750167947013135, lr=0.02452066276511354
2023-11-27 20:42:20   INFO  epoch: 16/24, acc_iter=109992, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:38:28, time_cost(all): 1 day, 11:06:02/15:03:23, loss=0.365574861753777, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.730589803323743, lr=0.02448057099237294
2023-11-27 20:43:17   INFO  epoch: 16/24, acc_iter=110042, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:24, time_cost(all): 1 day, 11:06:59/14:52:25, loss=0.365467319393674, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=0.5425496675960966, lr=0.024440479219632355
2023-11-27 20:44:15   INFO  epoch: 16/24, acc_iter=110092, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:51, time_cost(all): 1 day, 11:07:57/15:38:54, loss=0.365359777033571, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=0.850003147446009, lr=0.02440038744689177
2023-11-27 20:45:13   INFO  epoch: 16/24, acc_iter=110142, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:56, time_cost(all): 1 day, 11:08:55/16:05:36, loss=0.365252234673468, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=1.1008424710050244, lr=0.024360295674151183
2023-11-27 20:46:11   INFO  epoch: 16/24, acc_iter=110192, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:02, time_cost(all): 1 day, 11:09:53/15:28:46, loss=0.365144692313366, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=2.151755676532324, lr=0.024320203901410598
2023-11-27 20:47:08   INFO  epoch: 16/24, acc_iter=110242, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:47, time_cost(all): 1 day, 11:10:50/15:07:41, loss=0.365037149953263, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=3.080258736045809, lr=0.02428011212867001
2023-11-27 20:48:06   INFO  epoch: 16/24, acc_iter=110292, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:33:46, time_cost(all): 1 day, 11:11:48/14:49:53, loss=0.36492960759316, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=1.6320639324671027, lr=0.024240020355929426
2023-11-27 20:49:04   INFO  epoch: 16/24, acc_iter=110342, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:51, time_cost(all): 1 day, 11:12:46/16:14:25, loss=0.364822065233058, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=0.9699440651185074, lr=0.02419992858318884
2023-11-27 20:50:02   INFO  epoch: 16/24, acc_iter=110392, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:29:38, time_cost(all): 1 day, 11:13:44/15:34:41, loss=0.364714522872955, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=2.28848857790421, lr=0.02415983681044824
2023-11-27 20:50:59   INFO  epoch: 16/24, acc_iter=110442, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:02, time_cost(all): 1 day, 11:14:41/16:14:04, loss=0.364606980512852, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=3.1828221335635467, lr=0.024119745037707654
2023-11-27 20:51:57   INFO  epoch: 16/24, acc_iter=110492, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:45, time_cost(all): 1 day, 11:15:39/15:08:31, loss=0.36449943815275, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=4.650990276488213, lr=0.02407965326496707
2023-11-27 20:52:55   INFO  epoch: 16/24, acc_iter=110542, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:04, time_cost(all): 1 day, 11:16:37/15:11:19, loss=0.364391895792647, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=0.5586101175423331, lr=0.024039561492226483
2023-11-27 20:53:53   INFO  epoch: 16/24, acc_iter=110592, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:25:52, time_cost(all): 1 day, 11:17:35/15:51:38, loss=0.364284353432544, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=1.8483164244429786, lr=0.023999469719485883
2023-11-27 20:54:50   INFO  epoch: 16/24, acc_iter=110642, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:24:28, time_cost(all): 1 day, 11:18:32/15:59:26, loss=0.364176811072442, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.944972587334681, lr=0.023959377946745297
2023-11-27 20:55:48   INFO  epoch: 16/24, acc_iter=110692, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:06, time_cost(all): 1 day, 11:19:30/15:53:17, loss=0.364069268712339, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=1.9573636730295731, lr=0.023919286174004725
2023-11-27 20:56:46   INFO  epoch: 16/24, acc_iter=110742, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:07, time_cost(all): 1 day, 11:20:28/15:38:18, loss=0.363961726352236, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.1713051385405695, lr=0.023879194401264126
2023-11-27 20:57:44   INFO  epoch: 16/24, acc_iter=110792, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:16, time_cost(all): 1 day, 11:21:26/16:07:23, loss=0.363854183992134, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=4.66222130994239, lr=0.02383910262852354
2023-11-27 20:58:41   INFO  epoch: 16/24, acc_iter=110842, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:22:46, time_cost(all): 1 day, 11:22:23/14:38:26, loss=0.363746641632031, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=1.9396335665559983, lr=0.023799010855782954
2023-11-27 20:59:39   INFO  epoch: 16/24, acc_iter=110892, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:38, time_cost(all): 1 day, 11:23:21/15:35:50, loss=0.363639099271928, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=4.317587449438504, lr=0.023758919083042368
2023-11-27 21:00:37   INFO  epoch: 16/24, acc_iter=110942, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:41, time_cost(all): 1 day, 11:24:19/15:44:03, loss=0.363531556911826, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=4.78889804861248, lr=0.02371882731030177
2023-11-27 21:01:35   INFO  epoch: 16/24, acc_iter=110992, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:12, time_cost(all): 1 day, 11:25:17/15:28:21, loss=0.363424014551723, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=4.927721840626079, lr=0.023678735537561182
2023-11-27 21:02:32   INFO  epoch: 16/24, acc_iter=111042, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:10, time_cost(all): 1 day, 11:26:14/15:10:05, loss=0.36331647219162, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=2.1550717412616103, lr=0.023638643764820597
2023-11-27 21:03:30   INFO  epoch: 16/24, acc_iter=111092, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:54, time_cost(all): 1 day, 11:27:12/14:58:18, loss=0.363208929831518, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=3.6115177885531695, lr=0.02359855199208001
2023-11-27 21:04:28   INFO  epoch: 16/24, acc_iter=111142, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:10, time_cost(all): 1 day, 11:28:10/15:14:56, loss=0.363101387471415, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=4.148286875620557, lr=0.023558460219339425
2023-11-27 21:05:26   INFO  epoch: 16/24, acc_iter=111192, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:25, time_cost(all): 1 day, 11:29:08/15:28:11, loss=0.362993845111312, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=3.5529845779199944, lr=0.02351836844659884
2023-11-27 21:06:23   INFO  epoch: 16/24, acc_iter=111242, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:43, time_cost(all): 1 day, 11:30:05/14:38:48, loss=0.36288630275121, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.4271991686320606, lr=0.023478276673858253
2023-11-27 21:07:21   INFO  epoch: 16/24, acc_iter=111292, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:16, time_cost(all): 1 day, 11:31:03/15:05:44, loss=0.362778760391107, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=4.770311824355599, lr=0.023438184901117654
2023-11-27 21:08:19   INFO  epoch: 16/24, acc_iter=111342, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:48, time_cost(all): 1 day, 11:32:01/15:37:19, loss=0.362671218031004, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.206595355782318, lr=0.023398093128377068
2023-11-27 21:09:17   INFO  epoch: 16/24, acc_iter=111392, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:38, time_cost(all): 1 day, 11:32:59/15:17:49, loss=0.362563675670901, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=4.5047405881302645, lr=0.023358001355636482
2023-11-27 21:10:14   INFO  epoch: 16/24, acc_iter=111442, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:16, time_cost(all): 1 day, 11:33:56/14:37:26, loss=0.362456133310799, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=2.338003185749123, lr=0.023317909582895896
2023-11-27 21:11:12   INFO  epoch: 16/24, acc_iter=111492, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:00, time_cost(all): 1 day, 11:34:54/14:25:22, loss=0.362348590950696, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.32957333827363, lr=0.023277817810155296
2023-11-27 21:12:10   INFO  epoch: 16/24, acc_iter=111542, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:16, time_cost(all): 1 day, 11:35:52/15:12:02, loss=0.362241048590593, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.2401919464190845, lr=0.02323772603741471
2023-11-27 21:13:08   INFO  epoch: 16/24, acc_iter=111592, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:06, time_cost(all): 1 day, 11:36:50/14:55:46, loss=0.362133506230491, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=1.0176741394024014, lr=0.02319763426467414
2023-11-27 21:14:05   INFO  epoch: 16/24, acc_iter=111642, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:26, time_cost(all): 1 day, 11:37:47/15:15:26, loss=0.362025963870388, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.242180330190564, lr=0.02315754249193354
2023-11-27 21:15:03   INFO  epoch: 16/24, acc_iter=111692, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:23, time_cost(all): 1 day, 11:38:45/15:12:19, loss=0.361918421510285, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.4312395160326603, lr=0.023117450719192953
2023-11-27 21:16:01   INFO  epoch: 16/24, acc_iter=111742, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:43, time_cost(all): 1 day, 11:39:43/14:40:31, loss=0.361810879150183, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.463696703304006, lr=0.023077358946452367
2023-11-27 21:16:59   INFO  epoch: 16/24, acc_iter=111792, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:28, time_cost(all): 1 day, 11:40:41/14:48:19, loss=0.36170333679008, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.2206761364247558, lr=0.02303726717371178
2023-11-27 21:17:56   INFO  epoch: 16/24, acc_iter=111842, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:36, time_cost(all): 1 day, 11:41:38/14:32:02, loss=0.361595794429977, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=1.308787320131863, lr=0.02299717540097118
2023-11-27 21:18:54   INFO  epoch: 16/24, acc_iter=111892, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:36, time_cost(all): 1 day, 11:42:36/14:18:06, loss=0.361488252069875, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=0.7625008268542441, lr=0.022957083628230596
2023-11-27 21:19:52   INFO  epoch: 16/24, acc_iter=111942, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 1 day, 11:43:34/14:28:08, loss=0.361380709709772, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=2.1356393812592565, lr=0.02291699185549001
2023-11-27 21:20:50   INFO  epoch: 17/24, acc_iter=112029, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:01:58, time_cost(all): 1 day, 11:44:32/14:17:03, loss=0.361193586003193, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=3.0120830731083497, lr=0.022847232170921383
2023-11-27 21:21:47   INFO  epoch: 17/24, acc_iter=112079, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:01:39, time_cost(all): 1 day, 11:45:29/15:32:25, loss=0.361086043643091, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.2953967670913646, lr=0.02280714039818081
2023-11-27 21:22:45   INFO  epoch: 17/24, acc_iter=112129, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:09:39, time_cost(all): 1 day, 11:46:27/15:04:21, loss=0.360978501282988, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=3.8055247657270015, lr=0.02276704862544021
2023-11-27 21:23:43   INFO  epoch: 17/24, acc_iter=112179, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:04:50, time_cost(all): 1 day, 11:47:25/14:18:43, loss=0.360870958922885, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=1.2983405854918637, lr=0.022726956852699626
2023-11-27 21:24:41   INFO  epoch: 17/24, acc_iter=112229, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:00:19, time_cost(all): 1 day, 11:48:23/15:04:35, loss=0.360763416562783, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=2.903861831841095, lr=0.02268686507995904
2023-11-27 21:25:38   INFO  epoch: 17/24, acc_iter=112279, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:01:50, time_cost(all): 1 day, 11:49:20/15:17:22, loss=0.36065587420268, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=4.096710164041138, lr=0.022646773307218454
2023-11-27 21:26:36   INFO  epoch: 17/24, acc_iter=112329, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:02:48, time_cost(all): 1 day, 11:50:18/15:17:45, loss=0.360548331842577, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=2.2327535297689227, lr=0.022606681534477854
2023-11-27 21:27:34   INFO  epoch: 17/24, acc_iter=112379, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:04:08, time_cost(all): 1 day, 11:51:16/14:37:09, loss=0.360440789482475, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=3.827887688547349, lr=0.02256658976173727
2023-11-27 21:28:32   INFO  epoch: 17/24, acc_iter=112429, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:54:01, time_cost(all): 1 day, 11:52:14/14:47:29, loss=0.360333247122372, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=2.2769090220227106, lr=0.022526497988996683
2023-11-27 21:29:29   INFO  epoch: 17/24, acc_iter=112479, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:59:54, time_cost(all): 1 day, 11:53:11/15:13:16, loss=0.360225704762269, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=1.8249381146087948, lr=0.022486406216256097
2023-11-27 21:30:27   INFO  epoch: 17/24, acc_iter=112529, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:51:58, time_cost(all): 1 day, 11:54:09/14:29:59, loss=0.360118162402167, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=4.959923408017811, lr=0.02244631444351551
2023-11-27 21:31:25   INFO  epoch: 17/24, acc_iter=112579, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:54:43, time_cost(all): 1 day, 11:55:07/15:27:09, loss=0.360010620042064, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=4.193979440133269, lr=0.022406222670774925
2023-11-27 21:32:23   INFO  epoch: 17/24, acc_iter=112629, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:59:26, time_cost(all): 1 day, 11:56:05/15:13:41, loss=0.359903077681961, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=0.8384817626406142, lr=0.02236613089803434
2023-11-27 21:33:20   INFO  epoch: 17/24, acc_iter=112679, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:53:46, time_cost(all): 1 day, 11:57:02/14:25:53, loss=0.359795535321859, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=2.908474841581642, lr=0.02232603912529374
2023-11-27 21:34:18   INFO  epoch: 17/24, acc_iter=112729, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:51:21, time_cost(all): 1 day, 11:58:00/14:33:56, loss=0.359687992961756, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.4918240450594118, lr=0.022285947352553154
2023-11-27 21:35:16   INFO  epoch: 17/24, acc_iter=112779, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:48:58, time_cost(all): 1 day, 11:58:58/14:50:30, loss=0.359580450601653, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=2.6257352781324728, lr=0.022245855579812568
2023-11-27 21:36:14   INFO  epoch: 17/24, acc_iter=112829, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:54:58, time_cost(all): 1 day, 11:59:56/14:08:41, loss=0.35947290824155, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=3.1785315843378013, lr=0.022205763807071982
2023-11-27 21:37:11   INFO  epoch: 17/24, acc_iter=112879, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:53:01, time_cost(all): 1 day, 12:00:53/14:21:20, loss=0.359365365881448, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=1.817962176556212, lr=0.022165672034331396
2023-11-27 21:38:09   INFO  epoch: 17/24, acc_iter=112929, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:44:23, time_cost(all): 1 day, 12:01:51/15:05:48, loss=0.359257823521345, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.263016868757374, lr=0.022125580261590796
2023-11-27 21:39:07   INFO  epoch: 17/24, acc_iter=112979, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:44:58, time_cost(all): 1 day, 12:02:49/15:01:14, loss=0.359150281161242, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.005217414691691, lr=0.022085488488850225
2023-11-27 21:40:05   INFO  epoch: 17/24, acc_iter=113029, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:48:50, time_cost(all): 1 day, 12:03:47/15:14:56, loss=0.35904273880114, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.57714362293835, lr=0.02204539671610964
2023-11-27 21:41:02   INFO  epoch: 17/24, acc_iter=113079, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:46:59, time_cost(all): 1 day, 12:04:44/14:28:05, loss=0.358935196441037, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=1.1567495014157145, lr=0.02200530494336904
2023-11-27 21:42:00   INFO  epoch: 17/24, acc_iter=113129, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:46:26, time_cost(all): 1 day, 12:05:42/14:18:27, loss=0.358827654080934, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=0.7936406342853171, lr=0.021965213170628453
2023-11-27 21:42:58   INFO  epoch: 17/24, acc_iter=113179, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:46:22, time_cost(all): 1 day, 12:06:40/14:12:25, loss=0.358720111720832, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.112923383100843, lr=0.021925121397887867
2023-11-27 21:43:56   INFO  epoch: 17/24, acc_iter=113229, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:43:29, time_cost(all): 1 day, 12:07:38/13:54:37, loss=0.358612569360729, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.8094033303643329, lr=0.02188502962514728
2023-11-27 21:44:53   INFO  epoch: 17/24, acc_iter=113279, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:39:09, time_cost(all): 1 day, 12:08:35/15:04:36, loss=0.358505027000626, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=3.8824200968515186, lr=0.02184493785240668
2023-11-27 21:45:51   INFO  epoch: 17/24, acc_iter=113329, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:40:38, time_cost(all): 1 day, 12:09:33/14:42:10, loss=0.358397484640524, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=2.4850093040517898, lr=0.021804846079666096
2023-11-27 21:46:49   INFO  epoch: 17/24, acc_iter=113379, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:36:08, time_cost(all): 1 day, 12:10:31/14:11:23, loss=0.358289942280421, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.754071601817857, lr=0.02176475430692551
2023-11-27 21:47:47   INFO  epoch: 17/24, acc_iter=113429, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:37:52, time_cost(all): 1 day, 12:11:29/14:17:48, loss=0.358182399920318, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.937609404628545, lr=0.021724662534184924
2023-11-27 21:48:45   INFO  epoch: 17/24, acc_iter=113479, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:37:31, time_cost(all): 1 day, 12:12:27/15:04:25, loss=0.358074857560216, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=1.695929079556821, lr=0.02168457076144434
2023-11-27 21:49:42   INFO  epoch: 17/24, acc_iter=113529, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:38:12, time_cost(all): 1 day, 12:13:24/13:51:38, loss=0.357967315200113, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=1.1097229608743737, lr=0.021644478988703753
2023-11-27 21:50:40   INFO  epoch: 17/24, acc_iter=113579, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:40:06, time_cost(all): 1 day, 12:14:22/14:05:46, loss=0.35785977284001, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=2.145891419076034, lr=0.021604387215963167
2023-11-27 21:51:38   INFO  epoch: 17/24, acc_iter=113629, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:36:30, time_cost(all): 1 day, 12:15:20/14:12:24, loss=0.357752230479908, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=2.2979326158453057, lr=0.021564295443222567
2023-11-27 21:52:36   INFO  epoch: 17/24, acc_iter=113679, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:32:03, time_cost(all): 1 day, 12:16:18/14:07:42, loss=0.357644688119805, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=3.79951934710518, lr=0.02152420367048198
2023-11-27 21:53:33   INFO  epoch: 17/24, acc_iter=113729, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:34:39, time_cost(all): 1 day, 12:17:15/14:54:55, loss=0.357537145759702, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.1590612110304015, lr=0.021484111897741395
2023-11-27 21:54:31   INFO  epoch: 17/24, acc_iter=113779, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:29:43, time_cost(all): 1 day, 12:18:13/14:34:20, loss=0.3574296033996, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=1.8534415889581988, lr=0.02144402012500081
2023-11-27 21:55:29   INFO  epoch: 17/24, acc_iter=113829, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:29:16, time_cost(all): 1 day, 12:19:11/13:41:26, loss=0.357322061039497, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=1.3484338713079058, lr=0.02140392835226021
2023-11-27 21:56:27   INFO  epoch: 17/24, acc_iter=113879, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:29:09, time_cost(all): 1 day, 12:20:09/15:04:40, loss=0.357214518679394, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=1.2724867871503274, lr=0.021363836579519638
2023-11-27 21:57:24   INFO  epoch: 17/24, acc_iter=113929, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:36, time_cost(all): 1 day, 12:21:06/14:16:26, loss=0.357106976319292, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.7822457025066822, lr=0.021323744806779052
2023-11-27 21:58:22   INFO  epoch: 17/24, acc_iter=113979, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:28:48, time_cost(all): 1 day, 12:22:04/13:52:00, loss=0.356999433959189, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.495523446840069, lr=0.021283653034038452
2023-11-27 21:59:20   INFO  epoch: 17/24, acc_iter=114029, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:29:11, time_cost(all): 1 day, 12:23:02/13:40:03, loss=0.356891891599086, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=3.7931443123151993, lr=0.021243561261297866
2023-11-27 22:00:18   INFO  epoch: 17/24, acc_iter=114079, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:26:31, time_cost(all): 1 day, 12:24:00/14:52:16, loss=0.356784349238984, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=4.71084835144974, lr=0.02120346948855728
2023-11-27 22:01:15   INFO  epoch: 17/24, acc_iter=114129, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:29:22, time_cost(all): 1 day, 12:24:57/13:39:19, loss=0.356676806878881, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=1.0616949733284868, lr=0.021163377715816695
2023-11-27 22:02:13   INFO  epoch: 17/24, acc_iter=114179, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:24:52, time_cost(all): 1 day, 12:25:55/14:47:13, loss=0.356569264518778, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=3.3170182804209376, lr=0.021123285943076095
2023-11-27 22:03:11   INFO  epoch: 17/24, acc_iter=114229, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:27:28, time_cost(all): 1 day, 12:26:53/14:30:20, loss=0.356461722158675, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.765029042042776, lr=0.02108319417033551
2023-11-27 22:04:09   INFO  epoch: 17/24, acc_iter=114279, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:18:58, time_cost(all): 1 day, 12:27:51/14:03:50, loss=0.356354179798573, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=1.8028963394477775, lr=0.021043102397594923
2023-11-27 22:05:06   INFO  epoch: 17/24, acc_iter=114329, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:24:49, time_cost(all): 1 day, 12:28:48/13:42:42, loss=0.35624663743847, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=1.2824059857953996, lr=0.021003010624854337
2023-11-27 22:06:04   INFO  epoch: 17/24, acc_iter=114379, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:23:12, time_cost(all): 1 day, 12:29:46/13:44:51, loss=0.356139095078367, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.2811465452950688, lr=0.02096291885211375
2023-11-27 22:07:02   INFO  epoch: 17/24, acc_iter=114429, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:18:26, time_cost(all): 1 day, 12:30:44/13:52:42, loss=0.356031552718265, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=4.213778268197247, lr=0.020922827079373166
2023-11-27 22:08:00   INFO  epoch: 17/24, acc_iter=114479, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:18:12, time_cost(all): 1 day, 12:31:42/13:52:14, loss=0.355924010358162, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.828321652017878, lr=0.02088273530663258
2023-11-27 22:08:57   INFO  epoch: 17/24, acc_iter=114529, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:18:14, time_cost(all): 1 day, 12:32:39/13:28:30, loss=0.355816467998059, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=1.0542203411858164, lr=0.02084264353389198
2023-11-27 22:09:55   INFO  epoch: 17/24, acc_iter=114579, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:19:18, time_cost(all): 1 day, 12:33:37/14:24:38, loss=0.355708925637957, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=1.3094933509977371, lr=0.020802551761151394
2023-11-27 22:10:53   INFO  epoch: 17/24, acc_iter=114629, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:16:00, time_cost(all): 1 day, 12:34:35/14:36:26, loss=0.355601383277854, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=1.762772142807266, lr=0.02076245998841081
2023-11-27 22:11:51   INFO  epoch: 17/24, acc_iter=114679, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:53, time_cost(all): 1 day, 12:35:33/14:00:52, loss=0.355493840917751, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=3.1851497424319093, lr=0.020722368215670223
2023-11-27 22:12:48   INFO  epoch: 17/24, acc_iter=114729, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:17:15, time_cost(all): 1 day, 12:36:30/14:23:31, loss=0.355386298557649, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.939213147000069, lr=0.020682276442929637
2023-11-27 22:13:46   INFO  epoch: 17/24, acc_iter=114779, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:10:11, time_cost(all): 1 day, 12:37:28/14:17:08, loss=0.355278756197546, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.304586301175176, lr=0.02064218467018905
2023-11-27 22:14:44   INFO  epoch: 17/24, acc_iter=114829, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:20, time_cost(all): 1 day, 12:38:26/13:31:56, loss=0.355171213837443, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=2.2212998200889045, lr=0.020602092897448465
2023-11-27 22:15:42   INFO  epoch: 17/24, acc_iter=114879, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:22, time_cost(all): 1 day, 12:39:24/14:25:05, loss=0.355063671477341, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.4521874912701687, lr=0.020562001124707865
2023-11-27 22:16:39   INFO  epoch: 17/24, acc_iter=114929, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:06:36, time_cost(all): 1 day, 12:40:21/14:33:59, loss=0.354956129117238, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.11(1.03), norm=2.8683913224662736, lr=0.02052190935196728
2023-11-27 22:17:37   INFO  epoch: 17/24, acc_iter=114979, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:09:56, time_cost(all): 1 day, 12:41:19/13:56:31, loss=0.354848586757135, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=3.6305363673348405, lr=0.020481817579226694
2023-11-27 22:18:35   INFO  epoch: 17/24, acc_iter=115029, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:11:01, time_cost(all): 1 day, 12:42:17/14:33:16, loss=0.354741044397033, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=0.9711571390651066, lr=0.020441725806486108
2023-11-27 22:19:33   INFO  epoch: 17/24, acc_iter=115079, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:07:38, time_cost(all): 1 day, 12:43:15/13:37:11, loss=0.35463350203693, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.771423985987046, lr=0.020401634033745522
2023-11-27 22:20:30   INFO  epoch: 17/24, acc_iter=115129, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:08:33, time_cost(all): 1 day, 12:44:12/13:34:00, loss=0.354525959676827, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.146968054030752, lr=0.020361542261004922
2023-11-27 22:21:28   INFO  epoch: 17/24, acc_iter=115179, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:06:54, time_cost(all): 1 day, 12:45:10/14:14:15, loss=0.354418417316725, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=0.8227390212970798, lr=0.02032145048826435
2023-11-27 22:22:26   INFO  epoch: 17/24, acc_iter=115229, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:05:37, time_cost(all): 1 day, 12:46:08/13:19:36, loss=0.354310874956622, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=1.9264910335940295, lr=0.020281358715523765
2023-11-27 22:23:24   INFO  epoch: 17/24, acc_iter=115279, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:05:05, time_cost(all): 1 day, 12:47:06/13:22:36, loss=0.354203332596519, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.2994462280099137, lr=0.020241266942783165
2023-11-27 22:24:21   INFO  epoch: 17/24, acc_iter=115329, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:04:25, time_cost(all): 1 day, 12:48:03/13:58:51, loss=0.354095790236417, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=0.889535804830782, lr=0.02020117517004258
2023-11-27 22:25:19   INFO  epoch: 17/24, acc_iter=115379, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:03:48, time_cost(all): 1 day, 12:49:01/14:25:04, loss=0.353988247876314, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.8074189921949975, lr=0.020161083397301993
2023-11-27 22:26:17   INFO  epoch: 17/24, acc_iter=115429, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:57:39, time_cost(all): 1 day, 12:49:59/13:51:49, loss=0.353880705516211, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.239376822362777, lr=0.020120991624561407
2023-11-27 22:27:15   INFO  epoch: 17/24, acc_iter=115479, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:02:10, time_cost(all): 1 day, 12:50:57/14:31:01, loss=0.353773163156109, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=4.136242244376909, lr=0.020080899851820808
2023-11-27 22:28:12   INFO  epoch: 17/24, acc_iter=115529, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:58:32, time_cost(all): 1 day, 12:51:54/13:32:08, loss=0.353665620796006, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=1.5818520225549706, lr=0.020040808079080222
2023-11-27 22:29:10   INFO  epoch: 17/24, acc_iter=115579, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:12, time_cost(all): 1 day, 12:52:52/13:58:33, loss=0.353558078435903, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=3.2545353181411163, lr=0.020000716306339636
2023-11-27 22:30:08   INFO  epoch: 17/24, acc_iter=115629, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:58:43, time_cost(all): 1 day, 12:53:50/13:12:48, loss=0.3534505360758, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.635578355266943, lr=0.01996062453359905
2023-11-27 22:31:06   INFO  epoch: 17/24, acc_iter=115679, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:58:18, time_cost(all): 1 day, 12:54:48/13:42:18, loss=0.353342993715698, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=1.995583267759065, lr=0.019920532760858464
2023-11-27 22:32:03   INFO  epoch: 17/24, acc_iter=115729, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:38, time_cost(all): 1 day, 12:55:45/13:21:33, loss=0.353235451355595, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=0.9591795905439525, lr=0.01988044098811788
2023-11-27 22:33:01   INFO  epoch: 17/24, acc_iter=115779, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:56:17, time_cost(all): 1 day, 12:56:43/14:07:59, loss=0.353127908995492, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=2.9084528104891936, lr=0.019840349215377293
2023-11-27 22:33:59   INFO  epoch: 17/24, acc_iter=115829, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:52:31, time_cost(all): 1 day, 12:57:41/13:55:45, loss=0.35302036663539, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=2.9951239192986328, lr=0.019800257442636693
2023-11-27 22:34:57   INFO  epoch: 17/24, acc_iter=115879, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:53:11, time_cost(all): 1 day, 12:58:39/13:21:46, loss=0.352912824275287, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=4.463452403855314, lr=0.019760165669896107
2023-11-27 22:35:54   INFO  epoch: 17/24, acc_iter=115929, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:48:34, time_cost(all): 1 day, 12:59:36/13:41:42, loss=0.352805281915184, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.593376504530686, lr=0.01972007389715552
2023-11-27 22:36:52   INFO  epoch: 17/24, acc_iter=115979, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:50:55, time_cost(all): 1 day, 13:00:34/13:06:41, loss=0.352697739555082, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=2.647780225451504, lr=0.019679982124414935
2023-11-27 22:37:50   INFO  epoch: 17/24, acc_iter=116029, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:46:51, time_cost(all): 1 day, 13:01:32/14:14:31, loss=0.352590197194979, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.693523535979504, lr=0.019639890351674336
2023-11-27 22:38:48   INFO  epoch: 17/24, acc_iter=116079, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:15, time_cost(all): 1 day, 13:02:30/14:07:11, loss=0.352482654834876, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=1.6615265500540315, lr=0.019599798578933764
2023-11-27 22:39:45   INFO  epoch: 17/24, acc_iter=116129, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:44:58, time_cost(all): 1 day, 13:03:27/13:07:04, loss=0.352375112474774, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=4.836034579405817, lr=0.019559706806193178
2023-11-27 22:40:43   INFO  epoch: 17/24, acc_iter=116179, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:43:59, time_cost(all): 1 day, 13:04:25/13:59:02, loss=0.352267570114671, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=1.156077041939431, lr=0.019519615033452578
2023-11-27 22:41:41   INFO  epoch: 17/24, acc_iter=116229, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:44:04, time_cost(all): 1 day, 13:05:23/13:35:46, loss=0.352160027754568, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.827303741672371, lr=0.019479523260711992
2023-11-27 22:42:39   INFO  epoch: 17/24, acc_iter=116279, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:37, time_cost(all): 1 day, 13:06:21/13:14:38, loss=0.352052485394466, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.037889389773385, lr=0.019439431487971406
2023-11-27 22:43:36   INFO  epoch: 17/24, acc_iter=116329, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:07, time_cost(all): 1 day, 13:07:18/13:12:46, loss=0.351944943034363, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=0.7782573845957588, lr=0.01939933971523082
2023-11-27 22:44:34   INFO  epoch: 17/24, acc_iter=116379, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:37, time_cost(all): 1 day, 13:08:16/14:13:29, loss=0.35183740067426, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.18(1.03), norm=4.340537803010588, lr=0.01935924794249022
2023-11-27 22:45:32   INFO  epoch: 17/24, acc_iter=116429, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:39:31, time_cost(all): 1 day, 13:09:14/12:58:39, loss=0.351729858314158, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=1.5355999110338878, lr=0.019319156169749635
2023-11-27 22:46:30   INFO  epoch: 17/24, acc_iter=116479, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:38:33, time_cost(all): 1 day, 13:10:12/13:57:40, loss=0.351622315954055, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=4.54743041006262, lr=0.01927906439700905
2023-11-27 22:47:27   INFO  epoch: 17/24, acc_iter=116529, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:54, time_cost(all): 1 day, 13:11:09/12:58:53, loss=0.351514773593952, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=1.6385587740693004, lr=0.019238972624268463
2023-11-27 22:48:25   INFO  epoch: 17/24, acc_iter=116579, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:39:25, time_cost(all): 1 day, 13:12:07/13:58:11, loss=0.35140723123385, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=4.834384686865181, lr=0.019198880851527877
2023-11-27 22:49:23   INFO  epoch: 17/24, acc_iter=116629, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:07, time_cost(all): 1 day, 13:13:05/14:01:47, loss=0.351299688873747, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.135456468689857, lr=0.01915878907878729
2023-11-27 22:50:21   INFO  epoch: 17/24, acc_iter=116679, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:43, time_cost(all): 1 day, 13:14:03/13:21:28, loss=0.351192146513644, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=1.2885553760730704, lr=0.019118697306046706
2023-11-27 22:51:18   INFO  epoch: 17/24, acc_iter=116729, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:21, time_cost(all): 1 day, 13:15:00/13:36:07, loss=0.351084604153541, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=1.9192667415047797, lr=0.019078605533306106
2023-11-27 22:52:16   INFO  epoch: 17/24, acc_iter=116779, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:33, time_cost(all): 1 day, 13:15:58/13:03:59, loss=0.350977061793439, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=1.7280930222556032, lr=0.01903851376056552
2023-11-27 22:53:14   INFO  epoch: 17/24, acc_iter=116829, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:22, time_cost(all): 1 day, 13:16:56/13:14:47, loss=0.350869519433336, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=3.8908795671764786, lr=0.018998421987824934
2023-11-27 22:54:12   INFO  epoch: 17/24, acc_iter=116879, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:28, time_cost(all): 1 day, 13:17:54/13:31:46, loss=0.350761977073234, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=2.5067011951197626, lr=0.01895833021508435
2023-11-27 22:55:09   INFO  epoch: 17/24, acc_iter=116929, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:01, time_cost(all): 1 day, 13:18:51/13:21:47, loss=0.350654434713131, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=3.2833173820534864, lr=0.018918238442343763
2023-11-27 22:56:07   INFO  epoch: 17/24, acc_iter=116979, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:19, time_cost(all): 1 day, 13:19:49/13:00:21, loss=0.350546892353028, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.158719114469543, lr=0.018878146669603177
2023-11-27 22:57:05   INFO  epoch: 17/24, acc_iter=117029, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:16, time_cost(all): 1 day, 13:20:47/13:55:41, loss=0.350439349992925, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.4429590210920593, lr=0.01883805489686259
2023-11-27 22:58:03   INFO  epoch: 17/24, acc_iter=117079, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:37, time_cost(all): 1 day, 13:21:45/13:05:04, loss=0.350331807632823, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.3341338174563175, lr=0.018797963124122005
2023-11-27 22:59:00   INFO  epoch: 17/24, acc_iter=117129, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:41, time_cost(all): 1 day, 13:22:42/13:17:38, loss=0.35022426527272, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.1026583785241688, lr=0.018757871351381405
2023-11-27 22:59:58   INFO  epoch: 17/24, acc_iter=117179, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:26:28, time_cost(all): 1 day, 13:23:40/13:19:39, loss=0.350116722912617, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=4.28078874220987, lr=0.01871777957864082
2023-11-27 23:00:56   INFO  epoch: 17/24, acc_iter=117229, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:21, time_cost(all): 1 day, 13:24:38/13:47:11, loss=0.350009180552515, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=3.449374950569488, lr=0.018677687805900234
2023-11-27 23:01:54   INFO  epoch: 17/24, acc_iter=117279, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:50, time_cost(all): 1 day, 13:25:36/12:52:17, loss=0.349901638192412, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=4.336791784231039, lr=0.018637596033159648
2023-11-27 23:02:51   INFO  epoch: 17/24, acc_iter=117329, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:12, time_cost(all): 1 day, 13:26:33/13:23:24, loss=0.349794095832309, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=2.483273434042202, lr=0.018597504260419048
2023-11-27 23:03:49   INFO  epoch: 17/24, acc_iter=117379, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:21:53, time_cost(all): 1 day, 13:27:31/13:06:54, loss=0.349686553472207, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=0.9327204764884598, lr=0.018557412487678462
2023-11-27 23:04:47   INFO  epoch: 17/24, acc_iter=117429, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:20:53, time_cost(all): 1 day, 13:28:29/13:25:53, loss=0.349579011112104, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=0.9708758662452632, lr=0.01851732071493789
2023-11-27 23:05:45   INFO  epoch: 17/24, acc_iter=117479, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:29, time_cost(all): 1 day, 13:29:27/12:54:43, loss=0.349471468752001, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=1.0526889945883358, lr=0.01847722894219729
2023-11-27 23:06:42   INFO  epoch: 17/24, acc_iter=117529, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:26, time_cost(all): 1 day, 13:30:24/13:30:15, loss=0.349363926391899, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=1.9894425274223144, lr=0.018437137169456705
2023-11-27 23:07:40   INFO  epoch: 17/24, acc_iter=117579, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:25, time_cost(all): 1 day, 13:31:22/12:45:14, loss=0.349256384031796, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=4.175690459326078, lr=0.01839704539671612
2023-11-27 23:08:38   INFO  epoch: 17/24, acc_iter=117629, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:42, time_cost(all): 1 day, 13:32:20/13:37:38, loss=0.349148841671693, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=3.1630641522666667, lr=0.018356953623975533
2023-11-27 23:09:36   INFO  epoch: 17/24, acc_iter=117679, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:34, time_cost(all): 1 day, 13:33:18/13:24:48, loss=0.349041299311591, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=2.947659383963222, lr=0.018316861851234933
2023-11-27 23:10:33   INFO  epoch: 17/24, acc_iter=117729, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:14, time_cost(all): 1 day, 13:34:15/13:27:46, loss=0.348933756951488, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=4.579938668197064, lr=0.018276770078494348
2023-11-27 23:11:31   INFO  epoch: 17/24, acc_iter=117779, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:49, time_cost(all): 1 day, 13:35:13/13:38:02, loss=0.348826214591385, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=3.8551125886294737, lr=0.018236678305753762
2023-11-27 23:12:29   INFO  epoch: 17/24, acc_iter=117829, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:40, time_cost(all): 1 day, 13:36:11/12:45:59, loss=0.348718672231283, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.8933585070879944, lr=0.018196586533013176
2023-11-27 23:13:27   INFO  epoch: 17/24, acc_iter=117879, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:50, time_cost(all): 1 day, 13:37:09/12:38:32, loss=0.34861112987118, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=3.3333398203706, lr=0.01815649476027259
2023-11-27 23:14:24   INFO  epoch: 17/24, acc_iter=117929, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:07, time_cost(all): 1 day, 13:38:06/12:29:15, loss=0.348503587511077, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=2.079073382311093, lr=0.018116402987532004
2023-11-27 23:15:22   INFO  epoch: 17/24, acc_iter=117979, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:13, time_cost(all): 1 day, 13:39:04/13:29:40, loss=0.348396045150975, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=3.3206138840908457, lr=0.01807631121479142
2023-11-27 23:16:20   INFO  epoch: 17/24, acc_iter=118029, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:22, time_cost(all): 1 day, 13:40:02/12:49:06, loss=0.348288502790872, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.6902301690740416, lr=0.01803621944205082
2023-11-27 23:17:18   INFO  epoch: 17/24, acc_iter=118079, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:32, time_cost(all): 1 day, 13:41:00/12:56:57, loss=0.348180960430769, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=0.8753831544925579, lr=0.017996127669310233
2023-11-27 23:18:15   INFO  epoch: 17/24, acc_iter=118129, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:06, time_cost(all): 1 day, 13:41:57/12:36:38, loss=0.348073418070667, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.637523385872863, lr=0.017956035896569647
2023-11-27 23:19:13   INFO  epoch: 17/24, acc_iter=118179, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:28, time_cost(all): 1 day, 13:42:55/12:23:12, loss=0.347965875710564, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.385554084028983, lr=0.01791594412382906
2023-11-27 23:20:11   INFO  epoch: 17/24, acc_iter=118229, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:30, time_cost(all): 1 day, 13:43:53/13:29:58, loss=0.347858333350461, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.2248849190098436, lr=0.01787585235108846
2023-11-27 23:21:09   INFO  epoch: 17/24, acc_iter=118279, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:45, time_cost(all): 1 day, 13:44:51/13:22:14, loss=0.347750790990358, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=4.2169267039062674, lr=0.017835760578347876
2023-11-27 23:22:06   INFO  epoch: 17/24, acc_iter=118329, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:38, time_cost(all): 1 day, 13:45:48/13:24:54, loss=0.347643248630256, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.1568289261861375, lr=0.017795668805607304
2023-11-27 23:23:04   INFO  epoch: 17/24, acc_iter=118379, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:38, time_cost(all): 1 day, 13:46:46/13:16:05, loss=0.347535706270153, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=1.0300135643352135, lr=0.017755577032866704
2023-11-27 23:24:02   INFO  epoch: 17/24, acc_iter=118429, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:36, time_cost(all): 1 day, 13:47:44/12:42:49, loss=0.34742816391005, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.0429599241535423, lr=0.017715485260126118
2023-11-27 23:25:00   INFO  epoch: 17/24, acc_iter=118479, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:41, time_cost(all): 1 day, 13:48:42/12:27:37, loss=0.347320621549948, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=2.5907570679247782, lr=0.017675393487385532
2023-11-27 23:25:57   INFO  epoch: 17/24, acc_iter=118529, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 1 day, 13:49:39/13:05:14, loss=0.347213079189845, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=2.6451296117252157, lr=0.017635301714644946
2023-11-27 23:26:55   INFO  epoch: 18/24, acc_iter=118616, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:10:38, time_cost(all): 1 day, 13:50:37/12:19:28, loss=0.347025955483266, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=0.8188259005706714, lr=0.01756554203007632
2023-11-27 23:27:53   INFO  epoch: 18/24, acc_iter=118666, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:03:03, time_cost(all): 1 day, 13:51:35/13:22:33, loss=0.346918413123164, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=3.6851321924504363, lr=0.017525450257335734
2023-11-27 23:28:51   INFO  epoch: 18/24, acc_iter=118716, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:09:15, time_cost(all): 1 day, 13:52:33/12:41:51, loss=0.346810870763061, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.768914581686766, lr=0.017485358484595134
2023-11-27 23:29:48   INFO  epoch: 18/24, acc_iter=118766, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:08:00, time_cost(all): 1 day, 13:53:30/12:35:41, loss=0.346703328402958, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=4.958003587219773, lr=0.01744526671185455
2023-11-27 23:30:46   INFO  epoch: 18/24, acc_iter=118816, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:01:12, time_cost(all): 1 day, 13:54:28/13:12:03, loss=0.346595786042856, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.6629418654711823, lr=0.017405174939113977
2023-11-27 23:31:44   INFO  epoch: 18/24, acc_iter=118866, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:01:21, time_cost(all): 1 day, 13:55:26/12:59:55, loss=0.346488243682753, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=0.8457811881373486, lr=0.017365083166373377
2023-11-27 23:32:42   INFO  epoch: 18/24, acc_iter=118916, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:54, time_cost(all): 1 day, 13:56:24/13:09:53, loss=0.34638070132265, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=2.42810022795891, lr=0.01732499139363279
2023-11-27 23:33:40   INFO  epoch: 18/24, acc_iter=118966, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:58:29, time_cost(all): 1 day, 13:57:22/12:32:40, loss=0.346273158962548, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=1.1055506978342207, lr=0.017284899620892205
2023-11-27 23:34:37   INFO  epoch: 18/24, acc_iter=119016, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:00:37, time_cost(all): 1 day, 13:58:19/12:25:06, loss=0.346165616602445, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.7515355716985614, lr=0.01724480784815162
2023-11-27 23:35:35   INFO  epoch: 18/24, acc_iter=119066, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:58:23, time_cost(all): 1 day, 13:59:17/12:40:18, loss=0.346058074242342, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=3.6255325725832437, lr=0.01720471607541102
2023-11-27 23:36:33   INFO  epoch: 18/24, acc_iter=119116, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:52:48, time_cost(all): 1 day, 14:00:15/13:12:12, loss=0.34595053188224, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=2.802549454460973, lr=0.017164624302670434
2023-11-27 23:37:31   INFO  epoch: 18/24, acc_iter=119166, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:52:26, time_cost(all): 1 day, 14:01:13/12:53:41, loss=0.345842989522137, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=4.4717262580225645, lr=0.017124532529929848
2023-11-27 23:38:28   INFO  epoch: 18/24, acc_iter=119216, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:56:13, time_cost(all): 1 day, 14:02:10/12:49:55, loss=0.345735447162034, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.4867302176363824, lr=0.017084440757189262
2023-11-27 23:39:26   INFO  epoch: 18/24, acc_iter=119266, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:58:27, time_cost(all): 1 day, 14:03:08/13:15:40, loss=0.345627904801932, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=2.2985646404723417, lr=0.017044348984448676
2023-11-27 23:40:24   INFO  epoch: 18/24, acc_iter=119316, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:57:33, time_cost(all): 1 day, 14:04:06/12:09:19, loss=0.345520362441829, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=3.1965569444075754, lr=0.01700425721170809
2023-11-27 23:41:22   INFO  epoch: 18/24, acc_iter=119366, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:47:38, time_cost(all): 1 day, 14:05:04/12:14:53, loss=0.345412820081726, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=4.218319350694839, lr=0.016964165438967505
2023-11-27 23:42:19   INFO  epoch: 18/24, acc_iter=119416, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:47:35, time_cost(all): 1 day, 14:06:01/12:44:54, loss=0.345305277721624, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.153502109264594, lr=0.016924073666226905
2023-11-27 23:43:17   INFO  epoch: 18/24, acc_iter=119466, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:48:28, time_cost(all): 1 day, 14:06:59/12:27:43, loss=0.345197735361521, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.542127364484286, lr=0.01688398189348632
2023-11-27 23:44:15   INFO  epoch: 18/24, acc_iter=119516, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:47:58, time_cost(all): 1 day, 14:07:57/12:41:51, loss=0.345090193001418, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=1.9798439199692326, lr=0.016843890120745733
2023-11-27 23:45:13   INFO  epoch: 18/24, acc_iter=119566, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:49:16, time_cost(all): 1 day, 14:08:55/12:55:41, loss=0.344982650641316, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=0.5426877824157676, lr=0.016803798348005147
2023-11-27 23:46:10   INFO  epoch: 18/24, acc_iter=119616, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:41:32, time_cost(all): 1 day, 14:09:52/12:59:01, loss=0.344875108281213, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.472708461806682, lr=0.016763706575264548
2023-11-27 23:47:08   INFO  epoch: 18/24, acc_iter=119666, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:46:35, time_cost(all): 1 day, 14:10:50/12:29:25, loss=0.34476756592111, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=4.38198163663522, lr=0.01672361480252396
2023-11-27 23:48:06   INFO  epoch: 18/24, acc_iter=119716, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:40:28, time_cost(all): 1 day, 14:11:48/12:56:59, loss=0.344660023561008, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=4.290754194932748, lr=0.01668352302978339
2023-11-27 23:49:04   INFO  epoch: 18/24, acc_iter=119766, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:40:14, time_cost(all): 1 day, 14:12:46/12:42:07, loss=0.344552481200905, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=3.865464453289623, lr=0.01664343125704279
2023-11-27 23:50:01   INFO  epoch: 18/24, acc_iter=119816, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:45:12, time_cost(all): 1 day, 14:13:43/12:42:05, loss=0.344444938840802, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=2.304842798266078, lr=0.016603339484302204
2023-11-27 23:50:59   INFO  epoch: 18/24, acc_iter=119866, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:43:03, time_cost(all): 1 day, 14:14:41/12:06:45, loss=0.344337396480699, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=3.625535520335276, lr=0.01656324771156162
2023-11-27 23:51:57   INFO  epoch: 18/24, acc_iter=119916, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:43:08, time_cost(all): 1 day, 14:15:39/12:33:09, loss=0.344229854120597, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.530049160535106, lr=0.016523155938821033
2023-11-27 23:52:55   INFO  epoch: 18/24, acc_iter=119966, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:44:37, time_cost(all): 1 day, 14:16:37/12:18:18, loss=0.344122311760494, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.669011613725775, lr=0.016483064166080447
2023-11-27 23:53:52   INFO  epoch: 18/24, acc_iter=120016, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:36:32, time_cost(all): 1 day, 14:17:34/13:00:31, loss=0.344014769400391, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=0.5983103783295732, lr=0.016442972393339847
2023-11-27 23:54:50   INFO  epoch: 18/24, acc_iter=120066, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:40:33, time_cost(all): 1 day, 14:18:32/12:35:25, loss=0.343907227040289, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=1.4369112403289779, lr=0.01640288062059926
2023-11-27 23:55:48   INFO  epoch: 18/24, acc_iter=120116, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:34:03, time_cost(all): 1 day, 14:19:30/12:14:09, loss=0.343799684680186, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=2.8823471861005743, lr=0.016362788847858675
2023-11-27 23:56:46   INFO  epoch: 18/24, acc_iter=120166, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:35:35, time_cost(all): 1 day, 14:20:28/12:57:09, loss=0.343692142320083, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=3.9106409407234737, lr=0.01632269707511809
2023-11-27 23:57:43   INFO  epoch: 18/24, acc_iter=120216, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:32:32, time_cost(all): 1 day, 14:21:25/12:51:25, loss=0.343584599959981, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=0.7372432977574761, lr=0.016282605302377504
2023-11-27 23:58:41   INFO  epoch: 18/24, acc_iter=120266, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:38:24, time_cost(all): 1 day, 14:22:23/12:02:07, loss=0.343477057599878, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=3.4567519247795486, lr=0.016242513529636918
2023-11-27 23:59:39   INFO  epoch: 18/24, acc_iter=120316, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:33:53, time_cost(all): 1 day, 14:23:21/11:44:26, loss=0.343369515239775, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=3.645626995756578, lr=0.016202421756896332
2023-11-28 00:00:37   INFO  epoch: 18/24, acc_iter=120366, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:27:47, time_cost(all): 1 day, 14:24:19/12:47:57, loss=0.343261972879673, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=3.5019658849895223, lr=0.016162329984155732
2023-11-28 00:01:34   INFO  epoch: 18/24, acc_iter=120416, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:31:15, time_cost(all): 1 day, 14:25:16/12:01:28, loss=0.34315443051957, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.875438706061661, lr=0.016122238211415146
2023-11-28 00:02:32   INFO  epoch: 18/24, acc_iter=120466, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:27:20, time_cost(all): 1 day, 14:26:14/11:51:59, loss=0.343046888159467, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=0.9006338388827748, lr=0.01608214643867456
2023-11-28 00:03:30   INFO  epoch: 18/24, acc_iter=120516, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:33:09, time_cost(all): 1 day, 14:27:12/12:02:49, loss=0.342939345799365, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=3.470269122858596, lr=0.016042054665933975
2023-11-28 00:04:28   INFO  epoch: 18/24, acc_iter=120566, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:31:36, time_cost(all): 1 day, 14:28:10/12:26:15, loss=0.342831803439262, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=2.2632470850238646, lr=0.016001962893193375
2023-11-28 00:05:25   INFO  epoch: 18/24, acc_iter=120616, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:31:04, time_cost(all): 1 day, 14:29:07/12:17:53, loss=0.342724261079159, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=4.171222749019403, lr=0.015961871120452803
2023-11-28 00:06:23   INFO  epoch: 18/24, acc_iter=120666, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:29:06, time_cost(all): 1 day, 14:30:05/12:07:44, loss=0.342616718719057, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=4.040659853551766, lr=0.015921779347712217
2023-11-28 00:07:21   INFO  epoch: 18/24, acc_iter=120716, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:28:17, time_cost(all): 1 day, 14:31:03/11:56:48, loss=0.342509176358954, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.896206724766971, lr=0.015881687574971617
2023-11-28 00:08:19   INFO  epoch: 18/24, acc_iter=120766, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:55, time_cost(all): 1 day, 14:32:01/11:54:51, loss=0.342401633998851, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=2.9110684496280417, lr=0.01584159580223103
2023-11-28 00:09:16   INFO  epoch: 18/24, acc_iter=120816, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:22:29, time_cost(all): 1 day, 14:32:58/11:47:31, loss=0.342294091638749, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.703880285027194, lr=0.015801504029490446
2023-11-28 00:10:14   INFO  epoch: 18/24, acc_iter=120866, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:19:19, time_cost(all): 1 day, 14:33:56/11:40:54, loss=0.342186549278646, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.088811527332868, lr=0.01576141225674986
2023-11-28 00:11:12   INFO  epoch: 18/24, acc_iter=120916, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:19:52, time_cost(all): 1 day, 14:34:54/12:04:31, loss=0.342079006918543, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.717265720406534, lr=0.01572132048400926
2023-11-28 00:12:10   INFO  epoch: 18/24, acc_iter=120966, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:22:15, time_cost(all): 1 day, 14:35:52/11:35:52, loss=0.341971464558441, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=2.5343138661288567, lr=0.015681228711268674
2023-11-28 00:13:07   INFO  epoch: 18/24, acc_iter=121016, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:19:01, time_cost(all): 1 day, 14:36:49/12:33:02, loss=0.341863922198338, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=4.201910874683527, lr=0.015641136938528102
2023-11-28 00:14:05   INFO  epoch: 18/24, acc_iter=121066, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:16:32, time_cost(all): 1 day, 14:37:47/11:29:44, loss=0.341756379838235, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.5712445303570681, lr=0.015601045165787503
2023-11-28 00:15:03   INFO  epoch: 18/24, acc_iter=121116, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:17, time_cost(all): 1 day, 14:38:45/12:06:35, loss=0.341648837478132, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=3.3771590062896384, lr=0.015560953393046917
2023-11-28 00:16:01   INFO  epoch: 18/24, acc_iter=121166, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:51, time_cost(all): 1 day, 14:39:43/11:55:48, loss=0.34154129511803, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=4.170026137951031, lr=0.015520861620306331
2023-11-28 00:16:58   INFO  epoch: 18/24, acc_iter=121216, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:16, time_cost(all): 1 day, 14:40:40/12:00:41, loss=0.341433752757927, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=0.6613289897262427, lr=0.015480769847565745
2023-11-28 00:17:56   INFO  epoch: 18/24, acc_iter=121266, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:15:11, time_cost(all): 1 day, 14:41:38/11:31:31, loss=0.341326210397824, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=0.7072037376804001, lr=0.015440678074825145
2023-11-28 00:18:54   INFO  epoch: 18/24, acc_iter=121316, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:10:24, time_cost(all): 1 day, 14:42:36/11:50:59, loss=0.341218668037722, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=2.6963749570761353, lr=0.01540058630208456
2023-11-28 00:19:52   INFO  epoch: 18/24, acc_iter=121366, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:14:17, time_cost(all): 1 day, 14:43:34/11:50:04, loss=0.341111125677619, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.704328969549595, lr=0.015360494529343974
2023-11-28 00:20:49   INFO  epoch: 18/24, acc_iter=121416, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:14:50, time_cost(all): 1 day, 14:44:31/11:36:48, loss=0.341003583317516, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.05(1.03), norm=4.342786519194063, lr=0.015320402756603388
2023-11-28 00:21:47   INFO  epoch: 18/24, acc_iter=121466, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:14:28, time_cost(all): 1 day, 14:45:29/12:08:20, loss=0.340896040957414, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=0.7929250293440373, lr=0.015280310983862802
2023-11-28 00:22:45   INFO  epoch: 18/24, acc_iter=121516, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:12:38, time_cost(all): 1 day, 14:46:27/12:22:14, loss=0.340788498597311, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.460809520377965, lr=0.015240219211122216
2023-11-28 00:23:43   INFO  epoch: 18/24, acc_iter=121566, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:49, time_cost(all): 1 day, 14:47:25/11:42:11, loss=0.340680956237208, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=4.246680733269161, lr=0.01520012743838163
2023-11-28 00:24:40   INFO  epoch: 18/24, acc_iter=121616, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:11:01, time_cost(all): 1 day, 14:48:22/12:17:39, loss=0.340573413877106, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=1.2657986337725673, lr=0.01516003566564103
2023-11-28 00:25:38   INFO  epoch: 18/24, acc_iter=121666, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:04:47, time_cost(all): 1 day, 14:49:20/11:28:32, loss=0.340465871517003, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.115323181825354, lr=0.015119943892900445
2023-11-28 00:26:36   INFO  epoch: 18/24, acc_iter=121716, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:04:53, time_cost(all): 1 day, 14:50:18/11:28:03, loss=0.3403583291569, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=4.096982381028419, lr=0.015079852120159859
2023-11-28 00:27:34   INFO  epoch: 18/24, acc_iter=121766, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:04:30, time_cost(all): 1 day, 14:51:16/12:07:25, loss=0.340250786796798, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=4.914525148434499, lr=0.015039760347419273
2023-11-28 00:28:31   INFO  epoch: 18/24, acc_iter=121816, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:03:22, time_cost(all): 1 day, 14:52:13/12:03:54, loss=0.340143244436695, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=4.17363100887544, lr=0.014999668574678687
2023-11-28 00:29:29   INFO  epoch: 18/24, acc_iter=121866, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:03:46, time_cost(all): 1 day, 14:53:11/12:22:39, loss=0.340035702076592, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=0.6301381932356611, lr=0.014959576801938088
2023-11-28 00:30:27   INFO  epoch: 18/24, acc_iter=121916, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:19, time_cost(all): 1 day, 14:54:09/12:13:03, loss=0.33992815971649, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=4.3331295262001035, lr=0.014919485029197516
2023-11-28 00:31:25   INFO  epoch: 18/24, acc_iter=121966, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:42, time_cost(all): 1 day, 14:55:07/11:26:56, loss=0.339820617356387, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=3.7931566677676436, lr=0.01487939325645693
2023-11-28 00:32:22   INFO  epoch: 18/24, acc_iter=122016, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:51, time_cost(all): 1 day, 14:56:04/11:37:41, loss=0.339713074996284, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=2.6194300291446737, lr=0.01483930148371633
2023-11-28 00:33:20   INFO  epoch: 18/24, acc_iter=122066, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:58, time_cost(all): 1 day, 14:57:02/11:57:28, loss=0.339605532636182, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.761193049327661, lr=0.014799209710975744
2023-11-28 00:34:18   INFO  epoch: 18/24, acc_iter=122116, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:01:06, time_cost(all): 1 day, 14:58:00/11:51:21, loss=0.339497990276079, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=3.955283085813466, lr=0.014759117938235158
2023-11-28 00:35:16   INFO  epoch: 18/24, acc_iter=122166, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:54:41, time_cost(all): 1 day, 14:58:58/12:13:17, loss=0.339390447915976, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=2.5895308472480973, lr=0.014719026165494573
2023-11-28 00:36:13   INFO  epoch: 18/24, acc_iter=122216, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:18, time_cost(all): 1 day, 14:59:55/11:22:43, loss=0.339282905555874, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.702124668123289, lr=0.014678934392753973
2023-11-28 00:37:11   INFO  epoch: 18/24, acc_iter=122266, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:00, time_cost(all): 1 day, 15:00:53/11:48:57, loss=0.339175363195771, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=3.1226203518170355, lr=0.014638842620013387
2023-11-28 00:38:09   INFO  epoch: 18/24, acc_iter=122316, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:52:07, time_cost(all): 1 day, 15:01:51/11:13:48, loss=0.339067820835668, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=4.893600283049759, lr=0.014598750847272801
2023-11-28 00:39:07   INFO  epoch: 18/24, acc_iter=122366, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:28, time_cost(all): 1 day, 15:02:49/11:06:00, loss=0.338960278475565, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.8840200826522158, lr=0.014558659074532215
2023-11-28 00:40:04   INFO  epoch: 18/24, acc_iter=122416, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:53:11, time_cost(all): 1 day, 15:03:46/11:46:34, loss=0.338852736115463, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=1.998068518547108, lr=0.01451856730179163
2023-11-28 00:41:02   INFO  epoch: 18/24, acc_iter=122466, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:27, time_cost(all): 1 day, 15:04:44/11:22:17, loss=0.33874519375536, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=2.3660980568817576, lr=0.014478475529051044
2023-11-28 00:42:00   INFO  epoch: 18/24, acc_iter=122516, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:52:53, time_cost(all): 1 day, 15:05:42/12:04:06, loss=0.338637651395257, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=1.4370457425568706, lr=0.014438383756310458
2023-11-28 00:42:58   INFO  epoch: 18/24, acc_iter=122566, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:48:48, time_cost(all): 1 day, 15:06:40/11:40:13, loss=0.338530109035155, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=4.959988937969615, lr=0.014398291983569858
2023-11-28 00:43:55   INFO  epoch: 18/24, acc_iter=122616, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:51:12, time_cost(all): 1 day, 15:07:37/11:32:50, loss=0.338422566675052, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.83(1.03), norm=2.285404900702921, lr=0.014358200210829272
2023-11-28 00:44:53   INFO  epoch: 18/24, acc_iter=122666, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:38, time_cost(all): 1 day, 15:08:35/12:06:51, loss=0.338315024314949, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.982835639930685, lr=0.014318108438088686
2023-11-28 00:45:51   INFO  epoch: 18/24, acc_iter=122716, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:34, time_cost(all): 1 day, 15:09:33/11:37:14, loss=0.338207481954847, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=3.644162002473956, lr=0.0142780166653481
2023-11-28 00:46:49   INFO  epoch: 18/24, acc_iter=122766, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:47:03, time_cost(all): 1 day, 15:10:31/11:12:20, loss=0.338099939594744, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=1.324553872986665, lr=0.0142379248926075
2023-11-28 00:47:46   INFO  epoch: 18/24, acc_iter=122816, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:21, time_cost(all): 1 day, 15:11:28/12:03:08, loss=0.337992397234641, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=1.7121975745354905, lr=0.014197833119866929
2023-11-28 00:48:44   INFO  epoch: 18/24, acc_iter=122866, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:59, time_cost(all): 1 day, 15:12:26/11:26:42, loss=0.337884854874539, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=2.1055306016730344, lr=0.014157741347126343
2023-11-28 00:49:42   INFO  epoch: 18/24, acc_iter=122916, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:25, time_cost(all): 1 day, 15:13:24/11:11:51, loss=0.337777312514436, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.95(1.03), norm=3.682009847664324, lr=0.014117649574385743
2023-11-28 00:50:40   INFO  epoch: 18/24, acc_iter=122966, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:00, time_cost(all): 1 day, 15:14:22/11:35:34, loss=0.337669770154333, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=2.153218685111521, lr=0.014077557801645157
2023-11-28 00:51:37   INFO  epoch: 18/24, acc_iter=123016, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:37, time_cost(all): 1 day, 15:15:19/10:54:51, loss=0.337562227794231, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=2.201184283717165, lr=0.014037466028904572
2023-11-28 00:52:35   INFO  epoch: 18/24, acc_iter=123066, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:31, time_cost(all): 1 day, 15:16:17/11:16:13, loss=0.337454685434128, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=3.3828066557367187, lr=0.013997374256163986
2023-11-28 00:53:33   INFO  epoch: 18/24, acc_iter=123116, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:38:02, time_cost(all): 1 day, 15:17:15/10:51:48, loss=0.337347143074025, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.381623817152427, lr=0.013957282483423386
2023-11-28 00:54:31   INFO  epoch: 18/24, acc_iter=123166, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:23, time_cost(all): 1 day, 15:18:13/11:38:29, loss=0.337239600713923, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=4.0783569486789215, lr=0.0139171907106828
2023-11-28 00:55:28   INFO  epoch: 18/24, acc_iter=123216, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:57, time_cost(all): 1 day, 15:19:10/11:34:35, loss=0.33713205835382, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=1.2362852168676708, lr=0.013877098937942214
2023-11-28 00:56:26   INFO  epoch: 18/24, acc_iter=123266, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:35:31, time_cost(all): 1 day, 15:20:08/11:24:51, loss=0.337024515993717, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=2.87652978739236, lr=0.013837007165201629
2023-11-28 00:57:24   INFO  epoch: 18/24, acc_iter=123316, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:34:55, time_cost(all): 1 day, 15:21:06/10:49:18, loss=0.336916973633615, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.7791440140668184, lr=0.013796915392461043
2023-11-28 00:58:22   INFO  epoch: 18/24, acc_iter=123366, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:07, time_cost(all): 1 day, 15:22:04/11:14:16, loss=0.336809431273512, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=3.3392222143574966, lr=0.013756823619720457
2023-11-28 00:59:19   INFO  epoch: 18/24, acc_iter=123416, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:47, time_cost(all): 1 day, 15:23:01/11:07:36, loss=0.336701888913409, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=2.3531734279070005, lr=0.013716731846979871
2023-11-28 01:00:17   INFO  epoch: 18/24, acc_iter=123466, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:38, time_cost(all): 1 day, 15:23:59/11:00:58, loss=0.336594346553307, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=4.9676996184486155, lr=0.013676640074239271
2023-11-28 01:01:15   INFO  epoch: 18/24, acc_iter=123516, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:47, time_cost(all): 1 day, 15:24:57/10:57:55, loss=0.336486804193204, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=1.7872953718344633, lr=0.013636548301498685
2023-11-28 01:02:13   INFO  epoch: 18/24, acc_iter=123566, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:17, time_cost(all): 1 day, 15:25:55/11:08:53, loss=0.336379261833101, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.973509150793663, lr=0.0135964565287581
2023-11-28 01:03:10   INFO  epoch: 18/24, acc_iter=123616, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:29:11, time_cost(all): 1 day, 15:26:52/11:38:39, loss=0.336271719472999, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=3.9227752830224776, lr=0.013556364756017514
2023-11-28 01:04:08   INFO  epoch: 18/24, acc_iter=123666, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:45, time_cost(all): 1 day, 15:27:50/10:52:10, loss=0.336164177112896, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=1.6092528044646988, lr=0.013516272983276928
2023-11-28 01:05:06   INFO  epoch: 18/24, acc_iter=123716, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:06, time_cost(all): 1 day, 15:28:48/11:47:40, loss=0.336056634752793, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.3260276023311104, lr=0.013476181210536342
2023-11-28 01:06:04   INFO  epoch: 18/24, acc_iter=123766, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:47, time_cost(all): 1 day, 15:29:46/10:56:08, loss=0.33594909239269, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.791046485235641, lr=0.013436089437795756
2023-11-28 01:07:01   INFO  epoch: 18/24, acc_iter=123816, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:06, time_cost(all): 1 day, 15:30:43/11:07:21, loss=0.335841550032588, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=4.718933975288104, lr=0.01339599766505517
2023-11-28 01:07:59   INFO  epoch: 18/24, acc_iter=123866, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:29, time_cost(all): 1 day, 15:31:41/11:38:45, loss=0.335734007672485, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=1.994032805465433, lr=0.01335590589231457
2023-11-28 01:08:57   INFO  epoch: 18/24, acc_iter=123916, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:41, time_cost(all): 1 day, 15:32:39/11:03:52, loss=0.335626465312382, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=0.892106188951058, lr=0.013315814119573985
2023-11-28 01:09:55   INFO  epoch: 18/24, acc_iter=123966, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:10, time_cost(all): 1 day, 15:33:37/11:32:02, loss=0.33551892295228, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=4.220040678383023, lr=0.013275722346833399
2023-11-28 01:10:52   INFO  epoch: 18/24, acc_iter=124016, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:42, time_cost(all): 1 day, 15:34:34/11:07:20, loss=0.335411380592177, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=4.445081279516862, lr=0.013235630574092813
2023-11-28 01:11:50   INFO  epoch: 18/24, acc_iter=124066, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:56, time_cost(all): 1 day, 15:35:32/11:35:06, loss=0.335303838232074, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.3725884086673545, lr=0.013195538801352213
2023-11-28 01:12:48   INFO  epoch: 18/24, acc_iter=124116, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:39, time_cost(all): 1 day, 15:36:30/11:23:02, loss=0.335196295871972, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.581345556645613, lr=0.013155447028611628
2023-11-28 01:13:46   INFO  epoch: 18/24, acc_iter=124166, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:17, time_cost(all): 1 day, 15:37:28/10:43:29, loss=0.335088753511869, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=0.80746203498175, lr=0.013115355255871056
2023-11-28 01:14:44   INFO  epoch: 18/24, acc_iter=124216, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:00, time_cost(all): 1 day, 15:38:26/11:31:23, loss=0.334981211151766, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=4.68362762304465, lr=0.013075263483130456
2023-11-28 01:15:41   INFO  epoch: 18/24, acc_iter=124266, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:53, time_cost(all): 1 day, 15:39:23/10:32:13, loss=0.334873668791664, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.2777218455669335, lr=0.01303517171038987
2023-11-28 01:16:39   INFO  epoch: 18/24, acc_iter=124316, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:48, time_cost(all): 1 day, 15:40:21/10:33:27, loss=0.334766126431561, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.739800846048963, lr=0.012995079937649284
2023-11-28 01:17:37   INFO  epoch: 18/24, acc_iter=124366, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:30, time_cost(all): 1 day, 15:41:19/10:33:07, loss=0.334658584071458, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=4.595815100345656, lr=0.012954988164908698
2023-11-28 01:18:35   INFO  epoch: 18/24, acc_iter=124416, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:36, time_cost(all): 1 day, 15:42:17/11:13:54, loss=0.334551041711356, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=0.6073474459189252, lr=0.012914896392168099
2023-11-28 01:19:32   INFO  epoch: 18/24, acc_iter=124466, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:37, time_cost(all): 1 day, 15:43:14/11:25:24, loss=0.334443499351253, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=3.8580824062161696, lr=0.012874804619427513
2023-11-28 01:20:30   INFO  epoch: 18/24, acc_iter=124516, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:44, time_cost(all): 1 day, 15:44:12/10:37:46, loss=0.33433595699115, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.5055506120257185, lr=0.012834712846686927
2023-11-28 01:21:28   INFO  epoch: 18/24, acc_iter=124566, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:55, time_cost(all): 1 day, 15:45:10/10:58:39, loss=0.334228414631048, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=2.783451498600983, lr=0.012794621073946341
2023-11-28 01:22:26   INFO  epoch: 18/24, acc_iter=124616, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:21, time_cost(all): 1 day, 15:46:08/10:28:58, loss=0.334120872270945, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.7161385335373085, lr=0.012754529301205755
2023-11-28 01:23:23   INFO  epoch: 18/24, acc_iter=124666, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:22, time_cost(all): 1 day, 15:47:05/11:20:28, loss=0.334013329910842, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=3.295695087866463, lr=0.01271443752846517
2023-11-28 01:24:21   INFO  epoch: 18/24, acc_iter=124716, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:01, time_cost(all): 1 day, 15:48:03/10:52:35, loss=0.33390578755074, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.6539728418557451, lr=0.012674345755724584
2023-11-28 01:25:19   INFO  epoch: 18/24, acc_iter=124766, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:08, time_cost(all): 1 day, 15:49:01/10:46:36, loss=0.333798245190637, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.2055521184973355, lr=0.012634253982983984
2023-11-28 01:26:17   INFO  epoch: 18/24, acc_iter=124816, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:18, time_cost(all): 1 day, 15:49:59/10:26:06, loss=0.333690702830534, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=1.844086509676885, lr=0.012594162210243398
2023-11-28 01:27:14   INFO  epoch: 18/24, acc_iter=124866, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:20, time_cost(all): 1 day, 15:50:56/11:07:05, loss=0.333583160470432, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=3.2268090522824284, lr=0.012554070437502812
2023-11-28 01:28:12   INFO  epoch: 18/24, acc_iter=124916, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:29, time_cost(all): 1 day, 15:51:54/10:21:02, loss=0.333475618110329, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.693139096374137, lr=0.012513978664762226
2023-11-28 01:29:10   INFO  epoch: 18/24, acc_iter=124966, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:29, time_cost(all): 1 day, 15:52:52/11:17:34, loss=0.333368075750226, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=3.36719805504875, lr=0.012473886892021627
2023-11-28 01:30:08   INFO  epoch: 18/24, acc_iter=125016, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:39, time_cost(all): 1 day, 15:53:50/10:50:52, loss=0.333260533390124, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.83(1.03), norm=3.1703789610934985, lr=0.01243379511928104
2023-11-28 01:31:05   INFO  epoch: 18/24, acc_iter=125066, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:37, time_cost(all): 1 day, 15:54:47/10:58:44, loss=0.333152991030021, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=4.844241942521931, lr=0.012393703346540469
2023-11-28 01:32:03   INFO  epoch: 18/24, acc_iter=125116, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:41, time_cost(all): 1 day, 15:55:45/10:36:02, loss=0.333045448669918, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=0.9759718543243797, lr=0.01235361157379987
2023-11-28 01:33:01   INFO  epoch: 19/24, acc_iter=125203, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:01:12, time_cost(all): 1 day, 15:56:43/10:58:18, loss=0.33285832496334, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.50142958100922, lr=0.012283851889231256
2023-11-28 01:33:59   INFO  epoch: 19/24, acc_iter=125253, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:02:32, time_cost(all): 1 day, 15:57:41/11:09:34, loss=0.332750782603237, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=3.9413129756525627, lr=0.012243760116490657
2023-11-28 01:34:56   INFO  epoch: 19/24, acc_iter=125303, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:03:02, time_cost(all): 1 day, 15:58:38/10:26:10, loss=0.332643240243134, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.710548543441429, lr=0.012203668343750071
2023-11-28 01:35:54   INFO  epoch: 19/24, acc_iter=125353, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:00:04, time_cost(all): 1 day, 15:59:36/10:51:59, loss=0.332535697883032, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=0.8125638713486489, lr=0.012163576571009485
2023-11-28 01:36:52   INFO  epoch: 19/24, acc_iter=125403, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:56:56, time_cost(all): 1 day, 16:00:34/11:06:08, loss=0.332428155522929, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=3.2008773896471623, lr=0.0121234847982689
2023-11-28 01:37:50   INFO  epoch: 19/24, acc_iter=125453, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:06:02, time_cost(all): 1 day, 16:01:32/10:58:05, loss=0.332320613162826, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=1.973948340469884, lr=0.0120833930255283
2023-11-28 01:38:47   INFO  epoch: 19/24, acc_iter=125503, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:04:28, time_cost(all): 1 day, 16:02:29/10:10:15, loss=0.332213070802723, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=2.306267256385805, lr=0.012043301252787714
2023-11-28 01:39:45   INFO  epoch: 19/24, acc_iter=125553, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:02:37, time_cost(all): 1 day, 16:03:27/10:24:03, loss=0.332105528442621, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=3.380505728869101, lr=0.012003209480047142
2023-11-28 01:40:43   INFO  epoch: 19/24, acc_iter=125603, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:00:20, time_cost(all): 1 day, 16:04:25/10:44:18, loss=0.331997986082518, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.6384618546097416, lr=0.011963117707306542
2023-11-28 01:41:41   INFO  epoch: 19/24, acc_iter=125653, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:54:46, time_cost(all): 1 day, 16:05:23/10:17:18, loss=0.331890443722415, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.2403957085972364, lr=0.011923025934565956
2023-11-28 01:42:38   INFO  epoch: 19/24, acc_iter=125703, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:56:55, time_cost(all): 1 day, 16:06:20/10:49:27, loss=0.331782901362313, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=3.0191184652325873, lr=0.01188293416182537
2023-11-28 01:43:36   INFO  epoch: 19/24, acc_iter=125753, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:57:54, time_cost(all): 1 day, 16:07:18/11:05:28, loss=0.33167535900221, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.1661599741764979, lr=0.011842842389084784
2023-11-28 01:44:34   INFO  epoch: 19/24, acc_iter=125803, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:58:07, time_cost(all): 1 day, 16:08:16/10:17:31, loss=0.331567816642107, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.199397784717644, lr=0.011802750616344185
2023-11-28 01:45:32   INFO  epoch: 19/24, acc_iter=125853, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:48:00, time_cost(all): 1 day, 16:09:14/10:02:52, loss=0.331460274282005, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.6546625526420327, lr=0.011762658843603599
2023-11-28 01:46:29   INFO  epoch: 19/24, acc_iter=125903, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:53:58, time_cost(all): 1 day, 16:10:11/10:46:01, loss=0.331352731921902, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=2.4433753917772645, lr=0.011722567070863013
2023-11-28 01:47:27   INFO  epoch: 19/24, acc_iter=125953, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:49:48, time_cost(all): 1 day, 16:11:09/10:40:09, loss=0.331245189561799, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=0.9395062453077455, lr=0.011682475298122427
2023-11-28 01:48:25   INFO  epoch: 19/24, acc_iter=126003, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:48:27, time_cost(all): 1 day, 16:12:07/10:41:36, loss=0.331137647201697, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.1599895457213485, lr=0.011642383525381841
2023-11-28 01:49:23   INFO  epoch: 19/24, acc_iter=126053, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:51:49, time_cost(all): 1 day, 16:13:05/10:05:07, loss=0.331030104841594, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.7176582557699217, lr=0.011602291752641256
2023-11-28 01:50:20   INFO  epoch: 19/24, acc_iter=126103, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:43:39, time_cost(all): 1 day, 16:14:02/10:04:09, loss=0.330922562481491, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=4.017715950800441, lr=0.01156219997990067
2023-11-28 01:51:18   INFO  epoch: 19/24, acc_iter=126153, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:50:43, time_cost(all): 1 day, 16:15:00/10:17:24, loss=0.330815020121389, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=2.549885444215674, lr=0.01152210820716007
2023-11-28 01:52:16   INFO  epoch: 19/24, acc_iter=126203, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:49:53, time_cost(all): 1 day, 16:15:58/10:01:34, loss=0.330707477761286, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=1.6618135031390417, lr=0.011482016434419484
2023-11-28 01:53:14   INFO  epoch: 19/24, acc_iter=126253, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:45:37, time_cost(all): 1 day, 16:16:56/10:57:23, loss=0.330599935401183, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=4.513436238880585, lr=0.011441924661678898
2023-11-28 01:54:11   INFO  epoch: 19/24, acc_iter=126303, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:48:34, time_cost(all): 1 day, 16:17:53/10:30:06, loss=0.330492393041081, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=4.603598078636105, lr=0.011401832888938312
2023-11-28 01:55:09   INFO  epoch: 19/24, acc_iter=126353, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:39:45, time_cost(all): 1 day, 16:18:51/10:14:47, loss=0.330384850680978, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.952366561783213, lr=0.011361741116197713
2023-11-28 01:56:07   INFO  epoch: 19/24, acc_iter=126403, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:42:08, time_cost(all): 1 day, 16:19:49/10:45:39, loss=0.330277308320875, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.625445394619062, lr=0.011321649343457127
2023-11-28 01:57:05   INFO  epoch: 19/24, acc_iter=126453, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:42:34, time_cost(all): 1 day, 16:20:47/10:09:26, loss=0.330169765960773, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=2.415733019135421, lr=0.011281557570716555
2023-11-28 01:58:02   INFO  epoch: 19/24, acc_iter=126503, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:45:36, time_cost(all): 1 day, 16:21:44/10:46:33, loss=0.33006222360067, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=1.4892460068173379, lr=0.011241465797975955
2023-11-28 01:59:00   INFO  epoch: 19/24, acc_iter=126553, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:40:38, time_cost(all): 1 day, 16:22:42/10:46:45, loss=0.329954681240567, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=3.055710822518924, lr=0.01120137402523537
2023-11-28 01:59:58   INFO  epoch: 19/24, acc_iter=126603, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:43:46, time_cost(all): 1 day, 16:23:40/10:30:10, loss=0.329847138880465, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=2.092359777085002, lr=0.011161282252494784
2023-11-28 02:00:56   INFO  epoch: 19/24, acc_iter=126653, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:35:58, time_cost(all): 1 day, 16:24:38/10:12:07, loss=0.329739596520362, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.8183135597392277, lr=0.011121190479754198
2023-11-28 02:01:53   INFO  epoch: 19/24, acc_iter=126703, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:40:25, time_cost(all): 1 day, 16:25:35/10:29:38, loss=0.329632054160259, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=4.49129373435596, lr=0.011081098707013612
2023-11-28 02:02:51   INFO  epoch: 19/24, acc_iter=126753, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:38:03, time_cost(all): 1 day, 16:26:33/10:47:05, loss=0.329524511800156, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=0.9774622233970673, lr=0.011041006934273012
2023-11-28 02:03:49   INFO  epoch: 19/24, acc_iter=126803, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:38:35, time_cost(all): 1 day, 16:27:31/9:57:39, loss=0.329416969440054, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=0.6142186936264545, lr=0.011000915161532426
2023-11-28 02:04:47   INFO  epoch: 19/24, acc_iter=126853, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:20, time_cost(all): 1 day, 16:28:29/9:47:29, loss=0.329309427079951, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=1.1933096939417074, lr=0.010960823388791854
2023-11-28 02:05:44   INFO  epoch: 19/24, acc_iter=126903, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:33:40, time_cost(all): 1 day, 16:29:26/9:59:09, loss=0.329201884719848, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=4.758348202873854, lr=0.010920731616051255
2023-11-28 02:06:42   INFO  epoch: 19/24, acc_iter=126953, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:27:39, time_cost(all): 1 day, 16:30:24/9:42:49, loss=0.329094342359746, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=1.613002535987449, lr=0.010880639843310669
2023-11-28 02:07:40   INFO  epoch: 19/24, acc_iter=127003, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:26:45, time_cost(all): 1 day, 16:31:22/9:43:24, loss=0.328986799999643, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=4.658664815803193, lr=0.010840548070570083
2023-11-28 02:08:38   INFO  epoch: 19/24, acc_iter=127053, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:27:47, time_cost(all): 1 day, 16:32:20/9:53:41, loss=0.32887925763954, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=3.9198416085955996, lr=0.010800456297829497
2023-11-28 02:09:35   INFO  epoch: 19/24, acc_iter=127103, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:25:02, time_cost(all): 1 day, 16:33:17/9:41:17, loss=0.328771715279438, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=3.028482714125891, lr=0.010760364525088897
2023-11-28 02:10:33   INFO  epoch: 19/24, acc_iter=127153, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:26:47, time_cost(all): 1 day, 16:34:15/9:39:06, loss=0.328664172919335, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=2.821010712081309, lr=0.010720272752348312
2023-11-28 02:11:31   INFO  epoch: 19/24, acc_iter=127203, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:31:17, time_cost(all): 1 day, 16:35:13/10:07:06, loss=0.328556630559232, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.21092560280281, lr=0.010680180979607726
2023-11-28 02:12:29   INFO  epoch: 19/24, acc_iter=127253, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:25, time_cost(all): 1 day, 16:36:11/9:41:33, loss=0.32844908819913, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=2.22625793217367, lr=0.01064008920686714
2023-11-28 02:13:26   INFO  epoch: 19/24, acc_iter=127303, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:25:58, time_cost(all): 1 day, 16:37:08/10:12:40, loss=0.328341545839027, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=0.7424612607977868, lr=0.010599997434126554
2023-11-28 02:14:24   INFO  epoch: 19/24, acc_iter=127353, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:00, time_cost(all): 1 day, 16:38:06/9:57:27, loss=0.328234003478924, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=2.817310672732579, lr=0.010559905661385968
2023-11-28 02:15:22   INFO  epoch: 19/24, acc_iter=127403, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:26:44, time_cost(all): 1 day, 16:39:04/10:02:18, loss=0.328126461118822, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=4.720691121366301, lr=0.010519813888645382
2023-11-28 02:16:20   INFO  epoch: 19/24, acc_iter=127453, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:26:35, time_cost(all): 1 day, 16:40:02/9:47:13, loss=0.328018918758719, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.638345858086193, lr=0.010479722115904783
2023-11-28 02:17:17   INFO  epoch: 19/24, acc_iter=127503, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:19:14, time_cost(all): 1 day, 16:40:59/10:26:35, loss=0.327911376398616, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=0.5639262703757787, lr=0.010439630343164197
2023-11-28 02:18:15   INFO  epoch: 19/24, acc_iter=127553, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:20:03, time_cost(all): 1 day, 16:41:57/10:05:47, loss=0.327803834038514, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=1.571086906968817, lr=0.010399538570423611
2023-11-28 02:19:13   INFO  epoch: 19/24, acc_iter=127603, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:16:36, time_cost(all): 1 day, 16:42:55/10:02:08, loss=0.327696291678411, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=2.658683920279695, lr=0.010359446797683025
2023-11-28 02:20:11   INFO  epoch: 19/24, acc_iter=127653, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:29, time_cost(all): 1 day, 16:43:53/10:10:12, loss=0.327588749318308, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.12(1.03), norm=4.551140028896447, lr=0.010319355024942425
2023-11-28 02:21:08   INFO  epoch: 19/24, acc_iter=127703, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:14:32, time_cost(all): 1 day, 16:44:50/10:09:51, loss=0.327481206958206, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=2.048603215819492, lr=0.01027926325220184
2023-11-28 02:22:06   INFO  epoch: 19/24, acc_iter=127753, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:20:27, time_cost(all): 1 day, 16:45:48/9:55:21, loss=0.327373664598103, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=4.451162845957894, lr=0.010239171479461268
2023-11-28 02:23:04   INFO  epoch: 19/24, acc_iter=127803, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:19:13, time_cost(all): 1 day, 16:46:46/9:47:46, loss=0.327266122238, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.3812154406902337, lr=0.010199079706720668
2023-11-28 02:24:02   INFO  epoch: 19/24, acc_iter=127853, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:11:57, time_cost(all): 1 day, 16:47:44/9:55:43, loss=0.327158579877898, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=1.3719162134096545, lr=0.010158987933980082
2023-11-28 02:24:59   INFO  epoch: 19/24, acc_iter=127903, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:12:44, time_cost(all): 1 day, 16:48:41/10:12:35, loss=0.327051037517795, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=0.5293859431982991, lr=0.010118896161239496
2023-11-28 02:25:57   INFO  epoch: 19/24, acc_iter=127953, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:13:39, time_cost(all): 1 day, 16:49:39/10:21:27, loss=0.326943495157692, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.084319106023252, lr=0.01007880438849891
2023-11-28 02:26:55   INFO  epoch: 19/24, acc_iter=128003, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:08:46, time_cost(all): 1 day, 16:50:37/10:16:27, loss=0.326835952797589, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=3.8668469711708937, lr=0.01003871261575831
2023-11-28 02:27:53   INFO  epoch: 19/24, acc_iter=128053, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:11:05, time_cost(all): 1 day, 16:51:35/9:39:55, loss=0.326728410437487, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.870134801502299, lr=0.009999279998928376
2023-11-28 02:28:50   INFO  epoch: 19/24, acc_iter=128103, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:09:31, time_cost(all): 1 day, 16:52:32/10:04:01, loss=0.326620868077384, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=0.5346355501959554, lr=0.009978349735218217
2023-11-28 02:29:48   INFO  epoch: 19/24, acc_iter=128153, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:01, time_cost(all): 1 day, 16:53:30/9:40:58, loss=0.326513325717281, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=0.7649443741592403, lr=0.009957419471508057
2023-11-28 02:30:46   INFO  epoch: 19/24, acc_iter=128203, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:10:00, time_cost(all): 1 day, 16:54:28/9:36:15, loss=0.326405783357179, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.484830694161177, lr=0.009936489207797897
2023-11-28 02:31:44   INFO  epoch: 19/24, acc_iter=128253, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:04:52, time_cost(all): 1 day, 16:55:26/9:48:06, loss=0.326298240997076, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=4.330443694022954, lr=0.009915558944087736
2023-11-28 02:32:41   INFO  epoch: 19/24, acc_iter=128303, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:04:22, time_cost(all): 1 day, 16:56:23/10:09:49, loss=0.326190698636973, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.5087096270232356, lr=0.009894628680377576
2023-11-28 02:33:39   INFO  epoch: 19/24, acc_iter=128353, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:31, time_cost(all): 1 day, 16:57:21/9:50:21, loss=0.326083156276871, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=1.962515478103012, lr=0.009873698416667416
2023-11-28 02:34:37   INFO  epoch: 19/24, acc_iter=128403, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:06:23, time_cost(all): 1 day, 16:58:19/9:18:09, loss=0.325975613916768, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.3665501577582946, lr=0.009852768152957256
2023-11-28 02:35:35   INFO  epoch: 19/24, acc_iter=128453, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:04:36, time_cost(all): 1 day, 16:59:17/9:21:07, loss=0.325868071556665, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=3.942086130678679, lr=0.009831837889247097
2023-11-28 02:36:32   INFO  epoch: 19/24, acc_iter=128503, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/0:59:47, time_cost(all): 1 day, 17:00:14/9:20:03, loss=0.325760529196563, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=1.5920744858653169, lr=0.009810907625536937
2023-11-28 02:37:30   INFO  epoch: 19/24, acc_iter=128553, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:55, time_cost(all): 1 day, 17:01:12/9:52:20, loss=0.32565298683646, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=2.2809336264589604, lr=0.009789977361826777
2023-11-28 02:38:28   INFO  epoch: 19/24, acc_iter=128603, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:00:39, time_cost(all): 1 day, 17:02:10/10:07:35, loss=0.325545444476357, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=2.0620746229363522, lr=0.009769047098116617
2023-11-28 02:39:26   INFO  epoch: 19/24, acc_iter=128653, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:48, time_cost(all): 1 day, 17:03:08/9:48:38, loss=0.325437902116255, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=1.1641256339475898, lr=0.009748116834406457
2023-11-28 02:40:23   INFO  epoch: 19/24, acc_iter=128703, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:59:39, time_cost(all): 1 day, 17:04:05/9:56:43, loss=0.325330359756152, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=1.9904587110651748, lr=0.009727186570696296
2023-11-28 02:41:21   INFO  epoch: 19/24, acc_iter=128753, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:45, time_cost(all): 1 day, 17:05:03/9:57:28, loss=0.325222817396049, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=4.141747270716275, lr=0.009706256306986136
2023-11-28 02:42:19   INFO  epoch: 19/24, acc_iter=128803, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:52, time_cost(all): 1 day, 17:06:01/10:02:47, loss=0.325115275035947, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=3.5058109421691404, lr=0.009685326043275978
2023-11-28 02:43:17   INFO  epoch: 19/24, acc_iter=128853, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:53:16, time_cost(all): 1 day, 17:06:59/9:25:41, loss=0.325007732675844, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.8053733228974767, lr=0.009664395779565817
2023-11-28 02:44:14   INFO  epoch: 19/24, acc_iter=128903, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:02, time_cost(all): 1 day, 17:07:56/9:12:51, loss=0.324900190315741, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=4.296237410528466, lr=0.009643465515855657
2023-11-28 02:45:12   INFO  epoch: 19/24, acc_iter=128953, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:53:11, time_cost(all): 1 day, 17:08:54/9:41:03, loss=0.324792647955639, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=3.2240343412879175, lr=0.009622535252145497
2023-11-28 02:46:10   INFO  epoch: 19/24, acc_iter=129003, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:50:29, time_cost(all): 1 day, 17:09:52/9:21:36, loss=0.324685105595536, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=2.807909770174736, lr=0.009601604988435337
2023-11-28 02:47:08   INFO  epoch: 19/24, acc_iter=129053, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:14, time_cost(all): 1 day, 17:10:50/9:27:56, loss=0.324577563235433, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=2.790119641648503, lr=0.009580674724725177
2023-11-28 02:48:05   INFO  epoch: 19/24, acc_iter=129103, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:48:55, time_cost(all): 1 day, 17:11:47/9:38:59, loss=0.324470020875331, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=1.359229475922003, lr=0.009559744461015016
2023-11-28 02:49:03   INFO  epoch: 19/24, acc_iter=129153, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:50:29, time_cost(all): 1 day, 17:12:45/9:38:35, loss=0.324362478515228, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=0.5297760132353349, lr=0.009538814197304858
2023-11-28 02:50:01   INFO  epoch: 19/24, acc_iter=129203, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:50:36, time_cost(all): 1 day, 17:13:43/9:28:43, loss=0.324254936155125, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=2.5979915784957464, lr=0.009517883933594698
2023-11-28 02:50:59   INFO  epoch: 19/24, acc_iter=129253, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:47:27, time_cost(all): 1 day, 17:14:41/9:09:32, loss=0.324147393795023, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=1.4597559655118226, lr=0.009496953669884537
2023-11-28 02:51:56   INFO  epoch: 19/24, acc_iter=129303, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:48:24, time_cost(all): 1 day, 17:15:38/9:04:46, loss=0.32403985143492, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=3.0537486550790556, lr=0.009476023406174377
2023-11-28 02:52:54   INFO  epoch: 19/24, acc_iter=129353, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:13, time_cost(all): 1 day, 17:16:36/9:19:31, loss=0.323932309074817, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=1.9467945513392977, lr=0.009455093142464217
2023-11-28 02:53:52   INFO  epoch: 19/24, acc_iter=129403, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:44:15, time_cost(all): 1 day, 17:17:34/9:32:14, loss=0.323824766714714, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=4.9332381078898395, lr=0.009434162878754057
2023-11-28 02:54:50   INFO  epoch: 19/24, acc_iter=129453, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:42:44, time_cost(all): 1 day, 17:18:32/9:45:00, loss=0.323717224354612, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.942925628844201, lr=0.009413232615043897
2023-11-28 02:55:47   INFO  epoch: 19/24, acc_iter=129503, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:41:53, time_cost(all): 1 day, 17:19:29/9:13:39, loss=0.323609681994509, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=0.656911187875804, lr=0.009392302351333738
2023-11-28 02:56:45   INFO  epoch: 19/24, acc_iter=129553, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:41:29, time_cost(all): 1 day, 17:20:27/9:17:29, loss=0.323502139634406, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.3084662027605574, lr=0.009371372087623578
2023-11-28 02:57:43   INFO  epoch: 19/24, acc_iter=129603, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:41:17, time_cost(all): 1 day, 17:21:25/9:40:56, loss=0.323394597274304, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.346421309973309, lr=0.009350441823913418
2023-11-28 02:58:41   INFO  epoch: 19/24, acc_iter=129653, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:39:58, time_cost(all): 1 day, 17:22:23/9:27:55, loss=0.323287054914201, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.0974081904421755, lr=0.009329511560203257
2023-11-28 02:59:39   INFO  epoch: 19/24, acc_iter=129703, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:37:38, time_cost(all): 1 day, 17:23:21/9:17:34, loss=0.323179512554098, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=4.5901023309068245, lr=0.009308581296493097
2023-11-28 03:00:36   INFO  epoch: 19/24, acc_iter=129753, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:28, time_cost(all): 1 day, 17:24:18/9:33:26, loss=0.323071970193996, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.3617493679365613, lr=0.009287651032782937
2023-11-28 03:01:34   INFO  epoch: 19/24, acc_iter=129803, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:19, time_cost(all): 1 day, 17:25:16/9:40:40, loss=0.322964427833893, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=1.2585691378276205, lr=0.009266720769072777
2023-11-28 03:02:32   INFO  epoch: 19/24, acc_iter=129853, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:36:49, time_cost(all): 1 day, 17:26:14/8:51:46, loss=0.32285688547379, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.040146003845072, lr=0.009245790505362618
2023-11-28 03:03:30   INFO  epoch: 19/24, acc_iter=129903, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:16, time_cost(all): 1 day, 17:27:12/9:01:36, loss=0.322749343113688, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=4.999570095018598, lr=0.009224860241652458
2023-11-28 03:04:27   INFO  epoch: 19/24, acc_iter=129953, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:45, time_cost(all): 1 day, 17:28:09/9:00:02, loss=0.322641800753585, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.843253277147091, lr=0.009203929977942298
2023-11-28 03:05:25   INFO  epoch: 19/24, acc_iter=130003, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:02, time_cost(all): 1 day, 17:29:07/9:06:12, loss=0.322534258393482, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=1.5702377864930397, lr=0.009182999714232138
2023-11-28 03:06:23   INFO  epoch: 19/24, acc_iter=130053, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:57, time_cost(all): 1 day, 17:30:05/9:33:59, loss=0.32242671603338, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=2.996487819710375, lr=0.009162069450521977
2023-11-28 03:07:21   INFO  epoch: 19/24, acc_iter=130103, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:39, time_cost(all): 1 day, 17:31:03/9:24:26, loss=0.322319173673277, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.8775501230194367, lr=0.009141139186811817
2023-11-28 03:08:18   INFO  epoch: 19/24, acc_iter=130153, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:02, time_cost(all): 1 day, 17:32:00/9:37:00, loss=0.322211631313174, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=2.7480617512559404, lr=0.009120208923101657
2023-11-28 03:09:16   INFO  epoch: 19/24, acc_iter=130203, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:14, time_cost(all): 1 day, 17:32:58/9:32:12, loss=0.322104088953072, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=0.6242607131615989, lr=0.009099278659391499
2023-11-28 03:10:14   INFO  epoch: 19/24, acc_iter=130253, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:34, time_cost(all): 1 day, 17:33:56/9:00:41, loss=0.321996546592969, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=0.6662687874550697, lr=0.009078348395681338
2023-11-28 03:11:12   INFO  epoch: 19/24, acc_iter=130303, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:41, time_cost(all): 1 day, 17:34:54/9:27:58, loss=0.321889004232866, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=3.019246725134262, lr=0.009057418131971178
2023-11-28 03:12:09   INFO  epoch: 19/24, acc_iter=130353, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:49, time_cost(all): 1 day, 17:35:51/9:14:44, loss=0.321781461872764, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=3.1778446675097474, lr=0.009036487868261018
2023-11-28 03:13:07   INFO  epoch: 19/24, acc_iter=130403, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:24:40, time_cost(all): 1 day, 17:36:49/8:40:25, loss=0.321673919512661, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=2.228623630513905, lr=0.009015557604550858
2023-11-28 03:14:05   INFO  epoch: 19/24, acc_iter=130453, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:24:32, time_cost(all): 1 day, 17:37:47/9:02:26, loss=0.321566377152558, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=4.809900721036079, lr=0.008994627340840698
2023-11-28 03:15:03   INFO  epoch: 19/24, acc_iter=130503, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:27, time_cost(all): 1 day, 17:38:45/9:03:53, loss=0.321458834792456, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=0.723838174053653, lr=0.008973697077130537
2023-11-28 03:16:00   INFO  epoch: 19/24, acc_iter=130553, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:35, time_cost(all): 1 day, 17:39:42/9:18:35, loss=0.321351292432353, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.951676176306209, lr=0.008952766813420379
2023-11-28 03:16:58   INFO  epoch: 19/24, acc_iter=130603, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:19, time_cost(all): 1 day, 17:40:40/9:29:10, loss=0.32124375007225, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.9632102919668606, lr=0.008931836549710219
2023-11-28 03:17:56   INFO  epoch: 19/24, acc_iter=130653, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:47, time_cost(all): 1 day, 17:41:38/9:03:42, loss=0.321136207712148, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.0512373887564666, lr=0.008910906286000058
2023-11-28 03:18:54   INFO  epoch: 19/24, acc_iter=130703, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:24, time_cost(all): 1 day, 17:42:36/9:23:57, loss=0.321028665352045, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=2.5495258426264247, lr=0.008889976022289898
2023-11-28 03:19:51   INFO  epoch: 19/24, acc_iter=130753, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:03, time_cost(all): 1 day, 17:43:33/9:21:14, loss=0.320921122991942, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=4.986900852393376, lr=0.008869045758579738
2023-11-28 03:20:49   INFO  epoch: 19/24, acc_iter=130803, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:52, time_cost(all): 1 day, 17:44:31/8:37:27, loss=0.32081358063184, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.4568723275992195, lr=0.008848115494869578
2023-11-28 03:21:47   INFO  epoch: 19/24, acc_iter=130853, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:26, time_cost(all): 1 day, 17:45:29/8:40:03, loss=0.320706038271737, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=4.676813011511933, lr=0.00882718523115942
2023-11-28 03:22:45   INFO  epoch: 19/24, acc_iter=130903, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:41, time_cost(all): 1 day, 17:46:27/8:57:16, loss=0.320598495911634, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.2542922762999564, lr=0.008806254967449259
2023-11-28 03:23:42   INFO  epoch: 19/24, acc_iter=130953, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:24, time_cost(all): 1 day, 17:47:24/9:07:59, loss=0.320490953551531, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=4.978372746629342, lr=0.008785324703739099
2023-11-28 03:24:40   INFO  epoch: 19/24, acc_iter=131003, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:57, time_cost(all): 1 day, 17:48:22/8:54:01, loss=0.320383411191429, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.5657909360440545, lr=0.008764394440028939
2023-11-28 03:25:38   INFO  epoch: 19/24, acc_iter=131053, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:42, time_cost(all): 1 day, 17:49:20/8:49:32, loss=0.320275868831326, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.968472798988586, lr=0.008743464176318778
2023-11-28 03:26:36   INFO  epoch: 19/24, acc_iter=131103, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:35, time_cost(all): 1 day, 17:50:18/9:02:31, loss=0.320168326471223, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=0.9634532172468225, lr=0.008722533912608618
2023-11-28 03:27:33   INFO  epoch: 19/24, acc_iter=131153, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:21, time_cost(all): 1 day, 17:51:15/8:42:41, loss=0.320060784111121, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=3.256557837005955, lr=0.008701603648898458
2023-11-28 03:28:31   INFO  epoch: 19/24, acc_iter=131203, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:05, time_cost(all): 1 day, 17:52:13/8:30:16, loss=0.319953241751018, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=2.4072885895841734, lr=0.008680673385188298
2023-11-28 03:29:29   INFO  epoch: 19/24, acc_iter=131253, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:12, time_cost(all): 1 day, 17:53:11/8:37:54, loss=0.319845699390915, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.601071596710141, lr=0.00865974312147814
2023-11-28 03:30:27   INFO  epoch: 19/24, acc_iter=131303, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:16, time_cost(all): 1 day, 17:54:09/8:25:00, loss=0.319738157030813, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=1.4356318741744925, lr=0.008638812857767979
2023-11-28 03:31:24   INFO  epoch: 19/24, acc_iter=131353, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:26, time_cost(all): 1 day, 17:55:06/8:24:54, loss=0.31963061467071, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.341537702149012, lr=0.008617882594057819
2023-11-28 03:32:22   INFO  epoch: 19/24, acc_iter=131403, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:39, time_cost(all): 1 day, 17:56:04/8:54:52, loss=0.319523072310607, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=3.3227412264799607, lr=0.008596952330347659
2023-11-28 03:33:20   INFO  epoch: 19/24, acc_iter=131453, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:20, time_cost(all): 1 day, 17:57:02/8:36:07, loss=0.319415529950505, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=1.2648204103996092, lr=0.008576022066637498
2023-11-28 03:34:18   INFO  epoch: 19/24, acc_iter=131503, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:32, time_cost(all): 1 day, 17:58:00/8:30:56, loss=0.319307987590402, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=3.1685906871755387, lr=0.008555091802927338
2023-11-28 03:35:15   INFO  epoch: 19/24, acc_iter=131553, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:38, time_cost(all): 1 day, 17:58:57/8:24:49, loss=0.319200445230299, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=4.586127291484949, lr=0.00853416153921718
2023-11-28 03:36:13   INFO  epoch: 19/24, acc_iter=131603, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:33, time_cost(all): 1 day, 17:59:55/8:36:26, loss=0.319092902870197, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=1.7711680475585623, lr=0.00851323127550702
2023-11-28 03:37:11   INFO  epoch: 19/24, acc_iter=131653, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:38, time_cost(all): 1 day, 18:00:53/8:33:48, loss=0.318985360510094, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.207578284068608, lr=0.00849230101179686
2023-11-28 03:38:09   INFO  epoch: 19/24, acc_iter=131703, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:40, time_cost(all): 1 day, 18:01:51/8:37:06, loss=0.318877818149991, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=0.6823861600444268, lr=0.008471370748086699
2023-11-28 03:39:06   INFO  epoch: 20/24, acc_iter=131790, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:01:42, time_cost(all): 1 day, 18:02:48/8:25:13, loss=0.318690694443413, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=1.4610357058850474, lr=0.008434952089231021
2023-11-28 03:40:04   INFO  epoch: 20/24, acc_iter=131840, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:10:00, time_cost(all): 1 day, 18:03:46/8:25:32, loss=0.31858315208331, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=3.1520377229799155, lr=0.00841402182552086
2023-11-28 03:41:02   INFO  epoch: 20/24, acc_iter=131890, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:02:48, time_cost(all): 1 day, 18:04:44/9:03:27, loss=0.318475609723207, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.261710037726887, lr=0.0083930915618107
2023-11-28 03:42:00   INFO  epoch: 20/24, acc_iter=131940, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:06:11, time_cost(all): 1 day, 18:05:42/8:23:09, loss=0.318368067363105, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=4.662085455618388, lr=0.00837216129810054
2023-11-28 03:42:57   INFO  epoch: 20/24, acc_iter=131990, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:04:50, time_cost(all): 1 day, 18:06:39/8:53:07, loss=0.318260525003002, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=4.326477287818592, lr=0.00835123103439038
2023-11-28 03:43:55   INFO  epoch: 20/24, acc_iter=132040, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:55:37, time_cost(all): 1 day, 18:07:37/8:55:33, loss=0.318152982642899, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.7866980245954527, lr=0.008330300770680222
2023-11-28 03:44:53   INFO  epoch: 20/24, acc_iter=132090, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:59:04, time_cost(all): 1 day, 18:08:35/8:11:49, loss=0.318045440282797, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=1.5926151838259792, lr=0.008309370506970061
2023-11-28 03:45:51   INFO  epoch: 20/24, acc_iter=132140, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:02:08, time_cost(all): 1 day, 18:09:33/8:47:09, loss=0.317937897922694, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=2.335052315166915, lr=0.008288440243259901
2023-11-28 03:46:48   INFO  epoch: 20/24, acc_iter=132190, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:00:30, time_cost(all): 1 day, 18:10:30/8:36:33, loss=0.317830355562591, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.528658389089881, lr=0.008267509979549741
2023-11-28 03:47:46   INFO  epoch: 20/24, acc_iter=132240, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:01:14, time_cost(all): 1 day, 18:11:28/8:51:42, loss=0.317722813202489, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.075692530094658, lr=0.00824657971583958
2023-11-28 03:48:44   INFO  epoch: 20/24, acc_iter=132290, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:53:18, time_cost(all): 1 day, 18:12:26/8:40:27, loss=0.317615270842386, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=1.8986089914509665, lr=0.00822564945212942
2023-11-28 03:49:42   INFO  epoch: 20/24, acc_iter=132340, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:50:05, time_cost(all): 1 day, 18:13:24/8:50:54, loss=0.317507728482283, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.020657118523724, lr=0.00820471918841926
2023-11-28 03:50:39   INFO  epoch: 20/24, acc_iter=132390, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:50:12, time_cost(all): 1 day, 18:14:21/8:20:58, loss=0.31740018612218, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=4.618077559838246, lr=0.008183788924709102
2023-11-28 03:51:37   INFO  epoch: 20/24, acc_iter=132440, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:53:23, time_cost(all): 1 day, 18:15:19/8:27:52, loss=0.317292643762078, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.287467551716395, lr=0.008162858660998942
2023-11-28 03:52:35   INFO  epoch: 20/24, acc_iter=132490, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:52:26, time_cost(all): 1 day, 18:16:17/8:38:21, loss=0.317185101401975, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.7884750919495902, lr=0.008141928397288781
2023-11-28 03:53:33   INFO  epoch: 20/24, acc_iter=132540, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:52:20, time_cost(all): 1 day, 18:17:15/8:22:39, loss=0.317077559041872, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.9089351516107897, lr=0.008120998133578621
2023-11-28 03:54:30   INFO  epoch: 20/24, acc_iter=132590, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:48:02, time_cost(all): 1 day, 18:18:12/8:11:12, loss=0.31697001668177, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=4.67435297013535, lr=0.008100067869868461
2023-11-28 03:55:28   INFO  epoch: 20/24, acc_iter=132640, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:46:15, time_cost(all): 1 day, 18:19:10/8:27:57, loss=0.316862474321667, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=4.234589955991147, lr=0.0080791376061583
2023-11-28 03:56:26   INFO  epoch: 20/24, acc_iter=132690, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:50:46, time_cost(all): 1 day, 18:20:08/8:04:23, loss=0.316754931961564, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.319154370826387, lr=0.00805820734244814
2023-11-28 03:57:24   INFO  epoch: 20/24, acc_iter=132740, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:44:32, time_cost(all): 1 day, 18:21:06/8:41:37, loss=0.316647389601462, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=1.578087693760621, lr=0.00803727707873798
2023-11-28 03:58:21   INFO  epoch: 20/24, acc_iter=132790, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:46:46, time_cost(all): 1 day, 18:22:03/8:00:13, loss=0.316539847241359, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.6031078385113666, lr=0.008016346815027822
2023-11-28 03:59:19   INFO  epoch: 20/24, acc_iter=132840, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:40:53, time_cost(all): 1 day, 18:23:01/8:35:21, loss=0.316432304881256, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=1.7124869784726933, lr=0.007995416551317662
2023-11-28 04:00:17   INFO  epoch: 20/24, acc_iter=132890, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:43:54, time_cost(all): 1 day, 18:23:59/8:21:27, loss=0.316324762521154, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=3.4999077814171775, lr=0.007974486287607501
2023-11-28 04:01:15   INFO  epoch: 20/24, acc_iter=132940, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:40:32, time_cost(all): 1 day, 18:24:57/8:39:50, loss=0.316217220161051, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=2.8018407287250446, lr=0.007953556023897341
2023-11-28 04:02:12   INFO  epoch: 20/24, acc_iter=132990, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:40:22, time_cost(all): 1 day, 18:25:54/8:07:12, loss=0.316109677800948, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=2.5253441922987707, lr=0.007932625760187181
2023-11-28 04:03:10   INFO  epoch: 20/24, acc_iter=133040, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:38:47, time_cost(all): 1 day, 18:26:52/8:24:24, loss=0.316002135440846, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=4.6618519344331295, lr=0.007911695496477023
2023-11-28 04:04:08   INFO  epoch: 20/24, acc_iter=133090, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:38:08, time_cost(all): 1 day, 18:27:50/7:56:41, loss=0.315894593080743, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=0.9909624320245127, lr=0.007890765232766862
2023-11-28 04:05:06   INFO  epoch: 20/24, acc_iter=133140, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:44:50, time_cost(all): 1 day, 18:28:48/8:24:56, loss=0.31578705072064, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.794053021987503, lr=0.007869834969056702
2023-11-28 04:06:03   INFO  epoch: 20/24, acc_iter=133190, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:43:43, time_cost(all): 1 day, 18:29:45/8:17:20, loss=0.315679508360538, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=4.14432079108361, lr=0.007848904705346542
2023-11-28 04:07:01   INFO  epoch: 20/24, acc_iter=133240, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:35:39, time_cost(all): 1 day, 18:30:43/8:32:34, loss=0.315571966000435, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=2.446844529550747, lr=0.007827974441636382
2023-11-28 04:07:59   INFO  epoch: 20/24, acc_iter=133290, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:36:09, time_cost(all): 1 day, 18:31:41/7:49:57, loss=0.315464423640332, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=2.166788189956685, lr=0.007807044177926221
2023-11-28 04:08:57   INFO  epoch: 20/24, acc_iter=133340, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:40:13, time_cost(all): 1 day, 18:32:39/8:06:49, loss=0.31535688128023, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.5570908746191052, lr=0.007786113914216061
2023-11-28 04:09:54   INFO  epoch: 20/24, acc_iter=133390, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:36:02, time_cost(all): 1 day, 18:33:36/8:05:14, loss=0.315249338920127, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=2.4982688741131267, lr=0.007765183650505902
2023-11-28 04:10:52   INFO  epoch: 20/24, acc_iter=133440, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:37:36, time_cost(all): 1 day, 18:34:34/7:55:04, loss=0.315141796560024, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=1.2064093548516783, lr=0.007744253386795742
2023-11-28 04:11:50   INFO  epoch: 20/24, acc_iter=133490, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:33:37, time_cost(all): 1 day, 18:35:32/8:02:31, loss=0.315034254199922, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=2.1364901281077167, lr=0.007723323123085581
2023-11-28 04:12:48   INFO  epoch: 20/24, acc_iter=133540, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:29:58, time_cost(all): 1 day, 18:36:30/8:14:54, loss=0.314926711839819, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=0.6246635741438572, lr=0.007702392859375422
2023-11-28 04:13:45   INFO  epoch: 20/24, acc_iter=133590, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:32:52, time_cost(all): 1 day, 18:37:27/7:56:30, loss=0.314819169479716, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.8877563757926117, lr=0.007681462595665262
2023-11-28 04:14:43   INFO  epoch: 20/24, acc_iter=133640, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:33:57, time_cost(all): 1 day, 18:38:25/8:01:54, loss=0.314711627119613, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.83(1.03), norm=1.1338642201060876, lr=0.007660532331955102
2023-11-28 04:15:41   INFO  epoch: 20/24, acc_iter=133690, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:26:53, time_cost(all): 1 day, 18:39:23/7:57:36, loss=0.314604084759511, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=0.6056562786165038, lr=0.007639602068244941
2023-11-28 04:16:39   INFO  epoch: 20/24, acc_iter=133740, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:26:53, time_cost(all): 1 day, 18:40:21/8:17:54, loss=0.314496542399408, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=4.967440810846322, lr=0.007618671804534782
2023-11-28 04:17:36   INFO  epoch: 20/24, acc_iter=133790, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:29:16, time_cost(all): 1 day, 18:41:18/7:58:52, loss=0.314389000039305, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.797941292333152, lr=0.007597741540824622
2023-11-28 04:18:34   INFO  epoch: 20/24, acc_iter=133840, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:22:51, time_cost(all): 1 day, 18:42:16/8:19:35, loss=0.314281457679203, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.331714547056685, lr=0.007576811277114463
2023-11-28 04:19:32   INFO  epoch: 20/24, acc_iter=133890, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:22:40, time_cost(all): 1 day, 18:43:14/8:19:05, loss=0.3141739153191, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.3003634797535257, lr=0.007555881013404302
2023-11-28 04:20:30   INFO  epoch: 20/24, acc_iter=133940, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:26, time_cost(all): 1 day, 18:44:12/7:47:34, loss=0.314066372958997, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=2.186809768222826, lr=0.007534950749694142
2023-11-28 04:21:27   INFO  epoch: 20/24, acc_iter=133990, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:19:59, time_cost(all): 1 day, 18:45:09/7:43:51, loss=0.313958830598895, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.7929678182474627, lr=0.007514020485983982
2023-11-28 04:22:25   INFO  epoch: 20/24, acc_iter=134040, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:21:31, time_cost(all): 1 day, 18:46:07/8:06:05, loss=0.313851288238792, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.83(1.03), norm=4.707098925200259, lr=0.007493090222273822
2023-11-28 04:23:23   INFO  epoch: 20/24, acc_iter=134090, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:22:22, time_cost(all): 1 day, 18:47:05/7:44:34, loss=0.313743745878689, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.333934669753393, lr=0.007472159958563662
2023-11-28 04:24:21   INFO  epoch: 20/24, acc_iter=134140, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:22:22, time_cost(all): 1 day, 18:48:03/8:06:39, loss=0.313636203518587, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=1.5299247908552835, lr=0.007451229694853503
2023-11-28 04:25:18   INFO  epoch: 20/24, acc_iter=134190, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:21:01, time_cost(all): 1 day, 18:49:00/8:12:10, loss=0.313528661158484, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=1.1618723341858666, lr=0.007430299431143343
2023-11-28 04:26:16   INFO  epoch: 20/24, acc_iter=134240, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:34, time_cost(all): 1 day, 18:49:58/7:43:16, loss=0.313421118798381, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=1.325496192822655, lr=0.007409369167433183
2023-11-28 04:27:14   INFO  epoch: 20/24, acc_iter=134290, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:20:12, time_cost(all): 1 day, 18:50:56/7:46:11, loss=0.313313576438279, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=0.974700697028291, lr=0.007388438903723022
2023-11-28 04:28:12   INFO  epoch: 20/24, acc_iter=134340, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:18:08, time_cost(all): 1 day, 18:51:54/7:43:43, loss=0.313206034078176, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=4.566133559546511, lr=0.007367508640012862
2023-11-28 04:29:09   INFO  epoch: 20/24, acc_iter=134390, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:14:42, time_cost(all): 1 day, 18:52:51/7:52:43, loss=0.313098491718073, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=4.474489498599032, lr=0.007346578376302702
2023-11-28 04:30:07   INFO  epoch: 20/24, acc_iter=134440, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:15:33, time_cost(all): 1 day, 18:53:49/8:04:43, loss=0.312990949357971, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=1.4982940387873889, lr=0.007325648112592542
2023-11-28 04:31:05   INFO  epoch: 20/24, acc_iter=134490, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:10:40, time_cost(all): 1 day, 18:54:47/7:47:41, loss=0.312883406997868, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=2.3741984493527752, lr=0.007304717848882382
2023-11-28 04:32:03   INFO  epoch: 20/24, acc_iter=134540, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:15:59, time_cost(all): 1 day, 18:55:45/7:33:09, loss=0.312775864637765, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=2.5560293980813404, lr=0.007283787585172223
2023-11-28 04:33:00   INFO  epoch: 20/24, acc_iter=134590, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:12:58, time_cost(all): 1 day, 18:56:42/8:05:34, loss=0.312668322277663, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.115219724723911, lr=0.007262857321462063
2023-11-28 04:33:58   INFO  epoch: 20/24, acc_iter=134640, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:10:17, time_cost(all): 1 day, 18:57:40/7:37:27, loss=0.31256077991756, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=3.8862621831916373, lr=0.007241927057751903
2023-11-28 04:34:56   INFO  epoch: 20/24, acc_iter=134690, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:12:10, time_cost(all): 1 day, 18:58:38/7:26:00, loss=0.312453237557457, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=2.858932101801988, lr=0.007220996794041742
2023-11-28 04:35:54   INFO  epoch: 20/24, acc_iter=134740, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:06:08, time_cost(all): 1 day, 18:59:36/7:53:26, loss=0.312345695197355, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=1.7469843627945854, lr=0.007200066530331582
2023-11-28 04:36:51   INFO  epoch: 20/24, acc_iter=134790, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:08:40, time_cost(all): 1 day, 19:00:33/7:43:09, loss=0.312238152837252, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=0.6538857257017167, lr=0.007179136266621423
2023-11-28 04:37:49   INFO  epoch: 20/24, acc_iter=134840, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:03:52, time_cost(all): 1 day, 19:01:31/7:23:22, loss=0.312130610477149, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.982942093793465, lr=0.007158206002911263
2023-11-28 04:38:47   INFO  epoch: 20/24, acc_iter=134890, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:03:31, time_cost(all): 1 day, 19:02:29/7:25:26, loss=0.312023068117047, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.354008839155084, lr=0.007137275739201103
2023-11-28 04:39:45   INFO  epoch: 20/24, acc_iter=134940, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:08:24, time_cost(all): 1 day, 19:03:27/7:32:34, loss=0.311915525756944, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=1.4376801706903328, lr=0.007116345475490943
2023-11-28 04:40:43   INFO  epoch: 20/24, acc_iter=134990, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:02:39, time_cost(all): 1 day, 19:04:25/8:00:41, loss=0.311807983396841, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=1.1907317296256135, lr=0.007095415211780783
2023-11-28 04:41:40   INFO  epoch: 20/24, acc_iter=135040, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:59, time_cost(all): 1 day, 19:05:22/7:18:06, loss=0.311700441036738, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.147541335247775, lr=0.007074484948070623
2023-11-28 04:42:38   INFO  epoch: 20/24, acc_iter=135090, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:56, time_cost(all): 1 day, 19:06:20/7:21:05, loss=0.311592898676636, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=1.6767389300215019, lr=0.007053554684360463
2023-11-28 04:43:36   INFO  epoch: 20/24, acc_iter=135140, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:58:25, time_cost(all): 1 day, 19:07:18/7:53:22, loss=0.311485356316533, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=3.0578925440027818, lr=0.007032624420650303
2023-11-28 04:44:34   INFO  epoch: 20/24, acc_iter=135190, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/1:01:14, time_cost(all): 1 day, 19:08:16/7:34:01, loss=0.31137781395643, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=1.569780310876106, lr=0.007011694156940144
2023-11-28 04:45:31   INFO  epoch: 20/24, acc_iter=135240, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:34, time_cost(all): 1 day, 19:09:13/7:49:55, loss=0.311270271596328, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=0.677408089240455, lr=0.006990763893229984
2023-11-28 04:46:29   INFO  epoch: 20/24, acc_iter=135290, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:00:32, time_cost(all): 1 day, 19:10:11/7:46:12, loss=0.311162729236225, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=2.9397620153436543, lr=0.006969833629519823
2023-11-28 04:47:27   INFO  epoch: 20/24, acc_iter=135340, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:57:48, time_cost(all): 1 day, 19:11:09/7:20:31, loss=0.311055186876122, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.6101039964981885, lr=0.006948903365809663
2023-11-28 04:48:25   INFO  epoch: 20/24, acc_iter=135390, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:58:14, time_cost(all): 1 day, 19:12:07/7:16:31, loss=0.31094764451602, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.77524275222604, lr=0.006927973102099503
2023-11-28 04:49:22   INFO  epoch: 20/24, acc_iter=135440, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:18, time_cost(all): 1 day, 19:13:04/7:12:41, loss=0.310840102155917, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=0.5033003750496634, lr=0.006907042838389343
2023-11-28 04:50:20   INFO  epoch: 20/24, acc_iter=135490, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:53:51, time_cost(all): 1 day, 19:14:02/7:23:21, loss=0.310732559795814, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=2.3198783114158648, lr=0.006886112574679183
2023-11-28 04:51:18   INFO  epoch: 20/24, acc_iter=135540, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:47, time_cost(all): 1 day, 19:15:00/7:30:06, loss=0.310625017435712, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=3.831147801253511, lr=0.006865182310969023
2023-11-28 04:52:16   INFO  epoch: 20/24, acc_iter=135590, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:53:50, time_cost(all): 1 day, 19:15:58/7:32:44, loss=0.310517475075609, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=3.3252213112725504, lr=0.006844252047258864
2023-11-28 04:53:13   INFO  epoch: 20/24, acc_iter=135640, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:51:27, time_cost(all): 1 day, 19:16:55/7:19:43, loss=0.310409932715506, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=4.675585002035555, lr=0.006823321783548704
2023-11-28 04:54:11   INFO  epoch: 20/24, acc_iter=135690, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:50:19, time_cost(all): 1 day, 19:17:53/7:45:10, loss=0.310302390355404, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=3.991787984759828, lr=0.006802391519838543
2023-11-28 04:55:09   INFO  epoch: 20/24, acc_iter=135740, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:50:41, time_cost(all): 1 day, 19:18:51/7:18:10, loss=0.310194847995301, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=4.701489492959396, lr=0.006781461256128383
2023-11-28 04:56:07   INFO  epoch: 20/24, acc_iter=135790, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:49:42, time_cost(all): 1 day, 19:19:49/7:19:18, loss=0.310087305635198, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=1.0455348962094047, lr=0.006760530992418223
2023-11-28 04:57:04   INFO  epoch: 20/24, acc_iter=135840, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:46:03, time_cost(all): 1 day, 19:20:46/7:27:00, loss=0.309979763275096, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=0.9594046846969708, lr=0.006739600728708064
2023-11-28 04:58:02   INFO  epoch: 20/24, acc_iter=135890, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:47:57, time_cost(all): 1 day, 19:21:44/7:39:00, loss=0.309872220914993, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=1.7973393118171903, lr=0.006718670464997904
2023-11-28 04:59:00   INFO  epoch: 20/24, acc_iter=135940, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:46:48, time_cost(all): 1 day, 19:22:42/7:09:30, loss=0.30976467855489, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=2.817774187221872, lr=0.006697740201287744
2023-11-28 04:59:58   INFO  epoch: 20/24, acc_iter=135990, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:46:12, time_cost(all): 1 day, 19:23:40/7:02:47, loss=0.309657136194788, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=2.391588029534801, lr=0.006676809937577584
2023-11-28 05:00:55   INFO  epoch: 20/24, acc_iter=136040, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:17, time_cost(all): 1 day, 19:24:37/7:25:26, loss=0.309549593834685, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=3.160336831915139, lr=0.006655879673867424
2023-11-28 05:01:53   INFO  epoch: 20/24, acc_iter=136090, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:45:06, time_cost(all): 1 day, 19:25:35/7:33:39, loss=0.309442051474582, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=3.4369856293195733, lr=0.006634949410157263
2023-11-28 05:02:51   INFO  epoch: 20/24, acc_iter=136140, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:40:14, time_cost(all): 1 day, 19:26:33/7:27:45, loss=0.30933450911448, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.2040878748676453, lr=0.006614019146447104
2023-11-28 05:03:49   INFO  epoch: 20/24, acc_iter=136190, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:39:18, time_cost(all): 1 day, 19:27:31/7:33:02, loss=0.309226966754377, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=4.859270375597565, lr=0.006593088882736944
2023-11-28 05:04:46   INFO  epoch: 20/24, acc_iter=136240, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:23, time_cost(all): 1 day, 19:28:28/7:20:13, loss=0.309119424394274, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=1.8029464868737257, lr=0.006572158619026785
2023-11-28 05:05:44   INFO  epoch: 20/24, acc_iter=136290, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:40:37, time_cost(all): 1 day, 19:29:26/7:25:24, loss=0.309011882034172, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=1.1349456007871654, lr=0.006551228355316623
2023-11-28 05:06:42   INFO  epoch: 20/24, acc_iter=136340, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:36:38, time_cost(all): 1 day, 19:30:24/7:23:27, loss=0.308904339674069, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=1.6553547933484876, lr=0.006530298091606464
2023-11-28 05:07:40   INFO  epoch: 20/24, acc_iter=136390, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:26, time_cost(all): 1 day, 19:31:22/6:59:21, loss=0.308796797313966, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=2.202255454178249, lr=0.006509367827896304
2023-11-28 05:08:37   INFO  epoch: 20/24, acc_iter=136440, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:34:52, time_cost(all): 1 day, 19:32:19/7:29:08, loss=0.308689254953863, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=3.520370824033751, lr=0.006488437564186144
2023-11-28 05:09:35   INFO  epoch: 20/24, acc_iter=136490, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:19, time_cost(all): 1 day, 19:33:17/7:15:38, loss=0.308581712593761, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.4732335892571293, lr=0.006467507300475983
2023-11-28 05:10:33   INFO  epoch: 20/24, acc_iter=136540, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:34:24, time_cost(all): 1 day, 19:34:15/7:25:35, loss=0.308474170233658, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.1901494505553143, lr=0.006446577036765824
2023-11-28 05:11:31   INFO  epoch: 20/24, acc_iter=136590, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:46, time_cost(all): 1 day, 19:35:13/7:23:43, loss=0.308366627873555, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=4.405656606988007, lr=0.006425646773055664
2023-11-28 05:12:28   INFO  epoch: 20/24, acc_iter=136640, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:33:05, time_cost(all): 1 day, 19:36:10/7:11:21, loss=0.308259085513453, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=1.7083978914508986, lr=0.006404716509345505
2023-11-28 05:13:26   INFO  epoch: 20/24, acc_iter=136690, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:31:33, time_cost(all): 1 day, 19:37:08/7:25:47, loss=0.30815154315335, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=0.7040194761063727, lr=0.006383786245635344
2023-11-28 05:14:24   INFO  epoch: 20/24, acc_iter=136740, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:22, time_cost(all): 1 day, 19:38:06/6:50:17, loss=0.308044000793247, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=4.110522548788481, lr=0.006362855981925184
2023-11-28 05:15:22   INFO  epoch: 20/24, acc_iter=136790, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:24, time_cost(all): 1 day, 19:39:04/7:12:15, loss=0.307936458433145, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.277441879986147, lr=0.006341925718215024
2023-11-28 05:16:19   INFO  epoch: 20/24, acc_iter=136840, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:07, time_cost(all): 1 day, 19:40:01/7:08:53, loss=0.307828916073042, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=1.049132446988182, lr=0.006320995454504864
2023-11-28 05:17:17   INFO  epoch: 20/24, acc_iter=136890, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:06, time_cost(all): 1 day, 19:40:59/7:15:43, loss=0.307721373712939, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.6759542034960329, lr=0.006300065190794704
2023-11-28 05:18:15   INFO  epoch: 20/24, acc_iter=136940, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:10, time_cost(all): 1 day, 19:41:57/7:10:27, loss=0.307613831352837, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=3.0675673997139428, lr=0.006279134927084545
2023-11-28 05:19:13   INFO  epoch: 20/24, acc_iter=136990, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:53, time_cost(all): 1 day, 19:42:55/7:09:16, loss=0.307506288992734, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=3.343020012117301, lr=0.006258204663374385
2023-11-28 05:20:10   INFO  epoch: 20/24, acc_iter=137040, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:32, time_cost(all): 1 day, 19:43:52/7:15:18, loss=0.307398746632631, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=4.2965114444973125, lr=0.006237274399664225
2023-11-28 05:21:08   INFO  epoch: 20/24, acc_iter=137090, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:21, time_cost(all): 1 day, 19:44:50/6:37:50, loss=0.307291204272529, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=1.7289954016647582, lr=0.006216344135954064
2023-11-28 05:22:06   INFO  epoch: 20/24, acc_iter=137140, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:04, time_cost(all): 1 day, 19:45:48/6:37:56, loss=0.307183661912426, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.9695283494216795, lr=0.006195413872243904
2023-11-28 05:23:04   INFO  epoch: 20/24, acc_iter=137190, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:20:58, time_cost(all): 1 day, 19:46:46/7:14:45, loss=0.307076119552323, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.4586383107273364, lr=0.006174483608533745
2023-11-28 05:24:01   INFO  epoch: 20/24, acc_iter=137240, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:49, time_cost(all): 1 day, 19:47:43/6:39:10, loss=0.306968577192221, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=3.944843396420377, lr=0.006153553344823585
2023-11-28 05:24:59   INFO  epoch: 20/24, acc_iter=137290, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:50, time_cost(all): 1 day, 19:48:41/7:11:40, loss=0.306861034832118, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=3.9321956897719597, lr=0.006132623081113424
2023-11-28 05:25:57   INFO  epoch: 20/24, acc_iter=137340, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:55, time_cost(all): 1 day, 19:49:39/6:58:57, loss=0.306753492472015, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=4.864705322314527, lr=0.006111692817403265
2023-11-28 05:26:55   INFO  epoch: 20/24, acc_iter=137390, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:36, time_cost(all): 1 day, 19:50:37/7:03:51, loss=0.306645950111913, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=3.4452848073161713, lr=0.006090762553693105
2023-11-28 05:27:52   INFO  epoch: 20/24, acc_iter=137440, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:18, time_cost(all): 1 day, 19:51:34/6:48:07, loss=0.30653840775181, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=1.6674597765881205, lr=0.006069832289982945
2023-11-28 05:28:50   INFO  epoch: 20/24, acc_iter=137490, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:46, time_cost(all): 1 day, 19:52:32/6:42:27, loss=0.306430865391707, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=0.8218212617670952, lr=0.006048902026272784
2023-11-28 05:29:48   INFO  epoch: 20/24, acc_iter=137540, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:47, time_cost(all): 1 day, 19:53:30/6:52:22, loss=0.306323323031605, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.084716639361584, lr=0.006027971762562625
2023-11-28 05:30:46   INFO  epoch: 20/24, acc_iter=137590, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:41, time_cost(all): 1 day, 19:54:28/7:07:58, loss=0.306215780671502, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=0.7593408936842285, lr=0.006007041498852465
2023-11-28 05:31:43   INFO  epoch: 20/24, acc_iter=137640, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:43, time_cost(all): 1 day, 19:55:25/6:48:44, loss=0.306108238311399, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=2.1979334679893436, lr=0.005986111235142305
2023-11-28 05:32:41   INFO  epoch: 20/24, acc_iter=137690, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:52, time_cost(all): 1 day, 19:56:23/6:37:36, loss=0.306000695951296, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=3.9287355739013807, lr=0.005965180971432145
2023-11-28 05:33:39   INFO  epoch: 20/24, acc_iter=137740, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:15, time_cost(all): 1 day, 19:57:21/6:52:31, loss=0.305893153591194, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=3.133629756407129, lr=0.005944250707721985
2023-11-28 05:34:37   INFO  epoch: 20/24, acc_iter=137790, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:55, time_cost(all): 1 day, 19:58:19/6:32:34, loss=0.305785611231091, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=4.647722969747953, lr=0.005923320444011825
2023-11-28 05:35:34   INFO  epoch: 20/24, acc_iter=137840, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:39, time_cost(all): 1 day, 19:59:16/6:45:57, loss=0.305678068870988, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=3.5719954068077318, lr=0.005902390180301666
2023-11-28 05:36:32   INFO  epoch: 20/24, acc_iter=137890, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:32, time_cost(all): 1 day, 20:00:14/6:55:28, loss=0.305570526510886, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=1.567563230029945, lr=0.005881459916591505
2023-11-28 05:37:30   INFO  epoch: 20/24, acc_iter=137940, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:18, time_cost(all): 1 day, 20:01:12/6:37:01, loss=0.305462984150783, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=1.63331547099136, lr=0.005860529652881345
2023-11-28 05:38:28   INFO  epoch: 20/24, acc_iter=137990, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:17, time_cost(all): 1 day, 20:02:10/6:30:43, loss=0.30535544179068, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=1.5102020150509354, lr=0.005839599389171185
2023-11-28 05:39:25   INFO  epoch: 20/24, acc_iter=138040, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:25, time_cost(all): 1 day, 20:03:07/6:45:40, loss=0.305247899430578, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=1.3697368777940142, lr=0.005818669125461026
2023-11-28 05:40:23   INFO  epoch: 20/24, acc_iter=138090, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:39, time_cost(all): 1 day, 20:04:05/6:52:49, loss=0.305140357070475, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.008102472251824, lr=0.005797738861750865
2023-11-28 05:41:21   INFO  epoch: 20/24, acc_iter=138140, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:32, time_cost(all): 1 day, 20:05:03/6:53:25, loss=0.305032814710372, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=3.982133580701426, lr=0.005776808598040705
2023-11-28 05:42:19   INFO  epoch: 20/24, acc_iter=138190, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:43, time_cost(all): 1 day, 20:06:01/6:27:31, loss=0.30492527235027, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=4.214151088375123, lr=0.005755878334330546
2023-11-28 05:43:16   INFO  epoch: 20/24, acc_iter=138240, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:39, time_cost(all): 1 day, 20:06:58/6:20:47, loss=0.304817729990167, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=3.485526012294768, lr=0.005734948070620385
2023-11-28 05:44:14   INFO  epoch: 20/24, acc_iter=138290, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 1 day, 20:07:56/6:49:04, loss=0.304710187630064, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=1.2379192097434555, lr=0.005714017806910225
2023-11-28 05:45:12   INFO  epoch: 21/24, acc_iter=138377, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:03:01, time_cost(all): 1 day, 20:08:54/6:50:27, loss=0.304523063923486, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=4.1963429675226145, lr=0.005677599148054547
2023-11-28 05:46:10   INFO  epoch: 21/24, acc_iter=138427, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:02:20, time_cost(all): 1 day, 20:09:52/6:44:11, loss=0.304415521563383, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=2.276957902743459, lr=0.005656668884344387
2023-11-28 05:47:07   INFO  epoch: 21/24, acc_iter=138477, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/1:58:48, time_cost(all): 1 day, 20:10:49/6:18:21, loss=0.30430797920328, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=4.157870028917558, lr=0.005635738620634227
2023-11-28 05:48:05   INFO  epoch: 21/24, acc_iter=138527, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:00:19, time_cost(all): 1 day, 20:11:47/6:41:09, loss=0.304200436843178, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=3.203524148108277, lr=0.005614808356924067
2023-11-28 05:49:03   INFO  epoch: 21/24, acc_iter=138577, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/1:56:48, time_cost(all): 1 day, 20:12:45/6:22:54, loss=0.304092894483075, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=2.29199331127517, lr=0.005593878093213907
2023-11-28 05:50:01   INFO  epoch: 21/24, acc_iter=138627, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:06:31, time_cost(all): 1 day, 20:13:43/6:17:39, loss=0.303985352122972, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=4.129778890999894, lr=0.005572947829503747
2023-11-28 05:50:58   INFO  epoch: 21/24, acc_iter=138677, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/2:01:53, time_cost(all): 1 day, 20:14:40/6:29:54, loss=0.30387780976287, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=3.082097209009376, lr=0.005552017565793588
2023-11-28 05:51:56   INFO  epoch: 21/24, acc_iter=138727, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/1:56:59, time_cost(all): 1 day, 20:15:38/6:36:53, loss=0.303770267402767, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.7528844556566634, lr=0.005531087302083427
2023-11-28 05:52:54   INFO  epoch: 21/24, acc_iter=138777, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/2:01:27, time_cost(all): 1 day, 20:16:36/6:22:56, loss=0.303662725042664, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=2.885446677264385, lr=0.005510157038373267
2023-11-28 05:53:52   INFO  epoch: 21/24, acc_iter=138827, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:59:35, time_cost(all): 1 day, 20:17:34/6:21:01, loss=0.303555182682562, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=4.413913592563395, lr=0.005489226774663108
2023-11-28 05:54:49   INFO  epoch: 21/24, acc_iter=138877, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:52:40, time_cost(all): 1 day, 20:18:31/6:17:17, loss=0.303447640322459, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.83(1.03), norm=4.106079352658085, lr=0.005468296510952948
2023-11-28 05:55:47   INFO  epoch: 21/24, acc_iter=138927, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:50:36, time_cost(all): 1 day, 20:19:29/6:14:37, loss=0.303340097962356, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=2.614801242264192, lr=0.005447366247242787
2023-11-28 05:56:45   INFO  epoch: 21/24, acc_iter=138977, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:49:40, time_cost(all): 1 day, 20:20:27/6:16:10, loss=0.303232555602254, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=4.501530337010512, lr=0.005426435983532628
2023-11-28 05:57:43   INFO  epoch: 21/24, acc_iter=139027, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:50:20, time_cost(all): 1 day, 20:21:25/6:15:56, loss=0.303125013242151, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=4.787577605460689, lr=0.005405505719822468
2023-11-28 05:58:40   INFO  epoch: 21/24, acc_iter=139077, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:56:30, time_cost(all): 1 day, 20:22:22/6:19:10, loss=0.303017470882048, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=0.6619589729194513, lr=0.005384575456112308
2023-11-28 05:59:38   INFO  epoch: 21/24, acc_iter=139127, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:50:11, time_cost(all): 1 day, 20:23:20/6:22:27, loss=0.302909928521945, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=3.1895555382017404, lr=0.005363645192402148
2023-11-28 06:00:36   INFO  epoch: 21/24, acc_iter=139177, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:51:21, time_cost(all): 1 day, 20:24:18/6:31:33, loss=0.302802386161843, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=3.1671099918562047, lr=0.005342714928691988
2023-11-28 06:01:34   INFO  epoch: 21/24, acc_iter=139227, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:46:47, time_cost(all): 1 day, 20:25:16/6:06:51, loss=0.30269484380174, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=4.874473132367509, lr=0.005321784664981828
2023-11-28 06:02:31   INFO  epoch: 21/24, acc_iter=139277, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:49:55, time_cost(all): 1 day, 20:26:13/6:04:17, loss=0.302587301441637, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=1.9923935284045173, lr=0.005300854401271668
2023-11-28 06:03:29   INFO  epoch: 21/24, acc_iter=139327, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:52:30, time_cost(all): 1 day, 20:27:11/6:11:34, loss=0.302479759081535, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=4.311169373899456, lr=0.005279924137561508
2023-11-28 06:04:27   INFO  epoch: 21/24, acc_iter=139377, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:45:29, time_cost(all): 1 day, 20:28:09/6:31:20, loss=0.302372216721432, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=0.6553948772724817, lr=0.005258993873851348
2023-11-28 06:05:25   INFO  epoch: 21/24, acc_iter=139427, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:40:53, time_cost(all): 1 day, 20:29:07/6:31:25, loss=0.302264674361329, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=0.9225273056054528, lr=0.005238063610141188
2023-11-28 06:06:22   INFO  epoch: 21/24, acc_iter=139477, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:40:19, time_cost(all): 1 day, 20:30:04/6:16:17, loss=0.302157132001227, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.9660389719982754, lr=0.005217133346431028
2023-11-28 06:07:20   INFO  epoch: 21/24, acc_iter=139527, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:41:48, time_cost(all): 1 day, 20:31:02/6:17:33, loss=0.302049589641124, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.0882122218023187, lr=0.005196203082720867
2023-11-28 06:08:18   INFO  epoch: 21/24, acc_iter=139577, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:44:44, time_cost(all): 1 day, 20:32:00/6:28:41, loss=0.301942047281021, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=0.5445876355549476, lr=0.005175272819010707
2023-11-28 06:09:16   INFO  epoch: 21/24, acc_iter=139627, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:39:03, time_cost(all): 1 day, 20:32:58/5:54:38, loss=0.301834504920919, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=2.1949477111426847, lr=0.005154342555300548
2023-11-28 06:10:13   INFO  epoch: 21/24, acc_iter=139677, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:44:23, time_cost(all): 1 day, 20:33:55/6:15:19, loss=0.301726962560816, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=2.083583231738627, lr=0.005133412291590388
2023-11-28 06:11:11   INFO  epoch: 21/24, acc_iter=139727, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:37:58, time_cost(all): 1 day, 20:34:53/6:23:43, loss=0.301619420200713, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.9298348603970186, lr=0.005112482027880227
2023-11-28 06:12:09   INFO  epoch: 21/24, acc_iter=139777, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:38:25, time_cost(all): 1 day, 20:35:51/6:13:13, loss=0.301511877840611, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.9585096210297253, lr=0.005091551764170068
2023-11-28 06:13:07   INFO  epoch: 21/24, acc_iter=139827, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:40:30, time_cost(all): 1 day, 20:36:49/6:04:53, loss=0.301404335480508, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.389095634883763, lr=0.005070621500459908
2023-11-28 06:14:04   INFO  epoch: 21/24, acc_iter=139877, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:36:41, time_cost(all): 1 day, 20:37:46/5:49:50, loss=0.301296793120405, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=2.3014985109680897, lr=0.005049691236749748
2023-11-28 06:15:02   INFO  epoch: 21/24, acc_iter=139927, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:37:28, time_cost(all): 1 day, 20:38:44/6:20:43, loss=0.301189250760303, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.1659750125449935, lr=0.005028760973039587
2023-11-28 06:16:00   INFO  epoch: 21/24, acc_iter=139977, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:34:05, time_cost(all): 1 day, 20:39:42/6:01:23, loss=0.3010817084002, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=2.60103218844079, lr=0.005007830709329428
2023-11-28 06:16:58   INFO  epoch: 21/24, acc_iter=140027, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:38:42, time_cost(all): 1 day, 20:40:40/6:19:36, loss=0.300974166040097, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=1.7299380937501485, lr=0.004986900445619268
2023-11-28 06:17:55   INFO  epoch: 21/24, acc_iter=140077, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:32:26, time_cost(all): 1 day, 20:41:37/5:46:48, loss=0.300866623679995, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=2.0535895986806736, lr=0.004965970181909108
2023-11-28 06:18:53   INFO  epoch: 21/24, acc_iter=140127, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:27:45, time_cost(all): 1 day, 20:42:35/6:11:45, loss=0.300759081319892, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=1.393201245356685, lr=0.004945039918198948
2023-11-28 06:19:51   INFO  epoch: 21/24, acc_iter=140177, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:27:29, time_cost(all): 1 day, 20:43:33/5:58:29, loss=0.300651538959789, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=3.79308094619808, lr=0.004924109654488788
2023-11-28 06:20:49   INFO  epoch: 21/24, acc_iter=140227, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:31:20, time_cost(all): 1 day, 20:44:31/6:10:45, loss=0.300543996599687, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=0.6240012872323033, lr=0.004903179390778628
2023-11-28 06:21:46   INFO  epoch: 21/24, acc_iter=140277, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:25:43, time_cost(all): 1 day, 20:45:28/6:02:51, loss=0.300436454239584, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=1.6395574760954146, lr=0.004882249127068469
2023-11-28 06:22:44   INFO  epoch: 21/24, acc_iter=140327, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:28:28, time_cost(all): 1 day, 20:46:26/6:09:31, loss=0.300328911879481, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=4.050674428226879, lr=0.004861318863358308
2023-11-28 06:23:42   INFO  epoch: 21/24, acc_iter=140377, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:31:12, time_cost(all): 1 day, 20:47:24/6:12:15, loss=0.300221369519379, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=2.944527708260548, lr=0.004840388599648148
2023-11-28 06:24:40   INFO  epoch: 21/24, acc_iter=140427, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:29:36, time_cost(all): 1 day, 20:48:22/6:01:34, loss=0.300113827159276, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=1.7195283757429913, lr=0.004819458335937989
2023-11-28 06:25:38   INFO  epoch: 21/24, acc_iter=140477, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:23:58, time_cost(all): 1 day, 20:49:20/5:47:36, loss=0.300006284799173, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=1.5879569949539976, lr=0.004798528072227829
2023-11-28 06:26:35   INFO  epoch: 21/24, acc_iter=140527, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:23:21, time_cost(all): 1 day, 20:50:17/6:03:43, loss=0.29989874243907, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=4.905173184540919, lr=0.004777597808517668
2023-11-28 06:27:33   INFO  epoch: 21/24, acc_iter=140577, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:19:39, time_cost(all): 1 day, 20:51:15/5:39:47, loss=0.299791200078968, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.8803685917789625, lr=0.004756667544807509
2023-11-28 06:28:31   INFO  epoch: 21/24, acc_iter=140627, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:19:30, time_cost(all): 1 day, 20:52:13/5:52:27, loss=0.299683657718865, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=4.069586958579731, lr=0.004735737281097349
2023-11-28 06:29:29   INFO  epoch: 21/24, acc_iter=140677, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:20:02, time_cost(all): 1 day, 20:53:11/5:44:48, loss=0.299576115358762, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=1.3471288839116349, lr=0.004714807017387189
2023-11-28 06:30:26   INFO  epoch: 21/24, acc_iter=140727, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:20:40, time_cost(all): 1 day, 20:54:08/5:54:48, loss=0.29946857299866, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.446486006866668, lr=0.004693876753677029
2023-11-28 06:31:24   INFO  epoch: 21/24, acc_iter=140777, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:18:00, time_cost(all): 1 day, 20:55:06/5:45:55, loss=0.299361030638557, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=1.6257292542591526, lr=0.004672946489966869
2023-11-28 06:32:22   INFO  epoch: 21/24, acc_iter=140827, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:57, time_cost(all): 1 day, 20:56:04/5:55:46, loss=0.299253488278454, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=2.3905290106407264, lr=0.004652016226256709
2023-11-28 06:33:20   INFO  epoch: 21/24, acc_iter=140877, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:14:06, time_cost(all): 1 day, 20:57:02/5:42:23, loss=0.299145945918352, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=1.5355432526032582, lr=0.004631085962546549
2023-11-28 06:34:17   INFO  epoch: 21/24, acc_iter=140927, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:44, time_cost(all): 1 day, 20:57:59/5:38:45, loss=0.299038403558249, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.2036196354841415, lr=0.004610155698836389
2023-11-28 06:35:15   INFO  epoch: 21/24, acc_iter=140977, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:19, time_cost(all): 1 day, 20:58:57/5:48:41, loss=0.298930861198146, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=1.7157271784214352, lr=0.004589225435126229
2023-11-28 06:36:13   INFO  epoch: 21/24, acc_iter=141027, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:48, time_cost(all): 1 day, 20:59:55/5:26:50, loss=0.298823318838044, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.16(1.03), norm=1.7234403132609515, lr=0.004568295171416069
2023-11-28 06:37:11   INFO  epoch: 21/24, acc_iter=141077, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:15:05, time_cost(all): 1 day, 21:00:53/5:55:16, loss=0.298715776477941, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.663178495089652, lr=0.004547364907705909
2023-11-28 06:38:08   INFO  epoch: 21/24, acc_iter=141127, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:11:19, time_cost(all): 1 day, 21:01:50/5:46:08, loss=0.298608234117838, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=3.3922761527408403, lr=0.004526434643995749
2023-11-28 06:39:06   INFO  epoch: 21/24, acc_iter=141177, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:13:56, time_cost(all): 1 day, 21:02:48/5:35:18, loss=0.298500691757736, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=3.494603356622711, lr=0.004505504380285589
2023-11-28 06:40:04   INFO  epoch: 21/24, acc_iter=141227, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:12:45, time_cost(all): 1 day, 21:03:46/5:48:05, loss=0.298393149397633, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.83(1.03), norm=4.158838094982885, lr=0.00448457411657543
2023-11-28 06:41:02   INFO  epoch: 21/24, acc_iter=141277, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:26, time_cost(all): 1 day, 21:04:44/5:52:01, loss=0.29828560703753, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=3.628383908373884, lr=0.00446364385286527
2023-11-28 06:41:59   INFO  epoch: 21/24, acc_iter=141327, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:00, time_cost(all): 1 day, 21:05:41/5:29:29, loss=0.298178064677428, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=1.8540646194958277, lr=0.004442713589155109
2023-11-28 06:42:57   INFO  epoch: 21/24, acc_iter=141377, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:06:03, time_cost(all): 1 day, 21:06:39/5:25:22, loss=0.298070522317325, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=3.4575433580367902, lr=0.00442178332544495
2023-11-28 06:43:55   INFO  epoch: 21/24, acc_iter=141427, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:08:48, time_cost(all): 1 day, 21:07:37/5:37:43, loss=0.297962979957222, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=3.246632282945021, lr=0.004400853061734789
2023-11-28 06:44:53   INFO  epoch: 21/24, acc_iter=141477, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:09:28, time_cost(all): 1 day, 21:08:35/5:29:15, loss=0.29785543759712, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=0.8153750667949693, lr=0.004379922798024629
2023-11-28 06:45:50   INFO  epoch: 21/24, acc_iter=141527, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:05:36, time_cost(all): 1 day, 21:09:32/5:32:54, loss=0.297747895237017, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.09(1.03), norm=0.8607666857203616, lr=0.004358992534314468
2023-11-28 06:46:48   INFO  epoch: 21/24, acc_iter=141577, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:07:11, time_cost(all): 1 day, 21:10:30/5:39:39, loss=0.297640352876914, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=4.409079624946159, lr=0.004338062270604309
2023-11-28 06:47:46   INFO  epoch: 21/24, acc_iter=141627, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:01:34, time_cost(all): 1 day, 21:11:28/5:31:22, loss=0.297532810516812, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.3099149601084197, lr=0.004317132006894149
2023-11-28 06:48:44   INFO  epoch: 21/24, acc_iter=141677, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:02:09, time_cost(all): 1 day, 21:12:26/5:25:56, loss=0.297425268156709, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.8951299354516484, lr=0.004296201743183989
2023-11-28 06:49:41   INFO  epoch: 21/24, acc_iter=141727, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/0:59:10, time_cost(all): 1 day, 21:13:23/5:32:32, loss=0.297317725796606, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=2.8520049612010965, lr=0.004275271479473829
2023-11-28 06:50:39   INFO  epoch: 21/24, acc_iter=141777, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:57:29, time_cost(all): 1 day, 21:14:21/5:40:55, loss=0.297210183436504, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.8751042910968425, lr=0.004254341215763669
2023-11-28 06:51:37   INFO  epoch: 21/24, acc_iter=141827, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:57:05, time_cost(all): 1 day, 21:15:19/5:26:24, loss=0.297102641076401, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=1.1940303393266367, lr=0.004233410952053509
2023-11-28 06:52:35   INFO  epoch: 21/24, acc_iter=141877, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:58:04, time_cost(all): 1 day, 21:16:17/5:25:48, loss=0.296995098716298, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=1.7502835227824771, lr=0.00421248068834335
2023-11-28 06:53:32   INFO  epoch: 21/24, acc_iter=141927, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:02, time_cost(all): 1 day, 21:17:14/5:36:00, loss=0.296887556356196, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=4.414389671859648, lr=0.004191550424633189
2023-11-28 06:54:30   INFO  epoch: 21/24, acc_iter=141977, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:17, time_cost(all): 1 day, 21:18:12/5:23:52, loss=0.296780013996093, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=2.7837248285459166, lr=0.004170620160923029
2023-11-28 06:55:28   INFO  epoch: 21/24, acc_iter=142027, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:55:38, time_cost(all): 1 day, 21:19:10/5:35:43, loss=0.29667247163599, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=4.578774335343033, lr=0.00414968989721287
2023-11-28 06:56:26   INFO  epoch: 21/24, acc_iter=142077, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:52:37, time_cost(all): 1 day, 21:20:08/5:22:02, loss=0.296564929275887, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=2.748983166081249, lr=0.00412875963350271
2023-11-28 06:57:23   INFO  epoch: 21/24, acc_iter=142127, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:52:31, time_cost(all): 1 day, 21:21:05/5:24:27, loss=0.296457386915785, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=1.4860245639347673, lr=0.004107829369792549
2023-11-28 06:58:21   INFO  epoch: 21/24, acc_iter=142177, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:55:18, time_cost(all): 1 day, 21:22:03/5:28:16, loss=0.296349844555682, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=3.362454885105465, lr=0.00408689910608239
2023-11-28 06:59:19   INFO  epoch: 21/24, acc_iter=142227, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:50:02, time_cost(all): 1 day, 21:23:01/5:09:53, loss=0.296242302195579, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.30925165624992, lr=0.00406596884237223
2023-11-28 07:00:17   INFO  epoch: 21/24, acc_iter=142277, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:46, time_cost(all): 1 day, 21:23:59/5:22:10, loss=0.296134759835477, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.536506666027933, lr=0.00404503857866207
2023-11-28 07:01:14   INFO  epoch: 21/24, acc_iter=142327, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:47:46, time_cost(all): 1 day, 21:24:56/5:31:39, loss=0.296027217475374, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=3.626346934595519, lr=0.00402410831495191
2023-11-28 07:02:12   INFO  epoch: 21/24, acc_iter=142377, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:50:34, time_cost(all): 1 day, 21:25:54/5:16:54, loss=0.295919675115271, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=1.7963475882956947, lr=0.00400317805124175
2023-11-28 07:03:10   INFO  epoch: 21/24, acc_iter=142427, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:45, time_cost(all): 1 day, 21:26:52/5:21:52, loss=0.295812132755169, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=1.2426245315631042, lr=0.00398224778753159
2023-11-28 07:04:08   INFO  epoch: 21/24, acc_iter=142477, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:44:52, time_cost(all): 1 day, 21:27:50/5:06:43, loss=0.295704590395066, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=2.712376737713915, lr=0.00396131752382143
2023-11-28 07:05:05   INFO  epoch: 21/24, acc_iter=142527, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:44:37, time_cost(all): 1 day, 21:28:47/5:18:26, loss=0.295597048034963, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=2.999216701094559, lr=0.00394038726011127
2023-11-28 07:06:03   INFO  epoch: 21/24, acc_iter=142577, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:05, time_cost(all): 1 day, 21:29:45/4:58:20, loss=0.295489505674861, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=1.4310738014341264, lr=0.00391945699640111
2023-11-28 07:07:01   INFO  epoch: 21/24, acc_iter=142627, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:43:53, time_cost(all): 1 day, 21:30:43/5:08:37, loss=0.295381963314758, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=0.6150548081901224, lr=0.00389852673269095
2023-11-28 07:07:59   INFO  epoch: 21/24, acc_iter=142677, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:45:07, time_cost(all): 1 day, 21:31:41/4:58:23, loss=0.295274420954655, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=1.7511160880081245, lr=0.00387759646898079
2023-11-28 07:08:56   INFO  epoch: 21/24, acc_iter=142727, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:41:58, time_cost(all): 1 day, 21:32:38/5:19:54, loss=0.295166878594553, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=1.026555872234501, lr=0.00385666620527063
2023-11-28 07:09:54   INFO  epoch: 21/24, acc_iter=142777, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:42:01, time_cost(all): 1 day, 21:33:36/5:23:14, loss=0.29505933623445, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.3080011896392407, lr=0.00383573594156047
2023-11-28 07:10:52   INFO  epoch: 21/24, acc_iter=142827, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:41:53, time_cost(all): 1 day, 21:34:34/5:05:07, loss=0.294951793874347, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=3.3207895527445253, lr=0.003814805677850311
2023-11-28 07:11:50   INFO  epoch: 21/24, acc_iter=142877, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:39:51, time_cost(all): 1 day, 21:35:32/5:05:45, loss=0.294844251514245, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=4.876978730052827, lr=0.00379387541414015
2023-11-28 07:12:47   INFO  epoch: 21/24, acc_iter=142927, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:39:17, time_cost(all): 1 day, 21:36:29/5:05:20, loss=0.294736709154142, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=3.9478056847068737, lr=0.00377294515042999
2023-11-28 07:13:45   INFO  epoch: 21/24, acc_iter=142977, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:37:42, time_cost(all): 1 day, 21:37:27/5:06:44, loss=0.294629166794039, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=0.7188037759170626, lr=0.003752014886719831
2023-11-28 07:14:43   INFO  epoch: 21/24, acc_iter=143027, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:33, time_cost(all): 1 day, 21:38:25/5:13:41, loss=0.294521624433937, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=3.864448355038133, lr=0.003731084623009671
2023-11-28 07:15:41   INFO  epoch: 21/24, acc_iter=143077, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:36:08, time_cost(all): 1 day, 21:39:23/5:16:25, loss=0.294414082073834, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.1833708318791603, lr=0.003710154359299511
2023-11-28 07:16:38   INFO  epoch: 21/24, acc_iter=143127, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:08, time_cost(all): 1 day, 21:40:20/5:02:34, loss=0.294306539713731, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.97(1.03), norm=0.6539730511922693, lr=0.003689224095589351
2023-11-28 07:17:36   INFO  epoch: 21/24, acc_iter=143177, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:51, time_cost(all): 1 day, 21:41:18/4:54:28, loss=0.294198997353629, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=4.636013200037816, lr=0.003668293831879191
2023-11-28 07:18:34   INFO  epoch: 21/24, acc_iter=143227, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:31:35, time_cost(all): 1 day, 21:42:16/5:09:26, loss=0.294091454993526, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=1.252214461811706, lr=0.003647363568169031
2023-11-28 07:19:32   INFO  epoch: 21/24, acc_iter=143277, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:33:04, time_cost(all): 1 day, 21:43:14/4:57:32, loss=0.293983912633423, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=2.6541336907818813, lr=0.003626433304458871
2023-11-28 07:20:29   INFO  epoch: 21/24, acc_iter=143327, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:30:40, time_cost(all): 1 day, 21:44:11/5:01:43, loss=0.29387637027332, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.23(1.03), norm=2.886656890110778, lr=0.003605503040748711
2023-11-28 07:21:27   INFO  epoch: 21/24, acc_iter=143377, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:06, time_cost(all): 1 day, 21:45:09/5:00:42, loss=0.293768827913218, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.137720938840433, lr=0.00358457277703855
2023-11-28 07:22:25   INFO  epoch: 21/24, acc_iter=143427, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:27:59, time_cost(all): 1 day, 21:46:07/4:49:44, loss=0.293661285553115, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.9293139095101883, lr=0.00356364251332839
2023-11-28 07:23:23   INFO  epoch: 21/24, acc_iter=143477, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:27:07, time_cost(all): 1 day, 21:47:05/5:01:09, loss=0.293553743193012, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=3.3271697662096997, lr=0.003542712249618231
2023-11-28 07:24:20   INFO  epoch: 21/24, acc_iter=143527, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:44, time_cost(all): 1 day, 21:48:02/4:54:04, loss=0.29344620083291, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.8785829718518072, lr=0.00352178198590807
2023-11-28 07:25:18   INFO  epoch: 21/24, acc_iter=143577, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:53, time_cost(all): 1 day, 21:49:00/4:58:09, loss=0.293338658472807, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.442040326067752, lr=0.00350085172219791
2023-11-28 07:26:16   INFO  epoch: 21/24, acc_iter=143627, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:33, time_cost(all): 1 day, 21:49:58/4:41:40, loss=0.293231116112704, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=3.140099839065887, lr=0.003479921458487751
2023-11-28 07:27:14   INFO  epoch: 21/24, acc_iter=143677, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:26, time_cost(all): 1 day, 21:50:56/4:43:40, loss=0.293123573752602, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.755988349393841, lr=0.003458991194777591
2023-11-28 07:28:11   INFO  epoch: 21/24, acc_iter=143727, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:07, time_cost(all): 1 day, 21:51:53/5:04:19, loss=0.293016031392499, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=3.0896528001940555, lr=0.00343806093106743
2023-11-28 07:29:09   INFO  epoch: 21/24, acc_iter=143777, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:26, time_cost(all): 1 day, 21:52:51/4:37:03, loss=0.292908489032396, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=3.815000625762834, lr=0.003417130667357271
2023-11-28 07:30:07   INFO  epoch: 21/24, acc_iter=143827, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:21:38, time_cost(all): 1 day, 21:53:49/4:52:05, loss=0.292800946672294, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=2.7016430692115128, lr=0.003396200403647111
2023-11-28 07:31:05   INFO  epoch: 21/24, acc_iter=143877, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:19:45, time_cost(all): 1 day, 21:54:47/4:34:12, loss=0.292693404312191, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=2.8670518277518973, lr=0.003375270139936951
2023-11-28 07:32:02   INFO  epoch: 21/24, acc_iter=143927, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:05, time_cost(all): 1 day, 21:55:44/4:46:27, loss=0.292585861952088, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=3.931555051983247, lr=0.003354339876226791
2023-11-28 07:33:00   INFO  epoch: 21/24, acc_iter=143977, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:37, time_cost(all): 1 day, 21:56:42/4:56:20, loss=0.292478319591986, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=3.016306231010447, lr=0.003333409612516631
2023-11-28 07:33:58   INFO  epoch: 21/24, acc_iter=144027, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:20, time_cost(all): 1 day, 21:57:40/4:48:02, loss=0.292370777231883, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.3488636973931107, lr=0.003312479348806471
2023-11-28 07:34:56   INFO  epoch: 21/24, acc_iter=144077, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:12, time_cost(all): 1 day, 21:58:38/4:35:46, loss=0.29226323487178, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=4.640966986865321, lr=0.003291549085096311
2023-11-28 07:35:53   INFO  epoch: 21/24, acc_iter=144127, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:15:19, time_cost(all): 1 day, 21:59:35/4:45:00, loss=0.292155692511678, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=2.396389793602274, lr=0.003270618821386151
2023-11-28 07:36:51   INFO  epoch: 21/24, acc_iter=144177, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:13:52, time_cost(all): 1 day, 22:00:33/4:54:55, loss=0.292048150151575, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.991203588410368, lr=0.003249688557675991
2023-11-28 07:37:49   INFO  epoch: 21/24, acc_iter=144227, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:39, time_cost(all): 1 day, 22:01:31/4:54:21, loss=0.291940607791472, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=1.4804650370869448, lr=0.003228758293965831
2023-11-28 07:38:47   INFO  epoch: 21/24, acc_iter=144277, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:15, time_cost(all): 1 day, 22:02:29/4:32:56, loss=0.29183306543137, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=4.6066054344788645, lr=0.003207828030255671
2023-11-28 07:39:44   INFO  epoch: 21/24, acc_iter=144327, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:10:59, time_cost(all): 1 day, 22:03:26/4:28:11, loss=0.291725523071267, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=4.784685678737107, lr=0.003186897766545511
2023-11-28 07:40:42   INFO  epoch: 21/24, acc_iter=144377, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:41, time_cost(all): 1 day, 22:04:24/4:49:20, loss=0.291617980711164, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.3665246656782015, lr=0.003165967502835351
2023-11-28 07:41:40   INFO  epoch: 21/24, acc_iter=144427, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:19, time_cost(all): 1 day, 22:05:22/4:27:32, loss=0.291510438351062, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=2.0406859757536377, lr=0.003145037239125192
2023-11-28 07:42:38   INFO  epoch: 21/24, acc_iter=144477, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:28, time_cost(all): 1 day, 22:06:20/4:45:24, loss=0.291402895990959, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.1756211111634838, lr=0.003124106975415031
2023-11-28 07:43:35   INFO  epoch: 21/24, acc_iter=144527, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:20, time_cost(all): 1 day, 22:07:17/4:27:02, loss=0.291295353630856, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=1.8479057147735494, lr=0.003103176711704871
2023-11-28 07:44:33   INFO  epoch: 21/24, acc_iter=144577, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:37, time_cost(all): 1 day, 22:08:15/4:30:53, loss=0.291187811270753, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=3.9760099653453995, lr=0.003082246447994712
2023-11-28 07:45:31   INFO  epoch: 21/24, acc_iter=144627, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:18, time_cost(all): 1 day, 22:09:13/4:43:22, loss=0.291080268910651, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=4.361821354858566, lr=0.003061316184284552
2023-11-28 07:46:29   INFO  epoch: 21/24, acc_iter=144677, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:41, time_cost(all): 1 day, 22:10:11/4:23:26, loss=0.290972726550548, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=3.1972837849666162, lr=0.003040385920574392
2023-11-28 07:47:26   INFO  epoch: 21/24, acc_iter=144727, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:35, time_cost(all): 1 day, 22:11:08/4:35:47, loss=0.290865184190445, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=0.794511141312547, lr=0.003019455656864232
2023-11-28 07:48:24   INFO  epoch: 21/24, acc_iter=144777, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:41, time_cost(all): 1 day, 22:12:06/4:38:58, loss=0.290757641830343, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=2.8614013073973283, lr=0.002998525393154072
2023-11-28 07:49:22   INFO  epoch: 21/24, acc_iter=144827, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:43, time_cost(all): 1 day, 22:13:04/4:30:11, loss=0.29065009947024, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=0.7811958716957262, lr=0.002977595129443912
2023-11-28 07:50:20   INFO  epoch: 21/24, acc_iter=144877, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:43, time_cost(all): 1 day, 22:14:02/4:36:45, loss=0.290542557110137, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=4.1748779461168954, lr=0.002956664865733752
2023-11-28 07:51:17   INFO  epoch: 22/24, acc_iter=144964, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:05:40, time_cost(all): 1 day, 22:14:59/4:28:36, loss=0.290355433403559, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=1.3217005490012914, lr=0.002920246206878074
2023-11-28 07:52:15   INFO  epoch: 22/24, acc_iter=145014, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:05:43, time_cost(all): 1 day, 22:15:57/4:22:27, loss=0.290247891043456, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.9932207587100335, lr=0.002899315943167914
2023-11-28 07:53:13   INFO  epoch: 22/24, acc_iter=145064, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:03:55, time_cost(all): 1 day, 22:16:55/4:39:02, loss=0.290140348683353, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.39453238657655, lr=0.002878385679457754
2023-11-28 07:54:11   INFO  epoch: 22/24, acc_iter=145114, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:07:52, time_cost(all): 1 day, 22:17:53/4:35:30, loss=0.290032806323251, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=2.644470032810222, lr=0.002857455415747593
2023-11-28 07:55:08   INFO  epoch: 22/24, acc_iter=145164, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:04:51, time_cost(all): 1 day, 22:18:50/4:22:46, loss=0.289925263963148, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=3.191453559708939, lr=0.002836525152037433
2023-11-28 07:56:06   INFO  epoch: 22/24, acc_iter=145214, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/1:55:33, time_cost(all): 1 day, 22:19:48/4:24:21, loss=0.289817721603045, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=3.0710483102579498, lr=0.002815594888327273
2023-11-28 07:57:04   INFO  epoch: 22/24, acc_iter=145264, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:55:40, time_cost(all): 1 day, 22:20:46/4:32:08, loss=0.289710179242943, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=1.7714428992840259, lr=0.002794664624617113
2023-11-28 07:58:02   INFO  epoch: 22/24, acc_iter=145314, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:03:14, time_cost(all): 1 day, 22:21:44/4:30:12, loss=0.28960263688284, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=0.6661746125835676, lr=0.002773734360906954
2023-11-28 07:58:59   INFO  epoch: 22/24, acc_iter=145364, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:58:00, time_cost(all): 1 day, 22:22:41/4:10:53, loss=0.289495094522737, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=0.805003690026551, lr=0.002752804097196793
2023-11-28 07:59:57   INFO  epoch: 22/24, acc_iter=145414, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/2:02:01, time_cost(all): 1 day, 22:23:39/4:10:54, loss=0.289387552162635, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=1.8732973689403214, lr=0.002731873833486633
2023-11-28 08:00:55   INFO  epoch: 22/24, acc_iter=145464, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:58:54, time_cost(all): 1 day, 22:24:37/4:17:41, loss=0.289280009802532, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=1.1180493067295207, lr=0.002710943569776474
2023-11-28 08:01:53   INFO  epoch: 22/24, acc_iter=145514, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:49:56, time_cost(all): 1 day, 22:25:35/4:07:40, loss=0.289172467442429, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=2.3384561565699946, lr=0.002690013306066314
2023-11-28 08:02:50   INFO  epoch: 22/24, acc_iter=145564, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:56:51, time_cost(all): 1 day, 22:26:32/4:05:33, loss=0.289064925082327, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=4.287612486703093, lr=0.002669083042356153
2023-11-28 08:03:48   INFO  epoch: 22/24, acc_iter=145614, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:47:45, time_cost(all): 1 day, 22:27:30/4:21:10, loss=0.288957382722224, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=4.198743512963047, lr=0.002648152778645994
2023-11-28 08:04:46   INFO  epoch: 22/24, acc_iter=145664, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:57:01, time_cost(all): 1 day, 22:28:28/4:08:45, loss=0.288849840362121, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.7625100847305073, lr=0.002627222514935834
2023-11-28 08:05:44   INFO  epoch: 22/24, acc_iter=145714, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:53:00, time_cost(all): 1 day, 22:29:26/4:13:18, loss=0.288742298002019, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=2.580559773678345, lr=0.002606292251225674
2023-11-28 08:06:41   INFO  epoch: 22/24, acc_iter=145764, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:52:48, time_cost(all): 1 day, 22:30:23/4:22:36, loss=0.288634755641916, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.9869690131212885, lr=0.002585361987515514
2023-11-28 08:07:39   INFO  epoch: 22/24, acc_iter=145814, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:45:38, time_cost(all): 1 day, 22:31:21/4:21:53, loss=0.288527213281813, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=2.9926742507124366, lr=0.002564431723805354
2023-11-28 08:08:37   INFO  epoch: 22/24, acc_iter=145864, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:52:29, time_cost(all): 1 day, 22:32:19/4:09:45, loss=0.288419670921711, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=3.0609900717876672, lr=0.002543501460095194
2023-11-28 08:09:35   INFO  epoch: 22/24, acc_iter=145914, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:42:58, time_cost(all): 1 day, 22:33:17/4:05:47, loss=0.288312128561608, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=4.303280717083843, lr=0.002522571196385034
2023-11-28 08:10:33   INFO  epoch: 22/24, acc_iter=145964, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:47:49, time_cost(all): 1 day, 22:34:15/3:57:54, loss=0.288204586201505, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.05766901364884, lr=0.002501640932674874
2023-11-28 08:11:30   INFO  epoch: 22/24, acc_iter=146014, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:49:14, time_cost(all): 1 day, 22:35:12/4:09:52, loss=0.288097043841403, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.3371934984569798, lr=0.002480710668964714
2023-11-28 08:12:28   INFO  epoch: 22/24, acc_iter=146064, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:48:20, time_cost(all): 1 day, 22:36:10/4:08:23, loss=0.2879895014813, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=2.8687429782423712, lr=0.002459780405254554
2023-11-28 08:13:26   INFO  epoch: 22/24, acc_iter=146114, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:48:09, time_cost(all): 1 day, 22:37:08/4:15:03, loss=0.287881959121197, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.7075208116953464, lr=0.002438850141544394
2023-11-28 08:14:24   INFO  epoch: 22/24, acc_iter=146164, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:46:16, time_cost(all): 1 day, 22:38:06/3:55:32, loss=0.287774416761094, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=2.785516741966176, lr=0.002417919877834234
2023-11-28 08:15:21   INFO  epoch: 22/24, acc_iter=146214, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:40:50, time_cost(all): 1 day, 22:39:03/4:00:54, loss=0.287666874400992, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=4.354038887542254, lr=0.002396989614124074
2023-11-28 08:16:19   INFO  epoch: 22/24, acc_iter=146264, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:40:57, time_cost(all): 1 day, 22:40:01/3:51:46, loss=0.287559332040889, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=2.7858334555661246, lr=0.002376059350413915
2023-11-28 08:17:17   INFO  epoch: 22/24, acc_iter=146314, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:41:59, time_cost(all): 1 day, 22:40:59/3:54:28, loss=0.287451789680786, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.8383469421428718, lr=0.002355129086703754
2023-11-28 08:18:15   INFO  epoch: 22/24, acc_iter=146364, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:43:23, time_cost(all): 1 day, 22:41:57/3:53:47, loss=0.287344247320684, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.6438783872139455, lr=0.002334198822993594
2023-11-28 08:19:12   INFO  epoch: 22/24, acc_iter=146414, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:35:18, time_cost(all): 1 day, 22:42:54/4:12:05, loss=0.287236704960581, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=2.9579954531701835, lr=0.002313268559283435
2023-11-28 08:20:10   INFO  epoch: 22/24, acc_iter=146464, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:32:39, time_cost(all): 1 day, 22:43:52/4:11:23, loss=0.287129162600478, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=4.45159755692499, lr=0.002292338295573275
2023-11-28 08:21:08   INFO  epoch: 22/24, acc_iter=146514, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:34:05, time_cost(all): 1 day, 22:44:50/3:56:15, loss=0.287021620240376, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=4.681456067177115, lr=0.002271408031863115
2023-11-28 08:22:06   INFO  epoch: 22/24, acc_iter=146564, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:34:40, time_cost(all): 1 day, 22:45:48/4:06:46, loss=0.286914077880273, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=1.9144399326503694, lr=0.002250477768152955
2023-11-28 08:23:03   INFO  epoch: 22/24, acc_iter=146614, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:31:38, time_cost(all): 1 day, 22:46:45/3:49:18, loss=0.28680653552017, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=4.065097706511, lr=0.002229547504442795
2023-11-28 08:24:01   INFO  epoch: 22/24, acc_iter=146664, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:30:15, time_cost(all): 1 day, 22:47:43/3:51:19, loss=0.286698993160068, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=4.745962796357388, lr=0.002208617240732635
2023-11-28 08:24:59   INFO  epoch: 22/24, acc_iter=146714, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:29:18, time_cost(all): 1 day, 22:48:41/4:03:29, loss=0.286591450799965, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.457256326745648, lr=0.002187686977022475
2023-11-28 08:25:57   INFO  epoch: 22/24, acc_iter=146764, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:32:15, time_cost(all): 1 day, 22:49:39/4:04:25, loss=0.286483908439862, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.769431305961731, lr=0.002166756713312314
2023-11-28 08:26:54   INFO  epoch: 22/24, acc_iter=146814, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:26:07, time_cost(all): 1 day, 22:50:36/3:49:22, loss=0.28637636607976, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=4.750758900528408, lr=0.002145826449602156
2023-11-28 08:27:52   INFO  epoch: 22/24, acc_iter=146864, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:30:26, time_cost(all): 1 day, 22:51:34/3:45:48, loss=0.286268823719657, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.22(1.03), norm=2.787612408163117, lr=0.002124896185891996
2023-11-28 08:28:50   INFO  epoch: 22/24, acc_iter=146914, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:29:49, time_cost(all): 1 day, 22:52:32/4:02:18, loss=0.286161281359554, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=0.7951582885597093, lr=0.002103965922181835
2023-11-28 08:29:48   INFO  epoch: 22/24, acc_iter=146964, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:24:11, time_cost(all): 1 day, 22:53:30/3:55:44, loss=0.286053738999452, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=3.086057559911033, lr=0.002083035658471675
2023-11-28 08:30:45   INFO  epoch: 22/24, acc_iter=147014, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:07, time_cost(all): 1 day, 22:54:27/3:51:38, loss=0.285946196639349, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=2.6813777582741203, lr=0.002062105394761515
2023-11-28 08:31:43   INFO  epoch: 22/24, acc_iter=147064, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:28:35, time_cost(all): 1 day, 22:55:25/3:49:44, loss=0.285838654279246, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=2.3450956275265016, lr=0.002041175131051355
2023-11-28 08:32:41   INFO  epoch: 22/24, acc_iter=147114, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:27:46, time_cost(all): 1 day, 22:56:23/3:49:47, loss=0.285731111919144, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=4.229282978297991, lr=0.002020244867341195
2023-11-28 08:33:39   INFO  epoch: 22/24, acc_iter=147164, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:24:13, time_cost(all): 1 day, 22:57:21/3:47:45, loss=0.285623569559041, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=4.298918225548356, lr=0.001999314603631034
2023-11-28 08:34:36   INFO  epoch: 22/24, acc_iter=147214, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:19:13, time_cost(all): 1 day, 22:58:18/3:44:32, loss=0.285516027198938, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.724402389484557, lr=0.001978384339920874
2023-11-28 08:35:34   INFO  epoch: 22/24, acc_iter=147264, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:23:59, time_cost(all): 1 day, 22:59:16/3:55:05, loss=0.285408484838836, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=1.6902013544590588, lr=0.001957454076210714
2023-11-28 08:36:32   INFO  epoch: 22/24, acc_iter=147314, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:23:02, time_cost(all): 1 day, 23:00:14/3:37:32, loss=0.285300942478733, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=3.9767321390271855, lr=0.001936523812500555
2023-11-28 08:37:30   INFO  epoch: 22/24, acc_iter=147364, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:20:14, time_cost(all): 1 day, 23:01:12/3:50:05, loss=0.28519340011863, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=4.354804718773878, lr=0.001915593548790395
2023-11-28 08:38:27   INFO  epoch: 22/24, acc_iter=147414, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:17:36, time_cost(all): 1 day, 23:02:09/3:44:55, loss=0.285085857758528, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.92(1.03), norm=2.596233400781233, lr=0.001894663285080235
2023-11-28 08:39:25   INFO  epoch: 22/24, acc_iter=147464, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:16:23, time_cost(all): 1 day, 23:03:07/3:40:00, loss=0.284978315398425, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=4.9366055217089055, lr=0.001873733021370075
2023-11-28 08:40:23   INFO  epoch: 22/24, acc_iter=147514, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:15:13, time_cost(all): 1 day, 23:04:05/3:32:55, loss=0.284870773038322, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=1.703659635836909, lr=0.001852802757659915
2023-11-28 08:41:21   INFO  epoch: 22/24, acc_iter=147564, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:41, time_cost(all): 1 day, 23:05:03/3:32:09, loss=0.28476323067822, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=2.0304567860243026, lr=0.001831872493949754
2023-11-28 08:42:18   INFO  epoch: 22/24, acc_iter=147614, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:18:31, time_cost(all): 1 day, 23:06:00/3:34:42, loss=0.284655688318117, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=3.1873629897213775, lr=0.001810942230239596
2023-11-28 08:43:16   INFO  epoch: 22/24, acc_iter=147664, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:13:11, time_cost(all): 1 day, 23:06:58/3:33:11, loss=0.284548145958014, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=2.969546828610068, lr=0.001790011966529436
2023-11-28 08:44:14   INFO  epoch: 22/24, acc_iter=147714, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:09:35, time_cost(all): 1 day, 23:07:56/3:30:12, loss=0.284440603597911, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=2.434885220269316, lr=0.001769081702819275
2023-11-28 08:45:12   INFO  epoch: 22/24, acc_iter=147764, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:08:27, time_cost(all): 1 day, 23:08:54/3:41:41, loss=0.284333061237809, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=2.1114947000366637, lr=0.001748151439109115
2023-11-28 08:46:09   INFO  epoch: 22/24, acc_iter=147814, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:13:13, time_cost(all): 1 day, 23:09:51/3:32:17, loss=0.284225518877706, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.436499752937082, lr=0.001727221175398955
2023-11-28 08:47:07   INFO  epoch: 22/24, acc_iter=147864, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:08:22, time_cost(all): 1 day, 23:10:49/3:34:51, loss=0.284117976517603, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=4.979315870743318, lr=0.001706290911688795
2023-11-28 08:48:05   INFO  epoch: 22/24, acc_iter=147914, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:12:13, time_cost(all): 1 day, 23:11:47/3:25:52, loss=0.284010434157501, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=3.4433415421670133, lr=0.001685360647978636
2023-11-28 08:49:03   INFO  epoch: 22/24, acc_iter=147964, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:11:19, time_cost(all): 1 day, 23:12:45/3:34:58, loss=0.283902891797398, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=1.2351097589053963, lr=0.001664430384268476
2023-11-28 08:50:00   INFO  epoch: 22/24, acc_iter=148014, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:08:05, time_cost(all): 1 day, 23:13:42/3:19:08, loss=0.283795349437295, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.876570882742738, lr=0.001643500120558316
2023-11-28 08:50:58   INFO  epoch: 22/24, acc_iter=148064, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:07:12, time_cost(all): 1 day, 23:14:40/3:29:39, loss=0.283687807077193, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=2.2987534124348663, lr=0.001622569856848156
2023-11-28 08:51:56   INFO  epoch: 22/24, acc_iter=148114, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:02:55, time_cost(all): 1 day, 23:15:38/3:24:47, loss=0.28358026471709, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=3.9039445032543365, lr=0.001601639593137996
2023-11-28 08:52:54   INFO  epoch: 22/24, acc_iter=148164, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:02:48, time_cost(all): 1 day, 23:16:36/3:36:29, loss=0.283472722356987, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.3124970168222347, lr=0.001580709329427835
2023-11-28 08:53:51   INFO  epoch: 22/24, acc_iter=148214, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:03:20, time_cost(all): 1 day, 23:17:33/3:30:13, loss=0.283365179996885, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=1.3346476104757559, lr=0.001559779065717675
2023-11-28 08:54:49   INFO  epoch: 22/24, acc_iter=148264, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:00:36, time_cost(all): 1 day, 23:18:31/3:32:10, loss=0.283257637636782, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=4.463680946053719, lr=0.001538848802007517
2023-11-28 08:55:47   INFO  epoch: 22/24, acc_iter=148314, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:02:31, time_cost(all): 1 day, 23:19:29/3:22:16, loss=0.283150095276679, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.6084694751117876, lr=0.001517918538297356
2023-11-28 08:56:45   INFO  epoch: 22/24, acc_iter=148364, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:57:50, time_cost(all): 1 day, 23:20:27/3:26:20, loss=0.283042552916577, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=0.7793955491483097, lr=0.001496988274587196
2023-11-28 08:57:42   INFO  epoch: 22/24, acc_iter=148414, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/1:01:39, time_cost(all): 1 day, 23:21:24/3:26:38, loss=0.282935010556474, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=2.558939966510732, lr=0.001476058010877036
2023-11-28 08:58:40   INFO  epoch: 22/24, acc_iter=148464, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/0:57:43, time_cost(all): 1 day, 23:22:22/3:22:26, loss=0.282827468196371, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=2.3695420075909333, lr=0.001455127747166876
2023-11-28 08:59:38   INFO  epoch: 22/24, acc_iter=148514, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:54:48, time_cost(all): 1 day, 23:23:20/3:27:37, loss=0.282719925836269, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=4.073639902573152, lr=0.001434197483456716
2023-11-28 09:00:36   INFO  epoch: 22/24, acc_iter=148564, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:57:57, time_cost(all): 1 day, 23:24:18/3:23:32, loss=0.282612383476166, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=4.489967163394201, lr=0.001413267219746557
2023-11-28 09:01:33   INFO  epoch: 22/24, acc_iter=148614, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:54:01, time_cost(all): 1 day, 23:25:15/3:13:11, loss=0.282504841116063, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=3.3991460420726534, lr=0.001392336956036397
2023-11-28 09:02:31   INFO  epoch: 22/24, acc_iter=148664, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:56:13, time_cost(all): 1 day, 23:26:13/3:11:17, loss=0.282397298755961, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=0.8219493015731509, lr=0.001371406692326237
2023-11-28 09:03:29   INFO  epoch: 22/24, acc_iter=148714, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:54:20, time_cost(all): 1 day, 23:27:11/3:24:45, loss=0.282289756395858, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.9118484492252663, lr=0.001350476428616076
2023-11-28 09:04:27   INFO  epoch: 22/24, acc_iter=148764, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:51:13, time_cost(all): 1 day, 23:28:09/3:20:06, loss=0.282182214035755, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=2.1392679633345164, lr=0.001329546164905916
2023-11-28 09:05:24   INFO  epoch: 22/24, acc_iter=148814, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:49:14, time_cost(all): 1 day, 23:29:06/3:05:51, loss=0.282074671675653, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.029461796127335, lr=0.001308615901195756
2023-11-28 09:06:22   INFO  epoch: 22/24, acc_iter=148864, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:08, time_cost(all): 1 day, 23:30:04/3:12:27, loss=0.28196712931555, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=3.33805036075712, lr=0.001287685637485598
2023-11-28 09:07:20   INFO  epoch: 22/24, acc_iter=148914, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:51:13, time_cost(all): 1 day, 23:31:02/3:05:16, loss=0.281859586955447, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=0.6436365927222254, lr=0.001266755373775437
2023-11-28 09:08:18   INFO  epoch: 22/24, acc_iter=148964, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:49:41, time_cost(all): 1 day, 23:32:00/3:07:22, loss=0.281752044595344, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=0.7456098538403304, lr=0.001245825110065275
2023-11-28 09:09:15   INFO  epoch: 22/24, acc_iter=149014, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:48:36, time_cost(all): 1 day, 23:32:57/3:01:52, loss=0.281644502235242, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=1.981413910950348, lr=0.001224894846355115
2023-11-28 09:10:13   INFO  epoch: 22/24, acc_iter=149064, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:48:30, time_cost(all): 1 day, 23:33:55/3:08:49, loss=0.281536959875139, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=4.134286772744729, lr=0.001203964582644957
2023-11-28 09:11:11   INFO  epoch: 22/24, acc_iter=149114, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:44:03, time_cost(all): 1 day, 23:34:53/3:15:18, loss=0.281429417515036, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=3.182815612707044, lr=0.001183034318934796
2023-11-28 09:12:09   INFO  epoch: 22/24, acc_iter=149164, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:44:18, time_cost(all): 1 day, 23:35:51/3:16:36, loss=0.281321875154934, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=3.1240834559023236, lr=0.001162104055224636
2023-11-28 09:13:06   INFO  epoch: 22/24, acc_iter=149214, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:44:08, time_cost(all): 1 day, 23:36:48/3:13:21, loss=0.281214332794831, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=0.7654295749594285, lr=0.001141173791514476
2023-11-28 09:14:04   INFO  epoch: 22/24, acc_iter=149264, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:42:25, time_cost(all): 1 day, 23:37:46/3:06:56, loss=0.281106790434728, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=0.6781272784876294, lr=0.001120243527804316
2023-11-28 09:15:02   INFO  epoch: 22/24, acc_iter=149314, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:42:51, time_cost(all): 1 day, 23:38:44/2:59:13, loss=0.280999248074626, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=1.0925546795083234, lr=0.001099313264094156
2023-11-28 09:16:00   INFO  epoch: 22/24, acc_iter=149364, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:40:48, time_cost(all): 1 day, 23:39:42/3:10:32, loss=0.280891705714523, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=4.628302817751238, lr=0.001078383000383997
2023-11-28 09:16:57   INFO  epoch: 22/24, acc_iter=149414, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:22, time_cost(all): 1 day, 23:40:39/2:54:15, loss=0.28078416335442, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=4.962867642321555, lr=0.001057452736673837
2023-11-28 09:17:55   INFO  epoch: 22/24, acc_iter=149464, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:39:38, time_cost(all): 1 day, 23:41:37/2:54:13, loss=0.280676620994318, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.4052475921908654, lr=0.001036522472963677
2023-11-28 09:18:53   INFO  epoch: 22/24, acc_iter=149514, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:37:44, time_cost(all): 1 day, 23:42:35/2:53:09, loss=0.280569078634215, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.4235958899255365, lr=0.001015592209253516
2023-11-28 09:19:51   INFO  epoch: 22/24, acc_iter=149564, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:36:35, time_cost(all): 1 day, 23:43:33/2:55:11, loss=0.280461536274112, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=1.2625610121134767, lr=0.000998506372821887
2023-11-28 09:20:48   INFO  epoch: 22/24, acc_iter=149614, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:34:53, time_cost(all): 1 day, 23:44:30/3:03:48, loss=0.28035399391401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=1.449316990967483, lr=0.000992649929996794
2023-11-28 09:21:46   INFO  epoch: 22/24, acc_iter=149664, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:33:52, time_cost(all): 1 day, 23:45:28/3:03:44, loss=0.280246451553907, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.9946644223974284, lr=0.000986793487171701
2023-11-28 09:22:44   INFO  epoch: 22/24, acc_iter=149714, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:35:57, time_cost(all): 1 day, 23:46:26/2:51:19, loss=0.280138909193804, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=2.281376941265025, lr=0.000980937044346608
2023-11-28 09:23:42   INFO  epoch: 22/24, acc_iter=149764, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:34:33, time_cost(all): 1 day, 23:47:24/3:03:41, loss=0.280031366833702, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=4.763474059575732, lr=0.000975080601521515
2023-11-28 09:24:39   INFO  epoch: 22/24, acc_iter=149814, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:32:31, time_cost(all): 1 day, 23:48:21/3:01:12, loss=0.279923824473599, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.148118612512283, lr=0.000969224158696421
2023-11-28 09:25:37   INFO  epoch: 22/24, acc_iter=149864, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:32:30, time_cost(all): 1 day, 23:49:19/2:59:56, loss=0.279816282113496, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=4.524002900416889, lr=0.000963367715871328
2023-11-28 09:26:35   INFO  epoch: 22/24, acc_iter=149914, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:25, time_cost(all): 1 day, 23:50:17/2:45:57, loss=0.279708739753394, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=4.319613965989959, lr=0.000957511273046235
2023-11-28 09:27:33   INFO  epoch: 22/24, acc_iter=149964, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:10, time_cost(all): 1 day, 23:51:15/2:51:55, loss=0.279601197393291, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=4.758778348908166, lr=0.000951654830221142
2023-11-28 09:28:30   INFO  epoch: 22/24, acc_iter=150014, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:29:18, time_cost(all): 1 day, 23:52:12/2:47:48, loss=0.279493655033188, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=2.4407137040306153, lr=0.000945798387396049
2023-11-28 09:29:28   INFO  epoch: 22/24, acc_iter=150064, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:26:28, time_cost(all): 1 day, 23:53:10/2:45:45, loss=0.279386112673086, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=2.417279094217401, lr=0.000939941944570955
2023-11-28 09:30:26   INFO  epoch: 22/24, acc_iter=150114, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:28:01, time_cost(all): 1 day, 23:54:08/2:41:42, loss=0.279278570312983, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=1.3179366729548714, lr=0.000934085501745862
2023-11-28 09:31:24   INFO  epoch: 22/24, acc_iter=150164, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:26:03, time_cost(all): 1 day, 23:55:06/2:56:26, loss=0.27917102795288, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.55172939846292, lr=0.000928229058920769
2023-11-28 09:32:21   INFO  epoch: 22/24, acc_iter=150214, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:25:54, time_cost(all): 1 day, 23:56:03/2:53:47, loss=0.279063485592777, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.973774394113489, lr=0.000922372616095676
2023-11-28 09:33:19   INFO  epoch: 22/24, acc_iter=150264, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:24:19, time_cost(all): 1 day, 23:57:01/2:53:43, loss=0.278955943232675, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=1.1995120299927022, lr=0.000916516173270583
2023-11-28 09:34:17   INFO  epoch: 22/24, acc_iter=150314, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:23:58, time_cost(all): 1 day, 23:57:59/2:39:27, loss=0.278848400872572, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=3.7320750487874212, lr=0.000910659730445489
2023-11-28 09:35:15   INFO  epoch: 22/24, acc_iter=150364, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:20:56, time_cost(all): 1 day, 23:58:57/2:36:43, loss=0.278740858512469, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.8648975022001104, lr=0.000904803287620396
2023-11-28 09:36:12   INFO  epoch: 22/24, acc_iter=150414, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:04, time_cost(all): 1 day, 23:59:54/2:42:56, loss=0.278633316152367, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.86(1.03), norm=1.0886127041804157, lr=0.000898946844795303
2023-11-28 09:37:10   INFO  epoch: 22/24, acc_iter=150464, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:47, time_cost(all): 2 days, 0:00:52/2:37:48, loss=0.278525773792264, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=1.536387944213351, lr=0.00089309040197021
2023-11-28 09:38:08   INFO  epoch: 22/24, acc_iter=150514, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:18:27, time_cost(all): 2 days, 0:01:50/2:46:23, loss=0.278418231432161, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=0.7853078310729277, lr=0.000887233959145117
2023-11-28 09:39:06   INFO  epoch: 22/24, acc_iter=150564, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:18:42, time_cost(all): 2 days, 0:02:48/2:37:26, loss=0.278310689072059, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=3.7623137303763237, lr=0.000881377516320024
2023-11-28 09:40:03   INFO  epoch: 22/24, acc_iter=150614, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:16:47, time_cost(all): 2 days, 0:03:45/2:41:12, loss=0.278203146711956, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=1.8348548765578863, lr=0.00087552107349493
2023-11-28 09:41:01   INFO  epoch: 22/24, acc_iter=150664, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:16:39, time_cost(all): 2 days, 0:04:43/2:34:02, loss=0.278095604351853, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=1.6538074491328854, lr=0.000869664630669837
2023-11-28 09:41:59   INFO  epoch: 22/24, acc_iter=150714, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:32, time_cost(all): 2 days, 0:05:41/2:40:37, loss=0.277988061991751, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=4.429405471605955, lr=0.000863808187844744
2023-11-28 09:42:57   INFO  epoch: 22/24, acc_iter=150764, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:38, time_cost(all): 2 days, 0:06:39/2:37:03, loss=0.277880519631648, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.4854939055782896, lr=0.000857951745019651
2023-11-28 09:43:54   INFO  epoch: 22/24, acc_iter=150814, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:36, time_cost(all): 2 days, 0:07:36/2:32:34, loss=0.277772977271545, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.08146853812323, lr=0.000852095302194558
2023-11-28 09:44:52   INFO  epoch: 22/24, acc_iter=150864, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:11:54, time_cost(all): 2 days, 0:08:34/2:34:11, loss=0.277665434911443, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=2.446283877298642, lr=0.000846238859369464
2023-11-28 09:45:50   INFO  epoch: 22/24, acc_iter=150914, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:28, time_cost(all): 2 days, 0:09:32/2:36:35, loss=0.27755789255134, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=1.0159071044687433, lr=0.000840382416544371
2023-11-28 09:46:48   INFO  epoch: 22/24, acc_iter=150964, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:10:35, time_cost(all): 2 days, 0:10:30/2:28:23, loss=0.277450350191237, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=3.226604749179523, lr=0.000834525973719278
2023-11-28 09:47:45   INFO  epoch: 22/24, acc_iter=151014, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:11, time_cost(all): 2 days, 0:11:27/2:35:09, loss=0.277342807831135, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=2.258962026752309, lr=0.000828669530894185
2023-11-28 09:48:43   INFO  epoch: 22/24, acc_iter=151064, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:38, time_cost(all): 2 days, 0:12:25/2:27:02, loss=0.277235265471032, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.234457119766146, lr=0.000822813088069092
2023-11-28 09:49:41   INFO  epoch: 22/24, acc_iter=151114, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:14, time_cost(all): 2 days, 0:13:23/2:26:32, loss=0.277127723110929, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=3.001298744639779, lr=0.000816956645243998
2023-11-28 09:50:39   INFO  epoch: 22/24, acc_iter=151164, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:42, time_cost(all): 2 days, 0:14:21/2:32:44, loss=0.277020180750827, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=3.7670681667837407, lr=0.000811100202418905
2023-11-28 09:51:37   INFO  epoch: 22/24, acc_iter=151214, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:38, time_cost(all): 2 days, 0:15:19/2:29:30, loss=0.276912638390724, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.635255220602006, lr=0.000805243759593812
2023-11-28 09:52:34   INFO  epoch: 22/24, acc_iter=151264, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:31, time_cost(all): 2 days, 0:16:16/2:23:15, loss=0.276805096030621, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.9796632697950463, lr=0.000799387316768719
2023-11-28 09:53:32   INFO  epoch: 22/24, acc_iter=151314, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:41, time_cost(all): 2 days, 0:17:14/2:26:04, loss=0.276697553670519, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.5705686574384594, lr=0.000793530873943626
2023-11-28 09:54:30   INFO  epoch: 22/24, acc_iter=151364, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:35, time_cost(all): 2 days, 0:18:12/2:25:19, loss=0.276590011310416, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=4.546860206234191, lr=0.000787674431118533
2023-11-28 09:55:28   INFO  epoch: 22/24, acc_iter=151414, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:42, time_cost(all): 2 days, 0:19:10/2:17:33, loss=0.276482468950313, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.6576439722296112, lr=0.000781817988293439
2023-11-28 09:56:25   INFO  epoch: 22/24, acc_iter=151464, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:41, time_cost(all): 2 days, 0:20:07/2:21:25, loss=0.276374926590211, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.9303004013749998, lr=0.000775961545468346
2023-11-28 09:57:23   INFO  epoch: 23/24, acc_iter=151551, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:57/2:06:18, time_cost(all): 2 days, 0:21:05/2:23:07, loss=0.276187802883632, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=3.8674554843233784, lr=0.000765771334952684
2023-11-28 09:58:21   INFO  epoch: 23/24, acc_iter=151601, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:55/2:03:59, time_cost(all): 2 days, 0:22:03/2:16:53, loss=0.276080260523529, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.972982088863273, lr=0.000759914892127591
2023-11-28 09:59:19   INFO  epoch: 23/24, acc_iter=151651, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:53/2:04:20, time_cost(all): 2 days, 0:23:01/2:18:06, loss=0.275972718163427, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.0703340977475295, lr=0.000754058449302497
2023-11-28 10:00:16   INFO  epoch: 23/24, acc_iter=151701, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:51/2:02:18, time_cost(all): 2 days, 0:23:58/2:14:23, loss=0.275865175803324, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.815398807213715, lr=0.000748202006477404
2023-11-28 10:01:14   INFO  epoch: 23/24, acc_iter=151751, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:48/2:08:03, time_cost(all): 2 days, 0:24:56/2:16:18, loss=0.275757633443221, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.8077961578336232, lr=0.000742345563652311
2023-11-28 10:02:12   INFO  epoch: 23/24, acc_iter=151801, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:46/2:02:29, time_cost(all): 2 days, 0:25:54/2:22:03, loss=0.275650091083118, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.9017839805736867, lr=0.000736489120827218
2023-11-28 10:03:10   INFO  epoch: 23/24, acc_iter=151851, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:44/1:58:32, time_cost(all): 2 days, 0:26:52/2:13:23, loss=0.275542548723016, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.925455117232655, lr=0.000730632678002125
2023-11-28 10:04:07   INFO  epoch: 23/24, acc_iter=151901, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:42/2:04:38, time_cost(all): 2 days, 0:27:49/2:09:39, loss=0.275435006362913, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=2.4531416408467126, lr=0.000724776235177032
2023-11-28 10:05:05   INFO  epoch: 23/24, acc_iter=151951, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:39/1:57:44, time_cost(all): 2 days, 0:28:47/2:21:12, loss=0.27532746400281, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=1.6144723670338224, lr=0.000718919792351938
2023-11-28 10:06:03   INFO  epoch: 23/24, acc_iter=152001, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:37/1:51:51, time_cost(all): 2 days, 0:29:45/2:12:18, loss=0.275219921642708, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=2.795789782126841, lr=0.000713063349526845
2023-11-28 10:07:01   INFO  epoch: 23/24, acc_iter=152051, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:35/1:51:36, time_cost(all): 2 days, 0:30:43/2:11:51, loss=0.275112379282605, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=0.9819617870002593, lr=0.000707206906701752
2023-11-28 10:07:58   INFO  epoch: 23/24, acc_iter=152101, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:33/1:59:04, time_cost(all): 2 days, 0:31:40/2:17:25, loss=0.275004836922502, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.967902615090377, lr=0.000701350463876659
2023-11-28 10:08:56   INFO  epoch: 23/24, acc_iter=152151, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:30/1:51:27, time_cost(all): 2 days, 0:32:38/2:04:19, loss=0.2748972945624, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=3.571428285457597, lr=0.000695494021051566
2023-11-28 10:09:54   INFO  epoch: 23/24, acc_iter=152201, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:28/1:53:23, time_cost(all): 2 days, 0:33:36/2:13:27, loss=0.274789752202297, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.3628826065467368, lr=0.000689637578226472
2023-11-28 10:10:52   INFO  epoch: 23/24, acc_iter=152251, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:26/1:54:09, time_cost(all): 2 days, 0:34:34/2:14:25, loss=0.274682209842194, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=2.914334362535645, lr=0.000683781135401379
2023-11-28 10:11:49   INFO  epoch: 23/24, acc_iter=152301, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:24/1:48:34, time_cost(all): 2 days, 0:35:31/2:13:54, loss=0.274574667482092, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.0313875575739315, lr=0.000677924692576286
2023-11-28 10:12:47   INFO  epoch: 23/24, acc_iter=152351, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:21/1:53:18, time_cost(all): 2 days, 0:36:29/2:10:37, loss=0.274467125121989, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=1.2639821214958396, lr=0.000672068249751193
2023-11-28 10:13:45   INFO  epoch: 23/24, acc_iter=152401, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:19/1:51:49, time_cost(all): 2 days, 0:37:27/2:04:53, loss=0.274359582761886, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=4.8839856152743515, lr=0.0006662118069261
2023-11-28 10:14:43   INFO  epoch: 23/24, acc_iter=152451, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:17/1:44:16, time_cost(all): 2 days, 0:38:25/2:00:29, loss=0.274252040401784, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=0.7770648752105453, lr=0.000660355364101007
2023-11-28 10:15:40   INFO  epoch: 23/24, acc_iter=152501, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:15/1:42:38, time_cost(all): 2 days, 0:39:22/2:08:28, loss=0.274144498041681, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=3.722319202629386, lr=0.000654498921275913
2023-11-28 10:16:38   INFO  epoch: 23/24, acc_iter=152551, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:12/1:43:43, time_cost(all): 2 days, 0:40:20/2:04:02, loss=0.274036955681578, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.209358430693718, lr=0.00064864247845082
2023-11-28 10:17:36   INFO  epoch: 23/24, acc_iter=152601, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:10/1:48:59, time_cost(all): 2 days, 0:41:18/2:02:50, loss=0.273929413321476, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=1.4813437275230068, lr=0.000642786035625727
2023-11-28 10:18:34   INFO  epoch: 23/24, acc_iter=152651, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:08/1:48:36, time_cost(all): 2 days, 0:42:16/2:06:25, loss=0.273821870961373, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.913113333016181, lr=0.000636929592800634
2023-11-28 10:19:31   INFO  epoch: 23/24, acc_iter=152701, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:06/1:42:47, time_cost(all): 2 days, 0:43:13/1:56:12, loss=0.27371432860127, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.667193389362558, lr=0.000631073149975541
2023-11-28 10:20:29   INFO  epoch: 23/24, acc_iter=152751, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:03/1:38:55, time_cost(all): 2 days, 0:44:11/1:56:07, loss=0.273606786241168, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=1.9355809539996165, lr=0.000625216707150447
2023-11-28 10:21:27   INFO  epoch: 23/24, acc_iter=152801, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:01/1:39:25, time_cost(all): 2 days, 0:45:09/2:00:43, loss=0.273499243881065, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=4.1112692186579585, lr=0.000619360264325354
2023-11-28 10:22:25   INFO  epoch: 23/24, acc_iter=152851, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:25:59/1:40:31, time_cost(all): 2 days, 0:46:07/2:01:23, loss=0.273391701520962, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=3.235527256486907, lr=0.000613503821500261
2023-11-28 10:23:22   INFO  epoch: 23/24, acc_iter=152901, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:26:57/1:38:56, time_cost(all): 2 days, 0:47:04/1:53:22, loss=0.27328415916086, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=3.5214608601848805, lr=0.000607647378675168
2023-11-28 10:24:20   INFO  epoch: 23/24, acc_iter=152951, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:27:54/1:40:37, time_cost(all): 2 days, 0:48:02/1:55:13, loss=0.273176616800757, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.079973455417438, lr=0.000601790935850075
2023-11-28 10:25:18   INFO  epoch: 23/24, acc_iter=153001, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:28:52/1:35:25, time_cost(all): 2 days, 0:49:00/1:51:44, loss=0.273069074440654, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=1.1674554193963413, lr=0.000595934493024981
2023-11-28 10:26:16   INFO  epoch: 23/24, acc_iter=153051, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:29:50/1:37:57, time_cost(all): 2 days, 0:49:58/1:48:34, loss=0.272961532080552, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=3.831597819395761, lr=0.000590078050199888
2023-11-28 10:27:13   INFO  epoch: 23/24, acc_iter=153101, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:30:48/1:35:00, time_cost(all): 2 days, 0:50:55/1:54:56, loss=0.272853989720449, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=1.558240422001563, lr=0.000584221607374795
2023-11-28 10:28:11   INFO  epoch: 23/24, acc_iter=153151, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:31:45/1:37:28, time_cost(all): 2 days, 0:51:53/1:52:24, loss=0.272746447360346, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=0.7085046782361071, lr=0.000578365164549702
2023-11-28 10:29:09   INFO  epoch: 23/24, acc_iter=153201, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:32:43/1:36:02, time_cost(all): 2 days, 0:52:51/1:46:58, loss=0.272638905000243, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.0910831735085478, lr=0.000572508721724609
2023-11-28 10:30:07   INFO  epoch: 23/24, acc_iter=153251, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:33:41/1:34:08, time_cost(all): 2 days, 0:53:49/1:48:45, loss=0.272531362640141, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=0.8961126889931936, lr=0.000566652278899515
2023-11-28 10:31:04   INFO  epoch: 23/24, acc_iter=153301, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:34:39/1:29:12, time_cost(all): 2 days, 0:54:46/1:50:17, loss=0.272423820280038, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=2.658212840269479, lr=0.000560795836074422
2023-11-28 10:32:02   INFO  epoch: 23/24, acc_iter=153351, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:35:36/1:30:10, time_cost(all): 2 days, 0:55:44/1:48:00, loss=0.272316277919935, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=1.549444438085242, lr=0.000554939393249329
2023-11-28 10:33:00   INFO  epoch: 23/24, acc_iter=153401, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:36:34/1:26:50, time_cost(all): 2 days, 0:56:42/1:43:51, loss=0.272208735559833, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=4.225052657712907, lr=0.000549082950424236
2023-11-28 10:33:58   INFO  epoch: 23/24, acc_iter=153451, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:37:32/1:32:05, time_cost(all): 2 days, 0:57:40/1:40:38, loss=0.27210119319973, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.798172969950036, lr=0.000543226507599143
2023-11-28 10:34:55   INFO  epoch: 23/24, acc_iter=153501, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:38:30/1:31:23, time_cost(all): 2 days, 0:58:37/1:46:58, loss=0.271993650839627, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=1.5819347870314604, lr=0.000537370064774049
2023-11-28 10:35:53   INFO  epoch: 23/24, acc_iter=153551, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:39:27/1:25:43, time_cost(all): 2 days, 0:59:35/1:45:00, loss=0.271886108479525, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=4.202532342618586, lr=0.000531513621948956
2023-11-28 10:36:51   INFO  epoch: 23/24, acc_iter=153601, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:40:25/1:23:58, time_cost(all): 2 days, 1:00:33/1:41:00, loss=0.271778566119422, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=3.011996871021552, lr=0.000525657179123863
2023-11-28 10:37:49   INFO  epoch: 23/24, acc_iter=153651, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:41:23/1:23:28, time_cost(all): 2 days, 1:01:31/1:46:35, loss=0.271671023759319, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=4.080500468093604, lr=0.00051980073629877
2023-11-28 10:38:46   INFO  epoch: 23/24, acc_iter=153701, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:42:21/1:28:35, time_cost(all): 2 days, 1:02:28/1:45:16, loss=0.271563481399217, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=3.3918957163204566, lr=0.000513944293473677
2023-11-28 10:39:44   INFO  epoch: 23/24, acc_iter=153751, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:43:18/1:22:32, time_cost(all): 2 days, 1:03:26/1:37:34, loss=0.271455939039114, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=1.9669005111952373, lr=0.000508087850648584
2023-11-28 10:40:42   INFO  epoch: 23/24, acc_iter=153801, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:44:16/1:26:26, time_cost(all): 2 days, 1:04:24/1:34:03, loss=0.271348396679011, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=3.303302267765239, lr=0.00050223140782349
2023-11-28 10:41:40   INFO  epoch: 23/24, acc_iter=153851, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:45:14/1:25:26, time_cost(all): 2 days, 1:05:22/1:42:08, loss=0.271240854318909, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=3.60489738069052, lr=0.000496374964998397
2023-11-28 10:42:37   INFO  epoch: 23/24, acc_iter=153901, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:46:12/1:20:28, time_cost(all): 2 days, 1:06:19/1:41:39, loss=0.271133311958806, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=0.9146306081000388, lr=0.000490518522173304
2023-11-28 10:43:35   INFO  epoch: 23/24, acc_iter=153951, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:47:09/1:16:30, time_cost(all): 2 days, 1:07:17/1:32:20, loss=0.271025769598703, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.03(1.03), norm=1.6390293058721552, lr=0.000484662079348211
2023-11-28 10:44:33   INFO  epoch: 23/24, acc_iter=154001, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:48:07/1:16:27, time_cost(all): 2 days, 1:08:15/1:36:51, loss=0.270918227238601, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.849255975347452, lr=0.000478805636523118
2023-11-28 10:45:31   INFO  epoch: 23/24, acc_iter=154051, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:49:05/1:14:44, time_cost(all): 2 days, 1:09:13/1:35:15, loss=0.270810684878498, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=1.1128261375446793, lr=0.000472949193698024
2023-11-28 10:46:28   INFO  epoch: 23/24, acc_iter=154101, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:50:03/1:19:59, time_cost(all): 2 days, 1:10:10/1:33:11, loss=0.270703142518395, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=2.577607269405622, lr=0.000467092750872931
2023-11-28 10:47:26   INFO  epoch: 23/24, acc_iter=154151, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:51:00/1:18:02, time_cost(all): 2 days, 1:11:08/1:36:05, loss=0.270595600158293, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=4.661066403292739, lr=0.000461236308047838
2023-11-28 10:48:24   INFO  epoch: 23/24, acc_iter=154201, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:51:58/1:14:35, time_cost(all): 2 days, 1:12:06/1:33:59, loss=0.27048805779819, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=4.729202601919355, lr=0.000455379865222745
2023-11-28 10:49:22   INFO  epoch: 23/24, acc_iter=154251, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:52:56/1:14:07, time_cost(all): 2 days, 1:13:04/1:25:52, loss=0.270380515438087, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=4.243784416709037, lr=0.000449523422397652
2023-11-28 10:50:19   INFO  epoch: 23/24, acc_iter=154301, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:53:54/1:13:01, time_cost(all): 2 days, 1:14:01/1:25:08, loss=0.270272973077985, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=4.416482214086836, lr=0.000443666979572558
2023-11-28 10:51:17   INFO  epoch: 23/24, acc_iter=154351, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:54:51/1:10:28, time_cost(all): 2 days, 1:14:59/1:25:34, loss=0.270165430717882, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.1462721067346227, lr=0.000437810536747465
2023-11-28 10:52:15   INFO  epoch: 23/24, acc_iter=154401, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:55:49/1:08:31, time_cost(all): 2 days, 1:15:57/1:24:57, loss=0.270057888357779, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=2.435867177765121, lr=0.000431954093922372
2023-11-28 10:53:13   INFO  epoch: 23/24, acc_iter=154451, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:56:47/1:13:19, time_cost(all): 2 days, 1:16:55/1:24:21, loss=0.269950345997676, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=0.7801518074123126, lr=0.000426097651097279
2023-11-28 10:54:10   INFO  epoch: 23/24, acc_iter=154501, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:57:45/1:07:34, time_cost(all): 2 days, 1:17:52/1:28:10, loss=0.269842803637574, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=4.987723453029673, lr=0.000420241208272186
2023-11-28 10:55:08   INFO  epoch: 23/24, acc_iter=154551, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:58:42/1:10:22, time_cost(all): 2 days, 1:18:50/1:23:34, loss=0.269735261277471, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=4.937764117023448, lr=0.000414384765447092
2023-11-28 10:56:06   INFO  epoch: 23/24, acc_iter=154601, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 0:59:40/1:05:14, time_cost(all): 2 days, 1:19:48/1:19:55, loss=0.269627718917368, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=1.1565184143104839, lr=0.000408528322621999
2023-11-28 10:57:04   INFO  epoch: 23/24, acc_iter=154651, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:00:38/1:08:08, time_cost(all): 2 days, 1:20:46/1:26:14, loss=0.269520176557266, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.099563501572607, lr=0.000402671879796906
2023-11-28 10:58:01   INFO  epoch: 23/24, acc_iter=154701, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:01:36/1:07:02, time_cost(all): 2 days, 1:21:43/1:23:46, loss=0.269412634197163, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=0.7200253040260265, lr=0.000396815436971813
2023-11-28 10:58:59   INFO  epoch: 23/24, acc_iter=154751, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:02:33/1:02:15, time_cost(all): 2 days, 1:22:41/1:22:24, loss=0.26930509183706, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=2.131092749498241, lr=0.00039095899414672
2023-11-28 10:59:57   INFO  epoch: 23/24, acc_iter=154801, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:03:31/1:05:10, time_cost(all): 2 days, 1:23:39/1:22:52, loss=0.269197549476958, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.5820438202893525, lr=0.000385102551321626
2023-11-28 11:00:55   INFO  epoch: 23/24, acc_iter=154851, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:04:29/1:01:26, time_cost(all): 2 days, 1:24:37/1:19:57, loss=0.269090007116855, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.701157142094674, lr=0.000379246108496533
2023-11-28 11:01:52   INFO  epoch: 23/24, acc_iter=154901, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:05:27/1:00:43, time_cost(all): 2 days, 1:25:34/1:18:11, loss=0.268982464756752, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=3.5421299269889204, lr=0.00037338966567144
2023-11-28 11:02:50   INFO  epoch: 23/24, acc_iter=154951, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:06:24/0:58:41, time_cost(all): 2 days, 1:26:32/1:18:29, loss=0.26887492239665, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=3.011521197599146, lr=0.000367533222846347
2023-11-28 11:03:48   INFO  epoch: 23/24, acc_iter=155001, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:07:22/0:59:20, time_cost(all): 2 days, 1:27:30/1:19:00, loss=0.268767380036547, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.746237904337777, lr=0.000361676780021254
2023-11-28 11:04:46   INFO  epoch: 23/24, acc_iter=155051, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:08:20/1:01:14, time_cost(all): 2 days, 1:28:28/1:12:51, loss=0.268659837676444, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=2.346261832319607, lr=0.00035582033719616
2023-11-28 11:05:43   INFO  epoch: 23/24, acc_iter=155101, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:09:18/0:55:24, time_cost(all): 2 days, 1:29:25/1:10:46, loss=0.268552295316342, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=2.0031333057641216, lr=0.000349963894371067
2023-11-28 11:06:41   INFO  epoch: 23/24, acc_iter=155151, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:10:15/0:56:28, time_cost(all): 2 days, 1:30:23/1:13:52, loss=0.268444752956239, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.613338105896708, lr=0.000344107451545974
2023-11-28 11:07:39   INFO  epoch: 23/24, acc_iter=155201, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:11:13/0:56:11, time_cost(all): 2 days, 1:31:21/1:12:40, loss=0.268337210596136, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.897159255450989, lr=0.000338251008720881
2023-11-28 11:08:37   INFO  epoch: 23/24, acc_iter=155251, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:12:11/0:55:53, time_cost(all): 2 days, 1:32:19/1:12:38, loss=0.268229668236034, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=1.5733464298072803, lr=0.000332394565895788
2023-11-28 11:09:34   INFO  epoch: 23/24, acc_iter=155301, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:13:09/0:51:41, time_cost(all): 2 days, 1:33:16/1:11:04, loss=0.268122125875931, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=2.4646496107273705, lr=0.000326538123070695
2023-11-28 11:10:32   INFO  epoch: 23/24, acc_iter=155351, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:14:06/0:54:55, time_cost(all): 2 days, 1:34:14/1:11:22, loss=0.268014583515828, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=4.019519720839384, lr=0.000320681680245601
2023-11-28 11:11:30   INFO  epoch: 23/24, acc_iter=155401, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:15:04/0:52:06, time_cost(all): 2 days, 1:35:12/1:07:42, loss=0.267907041155726, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=3.5819225143259548, lr=0.000314825237420508
2023-11-28 11:12:28   INFO  epoch: 23/24, acc_iter=155451, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:16:02/0:51:09, time_cost(all): 2 days, 1:36:10/1:07:34, loss=0.267799498795623, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.9194813933010033, lr=0.000308968794595415
2023-11-28 11:13:25   INFO  epoch: 23/24, acc_iter=155501, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:17:00/0:52:10, time_cost(all): 2 days, 1:37:07/1:08:07, loss=0.26769195643552, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=4.9401263482397795, lr=0.000303112351770322
2023-11-28 11:14:23   INFO  epoch: 23/24, acc_iter=155551, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:17:57/0:51:11, time_cost(all): 2 days, 1:38:05/1:07:47, loss=0.267584414075418, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.02425383507304, lr=0.000297255908945229
2023-11-28 11:15:21   INFO  epoch: 23/24, acc_iter=155601, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:18:55/0:50:09, time_cost(all): 2 days, 1:39:03/1:05:47, loss=0.267476871715315, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=2.054481764376254, lr=0.000291399466120135
2023-11-28 11:16:19   INFO  epoch: 23/24, acc_iter=155651, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:19:53/0:48:31, time_cost(all): 2 days, 1:40:01/1:05:46, loss=0.267369329355212, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.5570499264303126, lr=0.000285543023295042
2023-11-28 11:17:16   INFO  epoch: 23/24, acc_iter=155701, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:20:51/0:43:59, time_cost(all): 2 days, 1:40:58/0:59:31, loss=0.267261786995109, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=2.81009950164013, lr=0.000279686580469949
2023-11-28 11:18:14   INFO  epoch: 23/24, acc_iter=155751, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:21:48/0:45:05, time_cost(all): 2 days, 1:41:56/1:01:14, loss=0.267154244635007, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=0.6233438906049888, lr=0.000273830137644856
2023-11-28 11:19:12   INFO  epoch: 23/24, acc_iter=155801, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:22:46/0:42:46, time_cost(all): 2 days, 1:42:54/0:59:35, loss=0.267046702274904, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.3799567438767344, lr=0.000267973694819763
2023-11-28 11:20:10   INFO  epoch: 23/24, acc_iter=155851, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:23:44/0:44:49, time_cost(all): 2 days, 1:43:52/1:02:11, loss=0.266939159914801, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.6200942005421117, lr=0.000262117251994669
2023-11-28 11:21:07   INFO  epoch: 23/24, acc_iter=155901, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:24:42/0:39:59, time_cost(all): 2 days, 1:44:49/0:56:56, loss=0.266831617554699, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.89(1.03), norm=4.693935387615444, lr=0.000256260809169576
2023-11-28 11:22:05   INFO  epoch: 23/24, acc_iter=155951, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:25:39/0:41:34, time_cost(all): 2 days, 1:45:47/0:55:23, loss=0.266724075194596, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.1485789400929, lr=0.000250404366344483
2023-11-28 11:23:03   INFO  epoch: 23/24, acc_iter=156001, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:26:37/0:40:27, time_cost(all): 2 days, 1:46:45/0:56:41, loss=0.266616532834493, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=2.7157730985715265, lr=0.00024454792351939
2023-11-28 11:24:01   INFO  epoch: 23/24, acc_iter=156051, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:27:35/0:39:25, time_cost(all): 2 days, 1:47:43/0:56:45, loss=0.266508990474391, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=1.7392166941724576, lr=0.000238691480694297
2023-11-28 11:24:58   INFO  epoch: 23/24, acc_iter=156101, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:28:33/0:39:14, time_cost(all): 2 days, 1:48:40/0:56:57, loss=0.266401448114288, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=2.5808594262795372, lr=0.000232835037869204
2023-11-28 11:25:56   INFO  epoch: 23/24, acc_iter=156151, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:29:30/0:38:30, time_cost(all): 2 days, 1:49:38/0:54:49, loss=0.266293905754185, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=1.8564890415883992, lr=0.00022697859504411
2023-11-28 11:26:54   INFO  epoch: 23/24, acc_iter=156201, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:30:28/0:37:32, time_cost(all): 2 days, 1:50:36/0:51:30, loss=0.266186363394083, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=1.2252091205689317, lr=0.000221122152219017
2023-11-28 11:27:52   INFO  epoch: 23/24, acc_iter=156251, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:31:26/0:35:27, time_cost(all): 2 days, 1:51:34/0:54:02, loss=0.26607882103398, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=0.7787614373909055, lr=0.000215265709393924
2023-11-28 11:28:49   INFO  epoch: 23/24, acc_iter=156301, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:32:24/0:33:24, time_cost(all): 2 days, 1:52:31/0:52:34, loss=0.265971278673877, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.34184609317764, lr=0.000209409266568831
2023-11-28 11:29:47   INFO  epoch: 23/24, acc_iter=156351, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:33:21/0:32:13, time_cost(all): 2 days, 1:53:29/0:47:24, loss=0.265863736313775, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=1.0898729328995191, lr=0.000203552823743738
2023-11-28 11:30:45   INFO  epoch: 23/24, acc_iter=156401, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:34:19/0:33:02, time_cost(all): 2 days, 1:54:27/0:49:34, loss=0.265756193953672, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=0.8538737610608924, lr=0.000197696380918644
2023-11-28 11:31:43   INFO  epoch: 23/24, acc_iter=156451, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:35:17/0:30:24, time_cost(all): 2 days, 1:55:25/0:46:02, loss=0.265648651593569, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=2.4402412242551446, lr=0.000191839938093551
2023-11-28 11:32:40   INFO  epoch: 23/24, acc_iter=156501, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:23, time_cost(all): 2 days, 1:56:22/0:46:10, loss=0.265541109233467, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=0.8403707085198928, lr=0.000185983495268458
2023-11-28 11:33:38   INFO  epoch: 23/24, acc_iter=156551, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:37:12/0:30:59, time_cost(all): 2 days, 1:57:20/0:45:10, loss=0.265433566873364, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=2.8318184110199964, lr=0.000180127052443365
2023-11-28 11:34:36   INFO  epoch: 23/24, acc_iter=156601, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:38:10/0:28:59, time_cost(all): 2 days, 1:58:18/0:43:05, loss=0.265326024513261, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=0.6992080171114518, lr=0.000174270609618271
2023-11-28 11:35:34   INFO  epoch: 23/24, acc_iter=156651, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:39:08/0:28:29, time_cost(all): 2 days, 1:59:16/0:43:24, loss=0.265218482153159, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.159915329339974, lr=0.000168414166793178
2023-11-28 11:36:32   INFO  epoch: 23/24, acc_iter=156701, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:40:06/0:27:34, time_cost(all): 2 days, 2:00:14/0:41:19, loss=0.265110939793056, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.93924674700078, lr=0.000162557723968085
2023-11-28 11:37:29   INFO  epoch: 23/24, acc_iter=156751, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:41:03/0:25:16, time_cost(all): 2 days, 2:01:11/0:43:53, loss=0.265003397432953, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.108868489786488, lr=0.000156701281142992
2023-11-28 11:38:27   INFO  epoch: 23/24, acc_iter=156801, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:42:01/0:23:48, time_cost(all): 2 days, 2:02:09/0:41:15, loss=0.264895855072851, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=4.265599413914992, lr=0.000150844838317899
2023-11-28 11:39:25   INFO  epoch: 23/24, acc_iter=156851, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:42:59/0:23:18, time_cost(all): 2 days, 2:03:07/0:39:53, loss=0.264788312712748, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=1.8152467885770138, lr=0.000144988395492806
2023-11-28 11:40:23   INFO  epoch: 23/24, acc_iter=156901, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:43:57/0:22:23, time_cost(all): 2 days, 2:04:05/0:39:24, loss=0.264680770352645, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=0.593537308418709, lr=0.000139131952667712
2023-11-28 11:41:20   INFO  epoch: 23/24, acc_iter=156951, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:44:55/0:21:22, time_cost(all): 2 days, 2:05:02/0:37:37, loss=0.264573227992543, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=0.7252874196003501, lr=0.000133275509842619
2023-11-28 11:42:18   INFO  epoch: 23/24, acc_iter=157001, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:45:52/0:20:53, time_cost(all): 2 days, 2:06:00/0:36:00, loss=0.26446568563244, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=1.9278331668680908, lr=0.000127419067017526
2023-11-28 11:43:16   INFO  epoch: 23/24, acc_iter=157051, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:46:50/0:20:27, time_cost(all): 2 days, 2:06:58/0:34:36, loss=0.264358143272337, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=4.591196490574561, lr=0.000121562624192433
2023-11-28 11:44:14   INFO  epoch: 23/24, acc_iter=157101, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:47:48/0:19:44, time_cost(all): 2 days, 2:07:56/0:35:34, loss=0.264250600912235, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=1.5003084611618736, lr=0.00011570618136734
2023-11-28 11:45:11   INFO  epoch: 23/24, acc_iter=157151, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:48:46/0:17:40, time_cost(all): 2 days, 2:08:53/0:35:16, loss=0.264143058552132, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=2.263517514296912, lr=0.000109849738542246
2023-11-28 11:46:09   INFO  epoch: 23/24, acc_iter=157201, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:49:43/0:17:07, time_cost(all): 2 days, 2:09:51/0:34:04, loss=0.264035516192029, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.4691955574648103, lr=0.000103993295717153
2023-11-28 11:47:07   INFO  epoch: 23/24, acc_iter=157251, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:50:41/0:15:51, time_cost(all): 2 days, 2:10:49/0:32:20, loss=0.263927973831927, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=1.0765603855922108, lr=9.813685289206e-05
2023-11-28 11:48:05   INFO  epoch: 23/24, acc_iter=157301, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:51:39/0:14:55, time_cost(all): 2 days, 2:11:47/0:30:39, loss=0.263820431471824, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.13(1.03), norm=4.739190106784057, lr=9.2280410066967e-05
2023-11-28 11:49:02   INFO  epoch: 23/24, acc_iter=157351, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:52:37/0:14:17, time_cost(all): 2 days, 2:12:44/0:30:32, loss=0.263712889111721, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.771509018429455, lr=8.6423967241874e-05
2023-11-28 11:50:00   INFO  epoch: 23/24, acc_iter=157401, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:53:34/0:13:46, time_cost(all): 2 days, 2:13:42/0:28:20, loss=0.263605346751618, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=4.290291398544152, lr=8.056752441678e-05
2023-11-28 11:50:58   INFO  epoch: 23/24, acc_iter=157451, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:54:32/0:12:21, time_cost(all): 2 days, 2:14:40/0:29:36, loss=0.263497804391516, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.376354248750722, lr=7.4711081591687e-05
2023-11-28 11:51:56   INFO  epoch: 23/24, acc_iter=157501, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:55:30/0:11:03, time_cost(all): 2 days, 2:15:38/0:28:13, loss=0.263390262031413, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.256928969767996, lr=6.8854638766594e-05
2023-11-28 11:52:53   INFO  epoch: 23/24, acc_iter=157551, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:56:28/0:09:53, time_cost(all): 2 days, 2:16:35/0:27:09, loss=0.26328271967131, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=1.7830445887425506, lr=6.2998195941501e-05
2023-11-28 11:53:51   INFO  epoch: 23/24, acc_iter=157601, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:57:25/0:09:29, time_cost(all): 2 days, 2:17:33/0:26:36, loss=0.263175177311208, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.16730792627253, lr=5.7141753116408e-05
2023-11-28 11:54:49   INFO  epoch: 23/24, acc_iter=157651, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 1:58:23/0:08:43, time_cost(all): 2 days, 2:18:31/0:25:47, loss=0.263067634951105, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=2.121431651090784, lr=5.1285310291315e-05
2023-11-28 11:55:47   INFO  epoch: 23/24, acc_iter=157701, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 1:59:21/0:07:12, time_cost(all): 2 days, 2:19:29/0:23:23, loss=0.262960092591002, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=3.671320733191861, lr=4.5428867466221e-05
2023-11-28 11:56:44   INFO  epoch: 23/24, acc_iter=157751, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:00:19/0:06:46, time_cost(all): 2 days, 2:20:26/0:22:22, loss=0.2628525502309, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=3.7775405726481464, lr=3.9572424641128e-05
2023-11-28 11:57:42   INFO  epoch: 23/24, acc_iter=157801, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:01:16/0:05:31, time_cost(all): 2 days, 2:21:24/0:20:51, loss=0.262745007870797, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=0.85643586944239, lr=3.3715981816035e-05
2023-11-28 11:58:40   INFO  epoch: 23/24, acc_iter=157851, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:02:14/0:04:42, time_cost(all): 2 days, 2:22:22/0:20:07, loss=0.262637465510694, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=2.0603189168702754, lr=2.7859538990942e-05
2023-11-28 11:59:38   INFO  epoch: 23/24, acc_iter=157901, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:03:12/0:03:26, time_cost(all): 2 days, 2:23:20/0:20:01, loss=0.262529923150592, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=1.6013291876465372, lr=2.2003096165849e-05
2023-11-28 12:00:35   INFO  epoch: 23/24, acc_iter=157951, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:04:10/0:02:32, time_cost(all): 2 days, 2:24:17/0:19:42, loss=0.262422380790489, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=0.5643603486838134, lr=1.6146653340755e-05
2023-11-28 12:01:33   INFO  epoch: 23/24, acc_iter=158001, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:05:07/0:01:39, time_cost(all): 2 days, 2:25:15/0:17:37, loss=0.262314838430386, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=3.6405207557868793, lr=1.0290210515662e-05
2023-11-28 12:02:31   INFO  epoch: 23/24, acc_iter=158051, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:06:05/0:00:44, time_cost(all): 2 days, 2:26:13/0:16:30, loss=0.262207296070284, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.4824297797625245, lr=4.433767690569e-06
2023-11-28 12:02:31   INFO  **********************End training cfgs/picture_model/picture_waymo_detection(detection)**********************



2023-11-28 12:02:31   INFO  **********************Start evaluation cfgs/picture_model/picture_waymo_detection(detection)**********************
2023-11-28 12:02:31   INFO  Loading Waymo dataset
2023-11-28 12:02:31   INFO  Total skipped info 0
2023-11-28 12:02:31   INFO  Total samples for Waymo dataset: 39987
2023-11-28 12:02:31   INFO  ==> Loading parameters from checkpoint xxxxxx to CPU
2023-11-28 12:02:31   INFO  ==> Checkpoint trained from version: pcdet+0.6.0+0000000
2023-11-28 12:02:31   INFO  ==> Done (loaded 448/448)
2023-11-28 12:02:31   INFO  *************** EPOCH 24 EVALUATION *****************
2023-11-28 12:12:42   INFO  *************** Performance of EPOCH 24 *****************
2023-11-28 12:12:42   INFO  Generate label finished(sec_per_example: 0.0151 second).
2023-11-28 12:12:42   INFO  recall_roi_0.3: 0.000000
2023-11-28 12:12:42   INFO  recall_rcnn_0.3: 0.847721
2023-11-28 12:12:42   INFO  recall_roi_0.5: 0.000000
2023-11-28 12:12:42   INFO  recall_rcnn_0.5: 0.804182
2023-11-28 12:12:42   INFO  recall_roi_0.7: 0.000000
2023-11-28 12:12:42   INFO  recall_rcnn_0.7: 0.585841
2023-11-28 12:12:42   INFO  Average predicted number of objects(39987 samples): 120.153
2023-11-28 12:30:18   INFO  
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/AP: 0.8055 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/APH: 0.8026 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/APL: 0.8055 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/AP: 0.7293 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/APH: 0.7245 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/APL: 0.7293 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/AP: 0.8567 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/APH: 0.7842 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/APL: 0.8567 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/AP: 0.7718 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/APH: 0.7166 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/APL: 0.7718 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/AP: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/APH: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/APL: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/AP: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/APH: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/APL: 0.0000 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/AP: 0.7785 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/APH: 0.7687 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/APL: 0.7785 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/AP: 0.7527 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/APH: 0.7396 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/APL: 0.7527 

2023-11-28 12:30:18   INFO  Result is save to xxxxxxxxxxxxxxxxx
2023-11-28 12:30:18   INFO  ****************Evaluation done.*****************
2023-11-28 12:30:18   INFO  Epoch 24 has been evaluated
2023-11-28 12:30:18   INFO  **********************End evaluation cfgs/picture_model/picture_waymo_detection(detection)**********************
