2023-11-14 13:34:45   INFO  **********************Start logging**********************
2023-11-14 13:34:45   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-11-14 13:34:45   INFO  total_batch_size: 24
2023-11-14 13:34:45   INFO  cfg_file         cfgs/picture_models/picture_waymo_detection_0.2.yaml
2023-11-14 13:34:45   INFO  batch_size       3
2023-11-14 13:34:45   INFO  epochs           24
2023-11-14 13:34:45   INFO  workers          4
2023-11-14 13:34:45   INFO  extra_tag        detection
2023-11-14 13:34:45   INFO  ckpt             None
2023-11-14 13:34:45   INFO  pretrained_model waymo_pretrain_model_0.2.pth
2023-11-14 13:34:45   INFO  launcher         pytorch
2023-11-14 13:34:45   INFO  tcp_port         18888
2023-11-14 13:34:45   INFO  sync_bn          True
2023-11-14 13:34:45   INFO  fix_random_seed  False
2023-11-14 13:34:45   INFO  ckpt_save_interval 1
2023-11-14 13:34:45   INFO  local_rank       0
2023-11-14 13:34:45   INFO  max_ckpt_save_num 30
2023-11-14 13:34:45   INFO  merge_all_iters_to_one_epoch False
2023-11-14 13:34:45   INFO  set_cfgs         None
2023-11-14 13:34:45   INFO  max_waiting_mins 0
2023-11-14 13:34:45   INFO  start_epoch      0
2023-11-14 13:34:45   INFO  num_epochs_to_eval 0
2023-11-14 13:34:45   INFO  save_to_file     False
2023-11-14 13:34:45   INFO  use_tqdm_to_record False
2023-11-14 13:34:45   INFO  logger_iter_interval 50
2023-11-14 13:34:45   INFO  ckpt_save_time_interval 300
2023-11-14 13:34:45   INFO  wo_gpu_stat      False
2023-11-14 13:34:45   INFO  fp16             False
2023-11-14 13:34:45   INFO  cfg.LOCAL_RANK: 0
2023-11-14 13:34:45   INFO  cfg.CLASS_NAMES: ['Vehicle', 'Pedestrian', 'Cyclist']
2023-11-14 13:34:45   INFO  
cfg.DATA_CONFIG = edict()
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATASET: WaymoDataset
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/waymo
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.PROCESSED_DATA_TAG: waymo_processed_data_v0_5_0
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-74.88, -74.88, -2, 74.88, 74.88, 4.0]
2023-11-14 13:34:45   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-11-14 13:34:45   INFO  
cfg.DATA_CONFIG.SAMPLED_INTERVAL = edict()
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.train: 1
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.SAMPLED_INTERVAL.test: 1
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.FILTER_EMPTY_BOXES_FOR_TRAIN: True
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DISABLE_NLZ_FLAG_ON_POINTS: True
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.USE_SHARED_MEMORY: False
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.SHARED_MEMORY_FILE_LIMIT: 35000
2023-11-14 13:34:45   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.AUG_CONFIG_LIST: [{'NAME': 'gt_sampling', 'USE_ROAD_PLANE': False, 'DB_INFO_PATH': ['waymo_processed_data_v0_5_0_waymo_dbinfos_train_sampled_1.pkl'], 'USE_SHARED_MEMORY': True, 'DB_DATA_PATH': ['waymo_processed_data_v0_5_0_gt_database_train_sampled_1_global.npy'], 'BACKUP_DB_INFO': {'DB_INFO_PATH': 'waymo_processed_data_v0_5_0_waymo_dbinfos_train_sampled_1_multiframe_-4_to_0.pkl', 'DB_DATA_PATH': 'waymo_processed_data_v0_5_0_gt_database_train_sampled_1_multiframe_-4_to_0_global.npy', 'NUM_POINT_FEATURES': 6}, 'PREPARE': {'filter_by_min_points': ['Vehicle:5', 'Pedestrian:10', 'Cyclist:10'], 'filter_by_difficulty': [-1]}, 'SAMPLE_GROUPS': ['Vehicle:15', 'Pedestrian:10', 'Cyclist:10'], 'NUM_POINT_FEATURES': 5, 'REMOVE_EXTRA_WIDTH': [0.0, 0.0, 0.0], 'LIMIT_WHOLE_SCENE': True}, {'NAME': 'random_world_flip', 'ALONG_AXIS_LIST': ['x', 'y']}, {'NAME': 'random_world_rotation', 'WORLD_ROT_ANGLE': [-0.78539816, 0.78539816]}, {'NAME': 'random_world_scaling', 'WORLD_SCALE_RANGE': [0.95, 1.05]}, {'NAME': 'random_world_translation', 'NOISE_TRANSLATE_STD': [0.5, 0.5, 0.5]}]
2023-11-14 13:34:45   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'elongation']
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_range', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.32, 0.32, 0.1875]}]
2023-11-14 13:34:45   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/waymo_dataset.yaml
2023-11-14 13:34:45   INFO  
cfg.MODEL = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.NAME: CenterPoint
2023-11-14 13:34:45   INFO  
cfg.MODEL.VFE = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.VFE.NAME: DynPillarVFE3D
2023-11-14 13:34:45   INFO  cfg.MODEL.VFE.WITH_DISTANCE: False
2023-11-14 13:34:45   INFO  cfg.MODEL.VFE.USE_ABSLOTE_XYZ: True
2023-11-14 13:34:45   INFO  cfg.MODEL.VFE.USE_NORM: True
2023-11-14 13:34:45   INFO  cfg.MODEL.VFE.NUM_FILTERS: [192, 192]
2023-11-14 13:34:45   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVT
2023-11-14 13:34:45   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [ 468, 468, 32 ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: [ [ 1, 1, 4 ], [ 1, 1, 4 ], [ 1, 1, 2 ] ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [ 192, 192, 192, 192 ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [ [ 12, 12, 32 ], [ 12, 12, 8 ], [ 12, 12, 2 ], [ 12, 12, 1 ] ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [2, 2, 1]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [ [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ], [ [ 0, 0, 0 ], [ 6, 6, 0 ] ] ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-11-14 13:34:45   INFO  
cfg.MODEL.BACKBONE_3D.MASK_CONFIG = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.block_name: [ 'DSVTBlock','DSVTBlock','DSVTBlock','DSVTBlock' ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.set_info: [ [ 48, 1 ], [ 48, 1 ], [ 48, 1 ], [ 48, 1 ] ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.d_model: [ 192, 192, 192, 192 ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.nhead: [ 8, 8, 8, 8 ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [ 384, 384, 384, 384 ]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [468, 468]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 192
2023-11-14 13:34:45   INFO  
cfg.MODEL.MAP_TO_BEV = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.MAP_TO_BEV.NAME: PointPillarScatter3d
2023-11-14 13:34:45   INFO  cfg.MODEL.MAP_TO_BEV.INPUT_SHAPE: [468, 468, 1]
2023-11-14 13:34:45   INFO  cfg.MODEL.MAP_TO_BEV.NUM_BEV_FEATURES: 192
2023-11-14 13:34:45   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.NAME: BaseBEVResBackbone
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.LAYER_NUMS: [1, 2, 2]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.LAYER_STRIDES: [1, 2, 2]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.NUM_FILTERS: [128, 128, 256]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.UPSAMPLE_STRIDES: [1, 2, 4]
2023-11-14 13:34:45   INFO  cfg.MODEL.BACKBONE_2D.NUM_UPSAMPLE_FILTERS: [128, 128, 128]
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.NAME: CenterHead
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.CLASS_NAMES_EACH_HEAD: [['Vehicle', 'Pedestrian', 'Cyclist']]
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SHARED_CONV_CHANNEL: 64
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.USE_BIAS_BEFORE_NORM: False
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.NUM_HM_CONV: 2
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.BN_EPS: 0.001
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.BN_MOM: 0.01
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_ORDER: ['center', 'center_z', 'dim', 'rot']
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT = edict()
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center.out_channels: 2
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center.num_conv: 2
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z.out_channels: 1
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.center_z.num_conv: 2
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim.out_channels: 3
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.dim.num_conv: 2
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot.out_channels: 2
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.rot.num_conv: 2
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou.out_channels: 1
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.SEPARATE_HEAD_CFG.HEAD_DICT.iou.num_conv: 2
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.FEATURE_MAP_STRIDE: 1
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.NUM_MAX_OBJS: 500
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.GAUSSIAN_OVERLAP: 0.1
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.TARGET_ASSIGNER_CONFIG.MIN_RADIUS: 2
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.IOU_REG_LOSS: True
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.LOSS_CONFIG = edict()
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.cls_weight: 1.0
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.loc_weight: 2.0
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.LOSS_CONFIG.LOSS_WEIGHTS.code_weights: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0]
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.POST_PROCESSING = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.SCORE_THRESH: 0.1
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.POST_CENTER_LIMIT_RANGE: [-80, -80, -10.0, 80, 80, 10.0]
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.MAX_OBJ_PER_SAMPLE: 500
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.USE_IOU_TO_RECTIFY_SCORE: True
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.IOU_RECTIFIER: [0.68, 0.71, 0.65]
2023-11-14 13:34:45   INFO  
cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_TYPE: multi_class_nms
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_THRESH: [0.7, 0.6, 0.55]
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_PRE_MAXSIZE: [4096, 4096, 4096]
2023-11-14 13:34:45   INFO  cfg.MODEL.DENSE_HEAD.POST_PROCESSING.NMS_CONFIG.NMS_POST_MAXSIZE: [500, 500, 500]
2023-11-14 13:34:45   INFO  
cfg.MODEL.POST_PROCESSING = edict()
2023-11-14 13:34:45   INFO  cfg.MODEL.POST_PROCESSING.RECALL_THRESH_LIST: [0.3, 0.5, 0.7]
2023-11-14 13:34:45   INFO  cfg.MODEL.POST_PROCESSING.EVAL_METRIC: waymo
2023-11-14 13:34:45   INFO  
cfg.OPTIMIZATION = edict()
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 3
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 24
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.LR: 0.001
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.PCT_START: 0.1
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 100
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 10
2023-11-14 13:34:45   INFO  cfg.OPTIMIZATION.LOSS_SCALE_FP16: 32.0
2023-11-14 13:34:45   INFO  
cfg.HOOK = edict()
2023-11-14 13:34:45   INFO  
cfg.HOOK.DisableAugmentationHook = edict()
2023-11-14 13:34:45   INFO  cfg.HOOK.DisableAugmentationHook.DISABLE_AUG_LIST: ['gt_sampling', 'random_world_flip', 'random_world_rotation', 'random_world_scaling', 'random_world_translation']
2023-11-14 13:34:45   INFO  cfg.HOOK.DisableAugmentationHook.NUM_LAST_EPOCHS: 1
2023-11-14 13:34:45   INFO  cfg.TAG: picture_waymo_detection_0.2
2023-11-14 13:34:45   INFO  cfg.EXP_GROUP_PATH: cfgs/picture_model
2023-11-14 13:34:51   INFO  Loading GT database to shared memory
2023-11-14 13:34:51   INFO  GT database has been saved to shared memory
2023-11-14 13:34:51   INFO  Loading Waymo dataset
2023-11-14 13:34:55   INFO  Total skipped info 0
2023-11-14 13:34:55   INFO  Total samples for Waymo dataset: 158081
2023-11-14 13:34:55   INFO  DistributedDataParallel(
  (module): CenterPoint(
    (vfe): DynamicPillarVFE_3d(
      (pfn_layers): ModuleList(
        (0): PFNLayerV2(
          (linear): Linear(in_features=11, out_features=96, bias=False)
          (norm): SyncBatchNorm(96, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
        (1): PFNLayerV2(
          (linear): Linear(in_features=192, out_features=192, bias=False)
          (norm): SyncBatchNorm(192, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
      )
    )
    (backbone_3d): DSVTBackboneMAE(
    (input_layer): DSVTInputLayer(
      (posembed_layers): ModuleList(
        (0): ModuleList(
          (0): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (1): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (2): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
          (3): ModuleList(
            (0): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
            (1): PositionEmbeddingLearned(
              (position_embedding_head): Sequential(
                (0): Linear(in_features=2, out_features=192, bias=True)
                (1): BatchNorm1d(192, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                (2): ReLU(inplace=True)
                (3): Linear(in_features=192, out_features=192, bias=True)
              )
            )
          )
        )
      )
    )
    (stage_0): ModuleList(
      (0): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (1): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (2): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
      (3): DSVTBlock(
        (encoder_list): ModuleList(
          (0): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
          (1): DSVT_EncoderLayer(
            (win_attn): SetAttention(
              (self_attn): MultiheadAttention(
                (out_proj): NonDynamicallyQuantizableLinear(in_features=192, out_features=192, bias=True)
              )
              (linear1): Linear(in_features=192, out_features=384, bias=True)
              (dropout): Dropout(p=0, inplace=False)
              (linear2): Linear(in_features=384, out_features=192, bias=True)
              (norm1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (norm2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
              (dropout1): Identity()
              (dropout2): Identity()
            )
            (norm): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
          )
        )
      )
    )
    (residual_norm_stage_0): ModuleList(
      (0): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (1): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (2): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
      (3): LayerNorm((192,), eps=1e-05, elementwise_affine=True)
    )
  )
    (map_to_bev_module): PointPillarScatter3d()
    (pfe): None
    (backbone_2d): BaseBEVResBackbone(
      (blocks): ModuleList(
        (0): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(192, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(192, 128, kernel_size=(1, 1), stride=(1, 1), bias=False)
              (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (1): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(128, 128, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (2): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(128, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(128, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
      )
      (deblocks): ModuleList(
        (0): Sequential(
          (0): ConvTranspose2d(128, 128, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (1): Sequential(
          (0): ConvTranspose2d(128, 128, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (2): Sequential(
          (0): ConvTranspose2d(256, 128, kernel_size=(4, 4), stride=(4, 4), bias=False)
          (1): SyncBatchNorm(128, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
      )
    )
    (dense_head): CenterHead(
      (shared_conv): Sequential(
        (0): Conv2d(384, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
        (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
        (2): ReLU()
      )
      (heads_list): ModuleList(
        (0): SeparateHead(
          (center): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 2, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (center_z): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 1, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (dim): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (rot): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 2, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (iou): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 1, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
          (hm): Sequential(
            (0): Sequential(
              (0): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
              (1): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
              (2): ReLU()
            )
            (1): Conv2d(64, 3, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          )
        )
      )
      (hm_loss_func): FocalLossCenterNet()
      (reg_loss_func): RegLossCenterNet()
    )
    (point_head): None
    (roi_head): None
  )
)
2023-11-14 13:35:52   INFO  Total number of parameters: 9236651
2023-11-14 13:35:52   INFO  **********************Start training cfgs/picture_model/picture_waymo_detection_0.2(detection)**********************
2023-11-14 13:37:16   INFO  epoch: 0/24, acc_iter=50, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:05:27, time_cost(all): 0:00:58/2 days, 2:34:03, loss=3.104935538361489, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=1.2586236257523087, lr=0.001142325793229088
2023-11-14 13:38:15   INFO  epoch: 0/24, acc_iter=100, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:13:21, time_cost(all): 0:01:57/2 days, 2:06:53, loss=2.973096825816147, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=2.345392574451636, lr=0.001284651586458175
2023-11-14 13:39:14   INFO  epoch: 0/24, acc_iter=150, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:12:19, time_cost(all): 0:02:56/2 days, 5:35:41, loss=2.841258113270806, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=3.6224441774756517, lr=0.001426977379687263
2023-11-14 13:40:13   INFO  epoch: 0/24, acc_iter=200, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:08:56, time_cost(all): 0:03:55/2 days, 4:18:55, loss=2.709419400725464, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=3.2003538820679664, lr=0.00156930317291635
2023-11-14 13:41:12   INFO  epoch: 0/24, acc_iter=250, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:06:03, time_cost(all): 0:04:54/2 days, 5:58:02, loss=2.577580688180123, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=1.2204495051455437, lr=0.001711628966145438
2023-11-14 13:42:11   INFO  epoch: 0/24, acc_iter=300, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:09:22, time_cost(all): 0:05:53/2 days, 2:04:33, loss=2.445741975634781, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.935077127700883, lr=0.001853954759374526
2023-11-14 13:43:10   INFO  epoch: 0/24, acc_iter=350, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/1:59:08, time_cost(all): 0:06:52/2 days, 4:46:37, loss=2.31390326308944, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=3.911904013687293, lr=0.001996280552603613
2023-11-14 13:44:09   INFO  epoch: 0/24, acc_iter=400, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:58:42, time_cost(all): 0:07:51/2 days, 2:49:21, loss=2.182064550544098, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.5217171992771876, lr=0.002138606345832701
2023-11-14 13:45:08   INFO  epoch: 0/24, acc_iter=450, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:58:47, time_cost(all): 0:08:50/2 days, 1:12:48, loss=2.050225837998757, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.26526371229766, lr=0.002280932139061788
2023-11-14 13:46:07   INFO  epoch: 0/24, acc_iter=500, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:04:44, time_cost(all): 0:09:49/2 days, 1:21:15, loss=1.918387125453415, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=1.7517736155710573, lr=0.002423257932290876
2023-11-14 13:47:06   INFO  epoch: 0/24, acc_iter=550, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:01:38, time_cost(all): 0:10:48/2 days, 1:35:17, loss=1.786548412908074, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=3.5177816884284296, lr=0.002565583725519963
2023-11-14 13:48:05   INFO  epoch: 0/24, acc_iter=600, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:55:02, time_cost(all): 0:11:47/2 days, 1:20:16, loss=1.654709700362732, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.139892483293098, lr=0.002707909518749051
2023-11-14 13:49:04   INFO  epoch: 0/24, acc_iter=650, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:55:54, time_cost(all): 0:12:46/2 days, 5:10:32, loss=1.522870987817391, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=2.3375658205376673, lr=0.002850235311978139
2023-11-14 13:50:03   INFO  epoch: 0/24, acc_iter=700, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:59:15, time_cost(all): 0:13:45/2 days, 2:37:31, loss=1.391032275272049, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=1.0374356107788716, lr=0.002992561105207227
2023-11-14 13:51:01   INFO  epoch: 0/24, acc_iter=750, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:58:28, time_cost(all): 0:14:43/2 days, 4:04:59, loss=1.259193562726708, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.7491932302240403, lr=0.003134886898436314
2023-11-14 13:52:00   INFO  epoch: 0/24, acc_iter=800, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:56:11, time_cost(all): 0:15:42/2 days, 4:39:46, loss=1.127354850181366, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=3.7674019954402107, lr=0.003277212691665402
2023-11-14 13:52:59   INFO  epoch: 0/24, acc_iter=850, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:55:04, time_cost(all): 0:16:41/2 days, 5:35:29, loss=0.995516137636025, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=0.902154919580606, lr=0.003419538484894489
2023-11-14 13:53:58   INFO  epoch: 0/24, acc_iter=900, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:46:46, time_cost(all): 0:17:40/2 days, 4:28:00, loss=0.863677425090683, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=2.5201502193874106, lr=0.003561864278123577
2023-11-14 13:54:57   INFO  epoch: 0/24, acc_iter=950, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:51:23, time_cost(all): 0:18:39/2 days, 5:49:01, loss=0.731838712545342, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=3.601596903879295, lr=0.003704190071352665
2023-11-14 13:55:56   INFO  epoch: 0/24, acc_iter=1000, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:50:57, time_cost(all): 0:19:38/2 days, 2:27:39, loss=0.602479833704938, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=4.549417368106818, lr=0.003846515864581752
2023-11-14 13:56:55   INFO  epoch: 0/24, acc_iter=1050, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:47:53, time_cost(all): 0:20:37/2 days, 1:47:23, loss=0.599889057851823, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=0.7076754421805722, lr=0.00398884165781084
2023-11-14 13:57:54   INFO  epoch: 0/24, acc_iter=1100, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:48, time_cost(all): 0:21:36/2 days, 4:59:46, loss=0.599778115703646, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=3.6789525524293256, lr=0.004131167451039927
2023-11-14 13:58:53   INFO  epoch: 0/24, acc_iter=1150, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:44:07, time_cost(all): 0:22:35/2 days, 5:17:21, loss=0.59966717355547, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=3.3309335294148026, lr=0.004273493244269014
2023-11-14 13:59:52   INFO  epoch: 0/24, acc_iter=1200, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:41:44, time_cost(all): 0:23:34/2 days, 3:40:01, loss=0.599556231407293, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=4.626898325982464, lr=0.004415819037498102
2023-11-14 14:00:51   INFO  epoch: 0/24, acc_iter=1250, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:43:07, time_cost(all): 0:24:33/2 days, 2:21:30, loss=0.599445289259116, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=1.4542698115120243, lr=0.00455814483072719
2023-11-14 14:01:50   INFO  epoch: 0/24, acc_iter=1300, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:40:00, time_cost(all): 0:25:32/2 days, 2:55:29, loss=0.599334347110939, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=3.0626470084651474, lr=0.004700470623956277
2023-11-14 14:02:49   INFO  epoch: 0/24, acc_iter=1350, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:43:12, time_cost(all): 0:26:31/2 days, 2:34:59, loss=0.599223404962763, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=3.20718530818562, lr=0.004842796417185365
2023-11-14 14:03:48   INFO  epoch: 0/24, acc_iter=1400, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:41:46, time_cost(all): 0:27:30/2 days, 1:41:35, loss=0.599112462814586, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.097069265202906, lr=0.004985122210414453
2023-11-14 14:04:46   INFO  epoch: 0/24, acc_iter=1450, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:42:23, time_cost(all): 0:28:28/2 days, 4:08:51, loss=0.599001520666409, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=2.939403205394585, lr=0.005127448003643541
2023-11-14 14:05:45   INFO  epoch: 0/24, acc_iter=1500, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:40:52, time_cost(all): 0:29:27/2 days, 5:44:14, loss=0.598890578518232, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=1.1776872222469401, lr=0.005269773796872628
2023-11-14 14:06:44   INFO  epoch: 0/24, acc_iter=1550, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:36:24, time_cost(all): 0:30:26/2 days, 0:41:24, loss=0.598779636370056, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.5336440164121055, lr=0.005412099590101716
2023-11-14 14:07:43   INFO  epoch: 0/24, acc_iter=1600, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:37:46, time_cost(all): 0:31:25/2 days, 1:30:42, loss=0.598668694221879, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=4.8576736686818505, lr=0.005554425383330804
2023-11-14 14:08:42   INFO  epoch: 0/24, acc_iter=1650, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:45, time_cost(all): 0:32:24/2 days, 1:42:11, loss=0.598557752073702, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.736684801908348, lr=0.005696751176559891
2023-11-14 14:09:41   INFO  epoch: 0/24, acc_iter=1700, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:38:34, time_cost(all): 0:33:23/2 days, 5:22:29, loss=0.598446809925525, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=2.7731873188229827, lr=0.005839076969788979
2023-11-14 14:10:40   INFO  epoch: 0/24, acc_iter=1750, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:36:53, time_cost(all): 0:34:22/2 days, 1:07:26, loss=0.598335867777349, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.228818439952755, lr=0.005981402763018067
2023-11-14 14:11:39   INFO  epoch: 0/24, acc_iter=1800, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:38:11, time_cost(all): 0:35:21/2 days, 1:45:57, loss=0.598224925629172, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.790716444015827, lr=0.006123728556247154
2023-11-14 14:12:38   INFO  epoch: 0/24, acc_iter=1850, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:35:30, time_cost(all): 0:36:20/2 days, 1:18:35, loss=0.598113983480995, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=3.531240844643934, lr=0.006266054349476241
2023-11-14 14:13:37   INFO  epoch: 0/24, acc_iter=1900, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:29:17, time_cost(all): 0:37:19/2 days, 0:38:21, loss=0.598003041332818, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.187078000445373, lr=0.006408380142705329
2023-11-14 14:14:36   INFO  epoch: 0/24, acc_iter=1950, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:27:22, time_cost(all): 0:38:18/2 days, 3:09:57, loss=0.597892099184642, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=1.2320051655929238, lr=0.006550705935934416
2023-11-14 14:15:35   INFO  epoch: 0/24, acc_iter=2000, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:30:02, time_cost(all): 0:39:17/2 days, 2:59:30, loss=0.597781157036465, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=3.4096305643773124, lr=0.006693031729163504
2023-11-14 14:16:34   INFO  epoch: 0/24, acc_iter=2050, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:28:26, time_cost(all): 0:40:16/2 days, 5:31:52, loss=0.597670214888288, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=3.0171278360643843, lr=0.006835357522392592
2023-11-14 14:17:33   INFO  epoch: 0/24, acc_iter=2100, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:25:32, time_cost(all): 0:41:15/2 days, 4:38:55, loss=0.597559272740111, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=0.6708411321753885, lr=0.006977683315621679
2023-11-14 14:18:31   INFO  epoch: 0/24, acc_iter=2150, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:24:11, time_cost(all): 0:42:13/2 days, 3:18:56, loss=0.597448330591935, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=3.984303821302449, lr=0.007120009108850767
2023-11-14 14:19:30   INFO  epoch: 0/24, acc_iter=2200, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:28:21, time_cost(all): 0:43:12/2 days, 3:53:10, loss=0.597337388443758, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=1.6740750239495765, lr=0.007262334902079854
2023-11-14 14:20:29   INFO  epoch: 0/24, acc_iter=2250, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:28:03, time_cost(all): 0:44:11/2 days, 0:31:49, loss=0.597226446295581, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.664703315458704, lr=0.007404660695308942
2023-11-14 14:21:28   INFO  epoch: 0/24, acc_iter=2300, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:28:17, time_cost(all): 0:45:10/2 days, 3:35:49, loss=0.597115504147404, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=1.493300475552848, lr=0.00754698648853803
2023-11-14 14:22:27   INFO  epoch: 0/24, acc_iter=2350, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:25:36, time_cost(all): 0:46:09/2 days, 2:51:00, loss=0.597004561999228, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=2.8382259336819406, lr=0.007689312281767117
2023-11-14 14:23:26   INFO  epoch: 0/24, acc_iter=2400, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:21:44, time_cost(all): 0:47:08/2 days, 4:02:49, loss=0.596893619851051, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=2.6100969329374677, lr=0.007831638074996206
2023-11-14 14:24:25   INFO  epoch: 0/24, acc_iter=2450, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:22:17, time_cost(all): 0:48:07/2 days, 1:17:09, loss=0.596782677702874, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=3.1368801559861272, lr=0.007973963868225293
2023-11-14 14:25:24   INFO  epoch: 0/24, acc_iter=2500, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:07, time_cost(all): 0:49:06/2 days, 2:02:44, loss=0.596671735554697, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=2.589960970488426, lr=0.00811628966145438
2023-11-14 14:26:23   INFO  epoch: 0/24, acc_iter=2550, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:32, time_cost(all): 0:50:05/2 days, 1:31:09, loss=0.596560793406521, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=1.6480306229741573, lr=0.008258615454683468
2023-11-14 14:27:22   INFO  epoch: 0/24, acc_iter=2600, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:38, time_cost(all): 0:51:04/2 days, 3:21:29, loss=0.596449851258344, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.43218613274432, lr=0.008400941247912555
2023-11-14 14:28:21   INFO  epoch: 0/24, acc_iter=2650, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:16:13, time_cost(all): 0:52:03/2 days, 2:17:26, loss=0.596338909110167, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=1.3296366428343491, lr=0.008543267041141642
2023-11-14 14:29:20   INFO  epoch: 0/24, acc_iter=2700, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:15:54, time_cost(all): 0:53:02/2 days, 1:37:09, loss=0.59622796696199, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=4.376663357349404, lr=0.008685592834370731
2023-11-14 14:30:19   INFO  epoch: 0/24, acc_iter=2750, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:18:05, time_cost(all): 0:54:01/2 days, 1:55:49, loss=0.596117024813814, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.662331130826567, lr=0.008827918627599816
2023-11-14 14:31:18   INFO  epoch: 0/24, acc_iter=2800, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:16:31, time_cost(all): 0:55:00/2 days, 3:54:44, loss=0.596006082665637, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.2397633217100392, lr=0.008970244420828905
2023-11-14 14:32:16   INFO  epoch: 0/24, acc_iter=2850, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:13:13, time_cost(all): 0:55:58/2 days, 1:54:35, loss=0.59589514051746, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.92(1.03), norm=2.067672018823277, lr=0.009112570214057994
2023-11-14 14:33:15   INFO  epoch: 0/24, acc_iter=2900, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:13:20, time_cost(all): 0:56:57/2 days, 4:56:01, loss=0.595784198369283, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=3.6563588885939593, lr=0.009254896007287083
2023-11-14 14:34:14   INFO  epoch: 0/24, acc_iter=2950, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:08:13, time_cost(all): 0:57:56/2 days, 3:45:29, loss=0.595673256221107, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.9529943719700533, lr=0.009397221800516168
2023-11-14 14:35:13   INFO  epoch: 0/24, acc_iter=3000, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:09:54, time_cost(all): 0:58:55/2 days, 0:45:30, loss=0.59556231407293, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.2662782337753393, lr=0.009539547593745257
2023-11-14 14:36:12   INFO  epoch: 0/24, acc_iter=3050, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:43, time_cost(all): 0:59:54/2 days, 4:15:48, loss=0.595451371924753, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=2.993874323406328, lr=0.009681873386974345
2023-11-14 14:37:11   INFO  epoch: 0/24, acc_iter=3100, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:10:16, time_cost(all): 1:00:53/2 days, 3:43:16, loss=0.595340429776576, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=4.5050103877736785, lr=0.00982419918020343
2023-11-14 14:38:10   INFO  epoch: 0/24, acc_iter=3150, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:08:31, time_cost(all): 1:01:52/2 days, 3:47:23, loss=0.5952294876284, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=2.1557362126601047, lr=0.00996652497343252
2023-11-14 14:39:09   INFO  epoch: 0/24, acc_iter=3200, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:04:01, time_cost(all): 1:02:51/2 days, 4:27:33, loss=0.595118545480223, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=3.730342475204625, lr=0.010272126916654014
2023-11-14 14:40:08   INFO  epoch: 0/24, acc_iter=3250, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:59, time_cost(all): 1:03:50/2 days, 4:49:41, loss=0.595007603332046, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.8877552728095832, lr=0.010627941399726733
2023-11-14 14:41:07   INFO  epoch: 0/24, acc_iter=3300, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:07:28, time_cost(all): 1:04:49/2 days, 1:19:53, loss=0.594896661183869, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.9738123870551503, lr=0.010983755882799453
2023-11-14 14:42:06   INFO  epoch: 0/24, acc_iter=3350, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:35, time_cost(all): 1:05:48/2 days, 3:14:15, loss=0.594785719035693, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=2.3861737260594187, lr=0.011339570365872171
2023-11-14 14:43:05   INFO  epoch: 0/24, acc_iter=3400, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/0:59:35, time_cost(all): 1:06:47/2 days, 3:28:17, loss=0.594674776887516, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=3.43206430842439, lr=0.01169538484894489
2023-11-14 14:44:04   INFO  epoch: 0/24, acc_iter=3450, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:59, time_cost(all): 1:07:46/2 days, 1:10:45, loss=0.594563834739339, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=4.279572345416701, lr=0.01205119933201761
2023-11-14 14:45:03   INFO  epoch: 0/24, acc_iter=3500, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:58:26, time_cost(all): 1:08:45/2 days, 1:32:30, loss=0.594452892591162, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.688770905337641, lr=0.012407013815090328
2023-11-14 14:46:01   INFO  epoch: 0/24, acc_iter=3550, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:58, time_cost(all): 1:09:43/2 days, 1:39:35, loss=0.594341950442986, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=2.231142902931552, lr=0.012762828298163047
2023-11-14 14:47:00   INFO  epoch: 0/24, acc_iter=3600, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:56:56, time_cost(all): 1:10:42/2 days, 2:46:19, loss=0.594231008294809, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=2.4435906089053914, lr=0.013118642781235767
2023-11-14 14:47:59   INFO  epoch: 0/24, acc_iter=3650, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:57:31, time_cost(all): 1:11:41/2 days, 3:58:20, loss=0.594120066146632, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=2.0462700971822905, lr=0.013474457264308485
2023-11-14 14:48:58   INFO  epoch: 0/24, acc_iter=3700, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:30, time_cost(all): 1:12:40/2 days, 4:05:04, loss=0.594009123998455, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=2.9628298800259234, lr=0.013830271747381204
2023-11-14 14:49:57   INFO  epoch: 0/24, acc_iter=3750, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:54:36, time_cost(all): 1:13:39/2 days, 2:39:27, loss=0.593898181850279, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=3.6690987907992096, lr=0.014186086230453924
2023-11-14 14:50:56   INFO  epoch: 0/24, acc_iter=3800, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:54:25, time_cost(all): 1:14:38/2 days, 3:42:47, loss=0.593787239702102, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.3728143699123736, lr=0.014541900713526642
2023-11-14 14:51:55   INFO  epoch: 0/24, acc_iter=3850, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:56:23, time_cost(all): 1:15:37/2 days, 4:44:18, loss=0.593676297553925, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=1.9110221984838187, lr=0.01489771519659936
2023-11-14 14:52:54   INFO  epoch: 0/24, acc_iter=3900, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:53:12, time_cost(all): 1:16:36/2 days, 3:00:16, loss=0.593565355405748, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=3.4043947675058006, lr=0.01525352967967208
2023-11-14 14:53:53   INFO  epoch: 0/24, acc_iter=3950, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:14, time_cost(all): 1:17:35/2 days, 1:38:15, loss=0.593454413257572, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=1.9277688278593836, lr=0.0156093441627448
2023-11-14 14:54:52   INFO  epoch: 0/24, acc_iter=4000, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:52:34, time_cost(all): 1:18:34/2 days, 0:10:57, loss=0.593343471109395, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.3442759146399306, lr=0.015965158645817518
2023-11-14 14:55:51   INFO  epoch: 0/24, acc_iter=4050, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:29, time_cost(all): 1:19:33/2 days, 2:53:21, loss=0.593232528961218, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=3.8569716692045413, lr=0.016320973128890238
2023-11-14 14:56:50   INFO  epoch: 0/24, acc_iter=4100, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:49:08, time_cost(all): 1:20:32/2 days, 4:38:27, loss=0.593121586813041, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=0.5239411125314424, lr=0.016676787611962958
2023-11-14 14:57:49   INFO  epoch: 0/24, acc_iter=4150, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:49:44, time_cost(all): 1:21:31/2 days, 2:15:29, loss=0.593010644664865, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.8569088308317623, lr=0.017032602095035675
2023-11-14 14:58:48   INFO  epoch: 0/24, acc_iter=4200, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:25, time_cost(all): 1:22:30/2 days, 4:45:42, loss=0.592899702516688, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=0.6544662084890795, lr=0.017388416578108395
2023-11-14 14:59:46   INFO  epoch: 0/24, acc_iter=4250, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:09, time_cost(all): 1:23:28/2 days, 0:41:32, loss=0.592788760368511, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=4.30706067374521, lr=0.017744231061181115
2023-11-14 15:00:45   INFO  epoch: 0/24, acc_iter=4300, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:31, time_cost(all): 1:24:27/2 days, 2:18:04, loss=0.592677818220334, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=3.012625818286763, lr=0.01810004554425383
2023-11-14 15:01:44   INFO  epoch: 0/24, acc_iter=4350, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:14, time_cost(all): 1:25:26/2 days, 0:32:48, loss=0.592566876072158, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=3.615166415736949, lr=0.018455860027326552
2023-11-14 15:02:43   INFO  epoch: 0/24, acc_iter=4400, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:12, time_cost(all): 1:26:25/2 days, 0:48:29, loss=0.592455933923981, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.3891914942271195, lr=0.018811674510399272
2023-11-14 15:03:42   INFO  epoch: 0/24, acc_iter=4450, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:42:38, time_cost(all): 1:27:24/2 days, 4:30:39, loss=0.592344991775804, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=3.246236634399199, lr=0.01916748899347199
2023-11-14 15:04:41   INFO  epoch: 0/24, acc_iter=4500, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:42:10, time_cost(all): 1:28:23/2 days, 1:01:50, loss=0.592234049627627, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.8623448974551007, lr=0.01952330347654471
2023-11-14 15:05:40   INFO  epoch: 0/24, acc_iter=4550, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:50, time_cost(all): 1:29:22/2 days, 0:16:30, loss=0.592123107479451, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=3.33832934359992, lr=0.01987911795961743
2023-11-14 15:06:39   INFO  epoch: 0/24, acc_iter=4600, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:21, time_cost(all): 1:30:21/2 days, 3:58:01, loss=0.592012165331274, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=2.559642936971003, lr=0.020234932442690146
2023-11-14 15:07:38   INFO  epoch: 0/24, acc_iter=4650, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:25, time_cost(all): 1:31:20/2 days, 4:39:02, loss=0.591901223183097, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=3.0669814756890537, lr=0.020590746925762866
2023-11-14 15:08:37   INFO  epoch: 0/24, acc_iter=4700, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:28, time_cost(all): 1:32:19/2 days, 1:20:29, loss=0.59179028103492, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=2.0840965398005853, lr=0.020946561408835586
2023-11-14 15:09:36   INFO  epoch: 0/24, acc_iter=4750, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:44, time_cost(all): 1:33:18/2 days, 0:22:21, loss=0.591679338886744, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=2.2704613993400926, lr=0.021302375891908303
2023-11-14 15:10:35   INFO  epoch: 0/24, acc_iter=4800, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:33:27, time_cost(all): 1:34:17/2 days, 3:20:38, loss=0.591568396738567, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=3.7071732831552175, lr=0.021658190374981026
2023-11-14 15:11:34   INFO  epoch: 0/24, acc_iter=4850, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:35:15, time_cost(all): 1:35:16/2 days, 0:11:57, loss=0.59145745459039, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=2.635609772599768, lr=0.022014004858053743
2023-11-14 15:12:33   INFO  epoch: 0/24, acc_iter=4900, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:39, time_cost(all): 1:36:15/2 days, 1:13:22, loss=0.591346512442213, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=3.969734325241197, lr=0.02236981934112646
2023-11-14 15:13:31   INFO  epoch: 0/24, acc_iter=4950, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:05, time_cost(all): 1:37:13/2 days, 3:50:50, loss=0.591235570294037, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=3.5052568889422333, lr=0.02272563382419918
2023-11-14 15:14:30   INFO  epoch: 0/24, acc_iter=5000, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:56, time_cost(all): 1:38:12/2 days, 2:38:18, loss=0.59112462814586, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.1626747276722005, lr=0.0230814483072719
2023-11-14 15:15:29   INFO  epoch: 0/24, acc_iter=5050, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:01, time_cost(all): 1:39:11/2 days, 4:29:47, loss=0.591013685997683, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.88(1.03), norm=3.718995959633845, lr=0.023437262790344617
2023-11-14 15:16:28   INFO  epoch: 0/24, acc_iter=5100, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:02, time_cost(all): 1:40:10/2 days, 1:06:01, loss=0.590902743849506, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.400812815041887, lr=0.023793077273417337
2023-11-14 15:17:27   INFO  epoch: 0/24, acc_iter=5150, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:25, time_cost(all): 1:41:09/2 days, 0:57:15, loss=0.59079180170133, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.266557563643046, lr=0.024148891756490057
2023-11-14 15:18:26   INFO  epoch: 0/24, acc_iter=5200, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:23, time_cost(all): 1:42:08/2 days, 2:13:39, loss=0.590680859553153, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=4.590716845450897, lr=0.024504706239562773
2023-11-14 15:19:25   INFO  epoch: 0/24, acc_iter=5250, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:27, time_cost(all): 1:43:07/2 days, 4:14:58, loss=0.590569917404976, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=1.7409403410219997, lr=0.024860520722635494
2023-11-14 15:20:24   INFO  epoch: 0/24, acc_iter=5300, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:56, time_cost(all): 1:44:06/2 days, 3:29:06, loss=0.590458975256799, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=2.7635450848916356, lr=0.025216335205708214
2023-11-14 15:21:23   INFO  epoch: 0/24, acc_iter=5350, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:03, time_cost(all): 1:45:05/2 days, 2:02:05, loss=0.590348033108623, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=2.0356025458397617, lr=0.02557214968878093
2023-11-14 15:22:22   INFO  epoch: 0/24, acc_iter=5400, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:12, time_cost(all): 1:46:04/2 days, 1:24:57, loss=0.590237090960446, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=3.6363336315665022, lr=0.025927964171853654
2023-11-14 15:23:21   INFO  epoch: 0/24, acc_iter=5450, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:29, time_cost(all): 1:47:03/2 days, 0:31:22, loss=0.590126148812269, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=4.892738609916884, lr=0.02628377865492637
2023-11-14 15:24:20   INFO  epoch: 0/24, acc_iter=5500, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:35, time_cost(all): 1:48:02/2 days, 1:57:08, loss=0.590015206664092, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.697506038250242, lr=0.026639593137999087
2023-11-14 15:25:19   INFO  epoch: 0/24, acc_iter=5550, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:15, time_cost(all): 1:49:01/1 day, 23:35:25, loss=0.589904264515916, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=3.91396798327959, lr=0.02699540762107181
2023-11-14 15:26:18   INFO  epoch: 0/24, acc_iter=5600, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:24, time_cost(all): 1:50:00/2 days, 3:57:42, loss=0.589793322367739, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=2.3454263549312993, lr=0.027351222104144528
2023-11-14 15:27:16   INFO  epoch: 0/24, acc_iter=5650, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:19:00, time_cost(all): 1:50:58/2 days, 3:50:55, loss=0.589682380219562, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.402072829876762, lr=0.027707036587217244
2023-11-14 15:28:15   INFO  epoch: 0/24, acc_iter=5700, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:35, time_cost(all): 1:51:57/2 days, 1:35:14, loss=0.589571438071385, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=1.6723626030875336, lr=0.02806285107028996
2023-11-14 15:29:14   INFO  epoch: 0/24, acc_iter=5750, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:41, time_cost(all): 1:52:56/1 day, 23:44:38, loss=0.589460495923209, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=4.956511792492809, lr=0.028418665553362685
2023-11-14 15:30:13   INFO  epoch: 0/24, acc_iter=5800, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:58, time_cost(all): 1:53:55/2 days, 1:42:35, loss=0.589349553775032, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=1.1108553865065158, lr=0.0287744800364354
2023-11-14 15:31:12   INFO  epoch: 0/24, acc_iter=5850, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:44, time_cost(all): 1:54:54/2 days, 3:00:05, loss=0.589238611626855, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=0.8877047765369739, lr=0.029130294519508118
2023-11-14 15:32:11   INFO  epoch: 0/24, acc_iter=5900, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:57, time_cost(all): 1:55:53/2 days, 3:23:55, loss=0.589127669478678, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.4820110907950204, lr=0.02948610900258084
2023-11-14 15:33:10   INFO  epoch: 0/24, acc_iter=5950, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:13:01, time_cost(all): 1:56:52/2 days, 1:33:07, loss=0.589016727330502, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.8715189632735656, lr=0.02984192348565356
2023-11-14 15:34:09   INFO  epoch: 0/24, acc_iter=6000, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:06, time_cost(all): 1:57:51/1 day, 23:30:05, loss=0.588905785182325, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=4.244239332647666, lr=0.030197737968726275
2023-11-14 15:35:08   INFO  epoch: 0/24, acc_iter=6050, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:04, time_cost(all): 1:58:50/2 days, 3:50:51, loss=0.588794843034148, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.3484605150120574, lr=0.030553552451799
2023-11-14 15:36:07   INFO  epoch: 0/24, acc_iter=6100, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:59, time_cost(all): 1:59:49/1 day, 23:30:06, loss=0.588683900885971, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.432503164823574, lr=0.030909366934871715
2023-11-14 15:37:06   INFO  epoch: 0/24, acc_iter=6150, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:51, time_cost(all): 2:00:48/2 days, 2:03:23, loss=0.588572958737795, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.4903068220823583, lr=0.03126518141794443
2023-11-14 15:38:05   INFO  epoch: 0/24, acc_iter=6200, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:45, time_cost(all): 2:01:47/2 days, 4:01:48, loss=0.588462016589618, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.6757575815875265, lr=0.031620995901017156
2023-11-14 15:39:04   INFO  epoch: 0/24, acc_iter=6250, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:17, time_cost(all): 2:02:46/2 days, 3:35:22, loss=0.588351074441441, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.6481883899343677, lr=0.03197681038408987
2023-11-14 15:40:03   INFO  epoch: 0/24, acc_iter=6300, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:25, time_cost(all): 2:03:45/2 days, 0:29:02, loss=0.588240132293264, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=4.2767002057617685, lr=0.03233262486716259
2023-11-14 15:41:01   INFO  epoch: 0/24, acc_iter=6350, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:28, time_cost(all): 2:04:43/2 days, 2:42:52, loss=0.588129190145088, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=3.155365086177352, lr=0.03268843935023531
2023-11-14 15:42:00   INFO  epoch: 0/24, acc_iter=6400, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:40, time_cost(all): 2:05:42/2 days, 1:01:45, loss=0.588018247996911, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=3.7391675362926002, lr=0.03304425383330803
2023-11-14 15:42:59   INFO  epoch: 0/24, acc_iter=6450, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:39, time_cost(all): 2:06:41/2 days, 2:55:37, loss=0.587907305848734, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=0.5832708417126256, lr=0.03340006831638075
2023-11-14 15:43:58   INFO  epoch: 0/24, acc_iter=6500, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 2:07:40/2 days, 1:07:29, loss=0.587796363700557, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=3.048271985197299, lr=0.03375588279945347
2023-11-14 15:44:57   INFO  epoch: 0/24, acc_iter=6550, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 2:08:39/2 days, 1:23:17, loss=0.587685421552381, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=2.4117150278197905, lr=0.034111697282526186
2023-11-14 15:45:56   INFO  epoch: 1/24, acc_iter=6637, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:11:55, time_cost(all): 2:09:38/2 days, 1:43:50, loss=0.587492382214553, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=3.5277481834233133, lr=0.03473081448307272
2023-11-14 15:46:55   INFO  epoch: 1/24, acc_iter=6687, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:01:12, time_cost(all): 2:10:37/2 days, 3:08:34, loss=0.587381440066376, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=1.1815182520366578, lr=0.035086628966145436
2023-11-14 15:47:54   INFO  epoch: 1/24, acc_iter=6737, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:06:46, time_cost(all): 2:11:36/2 days, 3:39:16, loss=0.5872704979182, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.587326282660257, lr=0.03544244344921816
2023-11-14 15:48:53   INFO  epoch: 1/24, acc_iter=6787, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:00:49, time_cost(all): 2:12:35/2 days, 2:57:38, loss=0.587159555770023, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=2.8604969907572038, lr=0.035798257932290876
2023-11-14 15:49:52   INFO  epoch: 1/24, acc_iter=6837, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:05:38, time_cost(all): 2:13:34/2 days, 1:13:43, loss=0.587048613621846, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=4.219369476910469, lr=0.03615407241536359
2023-11-14 15:50:51   INFO  epoch: 1/24, acc_iter=6887, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:32, time_cost(all): 2:14:33/2 days, 1:39:06, loss=0.586937671473669, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=3.032966707962559, lr=0.03650988689843632
2023-11-14 15:51:50   INFO  epoch: 1/24, acc_iter=6937, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:04:47, time_cost(all): 2:15:32/2 days, 3:27:42, loss=0.586826729325493, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=2.4915639508654888, lr=0.03686570138150903
2023-11-14 15:52:49   INFO  epoch: 1/24, acc_iter=6987, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:58:34, time_cost(all): 2:16:31/2 days, 0:13:27, loss=0.586715787177316, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=2.7159482783041904, lr=0.03722151586458175
2023-11-14 15:53:48   INFO  epoch: 1/24, acc_iter=7037, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:04:52, time_cost(all): 2:17:30/2 days, 1:34:03, loss=0.586604845029139, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.5967438052346146, lr=0.037577330347654474
2023-11-14 15:54:46   INFO  epoch: 1/24, acc_iter=7087, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:05:01, time_cost(all): 2:18:28/2 days, 0:20:43, loss=0.586493902880962, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=3.5066340017170594, lr=0.03793314483072719
2023-11-14 15:55:45   INFO  epoch: 1/24, acc_iter=7137, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:04:03, time_cost(all): 2:19:27/1 day, 23:05:20, loss=0.586382960732786, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=2.935306094176169, lr=0.03828895931379991
2023-11-14 15:56:44   INFO  epoch: 1/24, acc_iter=7187, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:06, time_cost(all): 2:20:26/2 days, 2:16:24, loss=0.586272018584609, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=3.234931537359976, lr=0.03864477379687263
2023-11-14 15:57:43   INFO  epoch: 1/24, acc_iter=7237, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/2:01:26, time_cost(all): 2:21:25/2 days, 1:33:16, loss=0.586161076436432, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=0.7256868405688066, lr=0.03900058827994535
2023-11-14 15:58:42   INFO  epoch: 1/24, acc_iter=7287, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:50:02, time_cost(all): 2:22:24/2 days, 0:28:03, loss=0.586050134288255, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.280132320486835, lr=0.039356402763018064
2023-11-14 15:59:41   INFO  epoch: 1/24, acc_iter=7337, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:50:37, time_cost(all): 2:23:23/2 days, 3:36:07, loss=0.585939192140079, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.97(1.03), norm=3.72292067468111, lr=0.03971221724609079
2023-11-14 16:00:40   INFO  epoch: 1/24, acc_iter=7387, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:52:55, time_cost(all): 2:24:22/2 days, 0:24:09, loss=0.585828249991902, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=1.2098956432806827, lr=0.040068031729163504
2023-11-14 16:01:39   INFO  epoch: 1/24, acc_iter=7437, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:56:50, time_cost(all): 2:25:21/2 days, 3:23:09, loss=0.585717307843725, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=3.526249739826929, lr=0.04042384621223622
2023-11-14 16:02:38   INFO  epoch: 1/24, acc_iter=7487, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:50:47, time_cost(all): 2:26:20/2 days, 3:38:35, loss=0.585606365695548, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=1.5930225961355744, lr=0.040779660695308945
2023-11-14 16:03:37   INFO  epoch: 1/24, acc_iter=7537, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:48:10, time_cost(all): 2:27:19/2 days, 1:52:34, loss=0.585495423547371, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=4.959840752105265, lr=0.04113547517838166
2023-11-14 16:04:36   INFO  epoch: 1/24, acc_iter=7587, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:54:55, time_cost(all): 2:28:18/1 day, 23:33:10, loss=0.585384481399195, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=2.885216544869983, lr=0.041491289661454385
2023-11-14 16:05:35   INFO  epoch: 1/24, acc_iter=7637, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:49:58, time_cost(all): 2:29:17/2 days, 1:26:27, loss=0.585273539251018, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.708731445129996, lr=0.0418471041445271
2023-11-14 16:06:34   INFO  epoch: 1/24, acc_iter=7687, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:00, time_cost(all): 2:30:16/2 days, 1:13:33, loss=0.585162597102841, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=0.6016195592362388, lr=0.04220291862759982
2023-11-14 16:07:33   INFO  epoch: 1/24, acc_iter=7737, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:51:44, time_cost(all): 2:31:15/2 days, 2:36:03, loss=0.585051654954664, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=1.422896091099186, lr=0.04255873311067254
2023-11-14 16:08:31   INFO  epoch: 1/24, acc_iter=7787, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:53, time_cost(all): 2:32:13/2 days, 2:25:56, loss=0.584940712806488, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=3.0324183502345465, lr=0.04291454759374526
2023-11-14 16:09:30   INFO  epoch: 1/24, acc_iter=7837, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:46:56, time_cost(all): 2:33:12/2 days, 0:00:27, loss=0.584829770658311, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.062364483033368, lr=0.043270362076817975
2023-11-14 16:10:29   INFO  epoch: 1/24, acc_iter=7887, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:39:24, time_cost(all): 2:34:11/1 day, 23:32:13, loss=0.584718828510134, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=3.957630397541162, lr=0.0436261765598907
2023-11-14 16:11:28   INFO  epoch: 1/24, acc_iter=7937, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:45:46, time_cost(all): 2:35:10/2 days, 3:35:22, loss=0.584607886361957, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.709339871665876, lr=0.043981991042963416
2023-11-14 16:12:27   INFO  epoch: 1/24, acc_iter=7987, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:44:57, time_cost(all): 2:36:09/2 days, 1:30:07, loss=0.584496944213781, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=4.8776807320195825, lr=0.04433780552603613
2023-11-14 16:13:26   INFO  epoch: 1/24, acc_iter=8037, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:36:48, time_cost(all): 2:37:08/1 day, 23:52:34, loss=0.584386002065604, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=3.6441038355141404, lr=0.044693620009108856
2023-11-14 16:14:25   INFO  epoch: 1/24, acc_iter=8087, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:42:46, time_cost(all): 2:38:07/2 days, 3:16:47, loss=0.584275059917427, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.3016734410540027, lr=0.04504943449218157
2023-11-14 16:15:24   INFO  epoch: 1/24, acc_iter=8137, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:35:24, time_cost(all): 2:39:06/1 day, 23:34:06, loss=0.58416411776925, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=2.298009225747987, lr=0.04540524897525429
2023-11-14 16:16:23   INFO  epoch: 1/24, acc_iter=8187, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:39:45, time_cost(all): 2:40:05/2 days, 1:06:34, loss=0.584053175621074, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=0.5663408068671081, lr=0.045761063458327006
2023-11-14 16:17:22   INFO  epoch: 1/24, acc_iter=8237, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:36, time_cost(all): 2:41:04/1 day, 23:04:05, loss=0.583942233472897, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.414646440992586, lr=0.04611687794139973
2023-11-14 16:18:21   INFO  epoch: 1/24, acc_iter=8287, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:40:08, time_cost(all): 2:42:03/1 day, 22:40:36, loss=0.58383129132472, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.189451292485387, lr=0.046472692424472446
2023-11-14 16:19:20   INFO  epoch: 1/24, acc_iter=8337, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:34:11, time_cost(all): 2:43:02/2 days, 2:06:32, loss=0.583720349176543, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=2.91550945381102, lr=0.04682850690754516
2023-11-14 16:20:19   INFO  epoch: 1/24, acc_iter=8387, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:35:03, time_cost(all): 2:44:01/1 day, 23:29:16, loss=0.583609407028367, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=3.6060784186075554, lr=0.04718432139061789
2023-11-14 16:21:18   INFO  epoch: 1/24, acc_iter=8437, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:37:08, time_cost(all): 2:45:00/2 days, 0:15:32, loss=0.58349846488019, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=1.7513856314945073, lr=0.0475401358736906
2023-11-14 16:22:16   INFO  epoch: 1/24, acc_iter=8487, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:29:10, time_cost(all): 2:45:58/2 days, 2:13:00, loss=0.583387522732013, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=4.957095830775016, lr=0.04789595035676332
2023-11-14 16:23:15   INFO  epoch: 1/24, acc_iter=8537, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:35:07, time_cost(all): 2:46:57/2 days, 3:10:04, loss=0.583276580583836, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=3.207756906523523, lr=0.048251764839836044
2023-11-14 16:24:14   INFO  epoch: 1/24, acc_iter=8587, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:29:22, time_cost(all): 2:47:56/2 days, 2:44:33, loss=0.58316563843566, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=2.1529980044302213, lr=0.04860757932290876
2023-11-14 16:25:13   INFO  epoch: 1/24, acc_iter=8637, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:30:36, time_cost(all): 2:48:55/2 days, 1:06:53, loss=0.583054696287483, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.840171345000325, lr=0.04896339380598148
2023-11-14 16:26:12   INFO  epoch: 1/24, acc_iter=8687, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:28:22, time_cost(all): 2:49:54/2 days, 1:32:19, loss=0.582943754139306, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=2.8023558811441034, lr=0.0493192082890542
2023-11-14 16:27:11   INFO  epoch: 1/24, acc_iter=8737, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:29:22, time_cost(all): 2:50:53/2 days, 2:41:55, loss=0.582832811991129, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=4.515294840443733, lr=0.04967502277212692
2023-11-14 16:28:10   INFO  epoch: 1/24, acc_iter=8787, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:25:23, time_cost(all): 2:51:52/2 days, 1:32:12, loss=0.582721869842953, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=0.5025470094270135, lr=0.050030837255199634
2023-11-14 16:29:09   INFO  epoch: 1/24, acc_iter=8837, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:21:25, time_cost(all): 2:52:51/2 days, 0:55:36, loss=0.582610927694776, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=0.8121047834626043, lr=0.05038665173827236
2023-11-14 16:30:08   INFO  epoch: 1/24, acc_iter=8887, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:25:20, time_cost(all): 2:53:50/2 days, 1:33:34, loss=0.582499985546599, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=0.712933162114897, lr=0.050742466221345074
2023-11-14 16:31:07   INFO  epoch: 1/24, acc_iter=8937, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:23:09, time_cost(all): 2:54:49/2 days, 3:09:27, loss=0.582389043398422, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=3.353850613509795, lr=0.05109828070441779
2023-11-14 16:32:06   INFO  epoch: 1/24, acc_iter=8987, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:25:05, time_cost(all): 2:55:48/2 days, 2:49:33, loss=0.582278101250246, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.0916747566893292, lr=0.051454095187490514
2023-11-14 16:33:05   INFO  epoch: 1/24, acc_iter=9037, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:35, time_cost(all): 2:56:47/1 day, 23:55:14, loss=0.582167159102069, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=3.295969247329047, lr=0.05180990967056323
2023-11-14 16:34:04   INFO  epoch: 1/24, acc_iter=9087, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:21:27, time_cost(all): 2:57:46/1 day, 23:16:26, loss=0.582056216953892, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=0.6741842777623148, lr=0.05216572415363595
2023-11-14 16:35:03   INFO  epoch: 1/24, acc_iter=9137, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:19:36, time_cost(all): 2:58:45/1 day, 23:05:52, loss=0.581945274805715, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=2.121996622387865, lr=0.05252153863670867
2023-11-14 16:36:01   INFO  epoch: 1/24, acc_iter=9187, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:24, time_cost(all): 2:59:43/1 day, 22:42:55, loss=0.581834332657539, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.503020228382073, lr=0.05287735311978139
2023-11-14 16:37:00   INFO  epoch: 1/24, acc_iter=9237, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:08, time_cost(all): 3:00:42/1 day, 23:00:04, loss=0.581723390509362, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.8528113861788955, lr=0.053233167602854105
2023-11-14 16:37:59   INFO  epoch: 1/24, acc_iter=9287, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:27, time_cost(all): 3:01:41/2 days, 1:32:26, loss=0.581612448361185, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=3.6877881876149536, lr=0.05358898208592683
2023-11-14 16:38:58   INFO  epoch: 1/24, acc_iter=9337, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:16:15, time_cost(all): 3:02:40/2 days, 1:00:25, loss=0.581501506213008, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=1.3608372542738185, lr=0.053944796568999545
2023-11-14 16:39:57   INFO  epoch: 1/24, acc_iter=9387, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:16, time_cost(all): 3:03:39/1 day, 23:11:03, loss=0.581390564064832, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=1.5840743096215417, lr=0.05430061105207226
2023-11-14 16:40:56   INFO  epoch: 1/24, acc_iter=9437, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:13:49, time_cost(all): 3:04:38/2 days, 2:15:53, loss=0.581279621916655, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.915832537457683, lr=0.054656425535144985
2023-11-14 16:41:55   INFO  epoch: 1/24, acc_iter=9487, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:14:12, time_cost(all): 3:05:37/2 days, 1:36:09, loss=0.581168679768478, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=4.476360495955269, lr=0.05501224001821771
2023-11-14 16:42:54   INFO  epoch: 1/24, acc_iter=9537, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:14:34, time_cost(all): 3:06:36/1 day, 23:25:08, loss=0.581057737620301, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=0.5456541124875728, lr=0.05536805450129042
2023-11-14 16:43:53   INFO  epoch: 1/24, acc_iter=9587, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:33, time_cost(all): 3:07:35/1 day, 22:40:05, loss=0.580946795472125, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=1.4555220680601475, lr=0.05572386898436314
2023-11-14 16:44:52   INFO  epoch: 1/24, acc_iter=9637, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:10:17, time_cost(all): 3:08:34/1 day, 23:01:37, loss=0.580835853323948, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=3.114910116499007, lr=0.05607968346743586
2023-11-14 16:45:51   INFO  epoch: 1/24, acc_iter=9687, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:10:00, time_cost(all): 3:09:33/2 days, 2:31:35, loss=0.580724911175771, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=2.776720025258733, lr=0.05643549795050858
2023-11-14 16:46:50   INFO  epoch: 1/24, acc_iter=9737, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:08:48, time_cost(all): 3:10:32/1 day, 23:49:24, loss=0.580613969027594, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=4.874309379474873, lr=0.0567913124335813
2023-11-14 16:47:49   INFO  epoch: 1/24, acc_iter=9787, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:04:17, time_cost(all): 3:11:31/2 days, 2:09:25, loss=0.580503026879418, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=3.7431952202269714, lr=0.05714712691665402
2023-11-14 16:48:48   INFO  epoch: 1/24, acc_iter=9837, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:04:28, time_cost(all): 3:12:30/1 day, 23:08:46, loss=0.580392084731241, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=4.172164491309206, lr=0.05750294139972673
2023-11-14 16:49:46   INFO  epoch: 1/24, acc_iter=9887, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:06:51, time_cost(all): 3:13:28/1 day, 23:40:36, loss=0.580281142583064, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.9935311796409851, lr=0.057858755882799456
2023-11-14 16:50:45   INFO  epoch: 1/24, acc_iter=9937, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:48, time_cost(all): 3:14:27/2 days, 0:16:38, loss=0.580170200434887, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.9967618961426938, lr=0.05821457036587217
2023-11-14 16:51:44   INFO  epoch: 1/24, acc_iter=9987, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:00:00, time_cost(all): 3:15:26/2 days, 2:30:48, loss=0.580059258286711, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=1.3353570350684907, lr=0.0585703848489449
2023-11-14 16:52:43   INFO  epoch: 1/24, acc_iter=10037, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/0:59:17, time_cost(all): 3:16:25/2 days, 0:08:30, loss=0.579948316138534, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=2.393970301011672, lr=0.05892619933201761
2023-11-14 16:53:42   INFO  epoch: 1/24, acc_iter=10087, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:02:35, time_cost(all): 3:17:24/2 days, 2:21:49, loss=0.579837373990357, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=1.7666309284618216, lr=0.05928201381509034
2023-11-14 16:54:41   INFO  epoch: 1/24, acc_iter=10137, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:15, time_cost(all): 3:18:23/2 days, 0:35:46, loss=0.57972643184218, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.723383974025825, lr=0.05963782829816305
2023-11-14 16:55:40   INFO  epoch: 1/24, acc_iter=10187, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:56:59, time_cost(all): 3:19:22/1 day, 23:03:32, loss=0.579615489694004, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=0.8919468939456445, lr=0.05999364278123577
2023-11-14 16:56:39   INFO  epoch: 1/24, acc_iter=10237, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:15, time_cost(all): 3:20:21/1 day, 22:57:57, loss=0.579504547545827, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=3.0327331252546244, lr=0.06034945726430849
2023-11-14 16:57:38   INFO  epoch: 1/24, acc_iter=10287, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:57, time_cost(all): 3:21:20/2 days, 1:50:24, loss=0.57939360539765, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.7247710133013834, lr=0.06070527174738121
2023-11-14 16:58:37   INFO  epoch: 1/24, acc_iter=10337, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:57:44, time_cost(all): 3:22:19/2 days, 1:26:10, loss=0.579282663249473, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.5394624203147265, lr=0.06106108623045393
2023-11-14 16:59:36   INFO  epoch: 1/24, acc_iter=10387, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:57:24, time_cost(all): 3:23:18/2 days, 1:46:18, loss=0.579171721101297, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=4.908170469133446, lr=0.06141690071352665
2023-11-14 17:00:35   INFO  epoch: 1/24, acc_iter=10437, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:39, time_cost(all): 3:24:17/1 day, 23:43:48, loss=0.57906077895312, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.0729687046066583, lr=0.06177271519659936
2023-11-14 17:01:34   INFO  epoch: 1/24, acc_iter=10487, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:21, time_cost(all): 3:25:16/2 days, 0:53:21, loss=0.578949836804943, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.029366468923463, lr=0.062128529679672084
2023-11-14 17:02:33   INFO  epoch: 1/24, acc_iter=10537, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:50:58, time_cost(all): 3:26:15/1 day, 22:55:56, loss=0.578838894656766, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=4.289893978082946, lr=0.0624843441627448
2023-11-14 17:03:31   INFO  epoch: 1/24, acc_iter=10587, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:50, time_cost(all): 3:27:13/2 days, 1:53:03, loss=0.57872795250859, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.9967598306623846, lr=0.06284015864581752
2023-11-14 17:04:30   INFO  epoch: 1/24, acc_iter=10637, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:52:16, time_cost(all): 3:28:12/2 days, 1:54:21, loss=0.578617010360413, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=4.747909027094879, lr=0.06319597312889023
2023-11-14 17:05:29   INFO  epoch: 1/24, acc_iter=10687, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:25, time_cost(all): 3:29:11/1 day, 22:43:19, loss=0.578506068212236, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=0.6083027077898429, lr=0.06355178761196296
2023-11-14 17:06:28   INFO  epoch: 1/24, acc_iter=10737, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:58, time_cost(all): 3:30:10/2 days, 1:40:26, loss=0.578395126064059, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=2.6524786916840375, lr=0.06390760209503567
2023-11-14 17:07:27   INFO  epoch: 1/24, acc_iter=10787, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:44:34, time_cost(all): 3:31:09/1 day, 22:48:45, loss=0.578284183915883, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.4952978801015173, lr=0.0642634165781084
2023-11-14 17:08:26   INFO  epoch: 1/24, acc_iter=10837, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:43:51, time_cost(all): 3:32:08/2 days, 0:34:44, loss=0.578173241767706, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=2.656644233451041, lr=0.06461923106118111
2023-11-14 17:09:25   INFO  epoch: 1/24, acc_iter=10887, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:44:18, time_cost(all): 3:33:07/1 day, 22:56:07, loss=0.578062299619529, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=2.6804400683345095, lr=0.06497504554425383
2023-11-14 17:10:24   INFO  epoch: 1/24, acc_iter=10937, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:34, time_cost(all): 3:34:06/1 day, 21:55:05, loss=0.577951357471352, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=1.5677403954534606, lr=0.06533086002732655
2023-11-14 17:11:23   INFO  epoch: 1/24, acc_iter=10987, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:05, time_cost(all): 3:35:05/1 day, 22:16:02, loss=0.577840415323176, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=4.268607933395733, lr=0.06568667451039926
2023-11-14 17:12:22   INFO  epoch: 1/24, acc_iter=11037, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:44:00, time_cost(all): 3:36:04/2 days, 2:16:48, loss=0.577729473174999, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=0.8823394697247007, lr=0.06604248899347198
2023-11-14 17:13:21   INFO  epoch: 1/24, acc_iter=11087, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:42:49, time_cost(all): 3:37:03/2 days, 0:39:36, loss=0.577618531026822, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=1.2815687590341087, lr=0.06639830347654471
2023-11-14 17:14:20   INFO  epoch: 1/24, acc_iter=11137, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:59, time_cost(all): 3:38:02/1 day, 23:50:03, loss=0.577507588878645, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.383362004336393, lr=0.06675411795961743
2023-11-14 17:15:19   INFO  epoch: 1/24, acc_iter=11187, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:37:53, time_cost(all): 3:39:01/1 day, 22:35:59, loss=0.577396646730469, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=2.9024084297593946, lr=0.06710993244269015
2023-11-14 17:16:18   INFO  epoch: 1/24, acc_iter=11237, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:07, time_cost(all): 3:40:00/2 days, 0:21:51, loss=0.577285704582292, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=0.9457507531051763, lr=0.06746574692576286
2023-11-14 17:17:16   INFO  epoch: 1/24, acc_iter=11287, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:55, time_cost(all): 3:40:58/2 days, 2:04:52, loss=0.577174762434115, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=3.871071770656102, lr=0.06782156140883558
2023-11-14 17:18:15   INFO  epoch: 1/24, acc_iter=11337, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:48, time_cost(all): 3:41:57/2 days, 0:02:57, loss=0.577063820285938, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=1.288338628082151, lr=0.0681773758919083
2023-11-14 17:19:14   INFO  epoch: 1/24, acc_iter=11387, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:51, time_cost(all): 3:42:56/1 day, 21:59:15, loss=0.576952878137762, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=3.2013308296228415, lr=0.06853319037498103
2023-11-14 17:20:13   INFO  epoch: 1/24, acc_iter=11437, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:14, time_cost(all): 3:43:55/1 day, 22:57:09, loss=0.576841935989585, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=4.731686521834937, lr=0.06888900485805374
2023-11-14 17:21:12   INFO  epoch: 1/24, acc_iter=11487, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:48, time_cost(all): 3:44:54/2 days, 1:39:21, loss=0.576730993841408, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.173612476510532, lr=0.06924481934112646
2023-11-14 17:22:11   INFO  epoch: 1/24, acc_iter=11537, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:27, time_cost(all): 3:45:53/1 day, 23:11:37, loss=0.576620051693231, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=2.1015865195211045, lr=0.06960063382419918
2023-11-14 17:23:10   INFO  epoch: 1/24, acc_iter=11587, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:30:36, time_cost(all): 3:46:52/2 days, 0:04:42, loss=0.576509109545055, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=2.618246137013658, lr=0.06995644830727189
2023-11-14 17:24:09   INFO  epoch: 1/24, acc_iter=11637, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:22, time_cost(all): 3:47:51/1 day, 22:03:42, loss=0.576398167396878, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=1.4487540975219086, lr=0.07031226279034462
2023-11-14 17:25:08   INFO  epoch: 1/24, acc_iter=11687, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:30:05, time_cost(all): 3:48:50/1 day, 21:45:55, loss=0.576287225248701, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=4.5116961966419105, lr=0.07066807727341734
2023-11-14 17:26:07   INFO  epoch: 1/24, acc_iter=11737, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:44, time_cost(all): 3:49:49/2 days, 1:32:15, loss=0.576176283100524, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=4.561365308040807, lr=0.07102389175649006
2023-11-14 17:27:06   INFO  epoch: 1/24, acc_iter=11787, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:28:21, time_cost(all): 3:50:48/1 day, 23:15:00, loss=0.576065340952348, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=3.489966885628699, lr=0.07137970623956277
2023-11-14 17:28:05   INFO  epoch: 1/24, acc_iter=11837, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:36, time_cost(all): 3:51:47/2 days, 0:28:29, loss=0.575954398804171, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=3.8596831167106855, lr=0.0717355207226355
2023-11-14 17:29:04   INFO  epoch: 1/24, acc_iter=11887, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:32, time_cost(all): 3:52:46/1 day, 21:46:04, loss=0.575843456655994, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.187108666333871, lr=0.0720913352057082
2023-11-14 17:30:03   INFO  epoch: 1/24, acc_iter=11937, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:34, time_cost(all): 3:53:45/1 day, 22:28:07, loss=0.575732514507817, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=1.0662297206528935, lr=0.07244714968878094
2023-11-14 17:31:02   INFO  epoch: 1/24, acc_iter=11987, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:53, time_cost(all): 3:54:44/2 days, 0:16:01, loss=0.575621572359641, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=2.585672578043509, lr=0.07280296417185364
2023-11-14 17:32:00   INFO  epoch: 1/24, acc_iter=12037, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:18, time_cost(all): 3:55:42/1 day, 23:46:47, loss=0.575510630211464, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=1.9142030854024161, lr=0.07315877865492637
2023-11-14 17:32:59   INFO  epoch: 1/24, acc_iter=12087, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:25, time_cost(all): 3:56:41/1 day, 21:38:21, loss=0.575399688063287, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=3.2515675931616514, lr=0.07351459313799909
2023-11-14 17:33:58   INFO  epoch: 1/24, acc_iter=12137, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:18, time_cost(all): 3:57:40/1 day, 21:56:23, loss=0.57528874591511, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=1.3967865114007143, lr=0.0738704076210718
2023-11-14 17:34:57   INFO  epoch: 1/24, acc_iter=12187, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:53, time_cost(all): 3:58:39/2 days, 1:09:40, loss=0.575177803766934, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=0.9628440730918624, lr=0.07422622210414452
2023-11-14 17:35:56   INFO  epoch: 1/24, acc_iter=12237, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:41, time_cost(all): 3:59:38/2 days, 1:30:42, loss=0.575066861618757, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=1.9613952315481662, lr=0.07458203658721725
2023-11-14 17:36:55   INFO  epoch: 1/24, acc_iter=12287, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:55, time_cost(all): 4:00:37/1 day, 22:49:36, loss=0.57495591947058, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=2.774599506673768, lr=0.07493785107028995
2023-11-14 17:37:54   INFO  epoch: 1/24, acc_iter=12337, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:17:14, time_cost(all): 4:01:36/2 days, 0:41:44, loss=0.574844977322403, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=1.3654997518217922, lr=0.07529366555336268
2023-11-14 17:38:53   INFO  epoch: 1/24, acc_iter=12387, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:55, time_cost(all): 4:02:35/1 day, 23:49:02, loss=0.574734035174227, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=2.393146998824629, lr=0.0756494800364354
2023-11-14 17:39:52   INFO  epoch: 1/24, acc_iter=12437, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:46, time_cost(all): 4:03:34/1 day, 23:32:19, loss=0.57462309302605, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=2.229291165316398, lr=0.07600529451950812
2023-11-14 17:40:51   INFO  epoch: 1/24, acc_iter=12487, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:38, time_cost(all): 4:04:33/1 day, 23:50:55, loss=0.574512150877873, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.6256550912231944, lr=0.07636110900258083
2023-11-14 17:41:50   INFO  epoch: 1/24, acc_iter=12537, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:52, time_cost(all): 4:05:32/2 days, 0:20:45, loss=0.574401208729696, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=0.5112747432587741, lr=0.07671692348565357
2023-11-14 17:42:49   INFO  epoch: 1/24, acc_iter=12587, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:36, time_cost(all): 4:06:31/2 days, 1:46:18, loss=0.57429026658152, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=3.597954294361007, lr=0.07707273796872627
2023-11-14 17:43:48   INFO  epoch: 1/24, acc_iter=12637, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:50, time_cost(all): 4:07:30/1 day, 22:23:15, loss=0.574179324433343, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.449280009452422, lr=0.077428552451799
2023-11-14 17:44:47   INFO  epoch: 1/24, acc_iter=12687, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:25, time_cost(all): 4:08:29/1 day, 22:50:55, loss=0.574068382285166, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=2.85955187506674, lr=0.07778436693487172
2023-11-14 17:45:45   INFO  epoch: 1/24, acc_iter=12737, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:48, time_cost(all): 4:09:27/1 day, 23:10:43, loss=0.573957440136989, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=0.9822228778571758, lr=0.07814018141794443
2023-11-14 17:46:44   INFO  epoch: 1/24, acc_iter=12787, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:36, time_cost(all): 4:10:26/1 day, 23:46:23, loss=0.573846497988813, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=0.8950386167187141, lr=0.07849599590101715
2023-11-14 17:47:43   INFO  epoch: 1/24, acc_iter=12837, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:56, time_cost(all): 4:11:25/2 days, 0:17:42, loss=0.573735555840636, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=3.002440306660049, lr=0.07885181038408988
2023-11-14 17:48:42   INFO  epoch: 1/24, acc_iter=12887, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:52, time_cost(all): 4:12:24/1 day, 21:25:30, loss=0.573624613692459, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=4.138105778346006, lr=0.07920762486716258
2023-11-14 17:49:41   INFO  epoch: 1/24, acc_iter=12937, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:53, time_cost(all): 4:13:23/1 day, 22:34:54, loss=0.573513671544282, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=4.68037697683999, lr=0.07956343935023531
2023-11-14 17:50:40   INFO  epoch: 1/24, acc_iter=12987, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:34, time_cost(all): 4:14:22/1 day, 21:51:54, loss=0.573402729396106, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=1.0661083403550053, lr=0.07991925383330803
2023-11-14 17:51:39   INFO  epoch: 1/24, acc_iter=13037, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:34, time_cost(all): 4:15:21/2 days, 0:29:14, loss=0.573291787247929, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=4.772801969898303, lr=0.08027506831638075
2023-11-14 17:52:38   INFO  epoch: 1/24, acc_iter=13087, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:43, time_cost(all): 4:16:20/1 day, 22:37:37, loss=0.573180845099752, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=1.1273389160156448, lr=0.08063088279945346
2023-11-14 17:53:37   INFO  epoch: 1/24, acc_iter=13137, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:45, time_cost(all): 4:17:19/2 days, 0:42:27, loss=0.573069902951575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=1.9944589082444388, lr=0.0809866972825262
2023-11-14 17:54:36   INFO  epoch: 2/24, acc_iter=13224, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:08:28, time_cost(all): 4:18:18/2 days, 0:20:20, loss=0.572876863613748, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=1.268993514029269, lr=0.08160581448307272
2023-11-14 17:55:35   INFO  epoch: 2/24, acc_iter=13274, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:02:16, time_cost(all): 4:19:17/1 day, 21:50:21, loss=0.572765921465571, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=2.6173134341566695, lr=0.08196162896614544
2023-11-14 17:56:34   INFO  epoch: 2/24, acc_iter=13324, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:10:15, time_cost(all): 4:20:16/2 days, 0:03:36, loss=0.572654979317394, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.2713785647376905, lr=0.08231744344921815
2023-11-14 17:57:33   INFO  epoch: 2/24, acc_iter=13374, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:08:54, time_cost(all): 4:21:15/1 day, 22:33:17, loss=0.572544037169218, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=1.4092758261610143, lr=0.08267325793229087
2023-11-14 17:58:32   INFO  epoch: 2/24, acc_iter=13424, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:00:23, time_cost(all): 4:22:14/2 days, 0:07:29, loss=0.572433095021041, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=2.327615239797826, lr=0.08302907241536359
2023-11-14 17:59:30   INFO  epoch: 2/24, acc_iter=13474, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:57:40, time_cost(all): 4:23:12/2 days, 1:36:24, loss=0.572322152872864, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=4.401036620915712, lr=0.08338488689843632
2023-11-14 18:00:29   INFO  epoch: 2/24, acc_iter=13524, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:00:07, time_cost(all): 4:24:11/2 days, 0:56:35, loss=0.572211210724687, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=1.1565776579263733, lr=0.08374070138150903
2023-11-14 18:01:28   INFO  epoch: 2/24, acc_iter=13574, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:03:52, time_cost(all): 4:25:10/1 day, 21:23:02, loss=0.572100268576511, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=1.8827508451650863, lr=0.08409651586458175
2023-11-14 18:02:27   INFO  epoch: 2/24, acc_iter=13624, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:59:03, time_cost(all): 4:26:09/1 day, 22:28:20, loss=0.571989326428334, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=4.386940261693263, lr=0.08445233034765447
2023-11-14 18:03:26   INFO  epoch: 2/24, acc_iter=13674, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:55:09, time_cost(all): 4:27:08/2 days, 0:09:17, loss=0.571878384280157, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.602041423953247, lr=0.08480814483072718
2023-11-14 18:04:25   INFO  epoch: 2/24, acc_iter=13724, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:02:56, time_cost(all): 4:28:07/1 day, 23:53:58, loss=0.57176744213198, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=1.9394596838508054, lr=0.0851639593137999
2023-11-14 18:05:24   INFO  epoch: 2/24, acc_iter=13774, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:58:55, time_cost(all): 4:29:06/1 day, 23:05:29, loss=0.571656499983804, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=4.231688497173507, lr=0.08551977379687263
2023-11-14 18:06:23   INFO  epoch: 2/24, acc_iter=13824, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:56:19, time_cost(all): 4:30:05/2 days, 0:59:13, loss=0.571545557835627, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.519921812771461, lr=0.08587558827994535
2023-11-14 18:07:22   INFO  epoch: 2/24, acc_iter=13874, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:58:50, time_cost(all): 4:31:04/1 day, 22:54:26, loss=0.57143461568745, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=3.4857958812538774, lr=0.08623140276301806
2023-11-14 18:08:21   INFO  epoch: 2/24, acc_iter=13924, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:55:17, time_cost(all): 4:32:03/1 day, 22:28:58, loss=0.571323673539273, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=0.6014885865212678, lr=0.08658721724609078
2023-11-14 18:09:20   INFO  epoch: 2/24, acc_iter=13974, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:54:00, time_cost(all): 4:33:02/1 day, 21:20:35, loss=0.571212731391097, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=2.8134844494347195, lr=0.0869430317291635
2023-11-14 18:10:19   INFO  epoch: 2/24, acc_iter=14024, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:56:25, time_cost(all): 4:34:01/1 day, 23:59:26, loss=0.57110178924292, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=4.42672336828195, lr=0.08729884621223621
2023-11-14 18:11:18   INFO  epoch: 2/24, acc_iter=14074, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:48:54, time_cost(all): 4:35:00/1 day, 23:45:57, loss=0.570990847094743, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=1.5616157324941977, lr=0.08765466069530894
2023-11-14 18:12:17   INFO  epoch: 2/24, acc_iter=14124, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:54:30, time_cost(all): 4:35:59/1 day, 22:37:37, loss=0.570879904946566, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=4.207922967033848, lr=0.08801047517838166
2023-11-14 18:13:15   INFO  epoch: 2/24, acc_iter=14174, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:45:41, time_cost(all): 4:36:57/1 day, 23:03:07, loss=0.57076896279839, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=2.616182458535116, lr=0.08836628966145438
2023-11-14 18:14:14   INFO  epoch: 2/24, acc_iter=14224, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:43:19, time_cost(all): 4:37:56/2 days, 0:04:29, loss=0.570658020650213, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=1.016241550104242, lr=0.0887221041445271
2023-11-14 18:15:13   INFO  epoch: 2/24, acc_iter=14274, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:46:00, time_cost(all): 4:38:55/1 day, 23:15:11, loss=0.570547078502036, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=1.1895209118617909, lr=0.08907791862759981
2023-11-14 18:16:12   INFO  epoch: 2/24, acc_iter=14324, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:48:06, time_cost(all): 4:39:54/1 day, 20:52:22, loss=0.570436136353859, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=4.523503091308432, lr=0.08943373311067253
2023-11-14 18:17:11   INFO  epoch: 2/24, acc_iter=14374, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:40, time_cost(all): 4:40:53/1 day, 22:29:30, loss=0.570325194205683, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.07(1.03), norm=1.4808950764120778, lr=0.08978954759374526
2023-11-14 18:18:10   INFO  epoch: 2/24, acc_iter=14424, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:44:14, time_cost(all): 4:41:52/1 day, 22:30:20, loss=0.570214252057506, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=3.6893955501138764, lr=0.09014536207681798
2023-11-14 18:19:09   INFO  epoch: 2/24, acc_iter=14474, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:43:43, time_cost(all): 4:42:51/1 day, 21:38:47, loss=0.570103309909329, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=2.0133469294967705, lr=0.09050117655989069
2023-11-14 18:20:08   INFO  epoch: 2/24, acc_iter=14524, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:38:35, time_cost(all): 4:43:50/2 days, 0:00:56, loss=0.569992367761152, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.055631316635522, lr=0.09085699104296341
2023-11-14 18:21:07   INFO  epoch: 2/24, acc_iter=14574, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:46:52, time_cost(all): 4:44:49/1 day, 22:55:36, loss=0.569881425612976, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.169089529791097, lr=0.09121280552603613
2023-11-14 18:22:06   INFO  epoch: 2/24, acc_iter=14624, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:39:06, time_cost(all): 4:45:48/2 days, 0:41:01, loss=0.569770483464799, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=2.5121240789135117, lr=0.09156862000910884
2023-11-14 18:23:05   INFO  epoch: 2/24, acc_iter=14674, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:39:28, time_cost(all): 4:46:47/2 days, 0:19:10, loss=0.569659541316622, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=1.1054838067185127, lr=0.09192443449218157
2023-11-14 18:24:04   INFO  epoch: 2/24, acc_iter=14724, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:39:15, time_cost(all): 4:47:46/1 day, 21:42:49, loss=0.569548599168445, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=3.143839327335616, lr=0.09228024897525428
2023-11-14 18:25:03   INFO  epoch: 2/24, acc_iter=14774, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:40:58, time_cost(all): 4:48:45/1 day, 21:01:45, loss=0.569437657020268, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.164250607951531, lr=0.092636063458327
2023-11-14 18:26:02   INFO  epoch: 2/24, acc_iter=14824, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:33:56, time_cost(all): 4:49:44/1 day, 21:40:13, loss=0.569326714872092, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=2.2355861252753213, lr=0.09299187794139972
2023-11-14 18:27:00   INFO  epoch: 2/24, acc_iter=14874, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:40:40, time_cost(all): 4:50:42/1 day, 22:37:02, loss=0.569215772723915, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=4.101038469408605, lr=0.09334769242447244
2023-11-14 18:27:59   INFO  epoch: 2/24, acc_iter=14924, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:37:07, time_cost(all): 4:51:41/2 days, 0:46:36, loss=0.569104830575738, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=1.0536522923138159, lr=0.09370350690754516
2023-11-14 18:28:58   INFO  epoch: 2/24, acc_iter=14974, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:31:05, time_cost(all): 4:52:40/2 days, 0:00:22, loss=0.568993888427561, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.1220064569994754, lr=0.09405932139061789
2023-11-14 18:29:57   INFO  epoch: 2/24, acc_iter=15024, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:37:15, time_cost(all): 4:53:39/2 days, 0:38:22, loss=0.568882946279385, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=1.8429045424407646, lr=0.09441513587369059
2023-11-14 18:30:56   INFO  epoch: 2/24, acc_iter=15074, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:33:49, time_cost(all): 4:54:38/1 day, 20:40:30, loss=0.568772004131208, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=4.780431985079809, lr=0.09477095035676332
2023-11-14 18:31:55   INFO  epoch: 2/24, acc_iter=15124, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:31:40, time_cost(all): 4:55:37/1 day, 23:15:06, loss=0.568661061983031, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=4.98052820923596, lr=0.09512676483983604
2023-11-14 18:32:54   INFO  epoch: 2/24, acc_iter=15174, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:26:14, time_cost(all): 4:56:36/1 day, 21:33:39, loss=0.568550119834854, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=3.201644342067988, lr=0.09548257932290875
2023-11-14 18:33:53   INFO  epoch: 2/24, acc_iter=15224, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:32:38, time_cost(all): 4:57:35/1 day, 23:48:28, loss=0.568439177686678, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=3.726030594198916, lr=0.09583839380598147
2023-11-14 18:34:52   INFO  epoch: 2/24, acc_iter=15274, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:32:05, time_cost(all): 4:58:34/1 day, 20:48:54, loss=0.568328235538501, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=2.1179674525653414, lr=0.0961942082890542
2023-11-14 18:35:51   INFO  epoch: 2/24, acc_iter=15324, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:30:42, time_cost(all): 4:59:33/1 day, 20:46:44, loss=0.568217293390324, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=3.1206909888837524, lr=0.0965500227721269
2023-11-14 18:36:50   INFO  epoch: 2/24, acc_iter=15374, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:30:18, time_cost(all): 5:00:32/1 day, 22:28:47, loss=0.568106351242147, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=4.385775751202763, lr=0.09690583725519963
2023-11-14 18:37:49   INFO  epoch: 2/24, acc_iter=15424, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:57, time_cost(all): 5:01:31/1 day, 23:09:48, loss=0.567995409093971, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=1.3848958861782075, lr=0.09726165173827235
2023-11-14 18:38:48   INFO  epoch: 2/24, acc_iter=15474, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:22:50, time_cost(all): 5:02:30/1 day, 21:23:43, loss=0.567884466945794, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=0.6362444696444463, lr=0.09761746622134507
2023-11-14 18:39:47   INFO  epoch: 2/24, acc_iter=15524, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:26:12, time_cost(all): 5:03:29/2 days, 0:32:54, loss=0.567773524797617, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=2.006921885646988, lr=0.09797328070441778
2023-11-14 18:40:45   INFO  epoch: 2/24, acc_iter=15574, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:20:19, time_cost(all): 5:04:27/1 day, 21:57:17, loss=0.56766258264944, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=0.5254096161824333, lr=0.09832909518749051
2023-11-14 18:41:44   INFO  epoch: 2/24, acc_iter=15624, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:29, time_cost(all): 5:05:26/1 day, 20:43:53, loss=0.567551640501264, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.83(1.03), norm=4.692098675419684, lr=0.09868490967056323
2023-11-14 18:42:43   INFO  epoch: 2/24, acc_iter=15674, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:20:41, time_cost(all): 5:06:25/1 day, 21:31:48, loss=0.567440698353087, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=3.244950134711544, lr=0.09904072415363595
2023-11-14 18:43:42   INFO  epoch: 2/24, acc_iter=15724, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:00, time_cost(all): 5:07:24/1 day, 22:17:03, loss=0.56732975620491, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=1.0440980671895241, lr=0.09939653863670866
2023-11-14 18:44:41   INFO  epoch: 2/24, acc_iter=15774, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:15:22, time_cost(all): 5:08:23/1 day, 21:00:16, loss=0.567218814056733, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=1.2269243928050497, lr=0.09975235311978138
2023-11-14 18:45:40   INFO  epoch: 2/24, acc_iter=15824, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:14:49, time_cost(all): 5:09:22/1 day, 23:31:00, loss=0.567107871908557, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.8823164400867864, lr=0.09998781210108687
2023-11-14 18:46:39   INFO  epoch: 2/24, acc_iter=15874, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:15:38, time_cost(all): 5:10:21/1 day, 22:19:28, loss=0.56699692976038, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=2.6034571041780743, lr=0.09994772032834628
2023-11-14 18:47:38   INFO  epoch: 2/24, acc_iter=15924, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:15:01, time_cost(all): 5:11:20/1 day, 21:01:04, loss=0.566885987612203, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.152920018957226, lr=0.09990762855560568
2023-11-14 18:48:37   INFO  epoch: 2/24, acc_iter=15974, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:16:41, time_cost(all): 5:12:19/1 day, 20:44:20, loss=0.566775045464026, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=3.0293706970900764, lr=0.0998675367828651
2023-11-14 18:49:36   INFO  epoch: 2/24, acc_iter=16024, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:18, time_cost(all): 5:13:18/1 day, 23:50:32, loss=0.56666410331585, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=1.0579500492045146, lr=0.09982744501012451
2023-11-14 18:50:35   INFO  epoch: 2/24, acc_iter=16074, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:09:08, time_cost(all): 5:14:17/1 day, 23:27:23, loss=0.566553161167673, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.4038679711277577, lr=0.09978735323738393
2023-11-14 18:51:34   INFO  epoch: 2/24, acc_iter=16124, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:02, time_cost(all): 5:15:16/1 day, 21:32:09, loss=0.566442219019496, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=1.5377698394346015, lr=0.09974726146464334
2023-11-14 18:52:33   INFO  epoch: 2/24, acc_iter=16174, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:13:03, time_cost(all): 5:16:15/1 day, 20:43:31, loss=0.566331276871319, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=3.1368702916215683, lr=0.09970716969190276
2023-11-14 18:53:32   INFO  epoch: 2/24, acc_iter=16224, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:10:48, time_cost(all): 5:17:14/2 days, 0:02:24, loss=0.566220334723143, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.1295359347969, lr=0.09966707791916217
2023-11-14 18:54:30   INFO  epoch: 2/24, acc_iter=16274, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:27, time_cost(all): 5:18:12/1 day, 20:15:17, loss=0.566109392574966, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=1.9100807094849754, lr=0.09962698614642157
2023-11-14 18:55:29   INFO  epoch: 2/24, acc_iter=16324, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:04:27, time_cost(all): 5:19:11/1 day, 21:18:38, loss=0.565998450426789, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.549377710553852, lr=0.09958689437368098
2023-11-14 18:56:28   INFO  epoch: 2/24, acc_iter=16374, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:06:04, time_cost(all): 5:20:10/1 day, 20:27:03, loss=0.565887508278612, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.0834478072366123, lr=0.0995468026009404
2023-11-14 18:57:27   INFO  epoch: 2/24, acc_iter=16424, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:06:44, time_cost(all): 5:21:09/1 day, 22:37:54, loss=0.565776566130436, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=3.8564280492087506, lr=0.09950671082819981
2023-11-14 18:58:26   INFO  epoch: 2/24, acc_iter=16474, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:05:05, time_cost(all): 5:22:08/2 days, 0:27:46, loss=0.565665623982259, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=2.022206208930534, lr=0.09946661905545923
2023-11-14 18:59:25   INFO  epoch: 2/24, acc_iter=16524, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:58, time_cost(all): 5:23:07/2 days, 0:22:27, loss=0.565554681834082, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.4731692114314852, lr=0.09942652728271864
2023-11-14 19:00:24   INFO  epoch: 2/24, acc_iter=16574, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:55, time_cost(all): 5:24:06/2 days, 0:40:08, loss=0.565443739685905, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.4795375222754952, lr=0.09938643550997804
2023-11-14 19:01:23   INFO  epoch: 2/24, acc_iter=16624, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:01:37, time_cost(all): 5:25:05/1 day, 22:39:08, loss=0.565332797537729, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=0.7533731212554848, lr=0.09934634373723746
2023-11-14 19:02:22   INFO  epoch: 2/24, acc_iter=16674, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:57:43, time_cost(all): 5:26:04/1 day, 23:39:51, loss=0.565221855389552, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=3.9612566258580184, lr=0.09930625196449687
2023-11-14 19:03:21   INFO  epoch: 2/24, acc_iter=16724, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:58:40, time_cost(all): 5:27:03/1 day, 22:34:30, loss=0.565110913241375, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.854381317799646, lr=0.09926616019175628
2023-11-14 19:04:20   INFO  epoch: 2/24, acc_iter=16774, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:58:34, time_cost(all): 5:28:02/2 days, 0:06:44, loss=0.564999971093198, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.6083854671211797, lr=0.0992260684190157
2023-11-14 19:05:19   INFO  epoch: 2/24, acc_iter=16824, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:57:43, time_cost(all): 5:29:01/2 days, 0:10:17, loss=0.564889028945022, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.476012008655118, lr=0.09918597664627511
2023-11-14 19:06:18   INFO  epoch: 2/24, acc_iter=16874, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:59, time_cost(all): 5:30:00/1 day, 22:12:06, loss=0.564778086796845, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.382233786019926, lr=0.09914588487353451
2023-11-14 19:07:17   INFO  epoch: 2/24, acc_iter=16924, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:49, time_cost(all): 5:30:59/1 day, 20:55:14, loss=0.564667144648668, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=0.6161978304555648, lr=0.09910579310079393
2023-11-14 19:08:15   INFO  epoch: 2/24, acc_iter=16974, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:52:55, time_cost(all): 5:31:57/1 day, 20:57:14, loss=0.564556202500491, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=4.3669061146736485, lr=0.09906570132805334
2023-11-14 19:09:14   INFO  epoch: 2/24, acc_iter=17024, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:54:08, time_cost(all): 5:32:56/1 day, 23:32:24, loss=0.564445260352315, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=3.6265743967024173, lr=0.09902560955531275
2023-11-14 19:10:13   INFO  epoch: 2/24, acc_iter=17074, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:53:02, time_cost(all): 5:33:55/1 day, 20:01:13, loss=0.564334318204138, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=2.1012945116398463, lr=0.09898551778257217
2023-11-14 19:11:12   INFO  epoch: 2/24, acc_iter=17124, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:51:54, time_cost(all): 5:34:54/1 day, 23:32:25, loss=0.564223376055961, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.0869645489860438, lr=0.09894542600983158
2023-11-14 19:12:11   INFO  epoch: 2/24, acc_iter=17174, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:48:57, time_cost(all): 5:35:53/1 day, 22:26:40, loss=0.564112433907784, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=1.9046698678724128, lr=0.098905334237091
2023-11-14 19:13:10   INFO  epoch: 2/24, acc_iter=17224, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:48:05, time_cost(all): 5:36:52/1 day, 23:12:04, loss=0.564001491759608, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=4.207779360678089, lr=0.0988652424643504
2023-11-14 19:14:09   INFO  epoch: 2/24, acc_iter=17274, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:47:08, time_cost(all): 5:37:51/1 day, 22:50:00, loss=0.563890549611431, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=3.5686042851944793, lr=0.09882515069160981
2023-11-14 19:15:08   INFO  epoch: 2/24, acc_iter=17324, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:40, time_cost(all): 5:38:50/1 day, 20:34:09, loss=0.563779607463254, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.1139403341185607, lr=0.09878505891886923
2023-11-14 19:16:07   INFO  epoch: 2/24, acc_iter=17374, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:49:07, time_cost(all): 5:39:49/1 day, 21:26:16, loss=0.563668665315077, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=4.22135642022785, lr=0.09874496714612864
2023-11-14 19:17:06   INFO  epoch: 2/24, acc_iter=17424, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:18, time_cost(all): 5:40:48/1 day, 21:26:37, loss=0.563557723166901, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=3.9025137421406946, lr=0.09870487537338805
2023-11-14 19:18:05   INFO  epoch: 2/24, acc_iter=17474, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:42:44, time_cost(all): 5:41:47/1 day, 23:50:20, loss=0.563446781018724, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=4.463078375207154, lr=0.09866478360064747
2023-11-14 19:19:04   INFO  epoch: 2/24, acc_iter=17524, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:29, time_cost(all): 5:42:46/1 day, 22:48:38, loss=0.563335838870547, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.743124513020807, lr=0.09862469182790687
2023-11-14 19:20:03   INFO  epoch: 2/24, acc_iter=17574, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:18, time_cost(all): 5:43:45/1 day, 20:18:40, loss=0.56322489672237, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=3.1611035448587312, lr=0.09858460005516628
2023-11-14 19:21:02   INFO  epoch: 2/24, acc_iter=17624, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:44, time_cost(all): 5:44:44/1 day, 21:52:51, loss=0.563113954574194, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=4.720809204502024, lr=0.0985445082824257
2023-11-14 19:22:00   INFO  epoch: 2/24, acc_iter=17674, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:41:53, time_cost(all): 5:45:42/1 day, 20:53:55, loss=0.563003012426017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=1.1375321182239178, lr=0.09850441650968511
2023-11-14 19:22:59   INFO  epoch: 2/24, acc_iter=17724, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:19, time_cost(all): 5:46:41/1 day, 22:10:44, loss=0.56289207027784, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.9817288194776106, lr=0.09846432473694453
2023-11-14 19:23:58   INFO  epoch: 2/24, acc_iter=17774, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:52, time_cost(all): 5:47:40/1 day, 20:21:35, loss=0.562781128129663, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=0.9234195826490833, lr=0.09842423296420394
2023-11-14 19:24:57   INFO  epoch: 2/24, acc_iter=17824, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:18, time_cost(all): 5:48:39/1 day, 22:45:32, loss=0.562670185981487, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.7336703546144347, lr=0.09838414119146334
2023-11-14 19:25:56   INFO  epoch: 2/24, acc_iter=17874, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:37:19, time_cost(all): 5:49:38/1 day, 20:37:08, loss=0.56255924383331, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=0.60451951771973, lr=0.09834404941872275
2023-11-14 19:26:55   INFO  epoch: 2/24, acc_iter=17924, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:51, time_cost(all): 5:50:37/1 day, 22:14:22, loss=0.562448301685133, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=2.752823048058811, lr=0.09830395764598217
2023-11-14 19:27:54   INFO  epoch: 2/24, acc_iter=17974, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:27, time_cost(all): 5:51:36/1 day, 23:50:05, loss=0.562337359536956, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=3.257524728064291, lr=0.09826386587324158
2023-11-14 19:28:53   INFO  epoch: 2/24, acc_iter=18024, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:06, time_cost(all): 5:52:35/1 day, 23:04:08, loss=0.56222641738878, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=0.6948095161571011, lr=0.098223774100501
2023-11-14 19:29:52   INFO  epoch: 2/24, acc_iter=18074, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:23, time_cost(all): 5:53:34/1 day, 22:26:17, loss=0.562115475240603, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=3.99079788534349, lr=0.09818368232776041
2023-11-14 19:30:51   INFO  epoch: 2/24, acc_iter=18124, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:26, time_cost(all): 5:54:33/1 day, 22:09:18, loss=0.562004533092426, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=3.0921574260241975, lr=0.09814359055501981
2023-11-14 19:31:50   INFO  epoch: 2/24, acc_iter=18174, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:30:14, time_cost(all): 5:55:32/1 day, 21:55:05, loss=0.561893590944249, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=3.4567116345874793, lr=0.09810349878227922
2023-11-14 19:32:49   INFO  epoch: 2/24, acc_iter=18224, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:29:56, time_cost(all): 5:56:31/1 day, 21:27:46, loss=0.561782648796073, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=1.0442491154267541, lr=0.09806340700953864
2023-11-14 19:33:48   INFO  epoch: 2/24, acc_iter=18274, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:22, time_cost(all): 5:57:30/1 day, 23:02:06, loss=0.561671706647896, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=4.95623541753414, lr=0.09802331523679805
2023-11-14 19:34:47   INFO  epoch: 2/24, acc_iter=18324, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:07, time_cost(all): 5:58:29/1 day, 22:30:04, loss=0.561560764499719, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=3.6129000469186434, lr=0.09798322346405747
2023-11-14 19:35:45   INFO  epoch: 2/24, acc_iter=18374, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:32, time_cost(all): 5:59:27/1 day, 22:13:06, loss=0.561449822351542, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.04(1.03), norm=2.8795207538892513, lr=0.09794313169131688
2023-11-14 19:36:44   INFO  epoch: 2/24, acc_iter=18424, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:08, time_cost(all): 6:00:26/1 day, 22:38:24, loss=0.561338880203366, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.041824006790593, lr=0.0979030399185763
2023-11-14 19:37:43   INFO  epoch: 2/24, acc_iter=18474, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:09, time_cost(all): 6:01:25/1 day, 20:17:07, loss=0.561227938055189, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=0.7209872279444288, lr=0.0978629481458357
2023-11-14 19:38:42   INFO  epoch: 2/24, acc_iter=18524, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:18, time_cost(all): 6:02:24/1 day, 22:58:31, loss=0.561116995907012, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.728383345181168, lr=0.09782285637309511
2023-11-14 19:39:41   INFO  epoch: 2/24, acc_iter=18574, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:48, time_cost(all): 6:03:23/1 day, 19:30:28, loss=0.561006053758835, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=2.5810416411238766, lr=0.09778276460035452
2023-11-14 19:40:40   INFO  epoch: 2/24, acc_iter=18624, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:20, time_cost(all): 6:04:22/1 day, 23:07:47, loss=0.560895111610659, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=1.2901735712561444, lr=0.09774267282761394
2023-11-14 19:41:39   INFO  epoch: 2/24, acc_iter=18674, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:22:08, time_cost(all): 6:05:21/1 day, 22:29:35, loss=0.560784169462482, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=2.0463987934787995, lr=0.09770258105487335
2023-11-14 19:42:38   INFO  epoch: 2/24, acc_iter=18724, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:24, time_cost(all): 6:06:20/1 day, 23:47:38, loss=0.560673227314305, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=3.551254751791593, lr=0.09766248928213277
2023-11-14 19:43:37   INFO  epoch: 2/24, acc_iter=18774, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:59, time_cost(all): 6:07:19/1 day, 20:16:16, loss=0.560562285166128, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=4.471435230706462, lr=0.09762239750939217
2023-11-14 19:44:36   INFO  epoch: 2/24, acc_iter=18824, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:05, time_cost(all): 6:08:18/1 day, 19:41:47, loss=0.560451343017952, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=2.030019303570268, lr=0.09758230573665158
2023-11-14 19:45:35   INFO  epoch: 2/24, acc_iter=18874, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:42, time_cost(all): 6:09:17/1 day, 20:16:49, loss=0.560340400869775, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=0.6241211900551007, lr=0.097542213963911
2023-11-14 19:46:34   INFO  epoch: 2/24, acc_iter=18924, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:40, time_cost(all): 6:10:16/1 day, 23:35:46, loss=0.560229458721598, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=4.117246824170294, lr=0.09750212219117041
2023-11-14 19:47:33   INFO  epoch: 2/24, acc_iter=18974, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:46, time_cost(all): 6:11:15/1 day, 22:22:33, loss=0.560118516573421, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.1855961295941813, lr=0.09746203041842982
2023-11-14 19:48:32   INFO  epoch: 2/24, acc_iter=19024, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:46, time_cost(all): 6:12:14/1 day, 19:59:28, loss=0.560007574425245, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=4.275373900113353, lr=0.09742193864568924
2023-11-14 19:49:30   INFO  epoch: 2/24, acc_iter=19074, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:05, time_cost(all): 6:13:12/1 day, 20:32:00, loss=0.559896632277068, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.099490111924658, lr=0.09738184687294864
2023-11-14 19:50:29   INFO  epoch: 2/24, acc_iter=19124, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:24, time_cost(all): 6:14:11/1 day, 23:03:33, loss=0.559785690128891, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=2.7106924313060388, lr=0.09734175510020805
2023-11-14 19:51:28   INFO  epoch: 2/24, acc_iter=19174, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:30, time_cost(all): 6:15:10/1 day, 23:41:09, loss=0.559674747980714, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=3.275320238434752, lr=0.09730166332746747
2023-11-14 19:52:27   INFO  epoch: 2/24, acc_iter=19224, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:10, time_cost(all): 6:16:09/1 day, 20:12:00, loss=0.559563805832538, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=2.2779530929442657, lr=0.09726157155472688
2023-11-14 19:53:26   INFO  epoch: 2/24, acc_iter=19274, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:10:01, time_cost(all): 6:17:08/1 day, 19:50:46, loss=0.559452863684361, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=3.5958028251429823, lr=0.0972214797819863
2023-11-14 19:54:25   INFO  epoch: 2/24, acc_iter=19324, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:34, time_cost(all): 6:18:07/1 day, 23:35:23, loss=0.559341921536184, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=3.0770055403314704, lr=0.09718138800924571
2023-11-14 19:55:24   INFO  epoch: 2/24, acc_iter=19374, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:44, time_cost(all): 6:19:06/1 day, 23:03:07, loss=0.559230979388007, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.083031820999881, lr=0.09714129623650511
2023-11-14 19:56:23   INFO  epoch: 2/24, acc_iter=19424, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:47, time_cost(all): 6:20:05/1 day, 21:22:30, loss=0.559120037239831, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=0.8986575591573143, lr=0.09710120446376452
2023-11-14 19:57:22   INFO  epoch: 2/24, acc_iter=19474, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:37, time_cost(all): 6:21:04/1 day, 21:10:33, loss=0.559009095091654, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=0.5038761897335247, lr=0.09706111269102394
2023-11-14 19:58:21   INFO  epoch: 2/24, acc_iter=19524, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:37, time_cost(all): 6:22:03/1 day, 22:52:51, loss=0.558898152943477, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.943821274100075, lr=0.09702102091828335
2023-11-14 19:59:20   INFO  epoch: 2/24, acc_iter=19574, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:48, time_cost(all): 6:23:02/1 day, 19:13:05, loss=0.5587872107953, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=2.903042139845058, lr=0.09698092914554277
2023-11-14 20:00:19   INFO  epoch: 2/24, acc_iter=19624, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:37, time_cost(all): 6:24:01/1 day, 22:06:34, loss=0.558676268647124, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=1.2704498285682522, lr=0.09694083737280218
2023-11-14 20:01:18   INFO  epoch: 2/24, acc_iter=19674, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 6:25:00/1 day, 21:35:57, loss=0.558565326498947, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.923308248352993, lr=0.0969007456000616
2023-11-14 20:02:17   INFO  epoch: 2/24, acc_iter=19724, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 6:25:59/1 day, 19:31:20, loss=0.55845438435077, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=2.5719446070193683, lr=0.096860653827321
2023-11-14 20:03:15   INFO  epoch: 3/24, acc_iter=19811, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:03:33, time_cost(all): 6:26:57/1 day, 19:59:10, loss=0.558261345012943, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=4.4872563592570796, lr=0.09679089414275238
2023-11-14 20:04:14   INFO  epoch: 3/24, acc_iter=19861, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:13:35, time_cost(all): 6:27:56/1 day, 21:53:51, loss=0.558150402864766, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=4.064110641609576, lr=0.09675080237001178
2023-11-14 20:05:13   INFO  epoch: 3/24, acc_iter=19911, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:12:26, time_cost(all): 6:28:55/1 day, 22:00:24, loss=0.558039460716589, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=4.587663703511719, lr=0.0967107105972712
2023-11-14 20:06:12   INFO  epoch: 3/24, acc_iter=19961, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:00:39, time_cost(all): 6:29:54/1 day, 20:24:03, loss=0.557928518568412, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=4.174930257636822, lr=0.09667061882453061
2023-11-14 20:07:11   INFO  epoch: 3/24, acc_iter=20011, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:03:28, time_cost(all): 6:30:53/1 day, 20:48:53, loss=0.557817576420236, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=1.4249813185413343, lr=0.09663052705179002
2023-11-14 20:08:10   INFO  epoch: 3/24, acc_iter=20061, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:46, time_cost(all): 6:31:52/1 day, 20:00:55, loss=0.557706634272059, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.578147671979814, lr=0.09659043527904944
2023-11-14 20:09:09   INFO  epoch: 3/24, acc_iter=20111, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:06:28, time_cost(all): 6:32:51/1 day, 21:36:22, loss=0.557595692123882, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=1.640929541539092, lr=0.09655034350630885
2023-11-14 20:10:08   INFO  epoch: 3/24, acc_iter=20161, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:03:29, time_cost(all): 6:33:50/1 day, 22:57:25, loss=0.557484749975705, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=4.669341431465693, lr=0.09651025173356825
2023-11-14 20:11:07   INFO  epoch: 3/24, acc_iter=20211, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:06:11, time_cost(all): 6:34:49/1 day, 22:22:01, loss=0.557373807827529, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.6903173446994049, lr=0.09647015996082767
2023-11-14 20:12:06   INFO  epoch: 3/24, acc_iter=20261, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:55:15, time_cost(all): 6:35:48/1 day, 19:39:25, loss=0.557262865679352, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=2.853673479754445, lr=0.09643006818808708
2023-11-14 20:13:05   INFO  epoch: 3/24, acc_iter=20311, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:53:01, time_cost(all): 6:36:47/1 day, 20:20:14, loss=0.557151923531175, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=1.4812463214734506, lr=0.0963899764153465
2023-11-14 20:14:04   INFO  epoch: 3/24, acc_iter=20361, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:05, time_cost(all): 6:37:46/1 day, 23:13:14, loss=0.557040981382998, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.83(1.03), norm=3.9495145978557433, lr=0.09634988464260591
2023-11-14 20:15:03   INFO  epoch: 3/24, acc_iter=20411, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:58:33, time_cost(all): 6:38:45/1 day, 21:56:41, loss=0.556930039234822, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=4.70298304136222, lr=0.09630979286986532
2023-11-14 20:16:02   INFO  epoch: 3/24, acc_iter=20461, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/2:01:17, time_cost(all): 6:39:44/1 day, 22:22:56, loss=0.556819097086645, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=3.1088814499265807, lr=0.09626970109712474
2023-11-14 20:17:00   INFO  epoch: 3/24, acc_iter=20511, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:55:30, time_cost(all): 6:40:42/1 day, 20:58:05, loss=0.556708154938468, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=2.8479816644285934, lr=0.09622960932438414
2023-11-14 20:17:59   INFO  epoch: 3/24, acc_iter=20561, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:49:34, time_cost(all): 6:41:41/1 day, 22:27:10, loss=0.556597212790291, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=3.834400069236743, lr=0.09618951755164355
2023-11-14 20:18:58   INFO  epoch: 3/24, acc_iter=20611, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:57:19, time_cost(all): 6:42:40/1 day, 19:57:33, loss=0.556486270642115, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=3.8478005455683095, lr=0.09614942577890297
2023-11-14 20:19:57   INFO  epoch: 3/24, acc_iter=20661, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:55:54, time_cost(all): 6:43:39/1 day, 19:08:42, loss=0.556375328493938, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=3.4349688423425393, lr=0.09610933400616238
2023-11-14 20:20:56   INFO  epoch: 3/24, acc_iter=20711, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:47:51, time_cost(all): 6:44:38/1 day, 21:55:30, loss=0.556264386345761, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=1.797988860216611, lr=0.0960692422334218
2023-11-14 20:21:55   INFO  epoch: 3/24, acc_iter=20761, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:52:11, time_cost(all): 6:45:37/1 day, 21:56:27, loss=0.556153444197584, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=1.3282745222633938, lr=0.09602915046068121
2023-11-14 20:22:54   INFO  epoch: 3/24, acc_iter=20811, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:09, time_cost(all): 6:46:36/1 day, 19:18:56, loss=0.556042502049408, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=2.309146281879726, lr=0.09598905868794061
2023-11-14 20:23:53   INFO  epoch: 3/24, acc_iter=20861, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:48:54, time_cost(all): 6:47:35/1 day, 18:50:51, loss=0.555931559901231, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.6064797296156565, lr=0.09594896691520002
2023-11-14 20:24:52   INFO  epoch: 3/24, acc_iter=20911, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:45:58, time_cost(all): 6:48:34/1 day, 21:34:30, loss=0.555820617753054, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.916814455178821, lr=0.09590887514245944
2023-11-14 20:25:51   INFO  epoch: 3/24, acc_iter=20961, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:44:07, time_cost(all): 6:49:33/1 day, 20:21:31, loss=0.555709675604877, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=2.798648458737262, lr=0.09586878336971885
2023-11-14 20:26:50   INFO  epoch: 3/24, acc_iter=21011, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:46:24, time_cost(all): 6:50:32/1 day, 19:59:54, loss=0.555598733456701, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=1.4208298531186352, lr=0.09582869159697827
2023-11-14 20:27:49   INFO  epoch: 3/24, acc_iter=21061, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:40:49, time_cost(all): 6:51:31/1 day, 20:15:36, loss=0.555487791308524, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.05(1.03), norm=1.0927084581283342, lr=0.09578859982423768
2023-11-14 20:28:48   INFO  epoch: 3/24, acc_iter=21111, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:46:49, time_cost(all): 6:52:30/1 day, 20:23:37, loss=0.555376849160347, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.236955303118799, lr=0.09574850805149708
2023-11-14 20:29:47   INFO  epoch: 3/24, acc_iter=21161, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:40:39, time_cost(all): 6:53:29/1 day, 21:12:57, loss=0.55526590701217, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.404429540328117, lr=0.0957084162787565
2023-11-14 20:30:45   INFO  epoch: 3/24, acc_iter=21211, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:37:01, time_cost(all): 6:54:27/1 day, 22:02:16, loss=0.555154964863994, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.6240709438225545, lr=0.09566832450601591
2023-11-14 20:31:44   INFO  epoch: 3/24, acc_iter=21261, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:41:06, time_cost(all): 6:55:26/1 day, 22:57:40, loss=0.555044022715817, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=3.4617916980454377, lr=0.09562823273327532
2023-11-14 20:32:43   INFO  epoch: 3/24, acc_iter=21311, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:38:52, time_cost(all): 6:56:25/1 day, 21:00:48, loss=0.55493308056764, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=3.5543755294858594, lr=0.09558814096053474
2023-11-14 20:33:42   INFO  epoch: 3/24, acc_iter=21361, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:37:03, time_cost(all): 6:57:24/1 day, 18:34:32, loss=0.554822138419463, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.577414206220383, lr=0.09554804918779415
2023-11-14 20:34:41   INFO  epoch: 3/24, acc_iter=21411, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:36:19, time_cost(all): 6:58:23/1 day, 22:05:49, loss=0.554711196271287, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=0.5341824064905014, lr=0.09550795741505355
2023-11-14 20:35:40   INFO  epoch: 3/24, acc_iter=21461, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:34:26, time_cost(all): 6:59:22/1 day, 20:33:42, loss=0.55460025412311, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=2.057801463044372, lr=0.09546786564231297
2023-11-14 20:36:39   INFO  epoch: 3/24, acc_iter=21511, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:32:57, time_cost(all): 7:00:21/1 day, 18:49:07, loss=0.554489311974933, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.102160001327179, lr=0.09542777386957238
2023-11-14 20:37:38   INFO  epoch: 3/24, acc_iter=21561, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:40, time_cost(all): 7:01:20/1 day, 21:04:28, loss=0.554378369826756, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.968481895963742, lr=0.0953876820968318
2023-11-14 20:38:37   INFO  epoch: 3/24, acc_iter=21611, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:30:22, time_cost(all): 7:02:19/1 day, 21:49:59, loss=0.55426742767858, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=0.9823686056709988, lr=0.09534759032409121
2023-11-14 20:39:36   INFO  epoch: 3/24, acc_iter=21661, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:33:08, time_cost(all): 7:03:18/1 day, 19:26:50, loss=0.554156485530403, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=1.2162484674500476, lr=0.09530749855135062
2023-11-14 20:40:35   INFO  epoch: 3/24, acc_iter=21711, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:28:24, time_cost(all): 7:04:17/1 day, 22:49:30, loss=0.554045543382226, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.11(1.03), norm=4.0914603573635, lr=0.09526740677861004
2023-11-14 20:41:34   INFO  epoch: 3/24, acc_iter=21761, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:31:43, time_cost(all): 7:05:16/1 day, 19:56:34, loss=0.553934601234049, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.93(1.03), norm=4.400137604085398, lr=0.09522731500586944
2023-11-14 20:42:33   INFO  epoch: 3/24, acc_iter=21811, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:29:33, time_cost(all): 7:06:15/1 day, 22:38:08, loss=0.553823659085872, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=3.9796490847003394, lr=0.09518722323312885
2023-11-14 20:43:32   INFO  epoch: 3/24, acc_iter=21861, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:24:50, time_cost(all): 7:07:14/1 day, 22:22:29, loss=0.553712716937696, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=2.6127809120239616, lr=0.09514713146038827
2023-11-14 20:44:30   INFO  epoch: 3/24, acc_iter=21911, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:27:07, time_cost(all): 7:08:12/1 day, 18:44:35, loss=0.553601774789519, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.9093337597975966, lr=0.09510703968764768
2023-11-14 20:45:29   INFO  epoch: 3/24, acc_iter=21961, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:23:05, time_cost(all): 7:09:11/1 day, 21:23:02, loss=0.553490832641342, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.8113956275519665, lr=0.0950669479149071
2023-11-14 20:46:28   INFO  epoch: 3/24, acc_iter=22011, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:29:19, time_cost(all): 7:10:10/1 day, 18:27:25, loss=0.553379890493165, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=2.7523565173358873, lr=0.09502685614216651
2023-11-14 20:47:27   INFO  epoch: 3/24, acc_iter=22061, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:25:40, time_cost(all): 7:11:09/1 day, 21:29:27, loss=0.553268948344989, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.7380024182548153, lr=0.09498676436942591
2023-11-14 20:48:26   INFO  epoch: 3/24, acc_iter=22111, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:23:55, time_cost(all): 7:12:08/1 day, 22:04:00, loss=0.553158006196812, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=4.702238461051356, lr=0.09494667259668532
2023-11-14 20:49:25   INFO  epoch: 3/24, acc_iter=22161, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:25:28, time_cost(all): 7:13:07/1 day, 19:20:19, loss=0.553047064048635, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=4.573401759706572, lr=0.09490658082394474
2023-11-14 20:50:24   INFO  epoch: 3/24, acc_iter=22211, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:06, time_cost(all): 7:14:06/1 day, 21:03:35, loss=0.552936121900458, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=0.8201874187972877, lr=0.09486648905120415
2023-11-14 20:51:23   INFO  epoch: 3/24, acc_iter=22261, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:20:15, time_cost(all): 7:15:05/1 day, 20:23:31, loss=0.552825179752282, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=1.496036402269441, lr=0.09482639727846356
2023-11-14 20:52:22   INFO  epoch: 3/24, acc_iter=22311, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:10, time_cost(all): 7:16:04/1 day, 21:36:27, loss=0.552714237604105, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=2.3256462689589736, lr=0.09478630550572298
2023-11-14 20:53:21   INFO  epoch: 3/24, acc_iter=22361, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:49, time_cost(all): 7:17:03/1 day, 19:58:54, loss=0.552603295455928, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=3.5076842542496656, lr=0.09474621373298238
2023-11-14 20:54:20   INFO  epoch: 3/24, acc_iter=22411, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:19:34, time_cost(all): 7:18:02/1 day, 21:00:35, loss=0.552492353307751, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=0.74342846238286, lr=0.0947061219602418
2023-11-14 20:55:19   INFO  epoch: 3/24, acc_iter=22461, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:18:04, time_cost(all): 7:19:01/1 day, 20:41:21, loss=0.552381411159575, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.111994253636788, lr=0.09466603018750121
2023-11-14 20:56:18   INFO  epoch: 3/24, acc_iter=22511, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:36, time_cost(all): 7:20:00/1 day, 20:24:13, loss=0.552270469011398, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.7804260323721048, lr=0.09462593841476062
2023-11-14 20:57:17   INFO  epoch: 3/24, acc_iter=22561, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:17, time_cost(all): 7:20:59/1 day, 19:46:36, loss=0.552159526863221, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.6620693972431115, lr=0.09458584664202004
2023-11-14 20:58:15   INFO  epoch: 3/24, acc_iter=22611, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:12:56, time_cost(all): 7:21:57/1 day, 19:39:25, loss=0.552048584715044, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.16(1.03), norm=1.1302112936747117, lr=0.09454575486927945
2023-11-14 20:59:14   INFO  epoch: 3/24, acc_iter=22661, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:12:11, time_cost(all): 7:22:56/1 day, 19:14:37, loss=0.551937642566868, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=3.901484418776015, lr=0.09450566309653885
2023-11-14 21:00:13   INFO  epoch: 3/24, acc_iter=22711, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:10:19, time_cost(all): 7:23:55/1 day, 22:26:23, loss=0.551826700418691, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=3.6582454571026353, lr=0.09446557132379826
2023-11-14 21:01:12   INFO  epoch: 3/24, acc_iter=22761, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:37, time_cost(all): 7:24:54/1 day, 20:22:30, loss=0.551715758270514, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=2.2133571913789076, lr=0.09442547955105768
2023-11-14 21:02:11   INFO  epoch: 3/24, acc_iter=22811, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:09:06, time_cost(all): 7:25:53/1 day, 20:19:44, loss=0.551604816122337, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=1.2508597200045846, lr=0.09438538777831709
2023-11-14 21:03:10   INFO  epoch: 3/24, acc_iter=22861, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:07:32, time_cost(all): 7:26:52/1 day, 19:01:02, loss=0.551493873974161, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=0.9828995463035897, lr=0.0943452960055765
2023-11-14 21:04:09   INFO  epoch: 3/24, acc_iter=22911, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:05:36, time_cost(all): 7:27:51/1 day, 21:16:13, loss=0.551382931825984, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=2.396792012979551, lr=0.09430520423283592
2023-11-14 21:05:08   INFO  epoch: 3/24, acc_iter=22961, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:06:30, time_cost(all): 7:28:50/1 day, 18:12:15, loss=0.551271989677807, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=1.186411401698668, lr=0.09426511246009533
2023-11-14 21:06:07   INFO  epoch: 3/24, acc_iter=23011, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:06:01, time_cost(all): 7:29:49/1 day, 20:25:05, loss=0.55116104752963, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=0.5687501640013223, lr=0.09422502068735474
2023-11-14 21:07:06   INFO  epoch: 3/24, acc_iter=23061, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:08, time_cost(all): 7:30:48/1 day, 21:19:19, loss=0.551050105381454, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.2291494432920516, lr=0.09418492891461415
2023-11-14 21:08:05   INFO  epoch: 3/24, acc_iter=23111, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:17, time_cost(all): 7:31:47/1 day, 20:49:31, loss=0.550939163233277, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=2.2921739506058225, lr=0.09414483714187356
2023-11-14 21:09:04   INFO  epoch: 3/24, acc_iter=23161, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:29, time_cost(all): 7:32:46/1 day, 19:43:53, loss=0.5508282210851, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=2.565146948809, lr=0.09410474536913298
2023-11-14 21:10:03   INFO  epoch: 3/24, acc_iter=23211, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:10, time_cost(all): 7:33:45/1 day, 19:42:12, loss=0.550717278936923, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.508969774519819, lr=0.09406465359639239
2023-11-14 21:11:02   INFO  epoch: 3/24, acc_iter=23261, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:59:36, time_cost(all): 7:34:44/1 day, 19:42:00, loss=0.550606336788747, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=0.8047495009719876, lr=0.0940245618236518
2023-11-14 21:12:01   INFO  epoch: 3/24, acc_iter=23311, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:59:52, time_cost(all): 7:35:43/1 day, 18:49:30, loss=0.55049539464057, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.97(1.03), norm=1.499974712298167, lr=0.0939844700509112
2023-11-14 21:12:59   INFO  epoch: 3/24, acc_iter=23361, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:57:40, time_cost(all): 7:36:41/1 day, 20:49:41, loss=0.550384452492393, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=3.2521098161883093, lr=0.09394437827817062
2023-11-14 21:13:58   INFO  epoch: 3/24, acc_iter=23411, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/1:00:07, time_cost(all): 7:37:40/1 day, 21:44:13, loss=0.550273510344216, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=4.2200055861294885, lr=0.09390428650543003
2023-11-14 21:14:57   INFO  epoch: 3/24, acc_iter=23461, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:55, time_cost(all): 7:38:39/1 day, 22:12:25, loss=0.55016256819604, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=4.198856679045962, lr=0.09386419473268945
2023-11-14 21:15:56   INFO  epoch: 3/24, acc_iter=23511, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:57:02, time_cost(all): 7:39:38/1 day, 19:53:23, loss=0.550051626047863, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=0.8836930183608779, lr=0.09382410295994886
2023-11-14 21:16:55   INFO  epoch: 3/24, acc_iter=23561, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:40, time_cost(all): 7:40:37/1 day, 19:17:35, loss=0.549940683899686, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=2.387771816633978, lr=0.09378401118720828
2023-11-14 21:17:54   INFO  epoch: 3/24, acc_iter=23611, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:56:10, time_cost(all): 7:41:36/1 day, 18:41:05, loss=0.549829741751509, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=1.4344436006478634, lr=0.09374391941446769
2023-11-14 21:18:53   INFO  epoch: 3/24, acc_iter=23661, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:22, time_cost(all): 7:42:35/1 day, 18:34:51, loss=0.549718799603333, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=4.6532421794461865, lr=0.09370382764172709
2023-11-14 21:19:52   INFO  epoch: 3/24, acc_iter=23711, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:51:02, time_cost(all): 7:43:34/1 day, 19:00:20, loss=0.549607857455156, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=3.652528342528157, lr=0.0936637358689865
2023-11-14 21:20:51   INFO  epoch: 3/24, acc_iter=23761, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:39, time_cost(all): 7:44:33/1 day, 21:19:44, loss=0.549496915306979, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=0.8938140949060507, lr=0.09362364409624592
2023-11-14 21:21:50   INFO  epoch: 3/24, acc_iter=23811, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:52:09, time_cost(all): 7:45:32/1 day, 19:16:59, loss=0.549385973158802, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=0.972157676642158, lr=0.09358355232350533
2023-11-14 21:22:49   INFO  epoch: 3/24, acc_iter=23861, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:50:54, time_cost(all): 7:46:31/1 day, 21:42:00, loss=0.549275031010626, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=1.0274077422892707, lr=0.09354346055076475
2023-11-14 21:23:48   INFO  epoch: 3/24, acc_iter=23911, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:50:12, time_cost(all): 7:47:30/1 day, 21:18:54, loss=0.549164088862449, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=4.7793573150832716, lr=0.09350336877802415
2023-11-14 21:24:47   INFO  epoch: 3/24, acc_iter=23961, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:47:19, time_cost(all): 7:48:29/1 day, 17:44:59, loss=0.549053146714272, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=2.1776365486964404, lr=0.09346327700528356
2023-11-14 21:25:46   INFO  epoch: 3/24, acc_iter=24011, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:32, time_cost(all): 7:49:28/1 day, 21:24:58, loss=0.548942204566095, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.9645568914269935, lr=0.09342318523254298
2023-11-14 21:26:44   INFO  epoch: 3/24, acc_iter=24061, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:44:55, time_cost(all): 7:50:26/1 day, 21:52:12, loss=0.548831262417919, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.2312156216412413, lr=0.09338309345980239
2023-11-14 21:27:43   INFO  epoch: 3/24, acc_iter=24111, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:42:39, time_cost(all): 7:51:25/1 day, 21:12:47, loss=0.548720320269742, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=1.9431429595221124, lr=0.0933430016870618
2023-11-14 21:28:42   INFO  epoch: 3/24, acc_iter=24161, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:06, time_cost(all): 7:52:24/1 day, 19:29:37, loss=0.548609378121565, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=4.4391418405267835, lr=0.09330290991432122
2023-11-14 21:29:41   INFO  epoch: 3/24, acc_iter=24211, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:41:46, time_cost(all): 7:53:23/1 day, 21:41:11, loss=0.548498435973388, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.2231266075622718, lr=0.09326281814158063
2023-11-14 21:30:40   INFO  epoch: 3/24, acc_iter=24261, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:41:15, time_cost(all): 7:54:22/1 day, 20:28:32, loss=0.548387493825212, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=2.4052381060407106, lr=0.09322272636884003
2023-11-14 21:31:39   INFO  epoch: 3/24, acc_iter=24311, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:55, time_cost(all): 7:55:21/1 day, 21:20:36, loss=0.548276551677035, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=1.563697444737492, lr=0.09318263459609945
2023-11-14 21:32:38   INFO  epoch: 3/24, acc_iter=24361, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:07, time_cost(all): 7:56:20/1 day, 20:27:48, loss=0.548165609528858, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=2.7238241628105913, lr=0.09314254282335886
2023-11-14 21:33:37   INFO  epoch: 3/24, acc_iter=24411, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:37, time_cost(all): 7:57:19/1 day, 20:09:02, loss=0.548054667380681, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=1.697379737821718, lr=0.09310245105061828
2023-11-14 21:34:36   INFO  epoch: 3/24, acc_iter=24461, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:35, time_cost(all): 7:58:18/1 day, 21:51:55, loss=0.547943725232505, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=4.372674590908328, lr=0.09306235927787769
2023-11-14 21:35:35   INFO  epoch: 3/24, acc_iter=24511, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:54, time_cost(all): 7:59:17/1 day, 20:58:00, loss=0.547832783084328, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.7689214132111273, lr=0.0930222675051371
2023-11-14 21:36:34   INFO  epoch: 3/24, acc_iter=24561, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:33:58, time_cost(all): 8:00:16/1 day, 20:21:22, loss=0.547721840936151, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.5095796888096418, lr=0.0929821757323965
2023-11-14 21:37:33   INFO  epoch: 3/24, acc_iter=24611, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:35:45, time_cost(all): 8:01:15/1 day, 21:28:05, loss=0.547610898787974, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=4.020825831166459, lr=0.09294208395965592
2023-11-14 21:38:32   INFO  epoch: 3/24, acc_iter=24661, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:59, time_cost(all): 8:02:14/1 day, 20:27:07, loss=0.547499956639798, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=4.421427736210764, lr=0.09290199218691533
2023-11-14 21:39:31   INFO  epoch: 3/24, acc_iter=24711, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:33:15, time_cost(all): 8:03:13/1 day, 17:55:40, loss=0.547389014491621, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=1.4279892144224273, lr=0.09286190041417475
2023-11-14 21:40:29   INFO  epoch: 3/24, acc_iter=24761, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:05, time_cost(all): 8:04:11/1 day, 18:32:09, loss=0.547278072343444, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=2.257563288764402, lr=0.09282180864143416
2023-11-14 21:41:28   INFO  epoch: 3/24, acc_iter=24811, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:38, time_cost(all): 8:05:10/1 day, 18:33:14, loss=0.547167130195267, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=4.015647657181386, lr=0.09278171686869358
2023-11-14 21:42:27   INFO  epoch: 3/24, acc_iter=24861, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:45, time_cost(all): 8:06:09/1 day, 17:48:12, loss=0.547056188047091, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=2.1927414201896402, lr=0.09274162509595298
2023-11-14 21:43:26   INFO  epoch: 3/24, acc_iter=24911, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:42, time_cost(all): 8:07:08/1 day, 18:24:08, loss=0.546945245898914, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=4.479909744060615, lr=0.09270153332321239
2023-11-14 21:44:25   INFO  epoch: 3/24, acc_iter=24961, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:28:04, time_cost(all): 8:08:07/1 day, 21:42:50, loss=0.546834303750737, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.9663149639342554, lr=0.0926614415504718
2023-11-14 21:45:24   INFO  epoch: 3/24, acc_iter=25011, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:27:23, time_cost(all): 8:09:06/1 day, 18:55:09, loss=0.54672336160256, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=3.5154078084639986, lr=0.09262134977773122
2023-11-14 21:46:23   INFO  epoch: 3/24, acc_iter=25061, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:00, time_cost(all): 8:10:05/1 day, 20:04:04, loss=0.546612419454384, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.89(1.03), norm=4.686490386001266, lr=0.09258125800499063
2023-11-14 21:47:22   INFO  epoch: 3/24, acc_iter=25111, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:25, time_cost(all): 8:11:04/1 day, 18:30:57, loss=0.546501477306207, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.404515144075496, lr=0.09254116623225005
2023-11-14 21:48:21   INFO  epoch: 3/24, acc_iter=25161, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:41, time_cost(all): 8:12:03/1 day, 21:33:39, loss=0.54639053515803, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=3.288009074886211, lr=0.09250107445950945
2023-11-14 21:49:20   INFO  epoch: 3/24, acc_iter=25211, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:56, time_cost(all): 8:13:02/1 day, 19:03:20, loss=0.546279593009853, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=2.80865409108043, lr=0.09246098268676886
2023-11-14 21:50:19   INFO  epoch: 3/24, acc_iter=25261, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:28, time_cost(all): 8:14:01/1 day, 19:39:38, loss=0.546168650861677, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=2.320466091236586, lr=0.09242089091402828
2023-11-14 21:51:18   INFO  epoch: 3/24, acc_iter=25311, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:21:08, time_cost(all): 8:15:00/1 day, 18:49:31, loss=0.5460577087135, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=2.4022077151526258, lr=0.09238079914128769
2023-11-14 21:52:17   INFO  epoch: 3/24, acc_iter=25361, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:10, time_cost(all): 8:15:59/1 day, 20:54:11, loss=0.545946766565323, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=4.1556605394653845, lr=0.0923407073685471
2023-11-14 21:53:16   INFO  epoch: 3/24, acc_iter=25411, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:38, time_cost(all): 8:16:58/1 day, 19:08:17, loss=0.545835824417146, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.8414956353391858, lr=0.09230061559580652
2023-11-14 21:54:14   INFO  epoch: 3/24, acc_iter=25461, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:43, time_cost(all): 8:17:56/1 day, 20:16:49, loss=0.54572488226897, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=4.710357432984046, lr=0.09226052382306593
2023-11-14 21:55:13   INFO  epoch: 3/24, acc_iter=25511, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:59, time_cost(all): 8:18:55/1 day, 20:53:13, loss=0.545613940120793, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=2.958727889986509, lr=0.09222043205032533
2023-11-14 21:56:12   INFO  epoch: 3/24, acc_iter=25561, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:16:00, time_cost(all): 8:19:54/1 day, 21:04:24, loss=0.545502997972616, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.448056103380572, lr=0.09218034027758475
2023-11-14 21:57:11   INFO  epoch: 3/24, acc_iter=25611, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:17, time_cost(all): 8:20:53/1 day, 21:05:58, loss=0.545392055824439, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=0.8920828991792733, lr=0.09214024850484416
2023-11-14 21:58:10   INFO  epoch: 3/24, acc_iter=25661, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:54, time_cost(all): 8:21:52/1 day, 17:57:30, loss=0.545281113676263, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.300354741727756, lr=0.09210015673210357
2023-11-14 21:59:09   INFO  epoch: 3/24, acc_iter=25711, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:49, time_cost(all): 8:22:51/1 day, 20:26:05, loss=0.545170171528086, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=4.099324382163839, lr=0.09206006495936299
2023-11-14 22:00:08   INFO  epoch: 3/24, acc_iter=25761, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:16, time_cost(all): 8:23:50/1 day, 21:20:12, loss=0.545059229379909, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=2.4679656080179124, lr=0.0920199731866224
2023-11-14 22:01:07   INFO  epoch: 3/24, acc_iter=25811, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:31, time_cost(all): 8:24:49/1 day, 20:48:33, loss=0.544948287231732, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=4.676173170267697, lr=0.0919798814138818
2023-11-14 22:02:06   INFO  epoch: 3/24, acc_iter=25861, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:24, time_cost(all): 8:25:48/1 day, 17:31:34, loss=0.544837345083556, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.0734330803047318, lr=0.09193978964114122
2023-11-14 22:03:05   INFO  epoch: 3/24, acc_iter=25911, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:22, time_cost(all): 8:26:47/1 day, 20:49:39, loss=0.544726402935379, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=1.8871314682870501, lr=0.09189969786840063
2023-11-14 22:04:04   INFO  epoch: 3/24, acc_iter=25961, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:15, time_cost(all): 8:27:46/1 day, 18:32:58, loss=0.544615460787202, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.4629434050931704, lr=0.09185960609566005
2023-11-14 22:05:03   INFO  epoch: 3/24, acc_iter=26011, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:45, time_cost(all): 8:28:45/1 day, 19:11:31, loss=0.544504518639025, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=2.7713037412081456, lr=0.09181951432291946
2023-11-14 22:06:02   INFO  epoch: 3/24, acc_iter=26061, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:45, time_cost(all): 8:29:44/1 day, 19:32:39, loss=0.544393576490849, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.76170197028555, lr=0.09177942255017887
2023-11-14 22:07:01   INFO  epoch: 3/24, acc_iter=26111, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:40, time_cost(all): 8:30:43/1 day, 19:53:00, loss=0.544282634342672, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=2.5048956597388274, lr=0.09173933077743829
2023-11-14 22:07:59   INFO  epoch: 3/24, acc_iter=26161, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:50, time_cost(all): 8:31:41/1 day, 20:51:06, loss=0.544171692194495, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=4.8773364961173975, lr=0.09169923900469769
2023-11-14 22:08:58   INFO  epoch: 3/24, acc_iter=26211, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:44, time_cost(all): 8:32:40/1 day, 18:29:41, loss=0.544060750046318, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=0.8093712479097934, lr=0.0916591472319571
2023-11-14 22:09:57   INFO  epoch: 3/24, acc_iter=26261, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 8:33:39/1 day, 17:45:04, loss=0.543949807898142, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.50653091805975, lr=0.09161905545921652
2023-11-14 22:10:56   INFO  epoch: 3/24, acc_iter=26311, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:44, time_cost(all): 8:34:38/1 day, 17:49:03, loss=0.543838865749965, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=1.0897806692632468, lr=0.09157896368647593
2023-11-14 22:11:55   INFO  epoch: 4/24, acc_iter=26398, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:07:58, time_cost(all): 8:35:37/1 day, 20:57:44, loss=0.543645826412137, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=3.958476835891721, lr=0.0915092040019073
2023-11-14 22:12:54   INFO  epoch: 4/24, acc_iter=26448, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:11:42, time_cost(all): 8:36:36/1 day, 18:15:18, loss=0.543534884263961, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.1072573624293285, lr=0.09146911222916672
2023-11-14 22:13:53   INFO  epoch: 4/24, acc_iter=26498, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:03:08, time_cost(all): 8:37:35/1 day, 20:16:55, loss=0.543423942115784, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.49213116954771, lr=0.09142902045642613
2023-11-14 22:14:52   INFO  epoch: 4/24, acc_iter=26548, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:04:12, time_cost(all): 8:38:34/1 day, 18:08:13, loss=0.543312999967607, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=2.462262994471907, lr=0.09138892868368555
2023-11-14 22:15:51   INFO  epoch: 4/24, acc_iter=26598, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:02:48, time_cost(all): 8:39:33/1 day, 20:00:32, loss=0.54320205781943, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=1.7608020203152435, lr=0.09134883691094495
2023-11-14 22:16:50   INFO  epoch: 4/24, acc_iter=26648, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:32, time_cost(all): 8:40:32/1 day, 19:04:08, loss=0.543091115671254, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=3.4561782822904137, lr=0.09130874513820436
2023-11-14 22:17:49   INFO  epoch: 4/24, acc_iter=26698, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/1:58:18, time_cost(all): 8:41:31/1 day, 20:10:14, loss=0.542980173523077, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=0.6351961707427152, lr=0.09126865336546378
2023-11-14 22:18:48   INFO  epoch: 4/24, acc_iter=26748, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:56:43, time_cost(all): 8:42:30/1 day, 19:55:23, loss=0.5428692313749, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=4.096335682212162, lr=0.09122856159272319
2023-11-14 22:19:47   INFO  epoch: 4/24, acc_iter=26798, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:06:04, time_cost(all): 8:43:29/1 day, 19:59:11, loss=0.542758289226723, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.9726935470072235, lr=0.0911884698199826
2023-11-14 22:20:46   INFO  epoch: 4/24, acc_iter=26848, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:02:00, time_cost(all): 8:44:28/1 day, 19:03:40, loss=0.542647347078547, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=4.470483500956964, lr=0.09114837804724202
2023-11-14 22:21:44   INFO  epoch: 4/24, acc_iter=26898, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:55:40, time_cost(all): 8:45:26/1 day, 17:07:36, loss=0.54253640493037, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=3.7635807373526284, lr=0.09110828627450143
2023-11-14 22:22:43   INFO  epoch: 4/24, acc_iter=26948, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:54:40, time_cost(all): 8:46:25/1 day, 20:56:47, loss=0.542425462782193, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=3.362382524529392, lr=0.09106819450176083
2023-11-14 22:23:42   INFO  epoch: 4/24, acc_iter=26998, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:52:15, time_cost(all): 8:47:24/1 day, 18:13:31, loss=0.542314520634016, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=3.134119076974771, lr=0.09102810272902025
2023-11-14 22:24:41   INFO  epoch: 4/24, acc_iter=27048, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:53:47, time_cost(all): 8:48:23/1 day, 17:43:32, loss=0.54220357848584, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=0.8513966333608115, lr=0.09098801095627966
2023-11-14 22:25:40   INFO  epoch: 4/24, acc_iter=27098, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:52:44, time_cost(all): 8:49:22/1 day, 18:09:01, loss=0.542092636337663, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.3928332489531807, lr=0.09094791918353907
2023-11-14 22:26:39   INFO  epoch: 4/24, acc_iter=27148, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:52:30, time_cost(all): 8:50:21/1 day, 18:12:57, loss=0.541981694189486, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.4015109774042234, lr=0.09090782741079849
2023-11-14 22:27:38   INFO  epoch: 4/24, acc_iter=27198, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:48:12, time_cost(all): 8:51:20/1 day, 16:49:58, loss=0.541870752041309, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.8771880255463018, lr=0.09086773563805789
2023-11-14 22:28:37   INFO  epoch: 4/24, acc_iter=27248, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:56:26, time_cost(all): 8:52:19/1 day, 19:44:06, loss=0.541759809893133, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=4.326463548479072, lr=0.0908276438653173
2023-11-14 22:29:36   INFO  epoch: 4/24, acc_iter=27298, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:52:30, time_cost(all): 8:53:18/1 day, 18:45:14, loss=0.541648867744956, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=0.5795554806175194, lr=0.09078755209257672
2023-11-14 22:30:35   INFO  epoch: 4/24, acc_iter=27348, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:45:07, time_cost(all): 8:54:17/1 day, 20:43:49, loss=0.541537925596779, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=4.4083995751957445, lr=0.09074746031983613
2023-11-14 22:31:34   INFO  epoch: 4/24, acc_iter=27398, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:50:38, time_cost(all): 8:55:16/1 day, 16:44:05, loss=0.541426983448602, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=3.8416154700182528, lr=0.09070736854709555
2023-11-14 22:32:33   INFO  epoch: 4/24, acc_iter=27448, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:46:25, time_cost(all): 8:56:15/1 day, 17:24:28, loss=0.541316041300426, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=2.7958351694271357, lr=0.09066727677435496
2023-11-14 22:33:32   INFO  epoch: 4/24, acc_iter=27498, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:46:20, time_cost(all): 8:57:14/1 day, 18:07:19, loss=0.541205099152249, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=4.7611084434061866, lr=0.09062718500161437
2023-11-14 22:34:31   INFO  epoch: 4/24, acc_iter=27548, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:42:42, time_cost(all): 8:58:13/1 day, 17:42:46, loss=0.541094157004072, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.2828833730442049, lr=0.09058709322887377
2023-11-14 22:35:29   INFO  epoch: 4/24, acc_iter=27598, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:44:58, time_cost(all): 8:59:11/1 day, 18:44:36, loss=0.540983214855895, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=4.440093191762873, lr=0.09054700145613319
2023-11-14 22:36:28   INFO  epoch: 4/24, acc_iter=27648, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:41:48, time_cost(all): 9:00:10/1 day, 19:29:36, loss=0.540872272707719, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=1.8462901346980525, lr=0.0905069096833926
2023-11-14 22:37:27   INFO  epoch: 4/24, acc_iter=27698, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:43:59, time_cost(all): 9:01:09/1 day, 17:57:21, loss=0.540761330559542, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=4.415384395059085, lr=0.09046681791065202
2023-11-14 22:38:26   INFO  epoch: 4/24, acc_iter=27748, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:46:34, time_cost(all): 9:02:08/1 day, 19:43:33, loss=0.540650388411365, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.8766336997479889, lr=0.09042672613791143
2023-11-14 22:39:25   INFO  epoch: 4/24, acc_iter=27798, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:39:06, time_cost(all): 9:03:07/1 day, 17:23:08, loss=0.540539446263188, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=4.810176946557173, lr=0.09038663436517085
2023-11-14 22:40:24   INFO  epoch: 4/24, acc_iter=27848, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:42:18, time_cost(all): 9:04:06/1 day, 20:09:27, loss=0.540428504115012, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=3.998800475611813, lr=0.09034654259243025
2023-11-14 22:41:23   INFO  epoch: 4/24, acc_iter=27898, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:34:18, time_cost(all): 9:05:05/1 day, 17:27:50, loss=0.540317561966835, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=4.973091628414985, lr=0.09030645081968966
2023-11-14 22:42:22   INFO  epoch: 4/24, acc_iter=27948, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:37:17, time_cost(all): 9:06:04/1 day, 18:42:57, loss=0.540206619818658, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=4.034427054839044, lr=0.09026635904694907
2023-11-14 22:43:21   INFO  epoch: 4/24, acc_iter=27998, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:32:56, time_cost(all): 9:07:03/1 day, 18:02:43, loss=0.540095677670481, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.4255910309134423, lr=0.09022626727420849
2023-11-14 22:44:20   INFO  epoch: 4/24, acc_iter=28048, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:31:41, time_cost(all): 9:08:02/1 day, 18:56:10, loss=0.539984735522305, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.84943419163006, lr=0.0901861755014679
2023-11-14 22:45:19   INFO  epoch: 4/24, acc_iter=28098, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:35:48, time_cost(all): 9:09:01/1 day, 16:32:14, loss=0.539873793374128, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=2.0557561540138174, lr=0.09014608372872732
2023-11-14 22:46:18   INFO  epoch: 4/24, acc_iter=28148, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:29:22, time_cost(all): 9:10:00/1 day, 16:42:27, loss=0.539762851225951, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=0.7877058783020834, lr=0.09010599195598673
2023-11-14 22:47:17   INFO  epoch: 4/24, acc_iter=28198, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:36:54, time_cost(all): 9:10:59/1 day, 17:12:24, loss=0.539651909077774, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=4.989256032810024, lr=0.09006590018324613
2023-11-14 22:48:16   INFO  epoch: 4/24, acc_iter=28248, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:28:59, time_cost(all): 9:11:58/1 day, 17:03:14, loss=0.539540966929598, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=3.734594551049041, lr=0.09002580841050555
2023-11-14 22:49:14   INFO  epoch: 4/24, acc_iter=28298, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:30:06, time_cost(all): 9:12:56/1 day, 19:46:54, loss=0.539430024781421, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=2.12521147227372, lr=0.08998571663776496
2023-11-14 22:50:13   INFO  epoch: 4/24, acc_iter=28348, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:33:48, time_cost(all): 9:13:55/1 day, 18:03:38, loss=0.539319082633244, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.542797846267352, lr=0.08994562486502437
2023-11-14 22:51:12   INFO  epoch: 4/24, acc_iter=28398, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:27:24, time_cost(all): 9:14:54/1 day, 19:12:46, loss=0.539208140485067, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=3.7813603704405843, lr=0.08990553309228379
2023-11-14 22:52:11   INFO  epoch: 4/24, acc_iter=28448, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:32:13, time_cost(all): 9:15:53/1 day, 17:51:59, loss=0.539097198336891, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=4.755704414643819, lr=0.0898654413195432
2023-11-14 22:53:10   INFO  epoch: 4/24, acc_iter=28498, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:29:35, time_cost(all): 9:16:52/1 day, 17:08:42, loss=0.538986256188714, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=3.083020705899365, lr=0.0898253495468026
2023-11-14 22:54:09   INFO  epoch: 4/24, acc_iter=28548, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:48, time_cost(all): 9:17:51/1 day, 16:46:58, loss=0.538875314040537, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=4.164898337728243, lr=0.08978525777406202
2023-11-14 22:55:08   INFO  epoch: 4/24, acc_iter=28598, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:28, time_cost(all): 9:18:50/1 day, 20:00:29, loss=0.53876437189236, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=1.348016862377668, lr=0.08974516600132143
2023-11-14 22:56:07   INFO  epoch: 4/24, acc_iter=28648, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:20:14, time_cost(all): 9:19:49/1 day, 20:11:31, loss=0.538653429744184, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.3937564762597088, lr=0.08970507422858084
2023-11-14 22:57:06   INFO  epoch: 4/24, acc_iter=28698, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:22:07, time_cost(all): 9:20:48/1 day, 18:58:42, loss=0.538542487596007, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=2.8424593726250946, lr=0.08966498245584026
2023-11-14 22:58:05   INFO  epoch: 4/24, acc_iter=28748, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:24:31, time_cost(all): 9:21:47/1 day, 16:19:00, loss=0.53843154544783, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=0.8121914311636049, lr=0.08962489068309967
2023-11-14 22:59:04   INFO  epoch: 4/24, acc_iter=28798, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:22:28, time_cost(all): 9:22:46/1 day, 19:49:53, loss=0.538320603299653, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.731392953457108, lr=0.08958479891035907
2023-11-14 23:00:03   INFO  epoch: 4/24, acc_iter=28848, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:24:14, time_cost(all): 9:23:45/1 day, 20:20:07, loss=0.538209661151477, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.9927258662888656, lr=0.08954470713761849
2023-11-14 23:01:02   INFO  epoch: 4/24, acc_iter=28898, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:17:34, time_cost(all): 9:24:44/1 day, 19:36:15, loss=0.5380987190033, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=2.102602418697478, lr=0.0895046153648779
2023-11-14 23:02:01   INFO  epoch: 4/24, acc_iter=28948, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:24, time_cost(all): 9:25:43/1 day, 18:14:48, loss=0.537987776855123, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=1.502040081946282, lr=0.08946452359213732
2023-11-14 23:02:59   INFO  epoch: 4/24, acc_iter=28998, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:19:17, time_cost(all): 9:26:41/1 day, 19:14:33, loss=0.537876834706946, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.16(1.03), norm=1.9330701206568675, lr=0.08942443181939673
2023-11-14 23:03:58   INFO  epoch: 4/24, acc_iter=29048, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:12:52, time_cost(all): 9:27:40/1 day, 16:48:03, loss=0.537765892558769, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=2.4830632434850886, lr=0.08938434004665614
2023-11-14 23:04:57   INFO  epoch: 4/24, acc_iter=29098, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:17:28, time_cost(all): 9:28:39/1 day, 18:52:53, loss=0.537654950410593, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=1.6458927827386574, lr=0.08934424827391554
2023-11-14 23:05:56   INFO  epoch: 4/24, acc_iter=29148, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:12:09, time_cost(all): 9:29:38/1 day, 16:42:58, loss=0.537544008262416, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.724315996859638, lr=0.08930415650117496
2023-11-14 23:06:55   INFO  epoch: 4/24, acc_iter=29198, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:13:44, time_cost(all): 9:30:37/1 day, 19:32:15, loss=0.537433066114239, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=0.5657903846716019, lr=0.08926406472843437
2023-11-14 23:07:54   INFO  epoch: 4/24, acc_iter=29248, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:51, time_cost(all): 9:31:36/1 day, 19:37:54, loss=0.537322123966062, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=4.5196282132484615, lr=0.08922397295569379
2023-11-14 23:08:53   INFO  epoch: 4/24, acc_iter=29298, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:11:09, time_cost(all): 9:32:35/1 day, 19:16:11, loss=0.537211181817886, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.736351487325634, lr=0.0891838811829532
2023-11-14 23:09:52   INFO  epoch: 4/24, acc_iter=29348, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:07:13, time_cost(all): 9:33:34/1 day, 18:24:24, loss=0.537100239669709, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=3.586936705809706, lr=0.08914378941021261
2023-11-14 23:10:51   INFO  epoch: 4/24, acc_iter=29398, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:15, time_cost(all): 9:34:33/1 day, 17:35:49, loss=0.536989297521532, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=0.8113763461254091, lr=0.08910369763747203
2023-11-14 23:11:50   INFO  epoch: 4/24, acc_iter=29448, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:10:58, time_cost(all): 9:35:32/1 day, 17:55:09, loss=0.536878355373355, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=0.7992100574370079, lr=0.08906360586473143
2023-11-14 23:12:49   INFO  epoch: 4/24, acc_iter=29498, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:07:29, time_cost(all): 9:36:31/1 day, 18:32:16, loss=0.536767413225179, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=0.8386766491542816, lr=0.08902351409199084
2023-11-14 23:13:48   INFO  epoch: 4/24, acc_iter=29548, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:50, time_cost(all): 9:37:30/1 day, 19:03:33, loss=0.536656471077002, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.11(1.03), norm=2.8103703973057783, lr=0.08898342231925026
2023-11-14 23:14:47   INFO  epoch: 4/24, acc_iter=29598, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:33, time_cost(all): 9:38:29/1 day, 18:53:49, loss=0.536545528928825, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=0.9127309749090955, lr=0.08894333054650967
2023-11-14 23:15:46   INFO  epoch: 4/24, acc_iter=29648, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:24, time_cost(all): 9:39:28/1 day, 18:41:06, loss=0.536434586780648, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=4.450024562105315, lr=0.08890323877376909
2023-11-14 23:16:44   INFO  epoch: 4/24, acc_iter=29698, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:05:19, time_cost(all): 9:40:26/1 day, 17:51:16, loss=0.536323644632472, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=4.949850405528872, lr=0.08886314700102849
2023-11-14 23:17:43   INFO  epoch: 4/24, acc_iter=29748, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:02:21, time_cost(all): 9:41:25/1 day, 17:38:16, loss=0.536212702484295, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=4.833812222372971, lr=0.0888230552282879
2023-11-14 23:18:42   INFO  epoch: 4/24, acc_iter=29798, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:00:48, time_cost(all): 9:42:24/1 day, 20:00:38, loss=0.536101760336118, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=3.944687217941018, lr=0.08878296345554731
2023-11-14 23:19:41   INFO  epoch: 4/24, acc_iter=29848, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:57:39, time_cost(all): 9:43:23/1 day, 19:07:49, loss=0.535990818187941, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.767041138007315, lr=0.08874287168280673
2023-11-14 23:20:40   INFO  epoch: 4/24, acc_iter=29898, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:58:57, time_cost(all): 9:44:22/1 day, 16:57:47, loss=0.535879876039765, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=1.9969990542694687, lr=0.08870277991006614
2023-11-14 23:21:39   INFO  epoch: 4/24, acc_iter=29948, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/1:00:13, time_cost(all): 9:45:21/1 day, 17:34:53, loss=0.535768933891588, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=4.811248329334271, lr=0.08866268813732556
2023-11-14 23:22:38   INFO  epoch: 4/24, acc_iter=29998, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:59:48, time_cost(all): 9:46:20/1 day, 19:02:23, loss=0.535657991743411, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.3251745332938705, lr=0.08862259636458497
2023-11-14 23:23:37   INFO  epoch: 4/24, acc_iter=30048, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:53:59, time_cost(all): 9:47:19/1 day, 17:36:15, loss=0.535547049595234, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=1.320618107967881, lr=0.08858250459184439
2023-11-14 23:24:36   INFO  epoch: 4/24, acc_iter=30098, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:53:43, time_cost(all): 9:48:18/1 day, 19:31:21, loss=0.535436107447058, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.294509468429949, lr=0.08854241281910379
2023-11-14 23:25:35   INFO  epoch: 4/24, acc_iter=30148, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:57:02, time_cost(all): 9:49:17/1 day, 19:53:25, loss=0.535325165298881, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=3.2300777819377195, lr=0.0885023210463632
2023-11-14 23:26:34   INFO  epoch: 4/24, acc_iter=30198, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:23, time_cost(all): 9:50:16/1 day, 19:59:14, loss=0.535214223150704, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=1.095621735567333, lr=0.08846222927362261
2023-11-14 23:27:33   INFO  epoch: 4/24, acc_iter=30248, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:07, time_cost(all): 9:51:15/1 day, 17:03:52, loss=0.535103281002527, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=1.1802544827511903, lr=0.08842213750088203
2023-11-14 23:28:32   INFO  epoch: 4/24, acc_iter=30298, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:59, time_cost(all): 9:52:14/1 day, 18:10:43, loss=0.534992338854351, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.7755841746983996, lr=0.08838204572814144
2023-11-14 23:29:31   INFO  epoch: 4/24, acc_iter=30348, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:15, time_cost(all): 9:53:13/1 day, 18:53:05, loss=0.534881396706174, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=1.3741293436390152, lr=0.08834195395540084
2023-11-14 23:30:29   INFO  epoch: 4/24, acc_iter=30398, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:11, time_cost(all): 9:54:11/1 day, 19:40:22, loss=0.534770454557997, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.18(1.03), norm=1.3295955485137132, lr=0.08830186218266026
2023-11-14 23:31:28   INFO  epoch: 4/24, acc_iter=30448, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:47:18, time_cost(all): 9:55:10/1 day, 19:39:25, loss=0.53465951240982, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=1.3242065496996704, lr=0.08826177040991967
2023-11-14 23:32:27   INFO  epoch: 4/24, acc_iter=30498, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:52, time_cost(all): 9:56:09/1 day, 18:41:31, loss=0.534548570261644, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=1.3518608156655458, lr=0.08822167863717909
2023-11-14 23:33:26   INFO  epoch: 4/24, acc_iter=30548, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:45:10, time_cost(all): 9:57:08/1 day, 18:55:20, loss=0.534437628113467, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.23(1.03), norm=4.135300404653277, lr=0.0881815868644385
2023-11-14 23:34:25   INFO  epoch: 4/24, acc_iter=30598, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:54, time_cost(all): 9:58:07/1 day, 16:58:38, loss=0.53432668596529, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=1.8018671051161985, lr=0.08814149509169791
2023-11-14 23:35:24   INFO  epoch: 4/24, acc_iter=30648, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:45:27, time_cost(all): 9:59:06/1 day, 16:26:36, loss=0.534215743817113, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=2.495893228645751, lr=0.08810140331895733
2023-11-14 23:36:23   INFO  epoch: 4/24, acc_iter=30698, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:10, time_cost(all): 10:00:05/1 day, 17:32:13, loss=0.534104801668937, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.7598302171889397, lr=0.08806131154621673
2023-11-14 23:37:22   INFO  epoch: 4/24, acc_iter=30748, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:29, time_cost(all): 10:01:04/1 day, 16:38:25, loss=0.53399385952076, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.550158973899723, lr=0.08802121977347614
2023-11-14 23:38:21   INFO  epoch: 4/24, acc_iter=30798, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:27, time_cost(all): 10:02:03/1 day, 19:42:17, loss=0.533882917372583, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=3.423019214599785, lr=0.08798112800073556
2023-11-14 23:39:20   INFO  epoch: 4/24, acc_iter=30848, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:43:01, time_cost(all): 10:03:02/1 day, 17:27:32, loss=0.533771975224406, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.601243133371392, lr=0.08794103622799497
2023-11-14 23:40:19   INFO  epoch: 4/24, acc_iter=30898, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:44, time_cost(all): 10:04:01/1 day, 17:26:24, loss=0.53366103307623, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=4.568158810982109, lr=0.08790094445525438
2023-11-14 23:41:18   INFO  epoch: 4/24, acc_iter=30948, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:36, time_cost(all): 10:05:00/1 day, 18:45:37, loss=0.533550090928053, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.0437067212202304, lr=0.0878608526825138
2023-11-14 23:42:17   INFO  epoch: 4/24, acc_iter=30998, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:39:21, time_cost(all): 10:05:59/1 day, 19:13:58, loss=0.533439148779876, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=0.8488987630712246, lr=0.0878207609097732
2023-11-14 23:43:16   INFO  epoch: 4/24, acc_iter=31048, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:36:27, time_cost(all): 10:06:58/1 day, 15:36:23, loss=0.533328206631699, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.4194382500918845, lr=0.08778066913703261
2023-11-14 23:44:14   INFO  epoch: 4/24, acc_iter=31098, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:01, time_cost(all): 10:07:56/1 day, 16:47:11, loss=0.533217264483523, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=0.8272128455438743, lr=0.08774057736429203
2023-11-14 23:45:13   INFO  epoch: 4/24, acc_iter=31148, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:27, time_cost(all): 10:08:55/1 day, 18:54:48, loss=0.533106322335346, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=2.5588450404195173, lr=0.08770048559155144
2023-11-14 23:46:12   INFO  epoch: 4/24, acc_iter=31198, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:40, time_cost(all): 10:09:54/1 day, 17:27:23, loss=0.532995380187169, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=1.7633377764694038, lr=0.08766039381881086
2023-11-14 23:47:11   INFO  epoch: 4/24, acc_iter=31248, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:59, time_cost(all): 10:10:53/1 day, 19:12:10, loss=0.532884438038992, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=1.7892579163296687, lr=0.08762030204607027
2023-11-14 23:48:10   INFO  epoch: 4/24, acc_iter=31298, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:33:45, time_cost(all): 10:11:52/1 day, 17:38:08, loss=0.532773495890816, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=2.6691887515986537, lr=0.08758021027332968
2023-11-14 23:49:09   INFO  epoch: 4/24, acc_iter=31348, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:29:37, time_cost(all): 10:12:51/1 day, 18:37:19, loss=0.532662553742639, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=4.384999638864054, lr=0.08754011850058908
2023-11-14 23:50:08   INFO  epoch: 4/24, acc_iter=31398, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:29, time_cost(all): 10:13:50/1 day, 16:45:50, loss=0.532551611594462, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.9124216827992657, lr=0.0875000267278485
2023-11-14 23:51:07   INFO  epoch: 4/24, acc_iter=31448, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:27:56, time_cost(all): 10:14:49/1 day, 17:08:18, loss=0.532440669446285, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.25389078737672, lr=0.08745993495510791
2023-11-14 23:52:06   INFO  epoch: 4/24, acc_iter=31498, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:39, time_cost(all): 10:15:48/1 day, 18:41:39, loss=0.532329727298109, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=2.1257927748420213, lr=0.08741984318236733
2023-11-14 23:53:05   INFO  epoch: 4/24, acc_iter=31548, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:22, time_cost(all): 10:16:47/1 day, 19:14:32, loss=0.532218785149932, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=2.519786072913957, lr=0.08737975140962674
2023-11-14 23:54:04   INFO  epoch: 4/24, acc_iter=31598, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:38, time_cost(all): 10:17:46/1 day, 17:46:33, loss=0.532107843001755, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=1.7702982110317875, lr=0.08733965963688614
2023-11-14 23:55:03   INFO  epoch: 4/24, acc_iter=31648, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:25, time_cost(all): 10:18:45/1 day, 17:09:34, loss=0.531996900853578, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=4.7425775858756225, lr=0.08729956786414556
2023-11-14 23:56:02   INFO  epoch: 4/24, acc_iter=31698, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:00, time_cost(all): 10:19:44/1 day, 15:59:20, loss=0.531885958705402, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=2.02034145911599, lr=0.08725947609140497
2023-11-14 23:57:01   INFO  epoch: 4/24, acc_iter=31748, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:30, time_cost(all): 10:20:43/1 day, 18:15:07, loss=0.531775016557225, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=1.4539551558856405, lr=0.08721938431866438
2023-11-14 23:57:59   INFO  epoch: 4/24, acc_iter=31798, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:16, time_cost(all): 10:21:41/1 day, 15:50:20, loss=0.531664074409048, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=3.2554365168039756, lr=0.0871792925459238
2023-11-14 23:58:58   INFO  epoch: 4/24, acc_iter=31848, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:13, time_cost(all): 10:22:40/1 day, 18:45:47, loss=0.531553132260871, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=0.58409985347457, lr=0.08713920077318321
2023-11-14 23:59:57   INFO  epoch: 4/24, acc_iter=31898, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:09, time_cost(all): 10:23:39/1 day, 16:30:57, loss=0.531442190112695, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=0.7953170990085552, lr=0.08709910900044263
2023-11-15 00:00:56   INFO  epoch: 4/24, acc_iter=31948, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:14, time_cost(all): 10:24:38/1 day, 18:36:39, loss=0.531331247964518, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.2934498155422667, lr=0.08705901722770203
2023-11-15 00:01:55   INFO  epoch: 4/24, acc_iter=31998, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:23, time_cost(all): 10:25:37/1 day, 19:02:34, loss=0.531220305816341, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=0.6376558027683579, lr=0.08701892545496144
2023-11-15 00:02:54   INFO  epoch: 4/24, acc_iter=32048, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:28, time_cost(all): 10:26:36/1 day, 16:07:20, loss=0.531109363668164, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=1.3949980565744826, lr=0.08697883368222085
2023-11-15 00:03:53   INFO  epoch: 4/24, acc_iter=32098, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:17:14, time_cost(all): 10:27:35/1 day, 15:23:54, loss=0.530998421519988, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=3.5142366593871133, lr=0.08693874190948027
2023-11-15 00:04:52   INFO  epoch: 4/24, acc_iter=32148, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:16:10, time_cost(all): 10:28:34/1 day, 17:51:35, loss=0.530887479371811, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=3.53193840078536, lr=0.08689865013673968
2023-11-15 00:05:51   INFO  epoch: 4/24, acc_iter=32198, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:15:08, time_cost(all): 10:29:33/1 day, 16:34:56, loss=0.530776537223634, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=2.4288846743219517, lr=0.0868585583639991
2023-11-15 00:06:50   INFO  epoch: 4/24, acc_iter=32248, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:01, time_cost(all): 10:30:32/1 day, 17:06:48, loss=0.530665595075457, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.2184419077102897, lr=0.0868184665912585
2023-11-15 00:07:49   INFO  epoch: 4/24, acc_iter=32298, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:11:58, time_cost(all): 10:31:31/1 day, 16:29:13, loss=0.530554652927281, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=2.784200299496033, lr=0.08677837481851791
2023-11-15 00:08:48   INFO  epoch: 4/24, acc_iter=32348, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:37, time_cost(all): 10:32:30/1 day, 18:20:52, loss=0.530443710779104, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.518180563224669, lr=0.08673828304577733
2023-11-15 00:09:47   INFO  epoch: 4/24, acc_iter=32398, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:17, time_cost(all): 10:33:29/1 day, 19:10:21, loss=0.530332768630927, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.9748584607824755, lr=0.08669819127303674
2023-11-15 00:10:46   INFO  epoch: 4/24, acc_iter=32448, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:36, time_cost(all): 10:34:28/1 day, 16:39:14, loss=0.53022182648275, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=4.981463791277469, lr=0.08665809950029615
2023-11-15 00:11:44   INFO  epoch: 4/24, acc_iter=32498, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:19, time_cost(all): 10:35:26/1 day, 18:32:03, loss=0.530110884334574, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.616330624067485, lr=0.08661800772755557
2023-11-15 00:12:43   INFO  epoch: 4/24, acc_iter=32548, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:21, time_cost(all): 10:36:25/1 day, 16:39:09, loss=0.529999942186397, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.169711289070432, lr=0.08657791595481498
2023-11-15 00:13:42   INFO  epoch: 4/24, acc_iter=32598, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:23, time_cost(all): 10:37:24/1 day, 17:16:11, loss=0.52988900003822, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=4.076651947513175, lr=0.08653782418207438
2023-11-15 00:14:41   INFO  epoch: 4/24, acc_iter=32648, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:49, time_cost(all): 10:38:23/1 day, 18:29:52, loss=0.529778057890043, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=3.267479983356595, lr=0.0864977324093338
2023-11-15 00:15:40   INFO  epoch: 4/24, acc_iter=32698, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:43, time_cost(all): 10:39:22/1 day, 16:54:48, loss=0.529667115741867, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.19(1.03), norm=2.402598860818684, lr=0.08645764063659321
2023-11-15 00:16:39   INFO  epoch: 4/24, acc_iter=32748, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:45, time_cost(all): 10:40:21/1 day, 16:16:23, loss=0.52955617359369, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=1.70182824286531, lr=0.08641754886385263
2023-11-15 00:17:38   INFO  epoch: 4/24, acc_iter=32798, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:38, time_cost(all): 10:41:20/1 day, 15:39:05, loss=0.529445231445513, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=3.3246264054589063, lr=0.08637745709111204
2023-11-15 00:18:37   INFO  epoch: 4/24, acc_iter=32848, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:45, time_cost(all): 10:42:19/1 day, 18:41:29, loss=0.529334289297336, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.1590128344442725, lr=0.08633736531837144
2023-11-15 00:19:36   INFO  epoch: 4/24, acc_iter=32898, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 10:43:18/1 day, 18:07:50, loss=0.52922334714916, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=1.34885275719463, lr=0.08629727354563085
2023-11-15 00:20:35   INFO  epoch: 5/24, acc_iter=32985, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:10:01, time_cost(all): 10:44:17/1 day, 18:46:28, loss=0.529030307811332, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=1.6196692998805282, lr=0.08622751386106224
2023-11-15 00:21:34   INFO  epoch: 5/24, acc_iter=33035, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:03:06, time_cost(all): 10:45:16/1 day, 16:32:06, loss=0.528919365663155, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.145125412676925, lr=0.08618742208832166
2023-11-15 00:22:33   INFO  epoch: 5/24, acc_iter=33085, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:06:42, time_cost(all): 10:46:15/1 day, 18:53:48, loss=0.528808423514979, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=3.8737683556723974, lr=0.08614733031558106
2023-11-15 00:23:32   INFO  epoch: 5/24, acc_iter=33135, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:05:31, time_cost(all): 10:47:14/1 day, 17:27:26, loss=0.528697481366802, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=1.1549357086343237, lr=0.08610723854284047
2023-11-15 00:24:31   INFO  epoch: 5/24, acc_iter=33185, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:00:17, time_cost(all): 10:48:13/1 day, 15:37:09, loss=0.528586539218625, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=3.162804135949351, lr=0.08606714677009988
2023-11-15 00:25:29   INFO  epoch: 5/24, acc_iter=33235, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:07:26, time_cost(all): 10:49:11/1 day, 15:14:04, loss=0.528475597070448, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.103334190355007, lr=0.0860270549973593
2023-11-15 00:26:28   INFO  epoch: 5/24, acc_iter=33285, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/1:56:53, time_cost(all): 10:50:10/1 day, 16:03:15, loss=0.528364654922272, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=3.6301257839356857, lr=0.08598696322461871
2023-11-15 00:27:27   INFO  epoch: 5/24, acc_iter=33335, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:01:19, time_cost(all): 10:51:09/1 day, 18:12:20, loss=0.528253712774095, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=4.854401580024739, lr=0.08594687145187813
2023-11-15 00:28:26   INFO  epoch: 5/24, acc_iter=33385, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:58:58, time_cost(all): 10:52:08/1 day, 16:50:41, loss=0.528142770625918, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=2.8454875292858994, lr=0.08590677967913753
2023-11-15 00:29:25   INFO  epoch: 5/24, acc_iter=33435, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:58:38, time_cost(all): 10:53:07/1 day, 18:01:41, loss=0.528031828477741, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=1.8962804847401196, lr=0.08586668790639694
2023-11-15 00:30:24   INFO  epoch: 5/24, acc_iter=33485, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:02:01, time_cost(all): 10:54:06/1 day, 15:25:44, loss=0.527920886329565, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.040765997398824, lr=0.08582659613365635
2023-11-15 00:31:23   INFO  epoch: 5/24, acc_iter=33535, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:53:02, time_cost(all): 10:55:05/1 day, 16:38:12, loss=0.527809944181388, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=0.6828033359254734, lr=0.08578650436091577
2023-11-15 00:32:22   INFO  epoch: 5/24, acc_iter=33585, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:57:57, time_cost(all): 10:56:04/1 day, 17:00:05, loss=0.527699002033211, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=3.636226671264506, lr=0.08574641258817518
2023-11-15 00:33:21   INFO  epoch: 5/24, acc_iter=33635, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:51:32, time_cost(all): 10:57:03/1 day, 16:59:11, loss=0.527588059885034, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=3.9099001570406315, lr=0.0857063208154346
2023-11-15 00:34:20   INFO  epoch: 5/24, acc_iter=33685, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:50:22, time_cost(all): 10:58:02/1 day, 18:39:14, loss=0.527477117736858, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=4.585807102415506, lr=0.085666229042694
2023-11-15 00:35:19   INFO  epoch: 5/24, acc_iter=33735, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:55:59, time_cost(all): 10:59:01/1 day, 16:04:53, loss=0.527366175588681, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=3.957181195913148, lr=0.08562613726995341
2023-11-15 00:36:18   INFO  epoch: 5/24, acc_iter=33785, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:55:46, time_cost(all): 11:00:00/1 day, 15:45:14, loss=0.527255233440504, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.832031557691289, lr=0.08558604549721283
2023-11-15 00:37:17   INFO  epoch: 5/24, acc_iter=33835, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:50:51, time_cost(all): 11:00:59/1 day, 16:02:11, loss=0.527144291292327, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=1.3391913038561911, lr=0.08554595372447224
2023-11-15 00:38:16   INFO  epoch: 5/24, acc_iter=33885, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:45:30, time_cost(all): 11:01:58/1 day, 15:20:37, loss=0.527033349144151, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.265890094660387, lr=0.08550586195173165
2023-11-15 00:39:14   INFO  epoch: 5/24, acc_iter=33935, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:47:21, time_cost(all): 11:02:56/1 day, 14:54:09, loss=0.526922406995974, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.202442484759236, lr=0.08546577017899107
2023-11-15 00:40:13   INFO  epoch: 5/24, acc_iter=33985, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:50:16, time_cost(all): 11:03:55/1 day, 16:02:02, loss=0.526811464847797, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.81744973566651, lr=0.08542567840625047
2023-11-15 00:41:12   INFO  epoch: 5/24, acc_iter=34035, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:52, time_cost(all): 11:04:54/1 day, 14:48:33, loss=0.52670052269962, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=3.154777449593926, lr=0.08538558663350988
2023-11-15 00:42:11   INFO  epoch: 5/24, acc_iter=34085, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:46:41, time_cost(all): 11:05:53/1 day, 15:02:24, loss=0.526589580551444, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=0.621093415816564, lr=0.0853454948607693
2023-11-15 00:43:10   INFO  epoch: 5/24, acc_iter=34135, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:49:37, time_cost(all): 11:06:52/1 day, 14:40:54, loss=0.526478638403267, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=4.490087242701798, lr=0.08530540308802871
2023-11-15 00:44:09   INFO  epoch: 5/24, acc_iter=34185, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:43:33, time_cost(all): 11:07:51/1 day, 16:25:13, loss=0.52636769625509, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=4.321410326349417, lr=0.08526531131528813
2023-11-15 00:45:08   INFO  epoch: 5/24, acc_iter=34235, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:40:46, time_cost(all): 11:08:50/1 day, 16:58:52, loss=0.526256754106913, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=3.489237894062697, lr=0.08522521954254754
2023-11-15 00:46:07   INFO  epoch: 5/24, acc_iter=34285, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:44:07, time_cost(all): 11:09:49/1 day, 18:26:37, loss=0.526145811958737, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=1.7589162640254659, lr=0.08518512776980694
2023-11-15 00:47:06   INFO  epoch: 5/24, acc_iter=34335, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:44:05, time_cost(all): 11:10:48/1 day, 17:40:42, loss=0.52603486981056, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=1.8930122428032699, lr=0.08514503599706635
2023-11-15 00:48:05   INFO  epoch: 5/24, acc_iter=34385, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:42:27, time_cost(all): 11:11:47/1 day, 15:53:33, loss=0.525923927662383, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=1.6835912397039856, lr=0.08510494422432577
2023-11-15 00:49:04   INFO  epoch: 5/24, acc_iter=34435, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:37:46, time_cost(all): 11:12:46/1 day, 15:52:19, loss=0.525812985514206, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=0.6283659766591916, lr=0.08506485245158518
2023-11-15 00:50:03   INFO  epoch: 5/24, acc_iter=34485, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:39:27, time_cost(all): 11:13:45/1 day, 17:28:08, loss=0.52570204336603, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=1.6763681687512986, lr=0.0850247606788446
2023-11-15 00:51:02   INFO  epoch: 5/24, acc_iter=34535, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:33:33, time_cost(all): 11:14:44/1 day, 17:40:04, loss=0.525591101217853, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=3.729931553005591, lr=0.08498466890610401
2023-11-15 00:52:01   INFO  epoch: 5/24, acc_iter=34585, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:36:32, time_cost(all): 11:15:43/1 day, 16:26:10, loss=0.525480159069676, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=3.8753850497936755, lr=0.08494457713336342
2023-11-15 00:53:00   INFO  epoch: 5/24, acc_iter=34635, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:37:43, time_cost(all): 11:16:42/1 day, 15:00:24, loss=0.525369216921499, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.8063652690658643, lr=0.08490448536062283
2023-11-15 00:53:58   INFO  epoch: 5/24, acc_iter=34685, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:34:50, time_cost(all): 11:17:40/1 day, 14:47:05, loss=0.525258274773323, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=1.0485171625059602, lr=0.08486439358788224
2023-11-15 00:54:57   INFO  epoch: 5/24, acc_iter=34735, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:33:13, time_cost(all): 11:18:39/1 day, 15:55:36, loss=0.525147332625146, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=2.033060720664749, lr=0.08482430181514165
2023-11-15 00:55:56   INFO  epoch: 5/24, acc_iter=34785, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:28:57, time_cost(all): 11:19:38/1 day, 15:29:48, loss=0.525036390476969, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=4.844846021292712, lr=0.08478421004240107
2023-11-15 00:56:55   INFO  epoch: 5/24, acc_iter=34835, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:30:00, time_cost(all): 11:20:37/1 day, 15:33:03, loss=0.524925448328792, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=4.871179324922117, lr=0.08474411826966048
2023-11-15 00:57:54   INFO  epoch: 5/24, acc_iter=34885, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:27:31, time_cost(all): 11:21:36/1 day, 18:01:15, loss=0.524814506180616, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=2.953801092459322, lr=0.08470402649691988
2023-11-15 00:58:53   INFO  epoch: 5/24, acc_iter=34935, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:26:05, time_cost(all): 11:22:35/1 day, 17:30:38, loss=0.524703564032439, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=4.384020732878312, lr=0.0846639347241793
2023-11-15 00:59:52   INFO  epoch: 5/24, acc_iter=34985, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:32:41, time_cost(all): 11:23:34/1 day, 15:02:12, loss=0.524592621884262, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=4.207057586372296, lr=0.08462384295143871
2023-11-15 01:00:51   INFO  epoch: 5/24, acc_iter=35035, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:31:58, time_cost(all): 11:24:33/1 day, 17:26:12, loss=0.524481679736085, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=4.969943447957309, lr=0.08458375117869812
2023-11-15 01:01:50   INFO  epoch: 5/24, acc_iter=35085, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:24:06, time_cost(all): 11:25:32/1 day, 16:18:38, loss=0.524370737587909, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.847607237649688, lr=0.08454365940595754
2023-11-15 01:02:49   INFO  epoch: 5/24, acc_iter=35135, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:26:08, time_cost(all): 11:26:31/1 day, 18:03:23, loss=0.524259795439732, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.471650767910417, lr=0.08450356763321695
2023-11-15 01:03:48   INFO  epoch: 5/24, acc_iter=35185, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:41, time_cost(all): 11:27:30/1 day, 14:19:59, loss=0.524148853291555, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.87(1.03), norm=4.007180427890725, lr=0.08446347586047637
2023-11-15 01:04:47   INFO  epoch: 5/24, acc_iter=35235, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:23:01, time_cost(all): 11:28:29/1 day, 15:38:17, loss=0.524037911143378, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.343969540054444, lr=0.08442338408773577
2023-11-15 01:05:46   INFO  epoch: 5/24, acc_iter=35285, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:27:04, time_cost(all): 11:29:28/1 day, 18:04:09, loss=0.523926968995202, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=4.097969626786165, lr=0.08438329231499518
2023-11-15 01:06:45   INFO  epoch: 5/24, acc_iter=35335, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:25:48, time_cost(all): 11:30:27/1 day, 15:40:53, loss=0.523816026847025, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=4.550140069830802, lr=0.0843432005422546
2023-11-15 01:07:43   INFO  epoch: 5/24, acc_iter=35385, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:39, time_cost(all): 11:31:25/1 day, 16:21:42, loss=0.523705084698848, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=3.4061449364323697, lr=0.08430310876951401
2023-11-15 01:08:42   INFO  epoch: 5/24, acc_iter=35435, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:38, time_cost(all): 11:32:24/1 day, 15:32:23, loss=0.523594142550671, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=0.9600494253675547, lr=0.08426301699677342
2023-11-15 01:09:41   INFO  epoch: 5/24, acc_iter=35485, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:23:01, time_cost(all): 11:33:23/1 day, 18:09:34, loss=0.523483200402495, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=0.5982036353956959, lr=0.08422292522403284
2023-11-15 01:10:40   INFO  epoch: 5/24, acc_iter=35535, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:16:01, time_cost(all): 11:34:22/1 day, 15:12:05, loss=0.523372258254318, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=1.5238979605435194, lr=0.08418283345129224
2023-11-15 01:11:39   INFO  epoch: 5/24, acc_iter=35585, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:14:22, time_cost(all): 11:35:21/1 day, 15:34:11, loss=0.523261316106141, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.144199139352176, lr=0.08414274167855165
2023-11-15 01:12:38   INFO  epoch: 5/24, acc_iter=35635, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:15:40, time_cost(all): 11:36:20/1 day, 17:27:57, loss=0.523150373957964, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.866811414523381, lr=0.08410264990581107
2023-11-15 01:13:37   INFO  epoch: 5/24, acc_iter=35685, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:15:09, time_cost(all): 11:37:19/1 day, 16:52:45, loss=0.523039431809788, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=1.0189717952819288, lr=0.08406255813307048
2023-11-15 01:14:36   INFO  epoch: 5/24, acc_iter=35735, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:04, time_cost(all): 11:38:18/1 day, 15:58:41, loss=0.522928489661611, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=4.052008206403596, lr=0.0840224663603299
2023-11-15 01:15:35   INFO  epoch: 5/24, acc_iter=35785, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:14:07, time_cost(all): 11:39:17/1 day, 17:37:59, loss=0.522817547513434, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=2.006793638980854, lr=0.08398237458758931
2023-11-15 01:16:34   INFO  epoch: 5/24, acc_iter=35835, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:11:45, time_cost(all): 11:40:16/1 day, 16:45:21, loss=0.522706605365257, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.83(1.03), norm=0.5339678823880579, lr=0.08394228281484872
2023-11-15 01:17:33   INFO  epoch: 5/24, acc_iter=35885, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:42, time_cost(all): 11:41:15/1 day, 17:25:08, loss=0.522595663217081, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=4.6969437036599215, lr=0.08390219104210812
2023-11-15 01:18:32   INFO  epoch: 5/24, acc_iter=35935, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:10:02, time_cost(all): 11:42:14/1 day, 16:35:33, loss=0.522484721068904, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.2167853057179334, lr=0.08386209926936754
2023-11-15 01:19:31   INFO  epoch: 5/24, acc_iter=35985, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:36, time_cost(all): 11:43:13/1 day, 14:47:44, loss=0.522373778920727, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=0.6960254232469083, lr=0.08382200749662695
2023-11-15 01:20:30   INFO  epoch: 5/24, acc_iter=36035, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:48, time_cost(all): 11:44:12/1 day, 17:44:30, loss=0.52226283677255, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.9343861823388195, lr=0.08378191572388637
2023-11-15 01:21:28   INFO  epoch: 5/24, acc_iter=36085, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:09:09, time_cost(all): 11:45:10/1 day, 17:36:20, loss=0.522151894624374, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.228531182317354, lr=0.08374182395114578
2023-11-15 01:22:27   INFO  epoch: 5/24, acc_iter=36135, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:06:17, time_cost(all): 11:46:09/1 day, 15:07:56, loss=0.522040952476197, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=4.095263613440663, lr=0.08370173217840518
2023-11-15 01:23:26   INFO  epoch: 5/24, acc_iter=36185, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:04:38, time_cost(all): 11:47:08/1 day, 16:35:19, loss=0.52193001032802, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=3.2663308398958493, lr=0.0836616404056646
2023-11-15 01:24:25   INFO  epoch: 5/24, acc_iter=36235, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:46, time_cost(all): 11:48:07/1 day, 15:28:52, loss=0.521819068179843, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=4.849419527447267, lr=0.08362154863292401
2023-11-15 01:25:24   INFO  epoch: 5/24, acc_iter=36285, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:06:17, time_cost(all): 11:49:06/1 day, 13:57:01, loss=0.521708126031666, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.5120330274962694, lr=0.08358145686018342
2023-11-15 01:26:23   INFO  epoch: 5/24, acc_iter=36335, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:46, time_cost(all): 11:50:05/1 day, 17:51:56, loss=0.52159718388349, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.17(1.03), norm=2.3571556775063045, lr=0.08354136508744284
2023-11-15 01:27:22   INFO  epoch: 5/24, acc_iter=36385, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:06, time_cost(all): 11:51:04/1 day, 16:42:40, loss=0.521486241735313, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=3.9962295653794437, lr=0.08350127331470225
2023-11-15 01:28:21   INFO  epoch: 5/24, acc_iter=36435, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:03:37, time_cost(all): 11:52:03/1 day, 15:08:00, loss=0.521375299587136, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=2.3660897891415553, lr=0.08346118154196167
2023-11-15 01:29:20   INFO  epoch: 5/24, acc_iter=36485, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:59:20, time_cost(all): 11:53:02/1 day, 14:38:16, loss=0.521264357438959, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=2.687542188770674, lr=0.08342108976922108
2023-11-15 01:30:19   INFO  epoch: 5/24, acc_iter=36535, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:58:22, time_cost(all): 11:54:01/1 day, 14:03:19, loss=0.521153415290783, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=1.370552011384101, lr=0.08338099799648048
2023-11-15 01:31:18   INFO  epoch: 5/24, acc_iter=36585, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:55:02, time_cost(all): 11:55:00/1 day, 14:47:45, loss=0.521042473142606, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=2.857621630612019, lr=0.0833409062237399
2023-11-15 01:32:17   INFO  epoch: 5/24, acc_iter=36635, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:57:33, time_cost(all): 11:55:59/1 day, 17:31:46, loss=0.520931530994429, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=1.8810643796888125, lr=0.08330081445099931
2023-11-15 01:33:16   INFO  epoch: 5/24, acc_iter=36685, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:57:16, time_cost(all): 11:56:58/1 day, 15:48:23, loss=0.520820588846252, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=3.408043607405112, lr=0.08326072267825872
2023-11-15 01:34:15   INFO  epoch: 5/24, acc_iter=36735, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:18, time_cost(all): 11:57:57/1 day, 15:59:32, loss=0.520709646698076, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=2.0706113605209144, lr=0.08322063090551814
2023-11-15 01:35:13   INFO  epoch: 5/24, acc_iter=36785, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:55:18, time_cost(all): 11:58:55/1 day, 16:10:43, loss=0.520598704549899, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.78886335905265, lr=0.08318053913277754
2023-11-15 01:36:12   INFO  epoch: 5/24, acc_iter=36835, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:53:57, time_cost(all): 11:59:54/1 day, 17:06:11, loss=0.520487762401722, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=0.7130149905521571, lr=0.08314044736003695
2023-11-15 01:37:11   INFO  epoch: 5/24, acc_iter=36885, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:50:29, time_cost(all): 12:00:53/1 day, 16:31:38, loss=0.520376820253545, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=0.6982378682047894, lr=0.08310035558729637
2023-11-15 01:38:10   INFO  epoch: 5/24, acc_iter=36935, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:27, time_cost(all): 12:01:52/1 day, 14:14:33, loss=0.520265878105369, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=1.2866156288137036, lr=0.08306026381455578
2023-11-15 01:39:09   INFO  epoch: 5/24, acc_iter=36985, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:51:31, time_cost(all): 12:02:51/1 day, 14:28:20, loss=0.520154935957192, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=2.1037154395192044, lr=0.0830201720418152
2023-11-15 01:40:08   INFO  epoch: 5/24, acc_iter=37035, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:49:20, time_cost(all): 12:03:50/1 day, 15:44:44, loss=0.520043993809015, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=4.232016568455978, lr=0.08298008026907461
2023-11-15 01:41:07   INFO  epoch: 5/24, acc_iter=37085, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:35, time_cost(all): 12:04:49/1 day, 16:56:15, loss=0.519933051660838, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=3.6100340416262555, lr=0.08293998849633402
2023-11-15 01:42:06   INFO  epoch: 5/24, acc_iter=37135, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:13, time_cost(all): 12:05:48/1 day, 14:48:39, loss=0.519822109512662, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=3.8633129212852184, lr=0.08289989672359342
2023-11-15 01:43:05   INFO  epoch: 5/24, acc_iter=37185, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:44, time_cost(all): 12:06:47/1 day, 15:38:30, loss=0.519711167364485, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=3.28780354969877, lr=0.08285980495085284
2023-11-15 01:44:04   INFO  epoch: 5/24, acc_iter=37235, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:45:58, time_cost(all): 12:07:46/1 day, 16:13:33, loss=0.519600225216308, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=3.50865170675775, lr=0.08281971317811225
2023-11-15 01:45:03   INFO  epoch: 5/24, acc_iter=37285, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:37, time_cost(all): 12:08:45/1 day, 15:27:52, loss=0.519489283068131, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=2.822521363359688, lr=0.08277962140537166
2023-11-15 01:46:02   INFO  epoch: 5/24, acc_iter=37335, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:55, time_cost(all): 12:09:44/1 day, 16:52:19, loss=0.519378340919955, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=3.9918170490874645, lr=0.08273952963263108
2023-11-15 01:47:01   INFO  epoch: 5/24, acc_iter=37385, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:17, time_cost(all): 12:10:43/1 day, 14:16:07, loss=0.519267398771778, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.5363303211000257, lr=0.08269943785989049
2023-11-15 01:48:00   INFO  epoch: 5/24, acc_iter=37435, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:41:18, time_cost(all): 12:11:42/1 day, 15:29:52, loss=0.519156456623601, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=4.690226312306393, lr=0.0826593460871499
2023-11-15 01:48:58   INFO  epoch: 5/24, acc_iter=37485, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:28, time_cost(all): 12:12:40/1 day, 16:39:25, loss=0.519045514475424, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=2.528605725390479, lr=0.08261925431440931
2023-11-15 01:49:57   INFO  epoch: 5/24, acc_iter=37535, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:35, time_cost(all): 12:13:39/1 day, 14:40:28, loss=0.518934572327248, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.285688244889858, lr=0.08257916254166872
2023-11-15 01:50:56   INFO  epoch: 5/24, acc_iter=37585, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:31, time_cost(all): 12:14:38/1 day, 14:26:47, loss=0.518823630179071, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=0.6591806463006057, lr=0.08253907076892814
2023-11-15 01:51:55   INFO  epoch: 5/24, acc_iter=37635, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:24, time_cost(all): 12:15:37/1 day, 14:40:24, loss=0.518712688030894, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.83(1.03), norm=1.8124062299759138, lr=0.08249897899618755
2023-11-15 01:52:54   INFO  epoch: 5/24, acc_iter=37685, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:57, time_cost(all): 12:16:36/1 day, 15:46:00, loss=0.518601745882717, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=1.0751732597422097, lr=0.08245888722344696
2023-11-15 01:53:53   INFO  epoch: 5/24, acc_iter=37735, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:36:04, time_cost(all): 12:17:35/1 day, 13:49:29, loss=0.518490803734541, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=1.3496953631609028, lr=0.08241879545070638
2023-11-15 01:54:52   INFO  epoch: 5/24, acc_iter=37785, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:43, time_cost(all): 12:18:34/1 day, 15:31:12, loss=0.518379861586364, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=0.8023131865783463, lr=0.08237870367796578
2023-11-15 01:55:51   INFO  epoch: 5/24, acc_iter=37835, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:34:41, time_cost(all): 12:19:33/1 day, 17:09:11, loss=0.518268919438187, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=1.466377126014426, lr=0.08233861190522519
2023-11-15 01:56:50   INFO  epoch: 5/24, acc_iter=37885, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:09, time_cost(all): 12:20:32/1 day, 16:49:58, loss=0.51815797729001, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.255534820081751, lr=0.0822985201324846
2023-11-15 01:57:49   INFO  epoch: 5/24, acc_iter=37935, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:19, time_cost(all): 12:21:31/1 day, 17:07:45, loss=0.518047035141834, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=2.6693518727026255, lr=0.08225842835974402
2023-11-15 01:58:48   INFO  epoch: 5/24, acc_iter=37985, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:28:44, time_cost(all): 12:22:30/1 day, 14:52:24, loss=0.517936092993657, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=2.7613856715370515, lr=0.08221833658700343
2023-11-15 01:59:47   INFO  epoch: 5/24, acc_iter=38035, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:27:55, time_cost(all): 12:23:29/1 day, 14:09:10, loss=0.51782515084548, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=3.3155624625842215, lr=0.08217824481426284
2023-11-15 02:00:46   INFO  epoch: 5/24, acc_iter=38085, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:09, time_cost(all): 12:24:28/1 day, 15:59:17, loss=0.517714208697303, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=4.524777976654419, lr=0.08213815304152225
2023-11-15 02:01:45   INFO  epoch: 5/24, acc_iter=38135, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:25, time_cost(all): 12:25:27/1 day, 17:06:57, loss=0.517603266549127, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=4.834529535572459, lr=0.08209806126878166
2023-11-15 02:02:43   INFO  epoch: 5/24, acc_iter=38185, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:29, time_cost(all): 12:26:25/1 day, 14:15:26, loss=0.51749232440095, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=3.1889182958493825, lr=0.08205796949604108
2023-11-15 02:03:42   INFO  epoch: 5/24, acc_iter=38235, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:53, time_cost(all): 12:27:24/1 day, 13:24:11, loss=0.517381382252773, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.4115440359375655, lr=0.08201787772330049
2023-11-15 02:04:41   INFO  epoch: 5/24, acc_iter=38285, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:49, time_cost(all): 12:28:23/1 day, 13:28:39, loss=0.517270440104596, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.2988377925535457, lr=0.0819777859505599
2023-11-15 02:05:40   INFO  epoch: 5/24, acc_iter=38335, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:11, time_cost(all): 12:29:22/1 day, 14:58:33, loss=0.51715949795642, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.8900705037647025, lr=0.08193769417781932
2023-11-15 02:06:39   INFO  epoch: 5/24, acc_iter=38385, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:35, time_cost(all): 12:30:21/1 day, 15:33:03, loss=0.517048555808243, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=0.5479408344815562, lr=0.08189760240507873
2023-11-15 02:07:38   INFO  epoch: 5/24, acc_iter=38435, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:06, time_cost(all): 12:31:20/1 day, 14:17:05, loss=0.516937613660066, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=3.159873003481886, lr=0.08185751063233813
2023-11-15 02:08:37   INFO  epoch: 5/24, acc_iter=38485, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:24, time_cost(all): 12:32:19/1 day, 15:06:44, loss=0.516826671511889, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=1.0092056970334895, lr=0.08181741885959755
2023-11-15 02:09:36   INFO  epoch: 5/24, acc_iter=38535, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:59, time_cost(all): 12:33:18/1 day, 13:37:48, loss=0.516715729363713, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.5750561111346388, lr=0.08177732708685696
2023-11-15 02:10:35   INFO  epoch: 5/24, acc_iter=38585, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:20, time_cost(all): 12:34:17/1 day, 15:07:51, loss=0.516604787215536, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=3.7559592204376973, lr=0.08173723531411638
2023-11-15 02:11:34   INFO  epoch: 5/24, acc_iter=38635, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:15, time_cost(all): 12:35:16/1 day, 15:24:09, loss=0.516493845067359, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.9330031954985012, lr=0.08169714354137579
2023-11-15 02:12:33   INFO  epoch: 5/24, acc_iter=38685, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:52, time_cost(all): 12:36:15/1 day, 14:55:46, loss=0.516382902919182, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=2.6050976947759668, lr=0.08165705176863519
2023-11-15 02:13:32   INFO  epoch: 5/24, acc_iter=38735, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:56, time_cost(all): 12:37:14/1 day, 14:47:46, loss=0.516271960771006, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=4.4641080178474395, lr=0.0816169599958946
2023-11-15 02:14:31   INFO  epoch: 5/24, acc_iter=38785, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:41, time_cost(all): 12:38:13/1 day, 16:07:31, loss=0.516161018622829, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=3.484458523969585, lr=0.08157686822315402
2023-11-15 02:15:30   INFO  epoch: 5/24, acc_iter=38835, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:19, time_cost(all): 12:39:12/1 day, 15:30:18, loss=0.516050076474652, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=4.656780925217648, lr=0.08153677645041343
2023-11-15 02:16:28   INFO  epoch: 5/24, acc_iter=38885, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:35, time_cost(all): 12:40:10/1 day, 15:11:30, loss=0.515939134326475, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=3.636858491771375, lr=0.08149668467767285
2023-11-15 02:17:27   INFO  epoch: 5/24, acc_iter=38935, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:12:03, time_cost(all): 12:41:09/1 day, 13:59:58, loss=0.515828192178299, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=1.7571031677573121, lr=0.08145659290493226
2023-11-15 02:18:26   INFO  epoch: 5/24, acc_iter=38985, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:11:01, time_cost(all): 12:42:08/1 day, 14:34:18, loss=0.515717250030122, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.6292529039752566, lr=0.08141650113219168
2023-11-15 02:19:25   INFO  epoch: 5/24, acc_iter=39035, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:30, time_cost(all): 12:43:07/1 day, 13:38:09, loss=0.515606307881945, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.9655403910978935, lr=0.08137640935945108
2023-11-15 02:20:24   INFO  epoch: 5/24, acc_iter=39085, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:59, time_cost(all): 12:44:06/1 day, 14:26:58, loss=0.515495365733768, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=3.2874030498131614, lr=0.08133631758671049
2023-11-15 02:21:23   INFO  epoch: 5/24, acc_iter=39135, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:25, time_cost(all): 12:45:05/1 day, 13:49:16, loss=0.515384423585592, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.232923826505358, lr=0.0812962258139699
2023-11-15 02:22:22   INFO  epoch: 5/24, acc_iter=39185, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:44, time_cost(all): 12:46:04/1 day, 15:42:47, loss=0.515273481437415, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=2.093383345843065, lr=0.08125613404122932
2023-11-15 02:23:21   INFO  epoch: 5/24, acc_iter=39235, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:46, time_cost(all): 12:47:03/1 day, 14:48:01, loss=0.515162539289238, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.456703762999194, lr=0.08121604226848873
2023-11-15 02:24:20   INFO  epoch: 5/24, acc_iter=39285, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:51, time_cost(all): 12:48:02/1 day, 13:35:08, loss=0.515051597141061, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.316382028377829, lr=0.08117595049574813
2023-11-15 02:25:19   INFO  epoch: 5/24, acc_iter=39335, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:45, time_cost(all): 12:49:01/1 day, 14:56:12, loss=0.514940654992885, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=1.1003531699697024, lr=0.08113585872300755
2023-11-15 02:26:18   INFO  epoch: 5/24, acc_iter=39385, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:36, time_cost(all): 12:50:00/1 day, 15:09:42, loss=0.514829712844708, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=2.493244354483455, lr=0.08109576695026696
2023-11-15 02:27:17   INFO  epoch: 5/24, acc_iter=39435, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:40, time_cost(all): 12:50:59/1 day, 15:36:04, loss=0.514718770696531, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.3470604875394847, lr=0.08105567517752638
2023-11-15 02:28:16   INFO  epoch: 5/24, acc_iter=39485, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:44, time_cost(all): 12:51:58/1 day, 16:36:19, loss=0.514607828548354, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=4.434708801211146, lr=0.08101558340478579
2023-11-15 02:29:15   INFO  epoch: 6/24, acc_iter=39572, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:03:47, time_cost(all): 12:52:57/1 day, 15:10:53, loss=0.514414789210527, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.7599977498008714, lr=0.08094582372021716
2023-11-15 02:30:13   INFO  epoch: 6/24, acc_iter=39622, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:47, time_cost(all): 12:53:55/1 day, 15:54:49, loss=0.51430384706235, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=4.036401717525891, lr=0.08090573194747658
2023-11-15 02:31:12   INFO  epoch: 6/24, acc_iter=39672, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:04:03, time_cost(all): 12:54:54/1 day, 14:23:30, loss=0.514192904914173, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.3373360322179746, lr=0.08086564017473599
2023-11-15 02:32:11   INFO  epoch: 6/24, acc_iter=39722, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:03:28, time_cost(all): 12:55:53/1 day, 16:06:54, loss=0.514081962765997, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=4.603359869404723, lr=0.0808255484019954
2023-11-15 02:33:10   INFO  epoch: 6/24, acc_iter=39772, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:09:28, time_cost(all): 12:56:52/1 day, 15:10:53, loss=0.51397102061782, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=4.588717430206734, lr=0.08078545662925482
2023-11-15 02:34:09   INFO  epoch: 6/24, acc_iter=39822, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:04:13, time_cost(all): 12:57:51/1 day, 13:07:07, loss=0.513860078469643, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=3.3150394131751355, lr=0.08074536485651422
2023-11-15 02:35:08   INFO  epoch: 6/24, acc_iter=39872, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/1:59:35, time_cost(all): 12:58:50/1 day, 16:06:58, loss=0.513749136321466, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.1844848107353758, lr=0.08070527308377363
2023-11-15 02:36:07   INFO  epoch: 6/24, acc_iter=39922, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:55:36, time_cost(all): 12:59:49/1 day, 13:11:37, loss=0.51363819417329, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=1.2927871048530273, lr=0.08066518131103305
2023-11-15 02:37:06   INFO  epoch: 6/24, acc_iter=39972, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:57:27, time_cost(all): 13:00:48/1 day, 16:03:43, loss=0.513527252025113, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.7026963119612, lr=0.08062508953829246
2023-11-15 02:38:05   INFO  epoch: 6/24, acc_iter=40022, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:57:14, time_cost(all): 13:01:47/1 day, 16:35:08, loss=0.513416309876936, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=0.8336139040056411, lr=0.08058499776555188
2023-11-15 02:39:04   INFO  epoch: 6/24, acc_iter=40072, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:59:34, time_cost(all): 13:02:46/1 day, 14:06:45, loss=0.513305367728759, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=2.473449210199, lr=0.08054490599281128
2023-11-15 02:40:03   INFO  epoch: 6/24, acc_iter=40122, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:57:34, time_cost(all): 13:03:45/1 day, 15:57:25, loss=0.513194425580583, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=0.7357679365500374, lr=0.08050481422007069
2023-11-15 02:41:02   INFO  epoch: 6/24, acc_iter=40172, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:58:35, time_cost(all): 13:04:44/1 day, 15:37:16, loss=0.513083483432406, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=0.5014900742963179, lr=0.0804647224473301
2023-11-15 02:42:01   INFO  epoch: 6/24, acc_iter=40222, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:55:19, time_cost(all): 13:05:43/1 day, 12:55:39, loss=0.512972541284229, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.6930446023048416, lr=0.08042463067458952
2023-11-15 02:43:00   INFO  epoch: 6/24, acc_iter=40272, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:55:21, time_cost(all): 13:06:42/1 day, 12:54:17, loss=0.512861599136052, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.532363324749241, lr=0.08038453890184893
2023-11-15 02:43:58   INFO  epoch: 6/24, acc_iter=40322, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:54:26, time_cost(all): 13:07:40/1 day, 15:47:54, loss=0.512750656987876, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=4.563522487447557, lr=0.08034444712910835
2023-11-15 02:44:57   INFO  epoch: 6/24, acc_iter=40372, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:58:09, time_cost(all): 13:08:39/1 day, 12:50:49, loss=0.512639714839699, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=1.193266757584876, lr=0.08030435535636776
2023-11-15 02:45:56   INFO  epoch: 6/24, acc_iter=40422, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:48:53, time_cost(all): 13:09:38/1 day, 13:17:14, loss=0.512528772691522, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=1.1932971753756412, lr=0.08026426358362718
2023-11-15 02:46:55   INFO  epoch: 6/24, acc_iter=40472, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:48:46, time_cost(all): 13:10:37/1 day, 12:54:21, loss=0.512417830543345, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=3.249040221679786, lr=0.08022417181088658
2023-11-15 02:47:54   INFO  epoch: 6/24, acc_iter=40522, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:51:23, time_cost(all): 13:11:36/1 day, 15:09:30, loss=0.512306888395169, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=0.909691584678427, lr=0.08018408003814599
2023-11-15 02:48:53   INFO  epoch: 6/24, acc_iter=40572, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:49:58, time_cost(all): 13:12:35/1 day, 13:27:41, loss=0.512195946246992, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=1.1664830683736185, lr=0.0801439882654054
2023-11-15 02:49:52   INFO  epoch: 6/24, acc_iter=40622, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:48:58, time_cost(all): 13:13:34/1 day, 14:30:46, loss=0.512085004098815, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=3.640368335562682, lr=0.08010389649266482
2023-11-15 02:50:51   INFO  epoch: 6/24, acc_iter=40672, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:44:26, time_cost(all): 13:14:33/1 day, 15:25:21, loss=0.511974061950638, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.1927804557540345, lr=0.08006380471992423
2023-11-15 02:51:50   INFO  epoch: 6/24, acc_iter=40722, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:47:33, time_cost(all): 13:15:32/1 day, 13:22:15, loss=0.511863119802462, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=2.9181445835108972, lr=0.08002371294718363
2023-11-15 02:52:49   INFO  epoch: 6/24, acc_iter=40772, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:40:49, time_cost(all): 13:16:31/1 day, 15:11:09, loss=0.511752177654285, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=2.0965749486723544, lr=0.07998362117444305
2023-11-15 02:53:48   INFO  epoch: 6/24, acc_iter=40822, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:46:06, time_cost(all): 13:17:30/1 day, 12:39:46, loss=0.511641235506108, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.93(1.03), norm=0.839575146740627, lr=0.07994352940170246
2023-11-15 02:54:47   INFO  epoch: 6/24, acc_iter=40872, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:42:34, time_cost(all): 13:18:29/1 day, 12:50:26, loss=0.511530293357931, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=1.454956696404952, lr=0.07990343762896188
2023-11-15 02:55:46   INFO  epoch: 6/24, acc_iter=40922, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:44:34, time_cost(all): 13:19:28/1 day, 14:56:24, loss=0.511419351209755, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=0.9778582797965504, lr=0.07986334585622129
2023-11-15 02:56:45   INFO  epoch: 6/24, acc_iter=40972, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:39:36, time_cost(all): 13:20:27/1 day, 15:47:05, loss=0.511308409061578, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=3.0402470162115, lr=0.0798232540834807
2023-11-15 02:57:43   INFO  epoch: 6/24, acc_iter=41022, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:36:48, time_cost(all): 13:21:25/1 day, 13:09:22, loss=0.511197466913401, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=1.6738375590706487, lr=0.07978316231074012
2023-11-15 02:58:42   INFO  epoch: 6/24, acc_iter=41072, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:36:08, time_cost(all): 13:22:24/1 day, 14:14:06, loss=0.511086524765224, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=4.334987318378062, lr=0.07974307053799952
2023-11-15 02:59:41   INFO  epoch: 6/24, acc_iter=41122, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:33:06, time_cost(all): 13:23:23/1 day, 15:09:48, loss=0.510975582617048, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.777839475257602, lr=0.07970297876525893
2023-11-15 03:00:40   INFO  epoch: 6/24, acc_iter=41172, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:36:23, time_cost(all): 13:24:22/1 day, 12:33:26, loss=0.510864640468871, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=2.232653625314696, lr=0.07966288699251835
2023-11-15 03:01:39   INFO  epoch: 6/24, acc_iter=41222, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:39:03, time_cost(all): 13:25:21/1 day, 14:25:45, loss=0.510753698320694, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=2.2073083523695938, lr=0.07962279521977776
2023-11-15 03:02:38   INFO  epoch: 6/24, acc_iter=41272, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:38:56, time_cost(all): 13:26:20/1 day, 14:33:05, loss=0.510642756172517, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=0.6952675519190568, lr=0.07958270344703718
2023-11-15 03:03:37   INFO  epoch: 6/24, acc_iter=41322, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:14, time_cost(all): 13:27:19/1 day, 15:35:04, loss=0.510531814024341, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=2.6057259292991977, lr=0.07954261167429659
2023-11-15 03:04:36   INFO  epoch: 6/24, acc_iter=41372, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:36:38, time_cost(all): 13:28:18/1 day, 12:28:31, loss=0.510420871876164, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=1.399778327526714, lr=0.07950251990155599
2023-11-15 03:05:35   INFO  epoch: 6/24, acc_iter=41422, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:33:37, time_cost(all): 13:29:17/1 day, 14:19:01, loss=0.510309929727987, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.120194814428746, lr=0.0794624281288154
2023-11-15 03:06:34   INFO  epoch: 6/24, acc_iter=41472, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:32:42, time_cost(all): 13:30:16/1 day, 16:07:41, loss=0.51019898757981, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=0.6558767545647979, lr=0.07942233635607482
2023-11-15 03:07:33   INFO  epoch: 6/24, acc_iter=41522, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:29:01, time_cost(all): 13:31:15/1 day, 13:19:46, loss=0.510088045431634, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=3.9173248439546935, lr=0.07938224458333423
2023-11-15 03:08:32   INFO  epoch: 6/24, acc_iter=41572, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:25:14, time_cost(all): 13:32:14/1 day, 16:00:26, loss=0.509977103283457, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.02(1.03), norm=0.6506828627240433, lr=0.07934215281059365
2023-11-15 03:09:31   INFO  epoch: 6/24, acc_iter=41622, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:29:53, time_cost(all): 13:33:13/1 day, 14:40:22, loss=0.50986616113528, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=3.50028557223092, lr=0.07930206103785306
2023-11-15 03:10:30   INFO  epoch: 6/24, acc_iter=41672, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:24:51, time_cost(all): 13:34:12/1 day, 14:06:56, loss=0.509755218987103, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.8647128779626818, lr=0.07926196926511248
2023-11-15 03:11:28   INFO  epoch: 6/24, acc_iter=41722, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:25:09, time_cost(all): 13:35:10/1 day, 13:52:16, loss=0.509644276838927, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=3.521784227609057, lr=0.07922187749237188
2023-11-15 03:12:27   INFO  epoch: 6/24, acc_iter=41772, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:21:15, time_cost(all): 13:36:09/1 day, 12:14:50, loss=0.50953333469075, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.2256501335734455, lr=0.07918178571963129
2023-11-15 03:13:26   INFO  epoch: 6/24, acc_iter=41822, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:24:42, time_cost(all): 13:37:08/1 day, 15:47:56, loss=0.509422392542573, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=2.229470331230252, lr=0.0791416939468907
2023-11-15 03:14:25   INFO  epoch: 6/24, acc_iter=41872, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:23:59, time_cost(all): 13:38:07/1 day, 14:18:31, loss=0.509311450394396, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=4.510031456369263, lr=0.07910160217415012
2023-11-15 03:15:24   INFO  epoch: 6/24, acc_iter=41922, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:20:03, time_cost(all): 13:39:06/1 day, 13:06:30, loss=0.50920050824622, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=4.328121318019621, lr=0.07906151040140953
2023-11-15 03:16:23   INFO  epoch: 6/24, acc_iter=41972, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:23:36, time_cost(all): 13:40:05/1 day, 15:52:57, loss=0.509089566098043, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.5253859838656973, lr=0.07902141862866893
2023-11-15 03:17:22   INFO  epoch: 6/24, acc_iter=42022, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:22, time_cost(all): 13:41:04/1 day, 15:40:18, loss=0.508978623949866, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=1.5334303808247074, lr=0.07898132685592835
2023-11-15 03:18:21   INFO  epoch: 6/24, acc_iter=42072, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:21:47, time_cost(all): 13:42:03/1 day, 15:01:04, loss=0.508867681801689, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=1.6227456531002191, lr=0.07894123508318776
2023-11-15 03:19:20   INFO  epoch: 6/24, acc_iter=42122, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:21:08, time_cost(all): 13:43:02/1 day, 14:54:20, loss=0.508756739653513, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.13(1.03), norm=4.464266800330636, lr=0.07890114331044717
2023-11-15 03:20:19   INFO  epoch: 6/24, acc_iter=42172, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:03, time_cost(all): 13:44:01/1 day, 12:24:47, loss=0.508645797505336, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=2.9440113292415573, lr=0.07886105153770659
2023-11-15 03:21:18   INFO  epoch: 6/24, acc_iter=42222, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:04, time_cost(all): 13:45:00/1 day, 13:16:24, loss=0.508534855357159, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=2.661239290734784, lr=0.078820959764966
2023-11-15 03:22:17   INFO  epoch: 6/24, acc_iter=42272, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:12:51, time_cost(all): 13:45:59/1 day, 13:43:08, loss=0.508423913208982, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=0.5593099254556252, lr=0.07878086799222542
2023-11-15 03:23:16   INFO  epoch: 6/24, acc_iter=42322, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:16:22, time_cost(all): 13:46:58/1 day, 13:16:03, loss=0.508312971060806, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=2.4394704893326753, lr=0.07874077621948483
2023-11-15 03:24:15   INFO  epoch: 6/24, acc_iter=42372, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:43, time_cost(all): 13:47:57/1 day, 12:04:04, loss=0.508202028912629, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=0.9628837105531791, lr=0.07870068444674423
2023-11-15 03:25:13   INFO  epoch: 6/24, acc_iter=42422, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:09:15, time_cost(all): 13:48:55/1 day, 15:02:05, loss=0.508091086764452, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=0.7479630819053251, lr=0.07866059267400365
2023-11-15 03:26:12   INFO  epoch: 6/24, acc_iter=42472, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:14:00, time_cost(all): 13:49:54/1 day, 12:55:06, loss=0.507980144616275, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.961856958803445, lr=0.07862050090126306
2023-11-15 03:27:11   INFO  epoch: 6/24, acc_iter=42522, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:09:18, time_cost(all): 13:50:53/1 day, 13:24:04, loss=0.507869202468099, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.6276082453440237, lr=0.07858040912852247
2023-11-15 03:28:10   INFO  epoch: 6/24, acc_iter=42572, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:12:19, time_cost(all): 13:51:52/1 day, 13:44:22, loss=0.507758260319922, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=3.508367114836577, lr=0.07854031735578187
2023-11-15 03:29:09   INFO  epoch: 6/24, acc_iter=42622, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:09:41, time_cost(all): 13:52:51/1 day, 12:19:59, loss=0.507647318171745, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.9887102635843465, lr=0.07850022558304129
2023-11-15 03:30:08   INFO  epoch: 6/24, acc_iter=42672, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:09:05, time_cost(all): 13:53:50/1 day, 13:22:34, loss=0.507536376023568, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=3.704812222676818, lr=0.0784601338103007
2023-11-15 03:31:07   INFO  epoch: 6/24, acc_iter=42722, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:04:37, time_cost(all): 13:54:49/1 day, 13:38:40, loss=0.507425433875392, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=2.5409898700649713, lr=0.07842004203756012
2023-11-15 03:32:06   INFO  epoch: 6/24, acc_iter=42772, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:05:43, time_cost(all): 13:55:48/1 day, 13:54:36, loss=0.507314491727215, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=2.612666282178627, lr=0.07837995026481953
2023-11-15 03:33:05   INFO  epoch: 6/24, acc_iter=42822, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:52, time_cost(all): 13:56:47/1 day, 14:10:18, loss=0.507203549579038, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.663626197570704, lr=0.07833985849207895
2023-11-15 03:34:04   INFO  epoch: 6/24, acc_iter=42872, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:43, time_cost(all): 13:57:46/1 day, 14:20:29, loss=0.507092607430861, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=0.981193109516163, lr=0.07829976671933836
2023-11-15 03:35:03   INFO  epoch: 6/24, acc_iter=42922, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/0:59:45, time_cost(all): 13:58:45/1 day, 12:17:53, loss=0.506981665282685, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=3.0081457453064386, lr=0.07825967494659777
2023-11-15 03:36:02   INFO  epoch: 6/24, acc_iter=42972, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:12, time_cost(all): 13:59:44/1 day, 14:12:25, loss=0.506870723134508, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=4.065557799644939, lr=0.07821958317385717
2023-11-15 03:37:01   INFO  epoch: 6/24, acc_iter=43022, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:00:18, time_cost(all): 14:00:43/1 day, 15:32:59, loss=0.506759780986331, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=1.1155301891508276, lr=0.07817949140111659
2023-11-15 03:38:00   INFO  epoch: 6/24, acc_iter=43072, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:47, time_cost(all): 14:01:42/1 day, 15:16:45, loss=0.506648838838154, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=2.072076958812107, lr=0.078139399628376
2023-11-15 03:38:58   INFO  epoch: 6/24, acc_iter=43122, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:56:57, time_cost(all): 14:02:40/1 day, 12:41:00, loss=0.506537896689978, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=4.523771090957621, lr=0.07809930785563542
2023-11-15 03:39:57   INFO  epoch: 6/24, acc_iter=43172, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:56:29, time_cost(all): 14:03:39/1 day, 12:47:46, loss=0.506426954541801, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=3.23488741785602, lr=0.07805921608289482
2023-11-15 03:40:56   INFO  epoch: 6/24, acc_iter=43222, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:43, time_cost(all): 14:04:38/1 day, 12:17:55, loss=0.506316012393624, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=1.7500146226355109, lr=0.07801912431015423
2023-11-15 03:41:55   INFO  epoch: 6/24, acc_iter=43272, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:58, time_cost(all): 14:05:37/1 day, 12:24:31, loss=0.506205070245447, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=3.208230210671168, lr=0.07797903253741365
2023-11-15 03:42:54   INFO  epoch: 6/24, acc_iter=43322, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:56:37, time_cost(all): 14:06:36/1 day, 14:59:46, loss=0.506094128097271, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=0.8340182829968608, lr=0.07793894076467306
2023-11-15 03:43:53   INFO  epoch: 6/24, acc_iter=43372, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:52:23, time_cost(all): 14:07:35/1 day, 15:08:04, loss=0.505983185949094, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=2.813102755499787, lr=0.07789884899193247
2023-11-15 03:44:52   INFO  epoch: 6/24, acc_iter=43422, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:22, time_cost(all): 14:08:34/1 day, 12:28:11, loss=0.505872243800917, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=3.944109128676133, lr=0.07785875721919189
2023-11-15 03:45:51   INFO  epoch: 6/24, acc_iter=43472, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:50:22, time_cost(all): 14:09:33/1 day, 11:49:38, loss=0.50576130165274, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=3.6777630769692458, lr=0.0778186654464513
2023-11-15 03:46:50   INFO  epoch: 6/24, acc_iter=43522, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:17, time_cost(all): 14:10:32/1 day, 12:06:49, loss=0.505650359504563, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=3.0426960816984456, lr=0.07777857367371072
2023-11-15 03:47:49   INFO  epoch: 6/24, acc_iter=43572, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:48:01, time_cost(all): 14:11:31/1 day, 13:42:52, loss=0.505539417356387, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=3.2316912833837406, lr=0.07773848190097013
2023-11-15 03:48:48   INFO  epoch: 6/24, acc_iter=43622, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:50:38, time_cost(all): 14:12:30/1 day, 12:38:53, loss=0.50542847520821, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=1.640077478310237, lr=0.07769839012822953
2023-11-15 03:49:47   INFO  epoch: 6/24, acc_iter=43672, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:45:47, time_cost(all): 14:13:29/1 day, 13:32:17, loss=0.505317533060033, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=4.549398793266085, lr=0.07765829835548894
2023-11-15 03:50:46   INFO  epoch: 6/24, acc_iter=43722, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:22, time_cost(all): 14:14:28/1 day, 11:50:14, loss=0.505206590911856, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.21602374207872, lr=0.07761820658274836
2023-11-15 03:51:45   INFO  epoch: 6/24, acc_iter=43772, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:46:54, time_cost(all): 14:15:27/1 day, 12:24:11, loss=0.50509564876368, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=1.4468043511637554, lr=0.07757811481000777
2023-11-15 03:52:43   INFO  epoch: 6/24, acc_iter=43822, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:44:48, time_cost(all): 14:16:25/1 day, 13:42:46, loss=0.504984706615503, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=0.982432912298929, lr=0.07753802303726717
2023-11-15 03:53:42   INFO  epoch: 6/24, acc_iter=43872, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:41:53, time_cost(all): 14:17:24/1 day, 14:25:09, loss=0.504873764467326, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=4.582890748602211, lr=0.07749793126452659
2023-11-15 03:54:41   INFO  epoch: 6/24, acc_iter=43922, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:28, time_cost(all): 14:18:23/1 day, 12:17:00, loss=0.504762822319149, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.420642574107157, lr=0.077457839491786
2023-11-15 03:55:40   INFO  epoch: 6/24, acc_iter=43972, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:40:03, time_cost(all): 14:19:22/1 day, 14:35:24, loss=0.504651880170973, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=2.8099918088309446, lr=0.07741774771904542
2023-11-15 03:56:39   INFO  epoch: 6/24, acc_iter=44022, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:41:00, time_cost(all): 14:20:21/1 day, 13:49:53, loss=0.504540938022796, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=2.4281198285306873, lr=0.07737765594630483
2023-11-15 03:57:38   INFO  epoch: 6/24, acc_iter=44072, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:58, time_cost(all): 14:21:20/1 day, 12:54:25, loss=0.504429995874619, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=3.5890310110726342, lr=0.07733756417356424
2023-11-15 03:58:37   INFO  epoch: 6/24, acc_iter=44122, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:22, time_cost(all): 14:22:19/1 day, 11:40:22, loss=0.504319053726442, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=3.1491788632513216, lr=0.07729747240082366
2023-11-15 03:59:36   INFO  epoch: 6/24, acc_iter=44172, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:06, time_cost(all): 14:23:18/1 day, 13:42:18, loss=0.504208111578266, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.1162386281522134, lr=0.07725738062808307
2023-11-15 04:00:35   INFO  epoch: 6/24, acc_iter=44222, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:25, time_cost(all): 14:24:17/1 day, 14:55:06, loss=0.504097169430089, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=3.4014123508346015, lr=0.07721728885534247
2023-11-15 04:01:34   INFO  epoch: 6/24, acc_iter=44272, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:13, time_cost(all): 14:25:16/1 day, 12:16:56, loss=0.503986227281912, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=2.406002066821558, lr=0.07717719708260189
2023-11-15 04:02:33   INFO  epoch: 6/24, acc_iter=44322, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:18, time_cost(all): 14:26:15/1 day, 12:29:45, loss=0.503875285133735, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.471911994742616, lr=0.0771371053098613
2023-11-15 04:03:32   INFO  epoch: 6/24, acc_iter=44372, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:28, time_cost(all): 14:27:14/1 day, 12:48:00, loss=0.503764342985559, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=1.6966649038622466, lr=0.07709701353712071
2023-11-15 04:04:31   INFO  epoch: 6/24, acc_iter=44422, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:31:55, time_cost(all): 14:28:13/1 day, 12:13:02, loss=0.503653400837382, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=2.555131507673964, lr=0.07705692176438013
2023-11-15 04:05:30   INFO  epoch: 6/24, acc_iter=44472, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:38, time_cost(all): 14:29:12/1 day, 12:12:08, loss=0.503542458689205, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=2.118564399359049, lr=0.07701682999163953
2023-11-15 04:06:28   INFO  epoch: 6/24, acc_iter=44522, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:20, time_cost(all): 14:30:10/1 day, 12:06:41, loss=0.503431516541028, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=4.295050679818446, lr=0.07697673821889894
2023-11-15 04:07:27   INFO  epoch: 6/24, acc_iter=44572, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:29, time_cost(all): 14:31:09/1 day, 11:40:33, loss=0.503320574392852, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=0.8426650458746597, lr=0.07693664644615836
2023-11-15 04:08:26   INFO  epoch: 6/24, acc_iter=44622, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:17, time_cost(all): 14:32:08/1 day, 13:26:16, loss=0.503209632244675, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.13(1.03), norm=0.8740055332515041, lr=0.07689655467341777
2023-11-15 04:09:25   INFO  epoch: 6/24, acc_iter=44672, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:05, time_cost(all): 14:33:07/1 day, 11:44:26, loss=0.503098690096498, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=2.7216107895823094, lr=0.07685646290067719
2023-11-15 04:10:24   INFO  epoch: 6/24, acc_iter=44722, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:33, time_cost(all): 14:34:06/1 day, 12:32:43, loss=0.502987747948321, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=2.3639651549342284, lr=0.0768163711279366
2023-11-15 04:11:23   INFO  epoch: 6/24, acc_iter=44772, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:06, time_cost(all): 14:35:05/1 day, 12:21:44, loss=0.502876805800145, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=0.6350176884352761, lr=0.07677627935519601
2023-11-15 04:12:22   INFO  epoch: 6/24, acc_iter=44822, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:54, time_cost(all): 14:36:04/1 day, 14:58:34, loss=0.502765863651968, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.135845188165028, lr=0.07673618758245543
2023-11-15 04:13:21   INFO  epoch: 6/24, acc_iter=44872, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:11, time_cost(all): 14:37:03/1 day, 13:53:39, loss=0.502654921503791, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=4.97836971634039, lr=0.07669609580971483
2023-11-15 04:14:20   INFO  epoch: 6/24, acc_iter=44922, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:20, time_cost(all): 14:38:02/1 day, 13:29:59, loss=0.502543979355614, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=1.624868289068105, lr=0.07665600403697424
2023-11-15 04:15:19   INFO  epoch: 6/24, acc_iter=44972, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:23:09, time_cost(all): 14:39:01/1 day, 11:43:16, loss=0.502433037207438, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=1.518035201858995, lr=0.07661591226423366
2023-11-15 04:16:18   INFO  epoch: 6/24, acc_iter=45022, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:22:06, time_cost(all): 14:40:00/1 day, 13:36:20, loss=0.502322095059261, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=1.8725412898668308, lr=0.07657582049149306
2023-11-15 04:17:17   INFO  epoch: 6/24, acc_iter=45072, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:24, time_cost(all): 14:40:59/1 day, 13:57:07, loss=0.502211152911084, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=0.6887829824483975, lr=0.07653572871875249
2023-11-15 04:18:16   INFO  epoch: 6/24, acc_iter=45122, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:35, time_cost(all): 14:41:58/1 day, 14:24:30, loss=0.502100210762907, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=3.612281968010349, lr=0.07649563694601189
2023-11-15 04:19:15   INFO  epoch: 6/24, acc_iter=45172, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:19:14, time_cost(all): 14:42:57/1 day, 14:15:08, loss=0.501989268614731, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=1.4490959237438341, lr=0.0764555451732713
2023-11-15 04:20:13   INFO  epoch: 6/24, acc_iter=45222, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:41, time_cost(all): 14:43:55/1 day, 14:49:54, loss=0.501878326466554, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=2.9392156421399633, lr=0.07641545340053071
2023-11-15 04:21:12   INFO  epoch: 6/24, acc_iter=45272, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:02, time_cost(all): 14:44:54/1 day, 11:51:14, loss=0.501767384318377, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=4.595188397454287, lr=0.07637536162779013
2023-11-15 04:22:11   INFO  epoch: 6/24, acc_iter=45322, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:46, time_cost(all): 14:45:53/1 day, 12:02:03, loss=0.5016564421702, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=0.6674855498554573, lr=0.07633526985504954
2023-11-15 04:23:10   INFO  epoch: 6/24, acc_iter=45372, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:55, time_cost(all): 14:46:52/1 day, 14:31:07, loss=0.501545500022024, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=2.6202255933593483, lr=0.07629517808230896
2023-11-15 04:24:09   INFO  epoch: 6/24, acc_iter=45422, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:41, time_cost(all): 14:47:51/1 day, 14:20:26, loss=0.501434557873847, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=3.153305682300763, lr=0.07625508630956837
2023-11-15 04:25:08   INFO  epoch: 6/24, acc_iter=45472, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:02, time_cost(all): 14:48:50/1 day, 12:12:49, loss=0.50132361572567, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=2.0785167013157615, lr=0.07621499453682777
2023-11-15 04:26:07   INFO  epoch: 6/24, acc_iter=45522, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:46, time_cost(all): 14:49:49/1 day, 14:45:23, loss=0.501212673577493, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=1.7789677580098289, lr=0.07617490276408719
2023-11-15 04:27:06   INFO  epoch: 6/24, acc_iter=45572, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:15, time_cost(all): 14:50:48/1 day, 14:25:08, loss=0.501101731429317, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=4.6535510475805495, lr=0.0761348109913466
2023-11-15 04:28:05   INFO  epoch: 6/24, acc_iter=45622, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:44, time_cost(all): 14:51:47/1 day, 13:15:01, loss=0.50099078928114, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=2.4303435466326446, lr=0.07609471921860601
2023-11-15 04:29:04   INFO  epoch: 6/24, acc_iter=45672, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:44, time_cost(all): 14:52:46/1 day, 13:00:19, loss=0.500879847132963, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.4412534942918551, lr=0.07605462744586541
2023-11-15 04:30:03   INFO  epoch: 6/24, acc_iter=45722, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:58, time_cost(all): 14:53:45/1 day, 11:16:35, loss=0.500768904984786, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=4.080675406984081, lr=0.07601453567312484
2023-11-15 04:31:02   INFO  epoch: 6/24, acc_iter=45772, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:23, time_cost(all): 14:54:44/1 day, 12:49:11, loss=0.50065796283661, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=3.338769872652299, lr=0.07597444390038424
2023-11-15 04:32:01   INFO  epoch: 6/24, acc_iter=45822, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:42, time_cost(all): 14:55:43/1 day, 13:25:10, loss=0.500547020688433, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=3.0286955199900465, lr=0.07593435212764366
2023-11-15 04:33:00   INFO  epoch: 6/24, acc_iter=45872, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:32, time_cost(all): 14:56:42/1 day, 13:14:28, loss=0.500436078540256, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=2.70985728854414, lr=0.07589426035490307
2023-11-15 04:33:59   INFO  epoch: 6/24, acc_iter=45922, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:50, time_cost(all): 14:57:41/1 day, 12:11:48, loss=0.500325136392079, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=2.924284882811827, lr=0.07585416858216248
2023-11-15 04:34:57   INFO  epoch: 6/24, acc_iter=45972, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:47, time_cost(all): 14:58:39/1 day, 12:11:55, loss=0.500214194243903, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=1.3798788522404835, lr=0.0758140768094219
2023-11-15 04:35:56   INFO  epoch: 6/24, acc_iter=46022, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:46, time_cost(all): 14:59:38/1 day, 14:31:36, loss=0.500103252095726, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.771816860730155, lr=0.07577398503668131
2023-11-15 04:36:55   INFO  epoch: 6/24, acc_iter=46072, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 15:00:37/1 day, 11:31:22, loss=0.499992309947549, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=2.2890316628971945, lr=0.07573389326394071
2023-11-15 04:37:54   INFO  epoch: 7/24, acc_iter=46159, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:11:49, time_cost(all): 15:01:36/1 day, 13:55:49, loss=0.499799270609722, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=0.9946279781432326, lr=0.0756641335793721
2023-11-15 04:38:53   INFO  epoch: 7/24, acc_iter=46209, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:09:20, time_cost(all): 15:02:35/1 day, 12:34:38, loss=0.499688328461545, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=4.560385976450964, lr=0.07562404180663151
2023-11-15 04:39:52   INFO  epoch: 7/24, acc_iter=46259, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:11:43, time_cost(all): 15:03:34/1 day, 12:36:00, loss=0.499577386313368, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.8127823988082463, lr=0.07558395003389093
2023-11-15 04:40:51   INFO  epoch: 7/24, acc_iter=46309, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:11:31, time_cost(all): 15:04:33/1 day, 13:10:52, loss=0.499466444165191, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=0.9679580623976896, lr=0.07554385826115033
2023-11-15 04:41:50   INFO  epoch: 7/24, acc_iter=46359, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:09:37, time_cost(all): 15:05:32/1 day, 13:13:31, loss=0.499355502017015, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=2.9909232071930734, lr=0.07550376648840974
2023-11-15 04:42:49   INFO  epoch: 7/24, acc_iter=46409, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:59, time_cost(all): 15:06:31/1 day, 14:12:08, loss=0.499244559868838, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.2(1.03), norm=2.9625379611787475, lr=0.07546367471566916
2023-11-15 04:43:48   INFO  epoch: 7/24, acc_iter=46459, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:05:28, time_cost(all): 15:07:30/1 day, 11:48:50, loss=0.499133617720661, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=2.341478464113196, lr=0.07542358294292857
2023-11-15 04:44:47   INFO  epoch: 7/24, acc_iter=46509, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:57:53, time_cost(all): 15:08:29/1 day, 12:39:35, loss=0.499022675572484, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=2.1609420700098516, lr=0.07538349117018797
2023-11-15 04:45:46   INFO  epoch: 7/24, acc_iter=46559, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:57:08, time_cost(all): 15:09:28/1 day, 13:57:14, loss=0.498911733424308, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.08(1.03), norm=0.6106519374040966, lr=0.07534339939744739
2023-11-15 04:46:45   INFO  epoch: 7/24, acc_iter=46609, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:58:40, time_cost(all): 15:10:27/1 day, 11:28:11, loss=0.498800791276131, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=2.250614743516312, lr=0.0753033076247068
2023-11-15 04:47:44   INFO  epoch: 7/24, acc_iter=46659, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:00:04, time_cost(all): 15:11:26/1 day, 10:59:42, loss=0.498689849127954, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=0.9322742413082503, lr=0.07526321585196621
2023-11-15 04:48:42   INFO  epoch: 7/24, acc_iter=46709, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:01:48, time_cost(all): 15:12:24/1 day, 12:06:18, loss=0.498578906979777, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=4.324137674582413, lr=0.07522312407922563
2023-11-15 04:49:41   INFO  epoch: 7/24, acc_iter=46759, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:51:57, time_cost(all): 15:13:23/1 day, 11:52:29, loss=0.498467964831601, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=3.558163623463269, lr=0.07518303230648504
2023-11-15 04:50:40   INFO  epoch: 7/24, acc_iter=46809, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/2:00:47, time_cost(all): 15:14:22/1 day, 11:39:41, loss=0.498357022683424, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=4.412268841228114, lr=0.07514294053374446
2023-11-15 04:51:39   INFO  epoch: 7/24, acc_iter=46859, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:51:02, time_cost(all): 15:15:21/1 day, 10:57:50, loss=0.498246080535247, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=3.411574076194119, lr=0.07510284876100386
2023-11-15 04:52:38   INFO  epoch: 7/24, acc_iter=46909, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:50:47, time_cost(all): 15:16:20/1 day, 12:35:24, loss=0.49813513838707, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=2.653532379878949, lr=0.07506275698826327
2023-11-15 04:53:37   INFO  epoch: 7/24, acc_iter=46959, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:53:54, time_cost(all): 15:17:19/1 day, 11:28:19, loss=0.498024196238894, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=4.0717132464406305, lr=0.07502266521552269
2023-11-15 04:54:36   INFO  epoch: 7/24, acc_iter=47009, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:51:38, time_cost(all): 15:18:18/1 day, 11:49:11, loss=0.497913254090717, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=4.272524331750336, lr=0.0749825734427821
2023-11-15 04:55:35   INFO  epoch: 7/24, acc_iter=47059, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:50:54, time_cost(all): 15:19:17/1 day, 12:58:13, loss=0.49780231194254, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.94(1.03), norm=2.7406918039527413, lr=0.07494248167004151
2023-11-15 04:56:34   INFO  epoch: 7/24, acc_iter=47109, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:52:57, time_cost(all): 15:20:16/1 day, 13:55:53, loss=0.497691369794363, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=1.6608662481466756, lr=0.07490238989730093
2023-11-15 04:57:33   INFO  epoch: 7/24, acc_iter=47159, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:52:32, time_cost(all): 15:21:15/1 day, 13:27:44, loss=0.497580427646187, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.3161580564742237, lr=0.07486229812456033
2023-11-15 04:58:32   INFO  epoch: 7/24, acc_iter=47209, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:45:52, time_cost(all): 15:22:14/1 day, 11:12:40, loss=0.49746948549801, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=4.086190877026497, lr=0.07482220635181974
2023-11-15 04:59:31   INFO  epoch: 7/24, acc_iter=47259, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:42:02, time_cost(all): 15:23:13/1 day, 11:33:58, loss=0.497358543349833, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.094464871994821, lr=0.07478211457907916
2023-11-15 05:00:30   INFO  epoch: 7/24, acc_iter=47309, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:42:33, time_cost(all): 15:24:12/1 day, 10:38:51, loss=0.497247601201656, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=3.80916225848863, lr=0.07474202280633857
2023-11-15 05:01:29   INFO  epoch: 7/24, acc_iter=47359, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:42:05, time_cost(all): 15:25:11/1 day, 11:45:04, loss=0.49713665905348, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.8050880190406433, lr=0.07470193103359798
2023-11-15 05:02:27   INFO  epoch: 7/24, acc_iter=47409, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:45:13, time_cost(all): 15:26:09/1 day, 10:36:09, loss=0.497025716905303, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.245288059572572, lr=0.0746618392608574
2023-11-15 05:03:26   INFO  epoch: 7/24, acc_iter=47459, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:42:56, time_cost(all): 15:27:08/1 day, 14:02:34, loss=0.496914774757126, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=3.0043420058381187, lr=0.07462174748811681
2023-11-15 05:04:25   INFO  epoch: 7/24, acc_iter=47509, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:38:48, time_cost(all): 15:28:07/1 day, 12:55:38, loss=0.496803832608949, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=3.879158442048186, lr=0.07458165571537621
2023-11-15 05:05:24   INFO  epoch: 7/24, acc_iter=47559, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:40:17, time_cost(all): 15:29:06/1 day, 13:22:42, loss=0.496692890460773, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=0.7379221606336743, lr=0.07454156394263563
2023-11-15 05:06:23   INFO  epoch: 7/24, acc_iter=47609, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:40:27, time_cost(all): 15:30:05/1 day, 13:45:30, loss=0.496581948312596, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=3.9068722181949798, lr=0.07450147216989504
2023-11-15 05:07:22   INFO  epoch: 7/24, acc_iter=47659, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:43:39, time_cost(all): 15:31:04/1 day, 11:52:22, loss=0.496471006164419, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=1.5205633921142427, lr=0.07446138039715446
2023-11-15 05:08:21   INFO  epoch: 7/24, acc_iter=47709, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:37:05, time_cost(all): 15:32:03/1 day, 12:25:56, loss=0.496360064016242, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.654029046792855, lr=0.07442128862441386
2023-11-15 05:09:20   INFO  epoch: 7/24, acc_iter=47759, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:37:32, time_cost(all): 15:33:02/1 day, 11:18:40, loss=0.496249121868066, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=1.0809479897443386, lr=0.07438119685167328
2023-11-15 05:10:19   INFO  epoch: 7/24, acc_iter=47809, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:31:59, time_cost(all): 15:34:01/1 day, 13:23:54, loss=0.496138179719889, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=0.6723160620919012, lr=0.07434110507893268
2023-11-15 05:11:18   INFO  epoch: 7/24, acc_iter=47859, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:37:26, time_cost(all): 15:35:00/1 day, 12:20:20, loss=0.496027237571712, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=0.5862546191920448, lr=0.0743010133061921
2023-11-15 05:12:17   INFO  epoch: 7/24, acc_iter=47909, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:28, time_cost(all): 15:35:59/1 day, 10:51:08, loss=0.495916295423535, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=2.3529716662245908, lr=0.07426092153345151
2023-11-15 05:13:16   INFO  epoch: 7/24, acc_iter=47959, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:35:37, time_cost(all): 15:36:58/1 day, 12:04:11, loss=0.495805353275359, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.177954351560296, lr=0.07422082976071093
2023-11-15 05:14:15   INFO  epoch: 7/24, acc_iter=48009, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:33:18, time_cost(all): 15:37:57/1 day, 13:19:03, loss=0.495694411127182, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.439723373552544, lr=0.07418073798797034
2023-11-15 05:15:14   INFO  epoch: 7/24, acc_iter=48059, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:35:06, time_cost(all): 15:38:56/1 day, 12:58:06, loss=0.495583468979005, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.0313531622674903, lr=0.07414064621522976
2023-11-15 05:16:12   INFO  epoch: 7/24, acc_iter=48109, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:29:02, time_cost(all): 15:39:54/1 day, 12:27:05, loss=0.495472526830828, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=4.721188885339991, lr=0.07410055444248917
2023-11-15 05:17:11   INFO  epoch: 7/24, acc_iter=48159, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:32:16, time_cost(all): 15:40:53/1 day, 13:09:05, loss=0.495361584682652, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=2.8952306024191388, lr=0.07406046266974857
2023-11-15 05:18:10   INFO  epoch: 7/24, acc_iter=48209, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:26:13, time_cost(all): 15:41:52/1 day, 11:48:12, loss=0.495250642534475, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=0.5387094231009348, lr=0.07402037089700798
2023-11-15 05:19:09   INFO  epoch: 7/24, acc_iter=48259, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:29:50, time_cost(all): 15:42:51/1 day, 12:54:10, loss=0.495139700386298, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=3.129488592850101, lr=0.0739802791242674
2023-11-15 05:20:08   INFO  epoch: 7/24, acc_iter=48309, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:22:42, time_cost(all): 15:43:50/1 day, 13:34:20, loss=0.495028758238121, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=4.140371336791048, lr=0.07394018735152681
2023-11-15 05:21:07   INFO  epoch: 7/24, acc_iter=48359, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:25:43, time_cost(all): 15:44:49/1 day, 13:23:13, loss=0.494917816089945, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.117129038388979, lr=0.07390009557878621
2023-11-15 05:22:06   INFO  epoch: 7/24, acc_iter=48409, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:21:52, time_cost(all): 15:45:48/1 day, 13:45:08, loss=0.494806873941768, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=4.524874333271113, lr=0.07386000380604563
2023-11-15 05:23:05   INFO  epoch: 7/24, acc_iter=48459, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:20:47, time_cost(all): 15:46:47/1 day, 12:01:48, loss=0.494695931793591, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=3.572219014231141, lr=0.07381991203330504
2023-11-15 05:24:04   INFO  epoch: 7/24, acc_iter=48509, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:21:40, time_cost(all): 15:47:46/1 day, 11:11:40, loss=0.494584989645414, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=4.479640250378439, lr=0.07377982026056445
2023-11-15 05:25:03   INFO  epoch: 7/24, acc_iter=48559, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:22:16, time_cost(all): 15:48:45/1 day, 13:34:06, loss=0.494474047497238, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.0795951540166433, lr=0.07373972848782387
2023-11-15 05:26:02   INFO  epoch: 7/24, acc_iter=48609, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:20:41, time_cost(all): 15:49:44/1 day, 13:07:33, loss=0.494363105349061, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.094944075393908, lr=0.07369963671508328
2023-11-15 05:27:01   INFO  epoch: 7/24, acc_iter=48659, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:59, time_cost(all): 15:50:43/1 day, 13:00:05, loss=0.494252163200884, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=3.0227375244368044, lr=0.0736595449423427
2023-11-15 05:28:00   INFO  epoch: 7/24, acc_iter=48709, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:15:40, time_cost(all): 15:51:42/1 day, 13:30:54, loss=0.494141221052707, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.2589587425692301, lr=0.07361945316960211
2023-11-15 05:28:59   INFO  epoch: 7/24, acc_iter=48759, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:16:38, time_cost(all): 15:52:41/1 day, 11:57:29, loss=0.494030278904531, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=3.509005016142758, lr=0.07357936139686151
2023-11-15 05:29:57   INFO  epoch: 7/24, acc_iter=48809, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:37, time_cost(all): 15:53:39/1 day, 11:27:14, loss=0.493919336756354, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=2.9941062685686686, lr=0.07353926962412093
2023-11-15 05:30:56   INFO  epoch: 7/24, acc_iter=48859, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:17:26, time_cost(all): 15:54:38/1 day, 11:24:22, loss=0.493808394608177, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=2.3150559891450104, lr=0.07349917785138034
2023-11-15 05:31:55   INFO  epoch: 7/24, acc_iter=48909, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:13, time_cost(all): 15:55:37/1 day, 10:14:15, loss=0.49369745246, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=4.340438167013459, lr=0.07345908607863975
2023-11-15 05:32:54   INFO  epoch: 7/24, acc_iter=48959, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:58, time_cost(all): 15:56:36/1 day, 11:47:12, loss=0.493586510311823, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=0.7411097976201064, lr=0.07341899430589917
2023-11-15 05:33:53   INFO  epoch: 7/24, acc_iter=49009, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:13:19, time_cost(all): 15:57:35/1 day, 11:52:02, loss=0.493475568163647, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=4.101574830593416, lr=0.07337890253315857
2023-11-15 05:34:52   INFO  epoch: 7/24, acc_iter=49059, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:09:57, time_cost(all): 15:58:34/1 day, 10:49:58, loss=0.49336462601547, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=2.797351718669227, lr=0.07333881076041798
2023-11-15 05:35:51   INFO  epoch: 7/24, acc_iter=49109, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:14, time_cost(all): 15:59:33/1 day, 13:26:32, loss=0.493253683867293, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=4.492986334928421, lr=0.0732987189876774
2023-11-15 05:36:50   INFO  epoch: 7/24, acc_iter=49159, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:10:41, time_cost(all): 16:00:32/1 day, 10:21:15, loss=0.493142741719116, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=2.5980791540932398, lr=0.07325862721493681
2023-11-15 05:37:49   INFO  epoch: 7/24, acc_iter=49209, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:31, time_cost(all): 16:01:31/1 day, 10:06:25, loss=0.49303179957094, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=2.497347369826378, lr=0.07321853544219623
2023-11-15 05:38:48   INFO  epoch: 7/24, acc_iter=49259, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:05:11, time_cost(all): 16:02:30/1 day, 11:01:31, loss=0.492920857422763, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=0.9381977496678614, lr=0.07317844366945564
2023-11-15 05:39:47   INFO  epoch: 7/24, acc_iter=49309, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:22, time_cost(all): 16:03:29/1 day, 10:03:15, loss=0.492809915274586, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=2.3603941059041906, lr=0.07313835189671505
2023-11-15 05:40:46   INFO  epoch: 7/24, acc_iter=49359, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:02:53, time_cost(all): 16:04:28/1 day, 10:06:25, loss=0.492698973126409, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=3.940977870955913, lr=0.07309826012397447
2023-11-15 05:41:45   INFO  epoch: 7/24, acc_iter=49409, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:06:59, time_cost(all): 16:05:27/1 day, 10:24:30, loss=0.492588030978233, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=4.705043246794631, lr=0.07305816835123387
2023-11-15 05:42:44   INFO  epoch: 7/24, acc_iter=49459, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:03:12, time_cost(all): 16:06:26/1 day, 13:17:34, loss=0.492477088830056, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=2.334514481452645, lr=0.07301807657849328
2023-11-15 05:43:42   INFO  epoch: 7/24, acc_iter=49509, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:02:05, time_cost(all): 16:07:24/1 day, 11:09:41, loss=0.492366146681879, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=4.428931819173749, lr=0.0729779848057527
2023-11-15 05:44:41   INFO  epoch: 7/24, acc_iter=49559, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:03:23, time_cost(all): 16:08:23/1 day, 11:56:46, loss=0.492255204533703, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=4.0430258195581725, lr=0.07293789303301211
2023-11-15 05:45:40   INFO  epoch: 7/24, acc_iter=49609, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:01:16, time_cost(all): 16:09:22/1 day, 10:36:16, loss=0.492144262385526, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=4.945101122617858, lr=0.07289780126027152
2023-11-15 05:46:39   INFO  epoch: 7/24, acc_iter=49659, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:00:41, time_cost(all): 16:10:21/1 day, 12:02:44, loss=0.492033320237349, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=4.383493432813188, lr=0.07285770948753093
2023-11-15 05:47:38   INFO  epoch: 7/24, acc_iter=49709, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:57:03, time_cost(all): 16:11:20/1 day, 10:01:46, loss=0.491922378089172, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=3.310700314543567, lr=0.07281761771479034
2023-11-15 05:48:37   INFO  epoch: 7/24, acc_iter=49759, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:56:51, time_cost(all): 16:12:19/1 day, 12:57:30, loss=0.491811435940996, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=1.683875092903592, lr=0.07277752594204975
2023-11-15 05:49:36   INFO  epoch: 7/24, acc_iter=49809, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:32, time_cost(all): 16:13:18/1 day, 12:47:58, loss=0.491700493792819, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=1.6222734011342144, lr=0.07273743416930917
2023-11-15 05:50:35   INFO  epoch: 7/24, acc_iter=49859, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:16, time_cost(all): 16:14:17/1 day, 12:18:04, loss=0.491589551644642, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.646085825562786, lr=0.07269734239656858
2023-11-15 05:51:34   INFO  epoch: 7/24, acc_iter=49909, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:58, time_cost(all): 16:15:16/1 day, 10:50:13, loss=0.491478609496465, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=0.5738881156250959, lr=0.072657250623828
2023-11-15 05:52:33   INFO  epoch: 7/24, acc_iter=49959, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:53:17, time_cost(all): 16:16:15/1 day, 10:03:21, loss=0.491367667348288, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=3.455915625909585, lr=0.0726171588510874
2023-11-15 05:53:32   INFO  epoch: 7/24, acc_iter=50009, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:50:33, time_cost(all): 16:17:14/1 day, 11:37:39, loss=0.491256725200112, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=4.734415683211953, lr=0.07257706707834682
2023-11-15 05:54:31   INFO  epoch: 7/24, acc_iter=50059, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:01, time_cost(all): 16:18:13/1 day, 10:20:39, loss=0.491145783051935, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=4.858140712746397, lr=0.07253697530560622
2023-11-15 05:55:30   INFO  epoch: 7/24, acc_iter=50109, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:15, time_cost(all): 16:19:12/1 day, 11:00:18, loss=0.491034840903758, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=3.9670184675790945, lr=0.07249688353286564
2023-11-15 05:56:29   INFO  epoch: 7/24, acc_iter=50159, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:49:53, time_cost(all): 16:20:11/1 day, 10:48:24, loss=0.490923898755581, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=2.143018431573652, lr=0.07245679176012505
2023-11-15 05:57:27   INFO  epoch: 7/24, acc_iter=50209, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:50:59, time_cost(all): 16:21:09/1 day, 10:17:27, loss=0.490812956607405, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=3.545189103483094, lr=0.07241669998738445
2023-11-15 05:58:26   INFO  epoch: 7/24, acc_iter=50259, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:48, time_cost(all): 16:22:08/1 day, 11:31:58, loss=0.490702014459228, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=4.7507507110864955, lr=0.07237660821464388
2023-11-15 05:59:25   INFO  epoch: 7/24, acc_iter=50309, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:23, time_cost(all): 16:23:07/1 day, 12:10:03, loss=0.490591072311051, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=3.1365116393928223, lr=0.07233651644190328
2023-11-15 06:00:24   INFO  epoch: 7/24, acc_iter=50359, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:56, time_cost(all): 16:24:06/1 day, 11:47:30, loss=0.490480130162874, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.857367322261988, lr=0.0722964246691627
2023-11-15 06:01:23   INFO  epoch: 7/24, acc_iter=50409, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:45:09, time_cost(all): 16:25:05/1 day, 12:47:11, loss=0.490369188014698, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=4.040898317535641, lr=0.07225633289642211
2023-11-15 06:02:22   INFO  epoch: 7/24, acc_iter=50459, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:42:41, time_cost(all): 16:26:04/1 day, 12:26:22, loss=0.490258245866521, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=1.4540436735212363, lr=0.07221624112368152
2023-11-15 06:03:21   INFO  epoch: 7/24, acc_iter=50509, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:40:50, time_cost(all): 16:27:03/1 day, 12:46:15, loss=0.490147303718344, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=1.9711810395012404, lr=0.07217614935094094
2023-11-15 06:04:20   INFO  epoch: 7/24, acc_iter=50559, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:26, time_cost(all): 16:28:02/1 day, 12:37:27, loss=0.490036361570168, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=4.3398608385332995, lr=0.07213605757820035
2023-11-15 06:05:19   INFO  epoch: 7/24, acc_iter=50609, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:24, time_cost(all): 16:29:01/1 day, 11:46:43, loss=0.489925419421991, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.243858643673232, lr=0.07209596580545975
2023-11-15 06:06:18   INFO  epoch: 7/24, acc_iter=50659, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:41:10, time_cost(all): 16:30:00/1 day, 10:30:23, loss=0.489814477273814, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=4.160425142161735, lr=0.07205587403271917
2023-11-15 06:07:17   INFO  epoch: 7/24, acc_iter=50709, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:01, time_cost(all): 16:30:59/1 day, 10:43:26, loss=0.489703535125637, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=3.9407939424346847, lr=0.07201578225997858
2023-11-15 06:08:16   INFO  epoch: 7/24, acc_iter=50759, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:39:46, time_cost(all): 16:31:58/1 day, 11:43:26, loss=0.48959259297746, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.94(1.03), norm=0.5347561195687356, lr=0.071975690487238
2023-11-15 06:09:15   INFO  epoch: 7/24, acc_iter=50809, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:17, time_cost(all): 16:32:57/1 day, 11:47:22, loss=0.489481650829284, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=4.128765309197689, lr=0.07193559871449741
2023-11-15 06:10:14   INFO  epoch: 7/24, acc_iter=50859, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:40, time_cost(all): 16:33:56/1 day, 10:00:03, loss=0.489370708681107, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.5819526384655376, lr=0.07189550694175681
2023-11-15 06:11:12   INFO  epoch: 7/24, acc_iter=50909, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:36:10, time_cost(all): 16:34:54/1 day, 9:44:52, loss=0.48925976653293, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=3.434436298683489, lr=0.07185541516901622
2023-11-15 06:12:11   INFO  epoch: 7/24, acc_iter=50959, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:15, time_cost(all): 16:35:53/1 day, 12:08:24, loss=0.489148824384753, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.84(1.03), norm=2.961145808327896, lr=0.07181532339627564
2023-11-15 06:13:10   INFO  epoch: 7/24, acc_iter=51009, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:20, time_cost(all): 16:36:52/1 day, 11:32:11, loss=0.489037882236577, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=4.27404470465013, lr=0.07177523162353505
2023-11-15 06:14:09   INFO  epoch: 7/24, acc_iter=51059, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:34, time_cost(all): 16:37:51/1 day, 12:01:02, loss=0.4889269400884, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=1.018401443489872, lr=0.07173513985079447
2023-11-15 06:15:08   INFO  epoch: 7/24, acc_iter=51109, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:22, time_cost(all): 16:38:50/1 day, 11:12:59, loss=0.488815997940223, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=1.6447918658113696, lr=0.07169504807805388
2023-11-15 06:16:07   INFO  epoch: 7/24, acc_iter=51159, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:29, time_cost(all): 16:39:49/1 day, 11:37:20, loss=0.488705055792046, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=4.486597258353195, lr=0.0716549563053133
2023-11-15 06:17:06   INFO  epoch: 7/24, acc_iter=51209, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:30:17, time_cost(all): 16:40:48/1 day, 12:23:51, loss=0.48859411364387, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.0340195263945313, lr=0.07161486453257271
2023-11-15 06:18:05   INFO  epoch: 7/24, acc_iter=51259, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:36, time_cost(all): 16:41:47/1 day, 10:25:14, loss=0.488483171495693, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=0.6503549688552726, lr=0.07157477275983211
2023-11-15 06:19:04   INFO  epoch: 7/24, acc_iter=51309, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:45, time_cost(all): 16:42:46/1 day, 11:16:55, loss=0.488372229347516, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.91(1.03), norm=1.0686560433838257, lr=0.07153468098709152
2023-11-15 06:20:03   INFO  epoch: 7/24, acc_iter=51359, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:55, time_cost(all): 16:43:45/1 day, 11:10:46, loss=0.488261287199339, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.911699719071183, lr=0.07149458921435094
2023-11-15 06:21:02   INFO  epoch: 7/24, acc_iter=51409, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:14, time_cost(all): 16:44:44/1 day, 11:43:52, loss=0.488150345051163, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.886860947085205, lr=0.07145449744161035
2023-11-15 06:22:01   INFO  epoch: 7/24, acc_iter=51459, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:55, time_cost(all): 16:45:43/1 day, 9:53:27, loss=0.488039402902986, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=3.178883589398961, lr=0.07141440566886977
2023-11-15 06:23:00   INFO  epoch: 7/24, acc_iter=51509, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:25, time_cost(all): 16:46:42/1 day, 11:16:10, loss=0.487928460754809, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.507468167164718, lr=0.07137431389612917
2023-11-15 06:23:59   INFO  epoch: 7/24, acc_iter=51559, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:32, time_cost(all): 16:47:41/1 day, 12:18:03, loss=0.487817518606632, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=2.899799502465898, lr=0.07133422212338858
2023-11-15 06:24:57   INFO  epoch: 7/24, acc_iter=51609, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:22:01, time_cost(all): 16:48:39/1 day, 10:18:57, loss=0.487706576458456, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.874614153437363, lr=0.071294130350648
2023-11-15 06:25:56   INFO  epoch: 7/24, acc_iter=51659, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:55, time_cost(all): 16:49:38/1 day, 11:28:44, loss=0.487595634310279, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=0.5752633015129969, lr=0.07125403857790741
2023-11-15 06:26:55   INFO  epoch: 7/24, acc_iter=51709, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:47, time_cost(all): 16:50:37/1 day, 10:23:48, loss=0.487484692162102, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=2.916457696925504, lr=0.07121394680516682
2023-11-15 06:27:54   INFO  epoch: 7/24, acc_iter=51759, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:32, time_cost(all): 16:51:36/1 day, 11:36:02, loss=0.487373750013925, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.4436974986846027, lr=0.07117385503242624
2023-11-15 06:28:53   INFO  epoch: 7/24, acc_iter=51809, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:12, time_cost(all): 16:52:35/1 day, 11:56:31, loss=0.487262807865749, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.02(1.03), norm=2.2404253474625047, lr=0.07113376325968565
2023-11-15 06:29:52   INFO  epoch: 7/24, acc_iter=51859, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:49, time_cost(all): 16:53:34/1 day, 10:42:07, loss=0.487151865717572, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=4.250118751396626, lr=0.07109367148694506
2023-11-15 06:30:51   INFO  epoch: 7/24, acc_iter=51909, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:23, time_cost(all): 16:54:33/1 day, 9:34:24, loss=0.487040923569395, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=4.839288518641844, lr=0.07105357971420447
2023-11-15 06:31:50   INFO  epoch: 7/24, acc_iter=51959, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:54, time_cost(all): 16:55:32/1 day, 11:27:41, loss=0.486929981421218, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=2.2118601028414355, lr=0.07101348794146388
2023-11-15 06:32:49   INFO  epoch: 7/24, acc_iter=52009, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:12:54, time_cost(all): 16:56:31/1 day, 12:10:57, loss=0.486819039273042, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=4.450617489919878, lr=0.0709733961687233
2023-11-15 06:33:48   INFO  epoch: 7/24, acc_iter=52059, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:27, time_cost(all): 16:57:30/1 day, 12:31:23, loss=0.486708097124865, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=2.807169554728358, lr=0.07093330439598271
2023-11-15 06:34:47   INFO  epoch: 7/24, acc_iter=52109, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:28, time_cost(all): 16:58:29/1 day, 11:18:37, loss=0.486597154976688, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.4194945408723583, lr=0.07089321262324212
2023-11-15 06:35:46   INFO  epoch: 7/24, acc_iter=52159, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:15, time_cost(all): 16:59:28/1 day, 10:52:38, loss=0.486486212828511, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=2.4451223241500406, lr=0.07085312085050152
2023-11-15 06:36:45   INFO  epoch: 7/24, acc_iter=52209, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:50, time_cost(all): 17:00:27/1 day, 12:20:22, loss=0.486375270680335, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=2.9205255499343936, lr=0.07081302907776094
2023-11-15 06:37:44   INFO  epoch: 7/24, acc_iter=52259, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:42, time_cost(all): 17:01:26/1 day, 10:48:58, loss=0.486264328532158, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.229501484682349, lr=0.07077293730502035
2023-11-15 06:38:42   INFO  epoch: 7/24, acc_iter=52309, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:15, time_cost(all): 17:02:24/1 day, 10:25:04, loss=0.486153386383981, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.811220444276301, lr=0.07073284553227976
2023-11-15 06:39:41   INFO  epoch: 7/24, acc_iter=52359, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:29, time_cost(all): 17:03:23/1 day, 10:01:52, loss=0.486042444235804, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=0.9077887871786579, lr=0.07069275375953918
2023-11-15 06:40:40   INFO  epoch: 7/24, acc_iter=52409, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:46, time_cost(all): 17:04:22/1 day, 9:15:18, loss=0.485931502087628, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=1.7728599684786543, lr=0.07065266198679859
2023-11-15 06:41:39   INFO  epoch: 7/24, acc_iter=52459, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:48, time_cost(all): 17:05:21/1 day, 11:38:00, loss=0.485820559939451, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.575985319166607, lr=0.070612570214058
2023-11-15 06:42:38   INFO  epoch: 7/24, acc_iter=52509, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:48, time_cost(all): 17:06:20/1 day, 10:31:06, loss=0.485709617791274, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=0.5061091962828123, lr=0.07057247844131742
2023-11-15 06:43:37   INFO  epoch: 7/24, acc_iter=52559, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:47, time_cost(all): 17:07:19/1 day, 10:17:34, loss=0.485598675643097, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=3.0739003137070418, lr=0.07053238666857682
2023-11-15 06:44:36   INFO  epoch: 7/24, acc_iter=52609, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 17:08:18/1 day, 11:31:28, loss=0.485487733494921, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=1.2397195579501101, lr=0.07049229489583624
2023-11-15 06:45:35   INFO  epoch: 7/24, acc_iter=52659, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 17:09:17/1 day, 10:35:31, loss=0.485376791346744, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=3.7026014900692967, lr=0.07045220312309565
2023-11-15 06:46:34   INFO  epoch: 8/24, acc_iter=52746, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:12:36, time_cost(all): 17:10:16/1 day, 10:38:22, loss=0.485183752008916, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=2.4306017098247334, lr=0.07038244343852702
2023-11-15 06:47:33   INFO  epoch: 8/24, acc_iter=52796, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:02:53, time_cost(all): 17:11:15/1 day, 10:45:32, loss=0.48507280986074, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.463404453880858, lr=0.07034235166578644
2023-11-15 06:48:32   INFO  epoch: 8/24, acc_iter=52846, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:10:46, time_cost(all): 17:12:14/1 day, 10:26:05, loss=0.484961867712563, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.7600573134450146, lr=0.07030225989304585
2023-11-15 06:49:31   INFO  epoch: 8/24, acc_iter=52896, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/1:59:41, time_cost(all): 17:13:13/1 day, 11:04:16, loss=0.484850925564386, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=1.0228385006632832, lr=0.07026216812030525
2023-11-15 06:50:30   INFO  epoch: 8/24, acc_iter=52946, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:06:08, time_cost(all): 17:14:12/1 day, 8:59:28, loss=0.484739983416209, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=1.9518690502646188, lr=0.07022207634756468
2023-11-15 06:51:29   INFO  epoch: 8/24, acc_iter=52996, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:00:50, time_cost(all): 17:15:11/1 day, 11:42:35, loss=0.484629041268033, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=2.6239419012400274, lr=0.07018198457482408
2023-11-15 06:52:27   INFO  epoch: 8/24, acc_iter=53046, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:07:32, time_cost(all): 17:16:09/1 day, 10:03:30, loss=0.484518099119856, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=0.8930523399914436, lr=0.0701418928020835
2023-11-15 06:53:26   INFO  epoch: 8/24, acc_iter=53096, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:06:13, time_cost(all): 17:17:08/1 day, 9:33:28, loss=0.484407156971679, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=2.089996038302658, lr=0.07010180102934291
2023-11-15 06:54:25   INFO  epoch: 8/24, acc_iter=53146, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:54:44, time_cost(all): 17:18:07/1 day, 9:27:40, loss=0.484296214823502, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=2.6643248421144685, lr=0.07006170925660232
2023-11-15 06:55:24   INFO  epoch: 8/24, acc_iter=53196, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:04:35, time_cost(all): 17:19:06/1 day, 8:55:05, loss=0.484185272675326, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=1.1844744302188404, lr=0.07002161748386174
2023-11-15 06:56:23   INFO  epoch: 8/24, acc_iter=53246, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:59:01, time_cost(all): 17:20:05/1 day, 10:48:00, loss=0.484074330527149, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=2.6936863284688126, lr=0.06998152571112115
2023-11-15 06:57:22   INFO  epoch: 8/24, acc_iter=53296, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:52:38, time_cost(all): 17:21:04/1 day, 9:22:21, loss=0.483963388378972, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=0.7575878664797069, lr=0.06994143393838055
2023-11-15 06:58:21   INFO  epoch: 8/24, acc_iter=53346, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:55:01, time_cost(all): 17:22:03/1 day, 10:37:11, loss=0.483852446230795, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=3.0949270886548774, lr=0.06990134216563997
2023-11-15 06:59:20   INFO  epoch: 8/24, acc_iter=53396, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:52:29, time_cost(all): 17:23:02/1 day, 11:29:51, loss=0.483741504082619, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=1.0729717636231757, lr=0.06986125039289938
2023-11-15 07:00:19   INFO  epoch: 8/24, acc_iter=53446, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:58:05, time_cost(all): 17:24:01/1 day, 8:48:48, loss=0.483630561934442, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=0.6138746555402289, lr=0.0698211586201588
2023-11-15 07:01:18   INFO  epoch: 8/24, acc_iter=53496, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:52:32, time_cost(all): 17:25:00/1 day, 11:30:37, loss=0.483519619786265, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.7269108230441876, lr=0.06978106684741821
2023-11-15 07:02:17   INFO  epoch: 8/24, acc_iter=53546, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:51:37, time_cost(all): 17:25:59/1 day, 10:27:25, loss=0.483408677638088, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=3.308947113735338, lr=0.06974097507467761
2023-11-15 07:03:16   INFO  epoch: 8/24, acc_iter=53596, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:50:33, time_cost(all): 17:26:58/1 day, 10:58:00, loss=0.483297735489912, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=2.355564844488935, lr=0.06970088330193702
2023-11-15 07:04:15   INFO  epoch: 8/24, acc_iter=53646, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:49:32, time_cost(all): 17:27:57/1 day, 11:08:58, loss=0.483186793341735, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=3.427279179031347, lr=0.06966079152919644
2023-11-15 07:05:14   INFO  epoch: 8/24, acc_iter=53696, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:45:10, time_cost(all): 17:28:56/1 day, 10:52:41, loss=0.483075851193558, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=3.1946199241997864, lr=0.06962069975645585
2023-11-15 07:06:12   INFO  epoch: 8/24, acc_iter=53746, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:05, time_cost(all): 17:29:54/1 day, 11:41:51, loss=0.482964909045381, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=0.5721967302214844, lr=0.06958060798371526
2023-11-15 07:07:11   INFO  epoch: 8/24, acc_iter=53796, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:52:07, time_cost(all): 17:30:53/1 day, 8:31:41, loss=0.482853966897205, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.7590611314840543, lr=0.06954051621097468
2023-11-15 07:08:10   INFO  epoch: 8/24, acc_iter=53846, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:45:38, time_cost(all): 17:31:52/1 day, 10:23:12, loss=0.482743024749028, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=2.8058132085920575, lr=0.06950042443823409
2023-11-15 07:09:09   INFO  epoch: 8/24, acc_iter=53896, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:35, time_cost(all): 17:32:51/1 day, 10:00:15, loss=0.482632082600851, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=1.762939694888442, lr=0.06946033266549351
2023-11-15 07:10:08   INFO  epoch: 8/24, acc_iter=53946, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:48:56, time_cost(all): 17:33:50/1 day, 10:35:50, loss=0.482521140452674, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=1.2150369299175179, lr=0.06942024089275291
2023-11-15 07:11:07   INFO  epoch: 8/24, acc_iter=53996, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:49:01, time_cost(all): 17:34:49/1 day, 9:11:20, loss=0.482410198304498, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.9628269936807947, lr=0.06938014912001232
2023-11-15 07:12:06   INFO  epoch: 8/24, acc_iter=54046, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:45:28, time_cost(all): 17:35:48/1 day, 8:51:12, loss=0.482299256156321, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=0.7059430981899932, lr=0.06934005734727174
2023-11-15 07:13:05   INFO  epoch: 8/24, acc_iter=54096, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:46:10, time_cost(all): 17:36:47/1 day, 10:21:18, loss=0.482188314008144, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=4.402575210181974, lr=0.06929996557453115
2023-11-15 07:14:04   INFO  epoch: 8/24, acc_iter=54146, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:41:59, time_cost(all): 17:37:46/1 day, 11:18:15, loss=0.482077371859967, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=0.7165950184281511, lr=0.06925987380179056
2023-11-15 07:15:03   INFO  epoch: 8/24, acc_iter=54196, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:39:37, time_cost(all): 17:38:45/1 day, 10:05:13, loss=0.481966429711791, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=0.5647248583116803, lr=0.06921978202904996
2023-11-15 07:16:02   INFO  epoch: 8/24, acc_iter=54246, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:37:30, time_cost(all): 17:39:44/1 day, 10:51:41, loss=0.481855487563614, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=4.187194903703634, lr=0.06917969025630938
2023-11-15 07:17:01   INFO  epoch: 8/24, acc_iter=54296, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:41:16, time_cost(all): 17:40:43/1 day, 10:12:13, loss=0.481744545415437, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=3.9316083717937973, lr=0.06913959848356879
2023-11-15 07:18:00   INFO  epoch: 8/24, acc_iter=54346, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:32:29, time_cost(all): 17:41:42/1 day, 9:54:20, loss=0.48163360326726, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=4.276188781996385, lr=0.0690995067108282
2023-11-15 07:18:59   INFO  epoch: 8/24, acc_iter=54396, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:36:53, time_cost(all): 17:42:41/1 day, 10:10:16, loss=0.481522661119084, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.504457222375282, lr=0.06905941493808762
2023-11-15 07:19:57   INFO  epoch: 8/24, acc_iter=54446, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:35:30, time_cost(all): 17:43:39/1 day, 10:48:32, loss=0.481411718970907, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=1.4038120911388339, lr=0.06901932316534704
2023-11-15 07:20:56   INFO  epoch: 8/24, acc_iter=54496, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:32:02, time_cost(all): 17:44:38/1 day, 11:30:29, loss=0.48130077682273, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.62106991452239, lr=0.06897923139260645
2023-11-15 07:21:55   INFO  epoch: 8/24, acc_iter=54546, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:30:21, time_cost(all): 17:45:37/1 day, 11:36:12, loss=0.481189834674553, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.16(1.03), norm=2.400550169887719, lr=0.06893913961986586
2023-11-15 07:22:54   INFO  epoch: 8/24, acc_iter=54596, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:30:06, time_cost(all): 17:46:36/1 day, 10:20:38, loss=0.481078892526377, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=1.6852326044121857, lr=0.06889904784712526
2023-11-15 07:23:53   INFO  epoch: 8/24, acc_iter=54646, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:31:40, time_cost(all): 17:47:35/1 day, 11:01:44, loss=0.4809679503782, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.293805090524584, lr=0.06885895607438468
2023-11-15 07:24:52   INFO  epoch: 8/24, acc_iter=54696, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:28:30, time_cost(all): 17:48:34/1 day, 9:54:31, loss=0.480857008230023, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=4.974403405817495, lr=0.06881886430164409
2023-11-15 07:25:51   INFO  epoch: 8/24, acc_iter=54746, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:25:04, time_cost(all): 17:49:33/1 day, 8:39:29, loss=0.480746066081846, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=0.912230557191855, lr=0.0687787725289035
2023-11-15 07:26:50   INFO  epoch: 8/24, acc_iter=54796, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:24:15, time_cost(all): 17:50:32/1 day, 10:30:24, loss=0.48063512393367, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=1.541958169968279, lr=0.0687386807561629
2023-11-15 07:27:49   INFO  epoch: 8/24, acc_iter=54846, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:27:34, time_cost(all): 17:51:31/1 day, 9:42:18, loss=0.480524181785493, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=0.7373733857149531, lr=0.06869858898342232
2023-11-15 07:28:48   INFO  epoch: 8/24, acc_iter=54896, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:30:22, time_cost(all): 17:52:30/1 day, 9:10:24, loss=0.480413239637316, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=3.3178568132582003, lr=0.06865849721068173
2023-11-15 07:29:47   INFO  epoch: 8/24, acc_iter=54946, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:28:14, time_cost(all): 17:53:29/1 day, 8:11:33, loss=0.480302297489139, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=2.313630233596011, lr=0.06861840543794115
2023-11-15 07:30:46   INFO  epoch: 8/24, acc_iter=54996, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:20:00, time_cost(all): 17:54:28/1 day, 10:49:39, loss=0.480191355340963, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.8911507915608747, lr=0.06857831366520056
2023-11-15 07:31:45   INFO  epoch: 8/24, acc_iter=55046, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:27:22, time_cost(all): 17:55:27/1 day, 8:26:35, loss=0.480080413192786, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=2.7290163445102915, lr=0.06853822189245998
2023-11-15 07:32:44   INFO  epoch: 8/24, acc_iter=55096, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:22:00, time_cost(all): 17:56:26/1 day, 11:26:13, loss=0.479969471044609, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.07(1.03), norm=1.4791353923662012, lr=0.06849813011971939
2023-11-15 07:33:42   INFO  epoch: 8/24, acc_iter=55146, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:19:10, time_cost(all): 17:57:24/1 day, 9:15:47, loss=0.479858528896432, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=0.6959352170430737, lr=0.0684580383469788
2023-11-15 07:34:41   INFO  epoch: 8/24, acc_iter=55196, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:58, time_cost(all): 17:58:23/1 day, 8:30:00, loss=0.479747586748256, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=4.955248239898062, lr=0.06841794657423822
2023-11-15 07:35:40   INFO  epoch: 8/24, acc_iter=55246, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:22:35, time_cost(all): 17:59:22/1 day, 9:04:22, loss=0.479636644600079, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=3.7844842399110528, lr=0.06837785480149762
2023-11-15 07:36:39   INFO  epoch: 8/24, acc_iter=55296, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:21:47, time_cost(all): 18:00:21/1 day, 8:43:59, loss=0.479525702451902, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.233010448534754, lr=0.06833776302875703
2023-11-15 07:37:38   INFO  epoch: 8/24, acc_iter=55346, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:16:32, time_cost(all): 18:01:20/1 day, 11:10:24, loss=0.479414760303725, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=2.7907381014004917, lr=0.06829767125601643
2023-11-15 07:38:37   INFO  epoch: 8/24, acc_iter=55396, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:14:36, time_cost(all): 18:02:19/1 day, 10:23:45, loss=0.479303818155549, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.832213368997695, lr=0.06825757948327586
2023-11-15 07:39:36   INFO  epoch: 8/24, acc_iter=55446, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:11:40, time_cost(all): 18:03:18/1 day, 11:01:24, loss=0.479192876007372, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=1.2219883485761565, lr=0.06821748771053526
2023-11-15 07:40:35   INFO  epoch: 8/24, acc_iter=55496, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:14:23, time_cost(all): 18:04:17/1 day, 8:16:20, loss=0.479081933859195, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=4.906220925625495, lr=0.06817739593779468
2023-11-15 07:41:34   INFO  epoch: 8/24, acc_iter=55546, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:11:44, time_cost(all): 18:05:16/1 day, 10:34:38, loss=0.478970991711018, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=4.644237383506702, lr=0.06813730416505409
2023-11-15 07:42:33   INFO  epoch: 8/24, acc_iter=55596, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:09:36, time_cost(all): 18:06:15/1 day, 10:32:29, loss=0.478860049562842, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.406177419738672, lr=0.0680972123923135
2023-11-15 07:43:32   INFO  epoch: 8/24, acc_iter=55646, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:13:06, time_cost(all): 18:07:14/1 day, 11:06:02, loss=0.478749107414665, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=4.845684194467702, lr=0.06805712061957292
2023-11-15 07:44:31   INFO  epoch: 8/24, acc_iter=55696, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:23, time_cost(all): 18:08:13/1 day, 8:53:48, loss=0.478638165266488, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=2.155390600925048, lr=0.06801702884683233
2023-11-15 07:45:30   INFO  epoch: 8/24, acc_iter=55746, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:48, time_cost(all): 18:09:12/1 day, 9:45:56, loss=0.478527223118311, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=1.990961834808557, lr=0.06797693707409175
2023-11-15 07:46:29   INFO  epoch: 8/24, acc_iter=55796, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:07:05, time_cost(all): 18:10:11/1 day, 9:32:42, loss=0.478416280970134, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=1.1211374057475245, lr=0.06793684530135115
2023-11-15 07:47:27   INFO  epoch: 8/24, acc_iter=55846, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:10:14, time_cost(all): 18:11:09/1 day, 8:39:35, loss=0.478305338821958, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=2.365056925318325, lr=0.06789675352861058
2023-11-15 07:48:26   INFO  epoch: 8/24, acc_iter=55896, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:07:19, time_cost(all): 18:12:08/1 day, 10:24:32, loss=0.478194396673781, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=4.708480299681091, lr=0.06785666175586998
2023-11-15 07:49:25   INFO  epoch: 8/24, acc_iter=55946, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:07:01, time_cost(all): 18:13:07/1 day, 10:21:46, loss=0.478083454525604, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=3.761490925864728, lr=0.06781656998312939
2023-11-15 07:50:24   INFO  epoch: 8/24, acc_iter=55996, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:06:34, time_cost(all): 18:14:06/1 day, 10:39:57, loss=0.477972512377427, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=0.7338920919762313, lr=0.06777647821038879
2023-11-15 07:51:23   INFO  epoch: 8/24, acc_iter=56046, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:45, time_cost(all): 18:15:05/1 day, 10:07:41, loss=0.477861570229251, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=1.0685625133975034, lr=0.0677363864376482
2023-11-15 07:52:22   INFO  epoch: 8/24, acc_iter=56096, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:05:10, time_cost(all): 18:16:04/1 day, 8:38:33, loss=0.477750628081074, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=2.946433354963043, lr=0.06769629466490762
2023-11-15 07:53:21   INFO  epoch: 8/24, acc_iter=56146, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:45, time_cost(all): 18:17:03/1 day, 9:08:36, loss=0.477639685932897, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=2.274173453604428, lr=0.06765620289216703
2023-11-15 07:54:20   INFO  epoch: 8/24, acc_iter=56196, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:59:39, time_cost(all): 18:18:02/1 day, 10:37:08, loss=0.47752874378472, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=3.992755602971989, lr=0.06761611111942645
2023-11-15 07:55:19   INFO  epoch: 8/24, acc_iter=56246, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:58:43, time_cost(all): 18:19:01/1 day, 9:05:53, loss=0.477417801636544, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.1231644176241695, lr=0.06757601934668586
2023-11-15 07:56:18   INFO  epoch: 8/24, acc_iter=56296, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/1:01:17, time_cost(all): 18:20:00/1 day, 10:28:23, loss=0.477306859488367, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.3626097007172735, lr=0.06753592757394528
2023-11-15 07:57:17   INFO  epoch: 8/24, acc_iter=56346, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/1:00:11, time_cost(all): 18:20:59/1 day, 9:39:56, loss=0.47719591734019, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=1.7895627053120189, lr=0.06749583580120469
2023-11-15 07:58:16   INFO  epoch: 8/24, acc_iter=56396, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:58:56, time_cost(all): 18:21:58/1 day, 7:54:41, loss=0.477084975192014, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=1.5114690964398545, lr=0.0674557440284641
2023-11-15 07:59:15   INFO  epoch: 8/24, acc_iter=56446, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:59, time_cost(all): 18:22:57/1 day, 8:36:50, loss=0.476974033043837, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=0.6101008490087967, lr=0.0674156522557235
2023-11-15 08:00:14   INFO  epoch: 8/24, acc_iter=56496, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:11, time_cost(all): 18:23:56/1 day, 10:19:09, loss=0.47686309089566, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=4.68286996557763, lr=0.06737556048298292
2023-11-15 08:01:12   INFO  epoch: 8/24, acc_iter=56546, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:46, time_cost(all): 18:24:54/1 day, 9:48:24, loss=0.476752148747483, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=2.1954685886449634, lr=0.06733546871024233
2023-11-15 08:02:11   INFO  epoch: 8/24, acc_iter=56596, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:52:59, time_cost(all): 18:25:53/1 day, 9:59:40, loss=0.476641206599306, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=1.1590635225041583, lr=0.06729537693750175
2023-11-15 08:03:10   INFO  epoch: 8/24, acc_iter=56646, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:58, time_cost(all): 18:26:52/1 day, 8:57:49, loss=0.47653026445113, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=2.6856767782127844, lr=0.06725528516476115
2023-11-15 08:04:09   INFO  epoch: 8/24, acc_iter=56696, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:10, time_cost(all): 18:27:51/1 day, 9:34:40, loss=0.476419322302953, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=3.4387294671301714, lr=0.06721519339202056
2023-11-15 08:05:08   INFO  epoch: 8/24, acc_iter=56746, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:48:24, time_cost(all): 18:28:50/1 day, 10:23:57, loss=0.476308380154776, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=2.152713640567134, lr=0.06717510161927998
2023-11-15 08:06:07   INFO  epoch: 8/24, acc_iter=56796, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:11, time_cost(all): 18:29:49/1 day, 7:37:38, loss=0.476197438006599, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=3.3652517910631063, lr=0.06713500984653939
2023-11-15 08:07:06   INFO  epoch: 8/24, acc_iter=56846, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:49:57, time_cost(all): 18:30:48/1 day, 7:50:35, loss=0.476086495858423, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=1.8407673415780597, lr=0.0670949180737988
2023-11-15 08:08:05   INFO  epoch: 8/24, acc_iter=56896, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:44:39, time_cost(all): 18:31:47/1 day, 9:04:40, loss=0.475975553710246, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=1.2421414247214067, lr=0.06705482630105822
2023-11-15 08:09:04   INFO  epoch: 8/24, acc_iter=56946, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:47:07, time_cost(all): 18:32:46/1 day, 8:30:18, loss=0.475864611562069, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.0(1.03), norm=4.633570769469001, lr=0.06701473452831763
2023-11-15 08:10:03   INFO  epoch: 8/24, acc_iter=56996, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:09, time_cost(all): 18:33:45/1 day, 7:52:19, loss=0.475753669413892, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=3.619388713012266, lr=0.06697464275557705
2023-11-15 08:11:02   INFO  epoch: 8/24, acc_iter=57046, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:46, time_cost(all): 18:34:44/1 day, 8:16:50, loss=0.475642727265716, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=0.7655386000350985, lr=0.06693455098283646
2023-11-15 08:12:01   INFO  epoch: 8/24, acc_iter=57096, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:41:00, time_cost(all): 18:35:43/1 day, 10:05:29, loss=0.475531785117539, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=4.258341399281964, lr=0.06689445921009586
2023-11-15 08:13:00   INFO  epoch: 8/24, acc_iter=57146, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:00, time_cost(all): 18:36:42/1 day, 10:08:31, loss=0.475420842969362, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=1.871080848172138, lr=0.06685436743735527
2023-11-15 08:13:59   INFO  epoch: 8/24, acc_iter=57196, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:48, time_cost(all): 18:37:41/1 day, 8:21:16, loss=0.475309900821185, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=3.8645528029710197, lr=0.06681427566461469
2023-11-15 08:14:58   INFO  epoch: 8/24, acc_iter=57246, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:11, time_cost(all): 18:38:40/1 day, 8:30:03, loss=0.475198958673009, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=1.7425673777285398, lr=0.0667741838918741
2023-11-15 08:15:56   INFO  epoch: 8/24, acc_iter=57296, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:29, time_cost(all): 18:39:38/1 day, 10:00:34, loss=0.475088016524832, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=1.743914621651022, lr=0.0667340921191335
2023-11-15 08:16:55   INFO  epoch: 8/24, acc_iter=57346, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:35, time_cost(all): 18:40:37/1 day, 10:33:17, loss=0.474977074376655, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.23(1.03), norm=1.272330620527757, lr=0.06669400034639292
2023-11-15 08:17:54   INFO  epoch: 8/24, acc_iter=57396, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:37:17, time_cost(all): 18:41:36/1 day, 10:03:23, loss=0.474866132228479, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=1.1486921467312308, lr=0.06665390857365233
2023-11-15 08:18:53   INFO  epoch: 8/24, acc_iter=57446, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:36, time_cost(all): 18:42:35/1 day, 8:48:46, loss=0.474755190080302, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=2.798252155934295, lr=0.06661381680091175
2023-11-15 08:19:52   INFO  epoch: 8/24, acc_iter=57496, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:36:48, time_cost(all): 18:43:34/1 day, 9:06:42, loss=0.474644247932125, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=1.426471492973097, lr=0.06657372502817116
2023-11-15 08:20:51   INFO  epoch: 8/24, acc_iter=57546, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:47, time_cost(all): 18:44:33/1 day, 10:22:16, loss=0.474533305783948, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=4.879981268185259, lr=0.06653363325543057
2023-11-15 08:21:50   INFO  epoch: 8/24, acc_iter=57596, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:34:20, time_cost(all): 18:45:32/1 day, 10:02:15, loss=0.474422363635771, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.83(1.03), norm=3.2712599525443826, lr=0.06649354148268999
2023-11-15 08:22:49   INFO  epoch: 8/24, acc_iter=57646, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:29, time_cost(all): 18:46:31/1 day, 9:17:30, loss=0.474311421487595, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=2.091824113441292, lr=0.0664534497099494
2023-11-15 08:23:48   INFO  epoch: 8/24, acc_iter=57696, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:13, time_cost(all): 18:47:30/1 day, 9:00:09, loss=0.474200479339418, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=1.879947318206884, lr=0.06641335793720882
2023-11-15 08:24:47   INFO  epoch: 8/24, acc_iter=57746, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:39, time_cost(all): 18:48:29/1 day, 9:37:12, loss=0.474089537191241, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=0.5204497296968666, lr=0.06637326616446822
2023-11-15 08:25:46   INFO  epoch: 8/24, acc_iter=57796, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:19, time_cost(all): 18:49:28/1 day, 7:30:38, loss=0.473978595043064, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=2.5889201408705556, lr=0.06633317439172763
2023-11-15 08:26:45   INFO  epoch: 8/24, acc_iter=57846, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:59, time_cost(all): 18:50:27/1 day, 7:36:07, loss=0.473867652894888, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.4498611278670586, lr=0.06629308261898705
2023-11-15 08:27:44   INFO  epoch: 8/24, acc_iter=57896, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:16, time_cost(all): 18:51:26/1 day, 10:21:02, loss=0.473756710746711, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=4.141972087250256, lr=0.06625299084624646
2023-11-15 08:28:43   INFO  epoch: 8/24, acc_iter=57946, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:01, time_cost(all): 18:52:25/1 day, 9:15:10, loss=0.473645768598534, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=2.011800665439803, lr=0.06621289907350586
2023-11-15 08:29:41   INFO  epoch: 8/24, acc_iter=57996, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:33, time_cost(all): 18:53:23/1 day, 10:05:24, loss=0.473534826450357, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=3.967149920279629, lr=0.06617280730076527
2023-11-15 08:30:40   INFO  epoch: 8/24, acc_iter=58046, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:01, time_cost(all): 18:54:22/1 day, 10:20:42, loss=0.473423884302181, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=4.294650317609419, lr=0.06613271552802469
2023-11-15 08:31:39   INFO  epoch: 8/24, acc_iter=58096, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:22, time_cost(all): 18:55:21/1 day, 10:16:41, loss=0.473312942154004, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=1.42141973111622, lr=0.0660926237552841
2023-11-15 08:32:38   INFO  epoch: 8/24, acc_iter=58146, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:25, time_cost(all): 18:56:20/1 day, 7:56:54, loss=0.473202000005827, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=1.6777269343396162, lr=0.06605253198254352
2023-11-15 08:33:37   INFO  epoch: 8/24, acc_iter=58196, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:46, time_cost(all): 18:57:19/1 day, 8:37:45, loss=0.47309105785765, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=0.5732423572943499, lr=0.06601244020980293
2023-11-15 08:34:36   INFO  epoch: 8/24, acc_iter=58246, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:21:21, time_cost(all): 18:58:18/1 day, 7:12:14, loss=0.472980115709474, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=4.261889856422634, lr=0.06597234843706234
2023-11-15 08:35:35   INFO  epoch: 8/24, acc_iter=58296, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:19, time_cost(all): 18:59:17/1 day, 10:24:17, loss=0.472869173561297, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.0362725631159675, lr=0.06593225666432176
2023-11-15 08:36:34   INFO  epoch: 8/24, acc_iter=58346, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:19:07, time_cost(all): 19:00:16/1 day, 9:16:10, loss=0.47275823141312, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=3.274908176079752, lr=0.06589216489158117
2023-11-15 08:37:33   INFO  epoch: 8/24, acc_iter=58396, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:57, time_cost(all): 19:01:15/1 day, 8:07:26, loss=0.472647289264943, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.227844756688298, lr=0.06585207311884057
2023-11-15 08:38:32   INFO  epoch: 8/24, acc_iter=58446, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:28, time_cost(all): 19:02:14/1 day, 7:40:18, loss=0.472536347116767, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=0.6953108633846248, lr=0.06581198134609999
2023-11-15 08:39:31   INFO  epoch: 8/24, acc_iter=58496, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:16:01, time_cost(all): 19:03:13/1 day, 9:20:45, loss=0.47242540496859, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=3.302392820362172, lr=0.06577188957335939
2023-11-15 08:40:30   INFO  epoch: 8/24, acc_iter=58546, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:43, time_cost(all): 19:04:12/1 day, 8:05:26, loss=0.472314462820413, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=0.6006360764241381, lr=0.0657317978006188
2023-11-15 08:41:29   INFO  epoch: 8/24, acc_iter=58596, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:47, time_cost(all): 19:05:11/1 day, 9:01:27, loss=0.472203520672236, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.4627064209748277, lr=0.06569170602787822
2023-11-15 08:42:28   INFO  epoch: 8/24, acc_iter=58646, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:58, time_cost(all): 19:06:10/1 day, 8:38:39, loss=0.47209257852406, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=1.227973354450881, lr=0.06565161425513763
2023-11-15 08:43:26   INFO  epoch: 8/24, acc_iter=58696, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:46, time_cost(all): 19:07:08/1 day, 9:37:41, loss=0.471981636375883, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=1.4166709639397057, lr=0.06561152248239704
2023-11-15 08:44:25   INFO  epoch: 8/24, acc_iter=58746, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:52, time_cost(all): 19:08:07/1 day, 7:39:12, loss=0.471870694227706, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=0.8401445367761753, lr=0.06557143070965646
2023-11-15 08:45:24   INFO  epoch: 8/24, acc_iter=58796, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:41, time_cost(all): 19:09:06/1 day, 9:19:47, loss=0.471759752079529, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=1.6398643127646588, lr=0.06553133893691587
2023-11-15 08:46:23   INFO  epoch: 8/24, acc_iter=58846, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:27, time_cost(all): 19:10:05/1 day, 7:22:48, loss=0.471648809931353, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=3.3939445571592053, lr=0.06549124716417529
2023-11-15 08:47:22   INFO  epoch: 8/24, acc_iter=58896, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:25, time_cost(all): 19:11:04/1 day, 8:02:15, loss=0.471537867783176, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=2.1889123245582978, lr=0.0654511553914347
2023-11-15 08:48:21   INFO  epoch: 8/24, acc_iter=58946, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:45, time_cost(all): 19:12:03/1 day, 8:50:26, loss=0.471426925634999, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=3.1747498098942706, lr=0.0654110636186941
2023-11-15 08:49:20   INFO  epoch: 8/24, acc_iter=58996, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:48, time_cost(all): 19:13:02/1 day, 7:25:20, loss=0.471315983486822, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=4.42027589727724, lr=0.06537097184595353
2023-11-15 08:50:19   INFO  epoch: 8/24, acc_iter=59046, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:46, time_cost(all): 19:14:01/1 day, 9:48:08, loss=0.471205041338646, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.768729308534938, lr=0.06533088007321293
2023-11-15 08:51:18   INFO  epoch: 8/24, acc_iter=59096, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:49, time_cost(all): 19:15:00/1 day, 8:49:59, loss=0.471094099190469, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=4.668021702066532, lr=0.06529078830047234
2023-11-15 08:52:17   INFO  epoch: 8/24, acc_iter=59146, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:42, time_cost(all): 19:15:59/1 day, 7:57:56, loss=0.470983157042292, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=3.458455164315392, lr=0.06525069652773174
2023-11-15 08:53:16   INFO  epoch: 8/24, acc_iter=59196, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:37, time_cost(all): 19:16:58/1 day, 9:16:43, loss=0.470872214894115, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=4.31614798675349, lr=0.06521060475499116
2023-11-15 08:54:15   INFO  epoch: 8/24, acc_iter=59246, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:43, time_cost(all): 19:17:57/1 day, 8:50:10, loss=0.470761272745939, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=1.4237678829558107, lr=0.06517051298225057
2023-11-15 08:55:14   INFO  epoch: 9/24, acc_iter=59333, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:12:52, time_cost(all): 19:18:56/1 day, 7:23:45, loss=0.470568233408111, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.1693949613798853, lr=0.06510075329768195
2023-11-15 08:56:13   INFO  epoch: 9/24, acc_iter=59383, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:56, time_cost(all): 19:19:55/1 day, 9:27:01, loss=0.470457291259934, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=2.51511809076787, lr=0.06506066152494136
2023-11-15 08:57:11   INFO  epoch: 9/24, acc_iter=59433, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:11:43, time_cost(all): 19:20:53/1 day, 8:20:48, loss=0.470346349111758, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=1.5050796688970736, lr=0.06502056975220077
2023-11-15 08:58:10   INFO  epoch: 9/24, acc_iter=59483, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:01:04, time_cost(all): 19:21:52/1 day, 7:00:59, loss=0.470235406963581, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=3.3475648567318497, lr=0.06498047797946019
2023-11-15 08:59:09   INFO  epoch: 9/24, acc_iter=59533, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:09:04, time_cost(all): 19:22:51/1 day, 9:15:31, loss=0.470124464815404, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=1.0799806076127068, lr=0.0649403862067196
2023-11-15 09:00:08   INFO  epoch: 9/24, acc_iter=59583, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:23, time_cost(all): 19:23:50/1 day, 9:32:27, loss=0.470013522667227, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.3228291401525216, lr=0.06490029443397902
2023-11-15 09:01:07   INFO  epoch: 9/24, acc_iter=59633, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:00:55, time_cost(all): 19:24:49/1 day, 7:42:30, loss=0.469902580519051, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=1.3349359405870749, lr=0.06486020266123843
2023-11-15 09:02:06   INFO  epoch: 9/24, acc_iter=59683, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:07:08, time_cost(all): 19:25:48/1 day, 9:35:41, loss=0.469791638370874, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=1.3341862936459576, lr=0.06482011088849784
2023-11-15 09:03:05   INFO  epoch: 9/24, acc_iter=59733, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:57:07, time_cost(all): 19:26:47/1 day, 8:46:51, loss=0.469680696222697, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=0.5702269498776438, lr=0.06478001911575726
2023-11-15 09:04:04   INFO  epoch: 9/24, acc_iter=59783, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:00:22, time_cost(all): 19:27:46/1 day, 9:32:20, loss=0.46956975407452, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=4.993195461896705, lr=0.06473992734301666
2023-11-15 09:05:03   INFO  epoch: 9/24, acc_iter=59833, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:57:47, time_cost(all): 19:28:45/1 day, 7:09:21, loss=0.469458811926344, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=1.7768385658445187, lr=0.06469983557027607
2023-11-15 09:06:02   INFO  epoch: 9/24, acc_iter=59883, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:52:36, time_cost(all): 19:29:44/1 day, 7:24:44, loss=0.469347869778167, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=1.1151383334668425, lr=0.06465974379753549
2023-11-15 09:07:01   INFO  epoch: 9/24, acc_iter=59933, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/2:01:27, time_cost(all): 19:30:43/1 day, 8:31:58, loss=0.46923692762999, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=3.0931385003904546, lr=0.0646196520247949
2023-11-15 09:08:00   INFO  epoch: 9/24, acc_iter=59983, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/2:00:50, time_cost(all): 19:31:42/1 day, 9:02:17, loss=0.469125985481813, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=1.5164502470380024, lr=0.0645795602520543
2023-11-15 09:08:59   INFO  epoch: 9/24, acc_iter=60033, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:58:10, time_cost(all): 19:32:41/1 day, 7:47:09, loss=0.469015043333637, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.059772522326592, lr=0.06453946847931372
2023-11-15 09:09:58   INFO  epoch: 9/24, acc_iter=60083, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:52:46, time_cost(all): 19:33:40/1 day, 8:07:10, loss=0.46890410118546, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=3.732527087741751, lr=0.06449937670657313
2023-11-15 09:10:56   INFO  epoch: 9/24, acc_iter=60133, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:52:53, time_cost(all): 19:34:38/1 day, 8:29:59, loss=0.468793159037283, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=3.931062486873359, lr=0.06445928493383254
2023-11-15 09:11:55   INFO  epoch: 9/24, acc_iter=60183, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:55:37, time_cost(all): 19:35:37/1 day, 8:50:39, loss=0.468682216889106, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=2.077289404280515, lr=0.06441919316109196
2023-11-15 09:12:54   INFO  epoch: 9/24, acc_iter=60233, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:49:45, time_cost(all): 19:36:36/1 day, 7:36:36, loss=0.46857127474093, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=1.39421934090409, lr=0.06437910138835137
2023-11-15 09:13:53   INFO  epoch: 9/24, acc_iter=60283, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:46:49, time_cost(all): 19:37:35/1 day, 7:32:03, loss=0.468460332592753, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=0.8515341849755699, lr=0.06433900961561079
2023-11-15 09:14:52   INFO  epoch: 9/24, acc_iter=60333, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:48:18, time_cost(all): 19:38:34/1 day, 7:21:43, loss=0.468349390444576, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=3.6598025326848846, lr=0.0642989178428702
2023-11-15 09:15:51   INFO  epoch: 9/24, acc_iter=60383, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:43:44, time_cost(all): 19:39:33/1 day, 6:50:07, loss=0.468238448296399, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=4.075395362696323, lr=0.06425882607012962
2023-11-15 09:16:50   INFO  epoch: 9/24, acc_iter=60433, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:44:14, time_cost(all): 19:40:32/1 day, 7:24:56, loss=0.468127506148223, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=0.7797511045286671, lr=0.06421873429738902
2023-11-15 09:17:49   INFO  epoch: 9/24, acc_iter=60483, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:40:40, time_cost(all): 19:41:31/1 day, 9:26:04, loss=0.468016564000046, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=2.9075517797775112, lr=0.06417864252464843
2023-11-15 09:18:48   INFO  epoch: 9/24, acc_iter=60533, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:41:11, time_cost(all): 19:42:30/1 day, 8:10:12, loss=0.467905621851869, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.14(1.03), norm=3.283030549922555, lr=0.06413855075190783
2023-11-15 09:19:47   INFO  epoch: 9/24, acc_iter=60583, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:43:09, time_cost(all): 19:43:29/1 day, 7:03:41, loss=0.467794679703692, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=3.997341789038772, lr=0.06409845897916724
2023-11-15 09:20:46   INFO  epoch: 9/24, acc_iter=60633, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:45:13, time_cost(all): 19:44:28/1 day, 7:07:58, loss=0.467683737555516, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=1.5142201530183101, lr=0.06405836720642666
2023-11-15 09:21:45   INFO  epoch: 9/24, acc_iter=60683, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:46:42, time_cost(all): 19:45:27/1 day, 8:18:21, loss=0.467572795407339, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.863983588979208, lr=0.06401827543368607
2023-11-15 09:22:44   INFO  epoch: 9/24, acc_iter=60733, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:41:42, time_cost(all): 19:46:26/1 day, 7:33:14, loss=0.467461853259162, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=1.284295886213107, lr=0.06397818366094549
2023-11-15 09:23:43   INFO  epoch: 9/24, acc_iter=60783, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:43:05, time_cost(all): 19:47:25/1 day, 7:03:49, loss=0.467350911110985, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.6177461043299637, lr=0.0639380918882049
2023-11-15 09:24:41   INFO  epoch: 9/24, acc_iter=60833, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:37:01, time_cost(all): 19:48:23/1 day, 7:43:37, loss=0.467239968962809, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=4.438599140894997, lr=0.06389800011546432
2023-11-15 09:25:40   INFO  epoch: 9/24, acc_iter=60883, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:36:16, time_cost(all): 19:49:22/1 day, 8:06:48, loss=0.467129026814632, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.14(1.03), norm=1.5343868461956731, lr=0.06385790834272373
2023-11-15 09:26:39   INFO  epoch: 9/24, acc_iter=60933, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:23, time_cost(all): 19:50:21/1 day, 8:54:50, loss=0.467018084666455, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=4.00889259641675, lr=0.06381781656998314
2023-11-15 09:27:38   INFO  epoch: 9/24, acc_iter=60983, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:35:02, time_cost(all): 19:51:20/1 day, 8:59:29, loss=0.466907142518278, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=3.6913338059508107, lr=0.06377772479724254
2023-11-15 09:28:37   INFO  epoch: 9/24, acc_iter=61033, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:37:54, time_cost(all): 19:52:19/1 day, 9:26:17, loss=0.466796200370102, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.767966857399539, lr=0.06373763302450197
2023-11-15 09:29:36   INFO  epoch: 9/24, acc_iter=61083, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:35:11, time_cost(all): 19:53:18/1 day, 6:54:40, loss=0.466685258221925, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=1.8110358813410479, lr=0.06369754125176137
2023-11-15 09:30:35   INFO  epoch: 9/24, acc_iter=61133, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:36:47, time_cost(all): 19:54:17/1 day, 7:35:39, loss=0.466574316073748, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=1.6297706567684573, lr=0.06365744947902079
2023-11-15 09:31:34   INFO  epoch: 9/24, acc_iter=61183, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:32:39, time_cost(all): 19:55:16/1 day, 8:06:54, loss=0.466463373925571, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.568854082438303, lr=0.06361735770628019
2023-11-15 09:32:33   INFO  epoch: 9/24, acc_iter=61233, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:33:18, time_cost(all): 19:56:15/1 day, 8:31:10, loss=0.466352431777395, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.038709412836847, lr=0.0635772659335396
2023-11-15 09:33:32   INFO  epoch: 9/24, acc_iter=61283, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:26:51, time_cost(all): 19:57:14/1 day, 9:00:44, loss=0.466241489629218, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=2.1529858701712716, lr=0.06353717416079901
2023-11-15 09:34:31   INFO  epoch: 9/24, acc_iter=61333, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:31:01, time_cost(all): 19:58:13/1 day, 9:13:46, loss=0.466130547481041, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=0.7890099412945719, lr=0.06349708238805843
2023-11-15 09:35:30   INFO  epoch: 9/24, acc_iter=61383, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:26:04, time_cost(all): 19:59:12/1 day, 8:58:54, loss=0.466019605332864, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=4.510494737069725, lr=0.06345699061531784
2023-11-15 09:36:29   INFO  epoch: 9/24, acc_iter=61433, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:29:41, time_cost(all): 20:00:11/1 day, 8:02:14, loss=0.465908663184688, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=0.884020236663061, lr=0.06341689884257726
2023-11-15 09:37:28   INFO  epoch: 9/24, acc_iter=61483, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:22:19, time_cost(all): 20:01:10/1 day, 7:39:10, loss=0.465797721036511, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=1.9690121444535857, lr=0.06337680706983667
2023-11-15 09:38:26   INFO  epoch: 9/24, acc_iter=61533, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:25, time_cost(all): 20:02:08/1 day, 9:03:58, loss=0.465686778888334, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=3.195484993623234, lr=0.06333671529709609
2023-11-15 09:39:25   INFO  epoch: 9/24, acc_iter=61583, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:23:45, time_cost(all): 20:03:07/1 day, 7:23:16, loss=0.465575836740157, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=4.438102602815646, lr=0.0632966235243555
2023-11-15 09:40:24   INFO  epoch: 9/24, acc_iter=61633, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:24:06, time_cost(all): 20:04:06/1 day, 6:54:40, loss=0.465464894591981, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=1.5410174384695985, lr=0.0632565317516149
2023-11-15 09:41:23   INFO  epoch: 9/24, acc_iter=61683, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:19:00, time_cost(all): 20:05:05/1 day, 8:54:11, loss=0.465353952443804, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.9010725467809357, lr=0.06321643997887431
2023-11-15 09:42:22   INFO  epoch: 9/24, acc_iter=61733, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:17, time_cost(all): 20:06:04/1 day, 6:44:41, loss=0.465243010295627, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.87(1.03), norm=0.6015327530161843, lr=0.06317634820613373
2023-11-15 09:43:21   INFO  epoch: 9/24, acc_iter=61783, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:33, time_cost(all): 20:07:03/1 day, 6:45:23, loss=0.46513206814745, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=4.652304948157541, lr=0.06313625643339314
2023-11-15 09:44:20   INFO  epoch: 9/24, acc_iter=61833, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:20:09, time_cost(all): 20:08:02/1 day, 8:46:03, loss=0.465021125999274, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=1.4163285936251377, lr=0.06309616466065254
2023-11-15 09:45:19   INFO  epoch: 9/24, acc_iter=61883, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:19:41, time_cost(all): 20:09:01/1 day, 6:37:11, loss=0.464910183851097, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=1.8058422786463277, lr=0.06305607288791196
2023-11-15 09:46:18   INFO  epoch: 9/24, acc_iter=61933, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:15:58, time_cost(all): 20:10:00/1 day, 6:26:55, loss=0.46479924170292, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=3.5688305347510676, lr=0.06301598111517137
2023-11-15 09:47:17   INFO  epoch: 9/24, acc_iter=61983, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:08, time_cost(all): 20:10:59/1 day, 8:53:25, loss=0.464688299554743, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=3.1074134231226833, lr=0.06297588934243079
2023-11-15 09:48:16   INFO  epoch: 9/24, acc_iter=62033, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:12:28, time_cost(all): 20:11:58/1 day, 8:53:55, loss=0.464577357406567, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.5874668546069373, lr=0.0629357975696902
2023-11-15 09:49:15   INFO  epoch: 9/24, acc_iter=62083, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:23, time_cost(all): 20:12:57/1 day, 7:16:48, loss=0.46446641525839, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=0.8816306493672111, lr=0.06289570579694961
2023-11-15 09:50:14   INFO  epoch: 9/24, acc_iter=62133, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:29, time_cost(all): 20:13:56/1 day, 8:51:54, loss=0.464355473110213, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.924205525942008, lr=0.06285561402420903
2023-11-15 09:51:13   INFO  epoch: 9/24, acc_iter=62183, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:11:58, time_cost(all): 20:14:55/1 day, 8:02:27, loss=0.464244530962036, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=4.4118354894551075, lr=0.06281552225146844
2023-11-15 09:52:11   INFO  epoch: 9/24, acc_iter=62233, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:10:26, time_cost(all): 20:15:53/1 day, 6:22:54, loss=0.46413358881386, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=4.931001710290486, lr=0.06277543047872786
2023-11-15 09:53:10   INFO  epoch: 9/24, acc_iter=62283, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:08:15, time_cost(all): 20:16:52/1 day, 7:30:50, loss=0.464022646665683, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.029541188225463, lr=0.06273533870598726
2023-11-15 09:54:09   INFO  epoch: 9/24, acc_iter=62333, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:08:18, time_cost(all): 20:17:51/1 day, 7:10:19, loss=0.463911704517506, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=4.220448711710271, lr=0.06269524693324667
2023-11-15 09:55:08   INFO  epoch: 9/24, acc_iter=62383, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:23, time_cost(all): 20:18:50/1 day, 7:36:55, loss=0.463800762369329, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.580385211035028, lr=0.06265515516050608
2023-11-15 09:56:07   INFO  epoch: 9/24, acc_iter=62433, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:07:42, time_cost(all): 20:19:49/1 day, 8:53:32, loss=0.463689820221153, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=3.672109770371309, lr=0.06261506338776548
2023-11-15 09:57:06   INFO  epoch: 9/24, acc_iter=62483, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:08:32, time_cost(all): 20:20:48/1 day, 6:04:48, loss=0.463578878072976, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.848031808190106, lr=0.0625749716150249
2023-11-15 09:58:05   INFO  epoch: 9/24, acc_iter=62533, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:08:34, time_cost(all): 20:21:47/1 day, 7:07:03, loss=0.463467935924799, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.6150511904016622, lr=0.06253487984228431
2023-11-15 09:59:04   INFO  epoch: 9/24, acc_iter=62583, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:23, time_cost(all): 20:22:46/1 day, 6:28:04, loss=0.463356993776622, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=2.2476179809084638, lr=0.062494788069543734
2023-11-15 10:00:03   INFO  epoch: 9/24, acc_iter=62633, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:40, time_cost(all): 20:23:45/1 day, 7:36:57, loss=0.463246051628446, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=4.041331371728891, lr=0.06245469629680314
2023-11-15 10:01:02   INFO  epoch: 9/24, acc_iter=62683, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:03:34, time_cost(all): 20:24:44/1 day, 6:41:36, loss=0.463135109480269, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=2.694918915238924, lr=0.062414604524062556
2023-11-15 10:02:01   INFO  epoch: 9/24, acc_iter=62733, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:00:00, time_cost(all): 20:25:43/1 day, 8:21:21, loss=0.463024167332092, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=0.7690088981213219, lr=0.06237451275132196
2023-11-15 10:03:00   INFO  epoch: 9/24, acc_iter=62783, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:02:01, time_cost(all): 20:26:42/1 day, 5:57:18, loss=0.462913225183915, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.91(1.03), norm=4.233406173501656, lr=0.062334420978581384
2023-11-15 10:03:59   INFO  epoch: 9/24, acc_iter=62833, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:20, time_cost(all): 20:27:41/1 day, 7:03:14, loss=0.462802283035739, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=4.464105449182397, lr=0.06229432920584079
2023-11-15 10:04:58   INFO  epoch: 9/24, acc_iter=62883, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/1:00:13, time_cost(all): 20:28:40/1 day, 6:47:36, loss=0.462691340887562, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=3.9849801757394028, lr=0.062254237433100205
2023-11-15 10:05:56   INFO  epoch: 9/24, acc_iter=62933, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:16, time_cost(all): 20:29:38/1 day, 7:22:25, loss=0.462580398739385, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=4.0370913034297455, lr=0.06221414566035961
2023-11-15 10:06:55   INFO  epoch: 9/24, acc_iter=62983, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:29, time_cost(all): 20:30:37/1 day, 8:35:50, loss=0.462469456591208, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.5162574518647634, lr=0.06217405388761903
2023-11-15 10:07:54   INFO  epoch: 9/24, acc_iter=63033, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:05, time_cost(all): 20:31:36/1 day, 6:14:03, loss=0.462358514443032, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=3.915294328761153, lr=0.06213396211487844
2023-11-15 10:08:53   INFO  epoch: 9/24, acc_iter=63083, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:38, time_cost(all): 20:32:35/1 day, 7:56:35, loss=0.462247572294855, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.87(1.03), norm=2.4241253170623818, lr=0.06209387034213785
2023-11-15 10:09:52   INFO  epoch: 9/24, acc_iter=63133, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:54, time_cost(all): 20:33:34/1 day, 7:51:40, loss=0.462136630146678, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=1.108806696151991, lr=0.06205377856939726
2023-11-15 10:10:51   INFO  epoch: 9/24, acc_iter=63183, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:52:42, time_cost(all): 20:34:33/1 day, 6:50:06, loss=0.462025687998501, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=4.326112281583611, lr=0.06201368679665667
2023-11-15 10:11:50   INFO  epoch: 9/24, acc_iter=63233, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:50:29, time_cost(all): 20:35:32/1 day, 6:05:52, loss=0.461914745850324, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.21724934348248, lr=0.06197359502391609
2023-11-15 10:12:49   INFO  epoch: 9/24, acc_iter=63283, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:50, time_cost(all): 20:36:31/1 day, 8:25:35, loss=0.461803803702148, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=4.688537593061185, lr=0.0619335032511755
2023-11-15 10:13:48   INFO  epoch: 9/24, acc_iter=63333, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:47, time_cost(all): 20:37:30/1 day, 8:33:50, loss=0.461692861553971, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.6059529179748169, lr=0.06189341147843491
2023-11-15 10:14:47   INFO  epoch: 9/24, acc_iter=63383, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:46:36, time_cost(all): 20:38:29/1 day, 8:02:04, loss=0.461581919405794, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=2.8035828565540126, lr=0.06185331970569432
2023-11-15 10:15:46   INFO  epoch: 9/24, acc_iter=63433, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:49:38, time_cost(all): 20:39:28/1 day, 7:06:27, loss=0.461470977257617, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.7810911494868593, lr=0.06181322793295373
2023-11-15 10:16:45   INFO  epoch: 9/24, acc_iter=63483, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:47:21, time_cost(all): 20:40:27/1 day, 8:05:57, loss=0.461360035109441, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=2.5906607018805485, lr=0.06177313616021315
2023-11-15 10:17:44   INFO  epoch: 9/24, acc_iter=63533, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:46:41, time_cost(all): 20:41:26/1 day, 8:04:17, loss=0.461249092961264, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.9(1.03), norm=2.56897941736914, lr=0.061733044387472555
2023-11-15 10:18:43   INFO  epoch: 9/24, acc_iter=63583, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:32, time_cost(all): 20:42:25/1 day, 7:16:59, loss=0.461138150813087, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=3.378938824792176, lr=0.06169295261473197
2023-11-15 10:19:41   INFO  epoch: 9/24, acc_iter=63633, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:43, time_cost(all): 20:43:23/1 day, 7:17:49, loss=0.46102720866491, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.0718140096719684, lr=0.06165286084199138
2023-11-15 10:20:40   INFO  epoch: 9/24, acc_iter=63683, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:34, time_cost(all): 20:44:22/1 day, 5:38:35, loss=0.460916266516734, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=2.248716142361132, lr=0.0616127690692508
2023-11-15 10:21:39   INFO  epoch: 9/24, acc_iter=63733, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:04, time_cost(all): 20:45:21/1 day, 8:01:45, loss=0.460805324368557, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.03(1.03), norm=0.9894058007854017, lr=0.061572677296510205
2023-11-15 10:22:38   INFO  epoch: 9/24, acc_iter=63783, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:42:40, time_cost(all): 20:46:20/1 day, 6:36:44, loss=0.46069438222038, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.999444833818174, lr=0.06153258552376962
2023-11-15 10:23:37   INFO  epoch: 9/24, acc_iter=63833, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:15, time_cost(all): 20:47:19/1 day, 8:27:12, loss=0.460583440072203, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=4.057317721128907, lr=0.061492493751029026
2023-11-15 10:24:36   INFO  epoch: 9/24, acc_iter=63883, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:26, time_cost(all): 20:48:18/1 day, 8:11:13, loss=0.460472497924027, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=2.165668996536669, lr=0.06145240197828844
2023-11-15 10:25:35   INFO  epoch: 9/24, acc_iter=63933, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:33, time_cost(all): 20:49:17/1 day, 6:56:22, loss=0.46036155577585, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.8110123368433246, lr=0.061412310205547854
2023-11-15 10:26:34   INFO  epoch: 9/24, acc_iter=63983, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:27, time_cost(all): 20:50:16/1 day, 6:48:44, loss=0.460250613627673, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.209719228521872, lr=0.06137221843280727
2023-11-15 10:27:33   INFO  epoch: 9/24, acc_iter=64033, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:27, time_cost(all): 20:51:15/1 day, 7:43:41, loss=0.460139671479497, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=4.040340641994092, lr=0.061332126660066676
2023-11-15 10:28:32   INFO  epoch: 9/24, acc_iter=64083, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:24, time_cost(all): 20:52:14/1 day, 6:30:13, loss=0.46002872933132, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=1.8569039900898865, lr=0.06129203488732609
2023-11-15 10:29:31   INFO  epoch: 9/24, acc_iter=64133, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:44, time_cost(all): 20:53:13/1 day, 6:18:04, loss=0.459917787183143, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=1.538560244397563, lr=0.061251943114585504
2023-11-15 10:30:30   INFO  epoch: 9/24, acc_iter=64183, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:27, time_cost(all): 20:54:12/1 day, 5:24:35, loss=0.459806845034966, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=1.3496272015999173, lr=0.06121185134184491
2023-11-15 10:31:29   INFO  epoch: 9/24, acc_iter=64233, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:33:00, time_cost(all): 20:55:11/1 day, 7:24:47, loss=0.459695902886789, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=2.282190197626294, lr=0.061171759569104325
2023-11-15 10:32:28   INFO  epoch: 9/24, acc_iter=64283, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:04, time_cost(all): 20:56:10/1 day, 8:07:15, loss=0.459584960738613, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=1.7686818795980401, lr=0.06113166779636373
2023-11-15 10:33:26   INFO  epoch: 9/24, acc_iter=64333, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:28:50, time_cost(all): 20:57:08/1 day, 5:31:51, loss=0.459474018590436, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=2.5454160793408294, lr=0.06109157602362315
2023-11-15 10:34:25   INFO  epoch: 9/24, acc_iter=64383, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:30:03, time_cost(all): 20:58:07/1 day, 7:41:29, loss=0.459363076442259, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=0.5000058810799985, lr=0.06105148425088256
2023-11-15 10:35:24   INFO  epoch: 9/24, acc_iter=64433, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:54, time_cost(all): 20:59:06/1 day, 6:36:41, loss=0.459252134294082, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=2.3275376193991733, lr=0.061011392478141975
2023-11-15 10:36:23   INFO  epoch: 9/24, acc_iter=64483, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:28:34, time_cost(all): 21:00:05/1 day, 6:48:05, loss=0.459141192145906, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.6330279087253078, lr=0.06097130070540138
2023-11-15 10:37:22   INFO  epoch: 9/24, acc_iter=64533, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:21, time_cost(all): 21:01:04/1 day, 8:02:00, loss=0.459030249997729, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.546750308311803, lr=0.060931208932660796
2023-11-15 10:38:21   INFO  epoch: 9/24, acc_iter=64583, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:58, time_cost(all): 21:02:03/1 day, 7:22:16, loss=0.458919307849552, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=4.500864701441406, lr=0.06089111715992021
2023-11-15 10:39:20   INFO  epoch: 9/24, acc_iter=64633, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:45, time_cost(all): 21:03:02/1 day, 7:07:31, loss=0.458808365701375, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=2.244883329222006, lr=0.06085102538717962
2023-11-15 10:40:19   INFO  epoch: 9/24, acc_iter=64683, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:12, time_cost(all): 21:04:01/1 day, 6:41:15, loss=0.458697423553199, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=3.6701742420001566, lr=0.06081093361443903
2023-11-15 10:41:18   INFO  epoch: 9/24, acc_iter=64733, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:45, time_cost(all): 21:05:00/1 day, 7:03:05, loss=0.458586481405022, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=1.6019764915133223, lr=0.060770841841698446
2023-11-15 10:42:17   INFO  epoch: 9/24, acc_iter=64783, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:31, time_cost(all): 21:05:59/1 day, 5:46:53, loss=0.458475539256845, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=3.0496977093501725, lr=0.06073075006895785
2023-11-15 10:43:16   INFO  epoch: 9/24, acc_iter=64833, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:03, time_cost(all): 21:06:58/1 day, 6:32:50, loss=0.458364597108668, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=0.6446121092714836, lr=0.06069065829621727
2023-11-15 10:44:15   INFO  epoch: 9/24, acc_iter=64883, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:45, time_cost(all): 21:07:57/1 day, 7:26:39, loss=0.458253654960492, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=3.8565399602366552, lr=0.06065056652347668
2023-11-15 10:45:14   INFO  epoch: 9/24, acc_iter=64933, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:42, time_cost(all): 21:08:56/1 day, 8:07:38, loss=0.458142712812315, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.4844219783716879, lr=0.06061047475073609
2023-11-15 10:46:13   INFO  epoch: 9/24, acc_iter=64983, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:38, time_cost(all): 21:09:55/1 day, 7:19:16, loss=0.458031770664138, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=2.752141072047751, lr=0.0605703829779955
2023-11-15 10:47:11   INFO  epoch: 9/24, acc_iter=65033, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:23, time_cost(all): 21:10:53/1 day, 5:29:58, loss=0.457920828515961, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=4.592644507402543, lr=0.06053029120525492
2023-11-15 10:48:10   INFO  epoch: 9/24, acc_iter=65083, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:47, time_cost(all): 21:11:52/1 day, 6:35:56, loss=0.457809886367785, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=4.627097732790748, lr=0.06049019943251433
2023-11-15 10:49:09   INFO  epoch: 9/24, acc_iter=65133, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:15:06, time_cost(all): 21:12:51/1 day, 5:33:05, loss=0.457698944219608, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=4.13839511908938, lr=0.06045010765977374
2023-11-15 10:50:08   INFO  epoch: 9/24, acc_iter=65183, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:37, time_cost(all): 21:13:50/1 day, 6:19:36, loss=0.457588002071431, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=3.12515196306044, lr=0.06041001588703315
2023-11-15 10:51:07   INFO  epoch: 9/24, acc_iter=65233, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:07, time_cost(all): 21:14:49/1 day, 5:58:05, loss=0.457477059923254, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.343205542320538, lr=0.06036992411429256
2023-11-15 10:52:06   INFO  epoch: 9/24, acc_iter=65283, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:55, time_cost(all): 21:15:48/1 day, 7:31:23, loss=0.457366117775078, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=2.1666164736411693, lr=0.060329832341551974
2023-11-15 10:53:05   INFO  epoch: 9/24, acc_iter=65333, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:04, time_cost(all): 21:16:47/1 day, 5:09:47, loss=0.457255175626901, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=2.824008563479732, lr=0.06028974056881139
2023-11-15 10:54:04   INFO  epoch: 9/24, acc_iter=65383, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:14, time_cost(all): 21:17:46/1 day, 6:58:06, loss=0.457144233478724, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.475850931109611, lr=0.060249648796070795
2023-11-15 10:55:03   INFO  epoch: 9/24, acc_iter=65433, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:36, time_cost(all): 21:18:45/1 day, 6:36:50, loss=0.457033291330547, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=4.877356926561157, lr=0.06020955702333021
2023-11-15 10:56:02   INFO  epoch: 9/24, acc_iter=65483, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:54, time_cost(all): 21:19:44/1 day, 7:15:19, loss=0.456922349182371, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=1.5616440528442883, lr=0.060169465250589624
2023-11-15 10:57:01   INFO  epoch: 9/24, acc_iter=65533, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:41, time_cost(all): 21:20:43/1 day, 6:15:19, loss=0.456811407034194, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.1554239251337006, lr=0.06012937347784904
2023-11-15 10:58:00   INFO  epoch: 9/24, acc_iter=65583, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:43, time_cost(all): 21:21:42/1 day, 6:41:56, loss=0.456700464886017, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=2.6394732517390116, lr=0.060089281705108445
2023-11-15 10:58:59   INFO  epoch: 9/24, acc_iter=65633, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:30, time_cost(all): 21:22:41/1 day, 5:11:43, loss=0.45658952273784, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.4969287280458534, lr=0.06004918993236786
2023-11-15 10:59:58   INFO  epoch: 9/24, acc_iter=65683, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:50, time_cost(all): 21:23:40/1 day, 7:28:38, loss=0.456478580589664, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=1.8841703710870532, lr=0.060009098159627274
2023-11-15 11:00:56   INFO  epoch: 9/24, acc_iter=65733, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:40, time_cost(all): 21:24:38/1 day, 6:00:30, loss=0.456367638441487, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=4.68537394865122, lr=0.05996900638688669
2023-11-15 11:01:55   INFO  epoch: 9/24, acc_iter=65783, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:44, time_cost(all): 21:25:37/1 day, 5:55:27, loss=0.45625669629331, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.2125467525631493, lr=0.059928914614146095
2023-11-15 11:02:54   INFO  epoch: 9/24, acc_iter=65833, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 21:26:36/1 day, 5:31:46, loss=0.456145754145133, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=1.777547176697516, lr=0.05988882284140551
2023-11-15 11:03:53   INFO  epoch: 10/24, acc_iter=65920, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:14:33, time_cost(all): 21:27:35/1 day, 5:41:38, loss=0.455952714807306, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=4.2354521179753615, lr=0.05981906315683688
2023-11-15 11:04:52   INFO  epoch: 10/24, acc_iter=65970, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:06:50, time_cost(all): 21:28:34/1 day, 4:59:33, loss=0.455841772659129, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=2.7475641181425376, lr=0.0597789713840963
2023-11-15 11:05:51   INFO  epoch: 10/24, acc_iter=66020, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:04:11, time_cost(all): 21:29:33/1 day, 7:21:11, loss=0.455730830510952, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.878303270603204, lr=0.05973887961135571
2023-11-15 11:06:50   INFO  epoch: 10/24, acc_iter=66070, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:10:39, time_cost(all): 21:30:32/1 day, 5:43:48, loss=0.455619888362776, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=0.8404987738123001, lr=0.05969878783861512
2023-11-15 11:07:49   INFO  epoch: 10/24, acc_iter=66120, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:06:50, time_cost(all): 21:31:31/1 day, 6:53:25, loss=0.455508946214599, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=3.3311884908279725, lr=0.05965869606587453
2023-11-15 11:08:48   INFO  epoch: 10/24, acc_iter=66170, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:05:58, time_cost(all): 21:32:30/1 day, 7:02:08, loss=0.455398004066422, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=1.2848802383035673, lr=0.059618604293133946
2023-11-15 11:09:47   INFO  epoch: 10/24, acc_iter=66220, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:02:28, time_cost(all): 21:33:29/1 day, 6:00:39, loss=0.455287061918245, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=3.0181743625562696, lr=0.059578512520393354
2023-11-15 11:10:46   INFO  epoch: 10/24, acc_iter=66270, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:06:06, time_cost(all): 21:34:28/1 day, 5:43:21, loss=0.455176119770069, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=4.771503136047836, lr=0.05953842074765277
2023-11-15 11:11:45   INFO  epoch: 10/24, acc_iter=66320, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:54:31, time_cost(all): 21:35:27/1 day, 5:29:22, loss=0.455065177621892, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.269247873350752, lr=0.059498328974912175
2023-11-15 11:12:44   INFO  epoch: 10/24, acc_iter=66370, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:59:55, time_cost(all): 21:36:26/1 day, 6:23:36, loss=0.454954235473715, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.94(1.03), norm=0.712973777126984, lr=0.05945823720217159
2023-11-15 11:13:43   INFO  epoch: 10/24, acc_iter=66420, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:53:00, time_cost(all): 21:37:25/1 day, 7:27:06, loss=0.454843293325538, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=4.59111782438245, lr=0.059418145429431
2023-11-15 11:14:41   INFO  epoch: 10/24, acc_iter=66470, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:00:00, time_cost(all): 21:38:23/1 day, 6:30:47, loss=0.454732351177362, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=3.0782045845689088, lr=0.05937805365669042
2023-11-15 11:15:40   INFO  epoch: 10/24, acc_iter=66520, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:51:53, time_cost(all): 21:39:22/1 day, 5:47:17, loss=0.454621409029185, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=1.146448908995516, lr=0.059337961883949825
2023-11-15 11:16:39   INFO  epoch: 10/24, acc_iter=66570, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:57:59, time_cost(all): 21:40:21/1 day, 5:06:43, loss=0.454510466881008, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=2.912844711736948, lr=0.05929787011120924
2023-11-15 11:17:38   INFO  epoch: 10/24, acc_iter=66620, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:53:34, time_cost(all): 21:41:20/1 day, 6:45:40, loss=0.454399524732831, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=1.196895082554927, lr=0.05925777833846865
2023-11-15 11:18:37   INFO  epoch: 10/24, acc_iter=66670, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:57:24, time_cost(all): 21:42:19/1 day, 6:20:54, loss=0.454288582584655, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=3.612909904317856, lr=0.05921768656572807
2023-11-15 11:19:36   INFO  epoch: 10/24, acc_iter=66720, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:47:13, time_cost(all): 21:43:18/1 day, 7:00:29, loss=0.454177640436478, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.2802415780427188, lr=0.059177594792987474
2023-11-15 11:20:35   INFO  epoch: 10/24, acc_iter=66770, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:46:23, time_cost(all): 21:44:17/1 day, 7:16:41, loss=0.454066698288301, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=0.5035269802818986, lr=0.05913750302024689
2023-11-15 11:21:34   INFO  epoch: 10/24, acc_iter=66820, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:55:25, time_cost(all): 21:45:16/1 day, 5:56:00, loss=0.453955756140124, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=1.6329989926404382, lr=0.059097411247506296
2023-11-15 11:22:33   INFO  epoch: 10/24, acc_iter=66870, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:53:50, time_cost(all): 21:46:15/1 day, 5:56:32, loss=0.453844813991948, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.782131927273809, lr=0.05905731947476571
2023-11-15 11:23:32   INFO  epoch: 10/24, acc_iter=66920, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:51:54, time_cost(all): 21:47:14/1 day, 6:15:24, loss=0.453733871843771, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=4.934677552538028, lr=0.059017227702025124
2023-11-15 11:24:31   INFO  epoch: 10/24, acc_iter=66970, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:52:43, time_cost(all): 21:48:13/1 day, 7:13:49, loss=0.453622929695594, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.906251230957664, lr=0.05897713592928453
2023-11-15 11:25:30   INFO  epoch: 10/24, acc_iter=67020, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:46:04, time_cost(all): 21:49:12/1 day, 6:26:32, loss=0.453511987547417, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=4.215640665845363, lr=0.058937044156543945
2023-11-15 11:26:29   INFO  epoch: 10/24, acc_iter=67070, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:41:12, time_cost(all): 21:50:11/1 day, 4:39:50, loss=0.453401045399241, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=2.315072604169372, lr=0.05889695238380336
2023-11-15 11:27:28   INFO  epoch: 10/24, acc_iter=67120, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:42:50, time_cost(all): 21:51:10/1 day, 6:15:13, loss=0.453290103251064, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=3.112244815532769, lr=0.058856860611062774
2023-11-15 11:28:26   INFO  epoch: 10/24, acc_iter=67170, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:39:57, time_cost(all): 21:52:08/1 day, 6:37:26, loss=0.453179161102887, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=3.385978767287269, lr=0.05881676883832218
2023-11-15 11:29:25   INFO  epoch: 10/24, acc_iter=67220, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:43:09, time_cost(all): 21:53:07/1 day, 6:26:19, loss=0.45306821895471, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.5843462897042198, lr=0.058776677065581595
2023-11-15 11:30:24   INFO  epoch: 10/24, acc_iter=67270, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:39:53, time_cost(all): 21:54:06/1 day, 4:44:24, loss=0.452957276806534, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=3.495874205275909, lr=0.058736585292841
2023-11-15 11:31:23   INFO  epoch: 10/24, acc_iter=67320, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:41:13, time_cost(all): 21:55:05/1 day, 6:57:54, loss=0.452846334658357, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.747130956416072, lr=0.058696493520100416
2023-11-15 11:32:22   INFO  epoch: 10/24, acc_iter=67370, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:36:50, time_cost(all): 21:56:04/1 day, 7:15:43, loss=0.45273539251018, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=1.780297126002117, lr=0.05865640174735983
2023-11-15 11:33:21   INFO  epoch: 10/24, acc_iter=67420, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:38:13, time_cost(all): 21:57:03/1 day, 4:50:33, loss=0.452624450362003, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=0.6613107700243943, lr=0.058616309974619245
2023-11-15 11:34:20   INFO  epoch: 10/24, acc_iter=67470, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:38:09, time_cost(all): 21:58:02/1 day, 4:19:32, loss=0.452513508213827, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=2.1039436212399796, lr=0.05857621820187865
2023-11-15 11:35:19   INFO  epoch: 10/24, acc_iter=67520, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:36:28, time_cost(all): 21:59:01/1 day, 6:18:30, loss=0.45240256606565, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=2.97542654450063, lr=0.058536126429138066
2023-11-15 11:36:18   INFO  epoch: 10/24, acc_iter=67570, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:33:13, time_cost(all): 22:00:00/1 day, 4:43:16, loss=0.452291623917473, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.5424692036757395, lr=0.05849603465639748
2023-11-15 11:37:17   INFO  epoch: 10/24, acc_iter=67620, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:30:20, time_cost(all): 22:00:59/1 day, 4:45:05, loss=0.452180681769296, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=3.548009975713923, lr=0.05845594288365689
2023-11-15 11:38:16   INFO  epoch: 10/24, acc_iter=67670, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:30:12, time_cost(all): 22:01:58/1 day, 6:19:58, loss=0.45206973962112, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.02(1.03), norm=1.687417940156577, lr=0.0584158511109163
2023-11-15 11:39:15   INFO  epoch: 10/24, acc_iter=67720, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:28:26, time_cost(all): 22:02:57/1 day, 5:35:32, loss=0.451958797472943, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=0.5422869849010572, lr=0.05837575933817571
2023-11-15 11:40:14   INFO  epoch: 10/24, acc_iter=67770, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:34:43, time_cost(all): 22:03:56/1 day, 6:17:51, loss=0.451847855324766, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.908635328502272, lr=0.05833566756543513
2023-11-15 11:41:13   INFO  epoch: 10/24, acc_iter=67820, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:33:18, time_cost(all): 22:04:55/1 day, 6:28:37, loss=0.451736913176589, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.759104435538959, lr=0.05829557579269454
2023-11-15 11:42:11   INFO  epoch: 10/24, acc_iter=67870, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:25:48, time_cost(all): 22:05:53/1 day, 6:17:48, loss=0.451625971028413, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=4.611143497437034, lr=0.05825548401995395
2023-11-15 11:43:10   INFO  epoch: 10/24, acc_iter=67920, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:27:26, time_cost(all): 22:06:52/1 day, 4:16:30, loss=0.451515028880236, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=3.7745679077705385, lr=0.05821539224721336
2023-11-15 11:44:09   INFO  epoch: 10/24, acc_iter=67970, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:30:25, time_cost(all): 22:07:51/1 day, 4:33:16, loss=0.451404086732059, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=3.5600533212041277, lr=0.05817530047447277
2023-11-15 11:45:08   INFO  epoch: 10/24, acc_iter=68020, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:30:15, time_cost(all): 22:08:50/1 day, 5:27:53, loss=0.451293144583882, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=2.9071612118976637, lr=0.05813520870173219
2023-11-15 11:46:07   INFO  epoch: 10/24, acc_iter=68070, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:25:33, time_cost(all): 22:09:49/1 day, 4:21:04, loss=0.451182202435706, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=4.341345829121968, lr=0.058095116928991594
2023-11-15 11:47:06   INFO  epoch: 10/24, acc_iter=68120, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:26:29, time_cost(all): 22:10:48/1 day, 6:59:40, loss=0.451071260287529, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.18(1.03), norm=0.9664654211478789, lr=0.05805502515625101
2023-11-15 11:48:05   INFO  epoch: 10/24, acc_iter=68170, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:21:24, time_cost(all): 22:11:47/1 day, 6:21:18, loss=0.450960318139352, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.420181101778262, lr=0.058014933383510416
2023-11-15 11:49:04   INFO  epoch: 10/24, acc_iter=68220, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:25:56, time_cost(all): 22:12:46/1 day, 5:30:46, loss=0.450849375991175, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=4.58860651411399, lr=0.05797484161076984
2023-11-15 11:50:03   INFO  epoch: 10/24, acc_iter=68270, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:21:37, time_cost(all): 22:13:45/1 day, 4:12:27, loss=0.450738433842999, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=3.42650429729228, lr=0.057934749838029244
2023-11-15 11:51:02   INFO  epoch: 10/24, acc_iter=68320, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:17:19, time_cost(all): 22:14:44/1 day, 4:58:37, loss=0.450627491694822, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.2957373395384406, lr=0.05789465806528866
2023-11-15 11:52:01   INFO  epoch: 10/24, acc_iter=68370, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:17:18, time_cost(all): 22:15:43/1 day, 5:52:04, loss=0.450516549546645, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=2.72187942394114, lr=0.057854566292548065
2023-11-15 11:53:00   INFO  epoch: 10/24, acc_iter=68420, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:21:35, time_cost(all): 22:16:42/1 day, 5:47:31, loss=0.450405607398468, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.7821304335997534, lr=0.05781447451980748
2023-11-15 11:53:59   INFO  epoch: 10/24, acc_iter=68470, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:29, time_cost(all): 22:17:41/1 day, 4:00:36, loss=0.450294665250292, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=3.96133762316285, lr=0.057774382747066894
2023-11-15 11:54:58   INFO  epoch: 10/24, acc_iter=68520, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:20:59, time_cost(all): 22:18:40/1 day, 5:21:44, loss=0.450183723102115, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=3.572530959629061, lr=0.05773429097432631
2023-11-15 11:55:57   INFO  epoch: 10/24, acc_iter=68570, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:13:54, time_cost(all): 22:19:39/1 day, 5:21:50, loss=0.450072780953938, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.1(1.03), norm=2.5018257663684347, lr=0.057694199201585715
2023-11-15 11:56:55   INFO  epoch: 10/24, acc_iter=68620, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:12:41, time_cost(all): 22:20:37/1 day, 6:37:09, loss=0.449961838805761, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=0.6417375708591418, lr=0.05765410742884513
2023-11-15 11:57:54   INFO  epoch: 10/24, acc_iter=68670, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:36, time_cost(all): 22:21:36/1 day, 4:28:17, loss=0.449850896657585, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=2.960224728087173, lr=0.05761401565610454
2023-11-15 11:58:53   INFO  epoch: 10/24, acc_iter=68720, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:13:17, time_cost(all): 22:22:35/1 day, 4:49:03, loss=0.449739954509408, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=4.847487055073333, lr=0.05757392388336395
2023-11-15 11:59:52   INFO  epoch: 10/24, acc_iter=68770, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:10:21, time_cost(all): 22:23:34/1 day, 5:40:04, loss=0.449629012361231, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=4.361226903860534, lr=0.057533832110623365
2023-11-15 12:00:51   INFO  epoch: 10/24, acc_iter=68820, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:09:48, time_cost(all): 22:24:33/1 day, 5:38:56, loss=0.449518070213054, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=4.389529663852511, lr=0.05749374033788277
2023-11-15 12:01:50   INFO  epoch: 10/24, acc_iter=68870, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:08:29, time_cost(all): 22:25:32/1 day, 5:19:01, loss=0.449407128064878, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.3994504815592146, lr=0.05745364856514219
2023-11-15 12:02:49   INFO  epoch: 10/24, acc_iter=68920, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:09:49, time_cost(all): 22:26:31/1 day, 5:21:41, loss=0.449296185916701, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.1141595081053226, lr=0.0574135567924016
2023-11-15 12:03:48   INFO  epoch: 10/24, acc_iter=68970, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:38, time_cost(all): 22:27:30/1 day, 5:46:06, loss=0.449185243768524, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=1.8945384810642205, lr=0.057373465019661014
2023-11-15 12:04:47   INFO  epoch: 10/24, acc_iter=69020, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:05:07, time_cost(all): 22:28:29/1 day, 6:05:03, loss=0.449074301620347, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=1.5211198137809128, lr=0.05733337324692042
2023-11-15 12:05:46   INFO  epoch: 10/24, acc_iter=69070, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:06:02, time_cost(all): 22:29:28/1 day, 5:52:28, loss=0.448963359472171, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=1.8846435950696394, lr=0.057293281474179836
2023-11-15 12:06:45   INFO  epoch: 10/24, acc_iter=69120, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:02:39, time_cost(all): 22:30:27/1 day, 4:31:41, loss=0.448852417323994, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.01(1.03), norm=4.19009836953882, lr=0.05725318970143925
2023-11-15 12:07:44   INFO  epoch: 10/24, acc_iter=69170, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:07:28, time_cost(all): 22:31:26/1 day, 4:13:32, loss=0.448741475175817, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=4.407190318166325, lr=0.05721309792869866
2023-11-15 12:08:43   INFO  epoch: 10/24, acc_iter=69220, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:05, time_cost(all): 22:32:25/1 day, 4:06:57, loss=0.44863053302764, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=1.1176411091889031, lr=0.05717300615595807
2023-11-15 12:09:42   INFO  epoch: 10/24, acc_iter=69270, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:28, time_cost(all): 22:33:24/1 day, 4:04:05, loss=0.448519590879464, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.09(1.03), norm=1.1573355588950696, lr=0.05713291438321748
2023-11-15 12:10:40   INFO  epoch: 10/24, acc_iter=69320, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:04:07, time_cost(all): 22:34:22/1 day, 4:58:04, loss=0.448408648731287, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.7399914485622636, lr=0.0570928226104769
2023-11-15 12:11:39   INFO  epoch: 10/24, acc_iter=69370, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:02:09, time_cost(all): 22:35:21/1 day, 4:19:56, loss=0.44829770658311, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=4.224371714853698, lr=0.05705273083773631
2023-11-15 12:12:38   INFO  epoch: 10/24, acc_iter=69420, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:59:46, time_cost(all): 22:36:20/1 day, 5:16:47, loss=0.448186764434933, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=3.9369929047237795, lr=0.05701263906499572
2023-11-15 12:13:37   INFO  epoch: 10/24, acc_iter=69470, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:57:57, time_cost(all): 22:37:19/1 day, 4:17:13, loss=0.448075822286757, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=2.8036923766701567, lr=0.05697254729225513
2023-11-15 12:14:36   INFO  epoch: 10/24, acc_iter=69520, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/1:00:29, time_cost(all): 22:38:18/1 day, 4:04:14, loss=0.44796488013858, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=2.276934723340923, lr=0.05693245551951455
2023-11-15 12:15:35   INFO  epoch: 10/24, acc_iter=69570, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:59:12, time_cost(all): 22:39:17/1 day, 3:55:36, loss=0.447853937990403, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.8691483770772015, lr=0.056892363746773957
2023-11-15 12:16:34   INFO  epoch: 10/24, acc_iter=69620, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:54:04, time_cost(all): 22:40:16/1 day, 4:11:17, loss=0.447742995842226, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=4.704965047289061, lr=0.05685227197403337
2023-11-15 12:17:33   INFO  epoch: 10/24, acc_iter=69670, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:02, time_cost(all): 22:41:15/1 day, 5:31:10, loss=0.44763205369405, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.08(1.03), norm=4.259861798387598, lr=0.05681218020129278
2023-11-15 12:18:32   INFO  epoch: 10/24, acc_iter=69720, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:36, time_cost(all): 22:42:14/1 day, 3:51:19, loss=0.447521111545873, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.2984849230026496, lr=0.05677208842855219
2023-11-15 12:19:31   INFO  epoch: 10/24, acc_iter=69770, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:50:25, time_cost(all): 22:43:13/1 day, 5:22:16, loss=0.447410169397696, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=4.230286516895033, lr=0.056731996655811606
2023-11-15 12:20:30   INFO  epoch: 10/24, acc_iter=69820, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:50:59, time_cost(all): 22:44:12/1 day, 5:14:26, loss=0.447299227249519, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.9876953305472815, lr=0.05669190488307101
2023-11-15 12:21:29   INFO  epoch: 10/24, acc_iter=69870, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:53:16, time_cost(all): 22:45:11/1 day, 4:25:40, loss=0.447188285101343, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=4.363856519758378, lr=0.05665181311033043
2023-11-15 12:22:28   INFO  epoch: 10/24, acc_iter=69920, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:29, time_cost(all): 22:46:10/1 day, 4:39:20, loss=0.447077342953166, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=4.614946749767022, lr=0.056611721337589835
2023-11-15 12:23:27   INFO  epoch: 10/24, acc_iter=69970, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:40, time_cost(all): 22:47:09/1 day, 3:43:18, loss=0.446966400804989, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=4.825633886347755, lr=0.056571629564849256
2023-11-15 12:24:25   INFO  epoch: 10/24, acc_iter=70020, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:21, time_cost(all): 22:48:07/1 day, 4:26:05, loss=0.446855458656812, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=2.7618446701245434, lr=0.05653153779210866
2023-11-15 12:25:24   INFO  epoch: 10/24, acc_iter=70070, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:59, time_cost(all): 22:49:06/1 day, 3:33:04, loss=0.446744516508636, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=4.380975647874764, lr=0.05649144601936808
2023-11-15 12:26:23   INFO  epoch: 10/24, acc_iter=70120, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:47:53, time_cost(all): 22:50:05/1 day, 5:38:36, loss=0.446633574360459, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=1.4540469260016309, lr=0.056451354246627485
2023-11-15 12:27:22   INFO  epoch: 10/24, acc_iter=70170, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:52, time_cost(all): 22:51:04/1 day, 5:32:55, loss=0.446522632212282, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=1.8630735482855423, lr=0.0564112624738869
2023-11-15 12:28:21   INFO  epoch: 10/24, acc_iter=70220, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:41:56, time_cost(all): 22:52:03/1 day, 5:29:31, loss=0.446411690064105, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=2.2426949005617334, lr=0.05637117070114631
2023-11-15 12:29:20   INFO  epoch: 10/24, acc_iter=70270, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:39, time_cost(all): 22:53:02/1 day, 4:11:44, loss=0.446300747915928, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=2.5532183099067867, lr=0.05633107892840572
2023-11-15 12:30:19   INFO  epoch: 10/24, acc_iter=70320, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:42:29, time_cost(all): 22:54:01/1 day, 4:35:33, loss=0.446189805767752, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=3.547740804976625, lr=0.056290987155665134
2023-11-15 12:31:18   INFO  epoch: 10/24, acc_iter=70370, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:01, time_cost(all): 22:55:00/1 day, 3:40:45, loss=0.446078863619575, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=2.068852812541789, lr=0.05625089538292455
2023-11-15 12:32:17   INFO  epoch: 10/24, acc_iter=70420, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:04, time_cost(all): 22:55:59/1 day, 6:13:15, loss=0.445967921471398, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=1.4876057021962323, lr=0.05621080361018396
2023-11-15 12:33:16   INFO  epoch: 10/24, acc_iter=70470, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:37:17, time_cost(all): 22:56:58/1 day, 5:21:45, loss=0.445856979323221, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=1.5585728677058717, lr=0.05617071183744337
2023-11-15 12:34:15   INFO  epoch: 10/24, acc_iter=70520, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:20, time_cost(all): 22:57:57/1 day, 5:40:02, loss=0.445746037175045, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=1.8689220688360877, lr=0.056130620064702784
2023-11-15 12:35:14   INFO  epoch: 10/24, acc_iter=70570, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:37:53, time_cost(all): 22:58:56/1 day, 5:31:27, loss=0.445635095026868, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=2.631784420954144, lr=0.05609052829196219
2023-11-15 12:36:13   INFO  epoch: 10/24, acc_iter=70620, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:48, time_cost(all): 22:59:55/1 day, 5:41:50, loss=0.445524152878691, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=4.572608773300703, lr=0.056050436519221605
2023-11-15 12:37:12   INFO  epoch: 10/24, acc_iter=70670, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:52, time_cost(all): 23:00:54/1 day, 4:13:16, loss=0.445413210730514, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=3.872029552345479, lr=0.05601034474648102
2023-11-15 12:38:10   INFO  epoch: 10/24, acc_iter=70720, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:17, time_cost(all): 23:01:52/1 day, 5:51:01, loss=0.445302268582338, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=3.0597992854487477, lr=0.055970252973740434
2023-11-15 12:39:09   INFO  epoch: 10/24, acc_iter=70770, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:26, time_cost(all): 23:02:51/1 day, 4:33:37, loss=0.445191326434161, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.3652105542911426, lr=0.05593016120099984
2023-11-15 12:40:08   INFO  epoch: 10/24, acc_iter=70820, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:48, time_cost(all): 23:03:50/1 day, 3:39:25, loss=0.445080384285984, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.6190948640617924, lr=0.055890069428259255
2023-11-15 12:41:07   INFO  epoch: 10/24, acc_iter=70870, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:26, time_cost(all): 23:04:49/1 day, 5:27:25, loss=0.444969442137807, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=1.93146457713049, lr=0.05584997765551867
2023-11-15 12:42:06   INFO  epoch: 10/24, acc_iter=70920, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:29, time_cost(all): 23:05:48/1 day, 4:43:02, loss=0.444858499989631, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=4.26319013972697, lr=0.055809885882778076
2023-11-15 12:43:05   INFO  epoch: 10/24, acc_iter=70970, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:27:50, time_cost(all): 23:06:47/1 day, 4:49:09, loss=0.444747557841454, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=2.126926173667562, lr=0.05576979411003749
2023-11-15 12:44:04   INFO  epoch: 10/24, acc_iter=71020, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:51, time_cost(all): 23:07:46/1 day, 5:37:13, loss=0.444636615693277, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=2.912798177372224, lr=0.0557297023372969
2023-11-15 12:45:03   INFO  epoch: 10/24, acc_iter=71070, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:15, time_cost(all): 23:08:45/1 day, 3:16:33, loss=0.4445256735451, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=3.165242782027379, lr=0.05568961056455631
2023-11-15 12:46:02   INFO  epoch: 10/24, acc_iter=71120, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:59, time_cost(all): 23:09:44/1 day, 4:53:53, loss=0.444414731396924, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=0.8292445968321056, lr=0.055649518791815726
2023-11-15 12:47:01   INFO  epoch: 10/24, acc_iter=71170, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:26:28, time_cost(all): 23:10:43/1 day, 5:10:22, loss=0.444303789248747, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.33048869586178, lr=0.05560942701907514
2023-11-15 12:48:00   INFO  epoch: 10/24, acc_iter=71220, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:13, time_cost(all): 23:11:42/1 day, 4:35:29, loss=0.44419284710057, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=1.241261684135687, lr=0.05556933524633455
2023-11-15 12:48:59   INFO  epoch: 10/24, acc_iter=71270, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:18, time_cost(all): 23:12:41/1 day, 4:33:04, loss=0.444081904952393, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=0.9082924041176879, lr=0.05552924347359396
2023-11-15 12:49:58   INFO  epoch: 10/24, acc_iter=71320, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:42, time_cost(all): 23:13:40/1 day, 4:50:03, loss=0.443970962804217, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=2.6240118153663246, lr=0.055489151700853376
2023-11-15 12:50:57   INFO  epoch: 10/24, acc_iter=71370, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:38, time_cost(all): 23:14:39/1 day, 4:04:14, loss=0.44386002065604, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=1.9198041054536314, lr=0.05544905992811278
2023-11-15 12:51:55   INFO  epoch: 10/24, acc_iter=71420, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:43, time_cost(all): 23:15:37/1 day, 4:13:25, loss=0.443749078507863, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=1.6887592763805488, lr=0.0554089681553722
2023-11-15 12:52:54   INFO  epoch: 10/24, acc_iter=71470, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:06, time_cost(all): 23:16:36/1 day, 4:26:15, loss=0.443638136359686, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=1.6334187977786743, lr=0.05536887638263161
2023-11-15 12:53:53   INFO  epoch: 10/24, acc_iter=71520, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:54, time_cost(all): 23:17:35/1 day, 3:32:04, loss=0.44352719421151, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.280151238052842, lr=0.05532878460989102
2023-11-15 12:54:52   INFO  epoch: 10/24, acc_iter=71570, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:58, time_cost(all): 23:18:34/1 day, 5:09:36, loss=0.443416252063333, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=3.28394450286465, lr=0.05528869283715043
2023-11-15 12:55:51   INFO  epoch: 10/24, acc_iter=71620, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:56, time_cost(all): 23:19:33/1 day, 4:31:36, loss=0.443305309915156, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=3.9651788194901347, lr=0.05524860106440985
2023-11-15 12:56:50   INFO  epoch: 10/24, acc_iter=71670, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:05, time_cost(all): 23:20:32/1 day, 4:30:06, loss=0.443194367766979, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=4.265753950311778, lr=0.055208509291669254
2023-11-15 12:57:49   INFO  epoch: 10/24, acc_iter=71720, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:56, time_cost(all): 23:21:31/1 day, 4:18:31, loss=0.443083425618803, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=4.776888970156016, lr=0.05516841751892867
2023-11-15 12:58:48   INFO  epoch: 10/24, acc_iter=71770, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:10, time_cost(all): 23:22:30/1 day, 3:43:41, loss=0.442972483470626, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=1.5604203048533416, lr=0.05512832574618808
2023-11-15 12:59:47   INFO  epoch: 10/24, acc_iter=71820, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:46, time_cost(all): 23:23:29/1 day, 4:21:54, loss=0.442861541322449, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=3.9671254598929906, lr=0.0550882339734475
2023-11-15 13:00:46   INFO  epoch: 10/24, acc_iter=71870, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:10:59, time_cost(all): 23:24:28/1 day, 5:15:30, loss=0.442750599174272, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.01(1.03), norm=1.6026026832756248, lr=0.055048142200706904
2023-11-15 13:01:45   INFO  epoch: 10/24, acc_iter=71920, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:05, time_cost(all): 23:25:27/1 day, 5:13:29, loss=0.442639657026096, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=0.8340064316747462, lr=0.05500805042796632
2023-11-15 13:02:44   INFO  epoch: 10/24, acc_iter=71970, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:28, time_cost(all): 23:26:26/1 day, 5:35:00, loss=0.442528714877919, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=4.866847323042244, lr=0.054967958655225725
2023-11-15 13:03:43   INFO  epoch: 10/24, acc_iter=72020, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:12, time_cost(all): 23:27:25/1 day, 4:54:11, loss=0.442417772729742, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=4.501934564138555, lr=0.05492786688248514
2023-11-15 13:04:42   INFO  epoch: 10/24, acc_iter=72070, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:55, time_cost(all): 23:28:24/1 day, 4:10:29, loss=0.442306830581565, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=0.6437763657650462, lr=0.05488777510974455
2023-11-15 13:05:40   INFO  epoch: 10/24, acc_iter=72120, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:21, time_cost(all): 23:29:22/1 day, 4:04:46, loss=0.442195888433389, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=2.3331077754713743, lr=0.05484768333700397
2023-11-15 13:06:39   INFO  epoch: 10/24, acc_iter=72170, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:52, time_cost(all): 23:30:21/1 day, 3:26:57, loss=0.442084946285212, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=3.734303626474774, lr=0.05480759156426338
2023-11-15 13:07:38   INFO  epoch: 10/24, acc_iter=72220, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:50, time_cost(all): 23:31:20/1 day, 2:52:41, loss=0.441974004137035, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=0.5339285750161975, lr=0.05476749979152279
2023-11-15 13:08:37   INFO  epoch: 10/24, acc_iter=72270, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:37, time_cost(all): 23:32:19/1 day, 4:38:01, loss=0.441863061988858, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=3.255102801967511, lr=0.0547274080187822
2023-11-15 13:09:36   INFO  epoch: 10/24, acc_iter=72320, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:46, time_cost(all): 23:33:18/1 day, 4:02:24, loss=0.441752119840682, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=2.8335662421150785, lr=0.05468731624604161
2023-11-15 13:10:35   INFO  epoch: 10/24, acc_iter=72370, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:47, time_cost(all): 23:34:17/1 day, 4:25:52, loss=0.441641177692505, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.98(1.03), norm=3.394282566381176, lr=0.054647224473301025
2023-11-15 13:11:34   INFO  epoch: 10/24, acc_iter=72420, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 23:35:16/1 day, 5:19:04, loss=0.441530235544328, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.8345837484993042, lr=0.05460713270056043
2023-11-15 13:12:33   INFO  epoch: 11/24, acc_iter=72507, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:04:14, time_cost(all): 23:36:15/1 day, 5:27:08, loss=0.441337196206501, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=2.5566517486823765, lr=0.05453737301599181
2023-11-15 13:13:32   INFO  epoch: 11/24, acc_iter=72557, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:11:40, time_cost(all): 23:37:14/1 day, 3:34:03, loss=0.441226254058324, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.8353820195202926, lr=0.05449728124325122
2023-11-15 13:14:31   INFO  epoch: 11/24, acc_iter=72607, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:11:25, time_cost(all): 23:38:13/1 day, 2:46:55, loss=0.441115311910147, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=2.646217837877274, lr=0.05445718947051064
2023-11-15 13:15:30   INFO  epoch: 11/24, acc_iter=72657, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:02:54, time_cost(all): 23:39:12/1 day, 4:50:19, loss=0.44100436976197, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=4.78039792994502, lr=0.054417097697770055
2023-11-15 13:16:29   INFO  epoch: 11/24, acc_iter=72707, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:01:03, time_cost(all): 23:40:11/1 day, 3:07:52, loss=0.440893427613794, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.18(1.03), norm=3.7933242333997463, lr=0.05437700592502946
2023-11-15 13:17:28   INFO  epoch: 11/24, acc_iter=72757, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:05:59, time_cost(all): 23:41:10/1 day, 4:22:39, loss=0.440782485465617, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=2.6968146207277925, lr=0.054336914152288876
2023-11-15 13:18:27   INFO  epoch: 11/24, acc_iter=72807, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:01:56, time_cost(all): 23:42:09/1 day, 5:06:30, loss=0.44067154331744, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.2247874579850995, lr=0.05429682237954828
2023-11-15 13:19:25   INFO  epoch: 11/24, acc_iter=72857, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:04:46, time_cost(all): 23:43:07/1 day, 4:10:37, loss=0.440560601169263, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=0.9345010983554624, lr=0.0542567306068077
2023-11-15 13:20:24   INFO  epoch: 11/24, acc_iter=72907, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:55:40, time_cost(all): 23:44:06/1 day, 4:31:43, loss=0.440449659021087, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=1.7660039975857653, lr=0.054216638834067105
2023-11-15 13:21:23   INFO  epoch: 11/24, acc_iter=72957, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:58:39, time_cost(all): 23:45:05/1 day, 3:23:55, loss=0.44033871687291, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=2.652830294177177, lr=0.05417654706132652
2023-11-15 13:22:22   INFO  epoch: 11/24, acc_iter=73007, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:54:05, time_cost(all): 23:46:04/1 day, 4:20:02, loss=0.440227774724733, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=2.3481171755949575, lr=0.05413645528858594
2023-11-15 13:23:21   INFO  epoch: 11/24, acc_iter=73057, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:54:31, time_cost(all): 23:47:03/1 day, 2:53:56, loss=0.440116832576556, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.965050004136619, lr=0.05409636351584535
2023-11-15 13:24:20   INFO  epoch: 11/24, acc_iter=73107, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/2:02:18, time_cost(all): 23:48:02/1 day, 3:20:56, loss=0.44000589042838, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=1.5226852921294118, lr=0.05405627174310476
2023-11-15 13:25:19   INFO  epoch: 11/24, acc_iter=73157, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:50:11, time_cost(all): 23:49:01/1 day, 3:14:23, loss=0.439894948280203, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=1.5536638068668365, lr=0.05401617997036417
2023-11-15 13:26:18   INFO  epoch: 11/24, acc_iter=73207, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:51:43, time_cost(all): 23:50:00/1 day, 4:57:34, loss=0.439784006132026, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.6290483748084372, lr=0.05397608819762358
2023-11-15 13:27:17   INFO  epoch: 11/24, acc_iter=73257, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:52:58, time_cost(all): 23:50:59/1 day, 4:05:00, loss=0.439673063983849, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=3.9373701752275228, lr=0.05393599642488299
2023-11-15 13:28:16   INFO  epoch: 11/24, acc_iter=73307, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:56:51, time_cost(all): 23:51:58/1 day, 4:49:12, loss=0.439562121835673, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=3.548578005309871, lr=0.053895904652142404
2023-11-15 13:29:15   INFO  epoch: 11/24, acc_iter=73357, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:51:03, time_cost(all): 23:52:57/1 day, 4:56:46, loss=0.439451179687496, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=3.3908556806241354, lr=0.05385581287940181
2023-11-15 13:30:14   INFO  epoch: 11/24, acc_iter=73407, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:46:38, time_cost(all): 23:53:56/1 day, 2:54:07, loss=0.439340237539319, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.8843790131739957, lr=0.053815721106661225
2023-11-15 13:31:13   INFO  epoch: 11/24, acc_iter=73457, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:52:21, time_cost(all): 23:54:55/1 day, 3:16:11, loss=0.439229295391142, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=1.1146469790030133, lr=0.053775629333920646
2023-11-15 13:32:12   INFO  epoch: 11/24, acc_iter=73507, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:53:40, time_cost(all): 23:55:54/1 day, 4:13:07, loss=0.439118353242966, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=4.622380783757015, lr=0.053735537561180054
2023-11-15 13:33:10   INFO  epoch: 11/24, acc_iter=73557, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:48:02, time_cost(all): 23:56:52/1 day, 3:31:02, loss=0.439007411094789, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.233925588854989, lr=0.05369544578843947
2023-11-15 13:34:09   INFO  epoch: 11/24, acc_iter=73607, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:45:34, time_cost(all): 23:57:51/1 day, 3:14:21, loss=0.438896468946612, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=2.6407548429118326, lr=0.053655354015698875
2023-11-15 13:35:08   INFO  epoch: 11/24, acc_iter=73657, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:51:03, time_cost(all): 23:58:50/1 day, 4:21:11, loss=0.438785526798435, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.8777760570985285, lr=0.05361526224295829
2023-11-15 13:36:07   INFO  epoch: 11/24, acc_iter=73707, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:40:28, time_cost(all): 23:59:49/1 day, 4:53:03, loss=0.438674584650259, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=2.8072254082676884, lr=0.053575170470217696
2023-11-15 13:37:06   INFO  epoch: 11/24, acc_iter=73757, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:43:28, time_cost(all): 1 day, 0:00:48/1 day, 2:28:59, loss=0.438563642502082, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=3.9167965009877728, lr=0.05353507869747711
2023-11-15 13:38:05   INFO  epoch: 11/24, acc_iter=73807, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:43:03, time_cost(all): 1 day, 0:01:47/1 day, 4:56:22, loss=0.438452700353905, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=2.5047556347073243, lr=0.05349498692473652
2023-11-15 13:39:04   INFO  epoch: 11/24, acc_iter=73857, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:43:19, time_cost(all): 1 day, 0:02:46/1 day, 4:14:10, loss=0.438341758205728, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=4.30923861329156, lr=0.05345489515199593
2023-11-15 13:40:03   INFO  epoch: 11/24, acc_iter=73907, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:41:33, time_cost(all): 1 day, 0:03:45/1 day, 2:28:16, loss=0.438230816057552, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=1.2993875596594455, lr=0.05341480337925535
2023-11-15 13:41:02   INFO  epoch: 11/24, acc_iter=73957, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:35:30, time_cost(all): 1 day, 0:04:44/1 day, 2:18:18, loss=0.438119873909375, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=0.9958098360231831, lr=0.05337471160651476
2023-11-15 13:42:01   INFO  epoch: 11/24, acc_iter=74007, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:34:37, time_cost(all): 1 day, 0:05:43/1 day, 4:14:04, loss=0.438008931761198, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.09(1.03), norm=1.704996531791629, lr=0.053334619833774174
2023-11-15 13:43:00   INFO  epoch: 11/24, acc_iter=74057, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:40:31, time_cost(all): 1 day, 0:06:42/1 day, 4:22:46, loss=0.437897989613021, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=1.7022388357345632, lr=0.05329452806103358
2023-11-15 13:43:59   INFO  epoch: 11/24, acc_iter=74107, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:36:30, time_cost(all): 1 day, 0:07:41/1 day, 3:16:27, loss=0.437787047464845, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=1.8851747054313832, lr=0.053254436288292996
2023-11-15 13:44:58   INFO  epoch: 11/24, acc_iter=74157, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:34:24, time_cost(all): 1 day, 0:08:40/1 day, 2:26:43, loss=0.437676105316668, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.134846195548413, lr=0.0532143445155524
2023-11-15 13:45:57   INFO  epoch: 11/24, acc_iter=74207, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:38:20, time_cost(all): 1 day, 0:09:39/1 day, 3:28:28, loss=0.437565163168491, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=0.7159106585808734, lr=0.05317425274281182
2023-11-15 13:46:55   INFO  epoch: 11/24, acc_iter=74257, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:20, time_cost(all): 1 day, 0:10:37/1 day, 3:43:42, loss=0.437454221020314, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=1.944352326778768, lr=0.05313416097007123
2023-11-15 13:47:54   INFO  epoch: 11/24, acc_iter=74307, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:31:08, time_cost(all): 1 day, 0:11:36/1 day, 3:13:13, loss=0.437343278872138, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=0.9288819278426165, lr=0.05309406919733064
2023-11-15 13:48:53   INFO  epoch: 11/24, acc_iter=74357, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:29:56, time_cost(all): 1 day, 0:12:35/1 day, 4:04:45, loss=0.437232336723961, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.2274040975866527, lr=0.05305397742459006
2023-11-15 13:49:52   INFO  epoch: 11/24, acc_iter=74407, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:27:39, time_cost(all): 1 day, 0:13:34/1 day, 4:17:51, loss=0.437121394575784, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.3275727060577738, lr=0.053013885651849474
2023-11-15 13:50:51   INFO  epoch: 11/24, acc_iter=74457, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:28:17, time_cost(all): 1 day, 0:14:33/1 day, 2:20:47, loss=0.437010452427607, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=4.404242448599572, lr=0.05297379387910888
2023-11-15 13:51:50   INFO  epoch: 11/24, acc_iter=74507, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:29:27, time_cost(all): 1 day, 0:15:32/1 day, 3:15:43, loss=0.436899510279431, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.22(1.03), norm=3.1362114429428, lr=0.052933702106368295
2023-11-15 13:52:49   INFO  epoch: 11/24, acc_iter=74557, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:32:08, time_cost(all): 1 day, 0:16:31/1 day, 2:23:24, loss=0.436788568131254, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.17(1.03), norm=3.536155794220727, lr=0.0528936103336277
2023-11-15 13:53:48   INFO  epoch: 11/24, acc_iter=74607, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:26:28, time_cost(all): 1 day, 0:17:30/1 day, 4:21:30, loss=0.436677625983077, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=3.9410136367451756, lr=0.05285351856088712
2023-11-15 13:54:47   INFO  epoch: 11/24, acc_iter=74657, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:29:25, time_cost(all): 1 day, 0:18:29/1 day, 2:26:48, loss=0.4365666838349, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=3.979056452523749, lr=0.052813426788146524
2023-11-15 13:55:46   INFO  epoch: 11/24, acc_iter=74707, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:22:10, time_cost(all): 1 day, 0:19:28/1 day, 2:41:47, loss=0.436455741686724, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=4.1724412816815875, lr=0.05277333501540594
2023-11-15 13:56:45   INFO  epoch: 11/24, acc_iter=74757, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:27:18, time_cost(all): 1 day, 0:20:27/1 day, 2:50:07, loss=0.436344799538547, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=2.46993612685003, lr=0.052733243242665345
2023-11-15 13:57:44   INFO  epoch: 11/24, acc_iter=74807, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:22:46, time_cost(all): 1 day, 0:21:26/1 day, 3:48:34, loss=0.43623385739037, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=1.1171118896415593, lr=0.052693151469924766
2023-11-15 13:58:43   INFO  epoch: 11/24, acc_iter=74857, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:20:46, time_cost(all): 1 day, 0:22:25/1 day, 4:09:45, loss=0.436122915242193, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=3.9347568011865666, lr=0.05265305969718418
2023-11-15 13:59:42   INFO  epoch: 11/24, acc_iter=74907, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:22:28, time_cost(all): 1 day, 0:23:24/1 day, 4:00:19, loss=0.436011973094017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.190681865957011, lr=0.05261296792444359
2023-11-15 14:00:40   INFO  epoch: 11/24, acc_iter=74957, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:18:34, time_cost(all): 1 day, 0:24:22/1 day, 2:29:02, loss=0.43590103094584, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=4.699981893686707, lr=0.052572876151703
2023-11-15 14:01:39   INFO  epoch: 11/24, acc_iter=75007, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:03, time_cost(all): 1 day, 0:25:21/1 day, 4:34:15, loss=0.435790088797663, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=4.289671572146925, lr=0.05253278437896241
2023-11-15 14:02:38   INFO  epoch: 11/24, acc_iter=75057, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:19:13, time_cost(all): 1 day, 0:26:20/1 day, 2:34:58, loss=0.435679146649486, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.9422714932800695, lr=0.05249269260622182
2023-11-15 14:03:37   INFO  epoch: 11/24, acc_iter=75107, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:19:28, time_cost(all): 1 day, 0:27:19/1 day, 3:53:31, loss=0.43556820450131, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.9625281388023905, lr=0.05245260083348123
2023-11-15 14:04:36   INFO  epoch: 11/24, acc_iter=75157, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:09, time_cost(all): 1 day, 0:28:18/1 day, 3:54:50, loss=0.435457262353133, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=2.5369431273855465, lr=0.052412509060740645
2023-11-15 14:05:35   INFO  epoch: 11/24, acc_iter=75207, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:17:18, time_cost(all): 1 day, 0:29:17/1 day, 3:23:21, loss=0.435346320204956, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=2.4571248917031703, lr=0.05237241728800005
2023-11-15 14:06:34   INFO  epoch: 11/24, acc_iter=75257, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:10, time_cost(all): 1 day, 0:30:16/1 day, 4:23:48, loss=0.435235378056779, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=2.041415703625234, lr=0.05233232551525947
2023-11-15 14:07:33   INFO  epoch: 11/24, acc_iter=75307, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:02, time_cost(all): 1 day, 0:31:15/1 day, 3:07:54, loss=0.435124435908603, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=4.704746056101776, lr=0.05229223374251889
2023-11-15 14:08:32   INFO  epoch: 11/24, acc_iter=75357, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:13:12, time_cost(all): 1 day, 0:32:14/1 day, 4:08:39, loss=0.435013493760426, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=2.924237041285715, lr=0.052252141969778294
2023-11-15 14:09:31   INFO  epoch: 11/24, acc_iter=75407, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:13:59, time_cost(all): 1 day, 0:33:13/1 day, 2:31:00, loss=0.434902551612249, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=2.1411608463027267, lr=0.05221205019703771
2023-11-15 14:10:30   INFO  epoch: 11/24, acc_iter=75457, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:13:48, time_cost(all): 1 day, 0:34:12/1 day, 3:22:25, loss=0.434791609464072, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=0.9313265277357149, lr=0.052171958424297116
2023-11-15 14:11:29   INFO  epoch: 11/24, acc_iter=75507, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:06:20, time_cost(all): 1 day, 0:35:11/1 day, 4:06:32, loss=0.434680667315896, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=1.5602440697213784, lr=0.05213186665155653
2023-11-15 14:12:28   INFO  epoch: 11/24, acc_iter=75557, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:49, time_cost(all): 1 day, 0:36:10/1 day, 3:53:57, loss=0.434569725167719, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=3.4854955882536394, lr=0.05209177487881594
2023-11-15 14:13:27   INFO  epoch: 11/24, acc_iter=75607, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:06:19, time_cost(all): 1 day, 0:37:09/1 day, 3:19:20, loss=0.434458783019542, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=4.062009765653512, lr=0.05205168310607535
2023-11-15 14:14:25   INFO  epoch: 11/24, acc_iter=75657, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:09:04, time_cost(all): 1 day, 0:38:07/1 day, 3:07:51, loss=0.434347840871365, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=3.6083684267553835, lr=0.05201159133333476
2023-11-15 14:15:24   INFO  epoch: 11/24, acc_iter=75707, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:23, time_cost(all): 1 day, 0:39:06/1 day, 4:24:08, loss=0.434236898723189, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.04(1.03), norm=4.454122852373744, lr=0.05197149956059418
2023-11-15 14:16:23   INFO  epoch: 11/24, acc_iter=75757, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:26, time_cost(all): 1 day, 0:40:05/1 day, 2:21:08, loss=0.434125956575012, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=1.2469456697299157, lr=0.051931407787853594
2023-11-15 14:17:22   INFO  epoch: 11/24, acc_iter=75807, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:03:31, time_cost(all): 1 day, 0:41:04/1 day, 2:03:33, loss=0.434015014426835, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=0.6737291300241093, lr=0.051891316015113
2023-11-15 14:18:21   INFO  epoch: 11/24, acc_iter=75857, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:48, time_cost(all): 1 day, 0:42:03/1 day, 2:38:25, loss=0.433904072278658, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=2.290707993634242, lr=0.051851224242372415
2023-11-15 14:19:20   INFO  epoch: 11/24, acc_iter=75907, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:03:00, time_cost(all): 1 day, 0:43:02/1 day, 2:43:00, loss=0.433793130130482, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=3.5833177368679814, lr=0.05181113246963182
2023-11-15 14:20:19   INFO  epoch: 11/24, acc_iter=75957, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:57:58, time_cost(all): 1 day, 0:44:01/1 day, 4:10:56, loss=0.433682187982305, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.94(1.03), norm=1.8664072219992753, lr=0.051771040696891236
2023-11-15 14:21:18   INFO  epoch: 11/24, acc_iter=76007, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:00:04, time_cost(all): 1 day, 0:45:00/1 day, 2:19:01, loss=0.433571245834128, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=3.981326722521668, lr=0.051730948924150644
2023-11-15 14:22:17   INFO  epoch: 11/24, acc_iter=76057, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:59:27, time_cost(all): 1 day, 0:45:59/1 day, 2:53:01, loss=0.433460303685951, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=4.328508140361883, lr=0.05169085715141006
2023-11-15 14:23:16   INFO  epoch: 11/24, acc_iter=76107, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:08, time_cost(all): 1 day, 0:46:58/1 day, 2:47:41, loss=0.433349361537775, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=2.121724356222085, lr=0.05165076537866947
2023-11-15 14:24:15   INFO  epoch: 11/24, acc_iter=76157, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:40, time_cost(all): 1 day, 0:47:57/1 day, 2:06:23, loss=0.433238419389598, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.0173936613007248, lr=0.051610673605928886
2023-11-15 14:25:14   INFO  epoch: 11/24, acc_iter=76207, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:57:21, time_cost(all): 1 day, 0:48:56/1 day, 3:15:45, loss=0.433127477241421, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=3.16605048963711, lr=0.0515705818331883
2023-11-15 14:26:13   INFO  epoch: 11/24, acc_iter=76257, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:48, time_cost(all): 1 day, 0:49:55/1 day, 2:12:17, loss=0.433016535093244, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=0.8437470259824486, lr=0.051530490060447715
2023-11-15 14:27:12   INFO  epoch: 11/24, acc_iter=76307, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:54:01, time_cost(all): 1 day, 0:50:54/1 day, 4:07:15, loss=0.432905592945068, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=2.3400257937317566, lr=0.05149039828770712
2023-11-15 14:28:10   INFO  epoch: 11/24, acc_iter=76357, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:53:40, time_cost(all): 1 day, 0:51:52/1 day, 1:45:11, loss=0.432794650796891, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.05(1.03), norm=3.6086814722994816, lr=0.051450306514966536
2023-11-15 14:29:09   INFO  epoch: 11/24, acc_iter=76407, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:12, time_cost(all): 1 day, 0:52:51/1 day, 4:06:05, loss=0.432683708648714, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=0.959696557117358, lr=0.05141021474222594
2023-11-15 14:30:08   INFO  epoch: 11/24, acc_iter=76457, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:52:38, time_cost(all): 1 day, 0:53:50/1 day, 3:15:18, loss=0.432572766500537, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.99(1.03), norm=4.002829700250631, lr=0.05137012296948536
2023-11-15 14:31:07   INFO  epoch: 11/24, acc_iter=76507, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:24, time_cost(all): 1 day, 0:54:49/1 day, 3:50:30, loss=0.432461824352361, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=1.7956396156979122, lr=0.051330031196744764
2023-11-15 14:32:06   INFO  epoch: 11/24, acc_iter=76557, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:50, time_cost(all): 1 day, 0:55:48/1 day, 3:44:16, loss=0.432350882204184, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=3.1364559544401422, lr=0.05128993942400418
2023-11-15 14:33:05   INFO  epoch: 11/24, acc_iter=76607, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:23, time_cost(all): 1 day, 0:56:47/1 day, 3:28:26, loss=0.432239940056007, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=1.7909649041528763, lr=0.0512498476512636
2023-11-15 14:34:04   INFO  epoch: 11/24, acc_iter=76657, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:47:50, time_cost(all): 1 day, 0:57:46/1 day, 2:00:02, loss=0.43212899790783, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=1.0180294845685554, lr=0.05120975587852301
2023-11-15 14:35:03   INFO  epoch: 11/24, acc_iter=76707, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:48:04, time_cost(all): 1 day, 0:58:45/1 day, 1:27:51, loss=0.432018055759654, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=2.273048673435424, lr=0.05116966410578242
2023-11-15 14:36:02   INFO  epoch: 11/24, acc_iter=76757, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:19, time_cost(all): 1 day, 0:59:44/1 day, 2:55:06, loss=0.431907113611477, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=2.8282414660361685, lr=0.05112957233304183
2023-11-15 14:37:01   INFO  epoch: 11/24, acc_iter=76807, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:07, time_cost(all): 1 day, 1:00:43/1 day, 1:45:47, loss=0.4317961714633, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=3.400963658896459, lr=0.05108948056030124
2023-11-15 14:38:00   INFO  epoch: 11/24, acc_iter=76857, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:18, time_cost(all): 1 day, 1:01:42/1 day, 2:49:38, loss=0.431685229315123, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.472498120477678, lr=0.05104938878756065
2023-11-15 14:38:59   INFO  epoch: 11/24, acc_iter=76907, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:40:31, time_cost(all): 1 day, 1:02:41/1 day, 1:55:32, loss=0.431574287166947, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=4.826335340659797, lr=0.051009297014820064
2023-11-15 14:39:58   INFO  epoch: 11/24, acc_iter=76957, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:40:52, time_cost(all): 1 day, 1:03:40/1 day, 2:13:13, loss=0.43146334501877, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=3.9795805696637725, lr=0.05096920524207947
2023-11-15 14:40:57   INFO  epoch: 11/24, acc_iter=77007, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:40, time_cost(all): 1 day, 1:04:39/1 day, 2:05:33, loss=0.431352402870593, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=4.083431197032995, lr=0.050929113469338885
2023-11-15 14:41:55   INFO  epoch: 11/24, acc_iter=77057, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:03, time_cost(all): 1 day, 1:05:37/1 day, 2:36:25, loss=0.431241460722416, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=4.960257475384903, lr=0.050889021696598306
2023-11-15 14:42:54   INFO  epoch: 11/24, acc_iter=77107, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:05, time_cost(all): 1 day, 1:06:36/1 day, 2:04:42, loss=0.43113051857424, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=1.6870753387052388, lr=0.050848929923857714
2023-11-15 14:43:53   INFO  epoch: 11/24, acc_iter=77157, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:15, time_cost(all): 1 day, 1:07:35/1 day, 2:04:22, loss=0.431019576426063, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=1.0087514108275792, lr=0.05080883815111713
2023-11-15 14:44:52   INFO  epoch: 11/24, acc_iter=77207, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:36, time_cost(all): 1 day, 1:08:34/1 day, 3:05:48, loss=0.430908634277886, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=4.590952142885988, lr=0.050768746378376535
2023-11-15 14:45:51   INFO  epoch: 11/24, acc_iter=77257, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:26, time_cost(all): 1 day, 1:09:33/1 day, 1:40:25, loss=0.430797692129709, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.869051792546478, lr=0.05072865460563595
2023-11-15 14:46:50   INFO  epoch: 11/24, acc_iter=77307, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:35:48, time_cost(all): 1 day, 1:10:32/1 day, 2:34:51, loss=0.430686749981533, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=1.6002427531063934, lr=0.050688562832895356
2023-11-15 14:47:49   INFO  epoch: 11/24, acc_iter=77357, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:28, time_cost(all): 1 day, 1:11:31/1 day, 2:07:19, loss=0.430575807833356, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.20881781392682, lr=0.05064847106015477
2023-11-15 14:48:48   INFO  epoch: 11/24, acc_iter=77407, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:45, time_cost(all): 1 day, 1:12:30/1 day, 1:30:27, loss=0.430464865685179, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=2.2932723684874436, lr=0.05060837928741418
2023-11-15 14:49:47   INFO  epoch: 11/24, acc_iter=77457, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:07, time_cost(all): 1 day, 1:13:29/1 day, 2:02:34, loss=0.430353923537002, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=3.0241779735554988, lr=0.05056828751467359
2023-11-15 14:50:46   INFO  epoch: 11/24, acc_iter=77507, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:28:51, time_cost(all): 1 day, 1:14:28/1 day, 3:47:45, loss=0.430242981388825, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=2.1810967623065007, lr=0.05052819574193301
2023-11-15 14:51:45   INFO  epoch: 11/24, acc_iter=77557, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:27:49, time_cost(all): 1 day, 1:15:27/1 day, 1:52:25, loss=0.430132039240649, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=2.306901807340603, lr=0.05048810396919242
2023-11-15 14:52:44   INFO  epoch: 11/24, acc_iter=77607, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:47, time_cost(all): 1 day, 1:16:26/1 day, 1:25:37, loss=0.430021097092472, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=2.179880295558068, lr=0.050448012196451834
2023-11-15 14:53:43   INFO  epoch: 11/24, acc_iter=77657, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:15, time_cost(all): 1 day, 1:17:25/1 day, 3:39:34, loss=0.429910154944295, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.992462407262814, lr=0.05040792042371124
2023-11-15 14:54:42   INFO  epoch: 11/24, acc_iter=77707, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:27:26, time_cost(all): 1 day, 1:18:24/1 day, 1:13:02, loss=0.429799212796118, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=0.9948322959086325, lr=0.050367828650970656
2023-11-15 14:55:40   INFO  epoch: 11/24, acc_iter=77757, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:16, time_cost(all): 1 day, 1:19:22/1 day, 3:09:11, loss=0.429688270647942, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.143290052112133, lr=0.05032773687823006
2023-11-15 14:56:39   INFO  epoch: 11/24, acc_iter=77807, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:12, time_cost(all): 1 day, 1:20:21/1 day, 1:34:28, loss=0.429577328499765, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=4.467398630299456, lr=0.05028764510548948
2023-11-15 14:57:38   INFO  epoch: 11/24, acc_iter=77857, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:55, time_cost(all): 1 day, 1:21:20/1 day, 1:46:21, loss=0.429466386351588, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.3702060581899005, lr=0.050247553332748884
2023-11-15 14:58:37   INFO  epoch: 11/24, acc_iter=77907, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:35, time_cost(all): 1 day, 1:22:19/1 day, 1:59:35, loss=0.429355444203411, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=1.7041280982096971, lr=0.0502074615600083
2023-11-15 14:59:36   INFO  epoch: 11/24, acc_iter=77957, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:26, time_cost(all): 1 day, 1:23:18/1 day, 3:24:29, loss=0.429244502055235, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=4.506683937527269, lr=0.05016736978726772
2023-11-15 15:00:35   INFO  epoch: 11/24, acc_iter=78007, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:39, time_cost(all): 1 day, 1:24:17/1 day, 1:09:28, loss=0.429133559907058, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=4.5000802141064336, lr=0.05012727801452713
2023-11-15 15:01:34   INFO  epoch: 11/24, acc_iter=78057, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:08, time_cost(all): 1 day, 1:25:16/1 day, 3:23:15, loss=0.429022617758881, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=4.341763138267513, lr=0.05008718624178654
2023-11-15 15:02:33   INFO  epoch: 11/24, acc_iter=78107, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:41, time_cost(all): 1 day, 1:26:15/1 day, 2:32:05, loss=0.428911675610704, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.09076892941488, lr=0.05004709446904595
2023-11-15 15:03:32   INFO  epoch: 11/24, acc_iter=78157, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:50, time_cost(all): 1 day, 1:27:14/1 day, 3:24:51, loss=0.428800733462528, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=3.045341613925834, lr=0.05000700269630536
2023-11-15 15:04:31   INFO  epoch: 11/24, acc_iter=78207, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:00, time_cost(all): 1 day, 1:28:13/1 day, 2:45:14, loss=0.428689791314351, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=1.0846326428104571, lr=0.04996691092356478
2023-11-15 15:05:30   INFO  epoch: 11/24, acc_iter=78257, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:50, time_cost(all): 1 day, 1:29:12/1 day, 1:31:52, loss=0.428578849166174, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=2.1723762348962765, lr=0.049926819150824184
2023-11-15 15:06:29   INFO  epoch: 11/24, acc_iter=78307, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:02, time_cost(all): 1 day, 1:30:11/1 day, 3:21:42, loss=0.428467907017997, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=4.029164593924463, lr=0.0498867273780836
2023-11-15 15:07:28   INFO  epoch: 11/24, acc_iter=78357, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:57, time_cost(all): 1 day, 1:31:10/1 day, 3:23:27, loss=0.428356964869821, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.49303054117693, lr=0.049846635605343005
2023-11-15 15:08:27   INFO  epoch: 11/24, acc_iter=78407, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:42, time_cost(all): 1 day, 1:32:09/1 day, 2:45:18, loss=0.428246022721644, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.98(1.03), norm=3.840453035915704, lr=0.049806543832602426
2023-11-15 15:09:25   INFO  epoch: 11/24, acc_iter=78457, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:59, time_cost(all): 1 day, 1:33:07/1 day, 1:19:41, loss=0.428135080573467, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=0.73937873363333, lr=0.04976645205986184
2023-11-15 15:10:24   INFO  epoch: 11/24, acc_iter=78507, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:04, time_cost(all): 1 day, 1:34:06/1 day, 1:49:19, loss=0.42802413842529, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=2.8830043764203603, lr=0.04972636028712125
2023-11-15 15:11:23   INFO  epoch: 11/24, acc_iter=78557, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:46, time_cost(all): 1 day, 1:35:05/1 day, 3:18:25, loss=0.427913196277114, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.07(1.03), norm=1.394221086319032, lr=0.04968626851438066
2023-11-15 15:12:22   INFO  epoch: 11/24, acc_iter=78607, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:45, time_cost(all): 1 day, 1:36:04/1 day, 2:46:17, loss=0.427802254128937, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.09(1.03), norm=1.532817796577072, lr=0.04964617674164007
2023-11-15 15:13:21   INFO  epoch: 11/24, acc_iter=78657, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:42, time_cost(all): 1 day, 1:37:03/1 day, 3:22:50, loss=0.42769131198076, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=4.971110434588752, lr=0.04960608496889948
2023-11-15 15:14:20   INFO  epoch: 11/24, acc_iter=78707, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:47, time_cost(all): 1 day, 1:38:02/1 day, 1:03:03, loss=0.427580369832583, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=0.7216495334126869, lr=0.04956599319615889
2023-11-15 15:15:19   INFO  epoch: 11/24, acc_iter=78757, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:44, time_cost(all): 1 day, 1:39:01/1 day, 2:02:13, loss=0.427469427684407, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=1.1915118769049702, lr=0.049525901423418305
2023-11-15 15:16:18   INFO  epoch: 11/24, acc_iter=78807, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:38, time_cost(all): 1 day, 1:40:00/1 day, 3:01:18, loss=0.42735848553623, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=3.2498002070885548, lr=0.04948580965067771
2023-11-15 15:17:17   INFO  epoch: 11/24, acc_iter=78857, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:31, time_cost(all): 1 day, 1:40:59/1 day, 3:19:36, loss=0.427247543388053, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=3.2991951911804245, lr=0.04944571787793713
2023-11-15 15:18:16   INFO  epoch: 11/24, acc_iter=78907, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:47, time_cost(all): 1 day, 1:41:58/1 day, 1:13:06, loss=0.427136601239876, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.1733622644652533, lr=0.04940562610519655
2023-11-15 15:19:15   INFO  epoch: 11/24, acc_iter=78957, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:38, time_cost(all): 1 day, 1:42:57/1 day, 1:54:08, loss=0.4270256590917, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=1.4644557782138168, lr=0.049365534332455954
2023-11-15 15:20:14   INFO  epoch: 11/24, acc_iter=79007, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:44, time_cost(all): 1 day, 1:43:56/1 day, 2:04:22, loss=0.426914716943523, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.6929688773758236, lr=0.04932544255971537
2023-11-15 15:21:13   INFO  epoch: 12/24, acc_iter=79094, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:04:47, time_cost(all): 1 day, 1:44:55/1 day, 3:13:15, loss=0.426721677605695, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=0.7039333473702385, lr=0.04925568287514674
2023-11-15 15:22:12   INFO  epoch: 12/24, acc_iter=79144, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:29, time_cost(all): 1 day, 1:45:54/1 day, 2:13:18, loss=0.426610735457519, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=3.0408355977426504, lr=0.049215591102406156
2023-11-15 15:23:10   INFO  epoch: 12/24, acc_iter=79194, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:08:48, time_cost(all): 1 day, 1:46:52/1 day, 0:44:46, loss=0.426499793309342, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=4.639876096162917, lr=0.04917549932966556
2023-11-15 15:24:09   INFO  epoch: 12/24, acc_iter=79244, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:07:55, time_cost(all): 1 day, 1:47:51/1 day, 1:37:07, loss=0.426388851161165, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=4.670016309974052, lr=0.04913540755692498
2023-11-15 15:25:08   INFO  epoch: 12/24, acc_iter=79294, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/1:59:26, time_cost(all): 1 day, 1:48:50/1 day, 3:08:24, loss=0.426277909012988, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=1.835831578204296, lr=0.0490953157841844
2023-11-15 15:26:07   INFO  epoch: 12/24, acc_iter=79344, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:06:07, time_cost(all): 1 day, 1:49:49/1 day, 2:45:57, loss=0.426166966864812, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=4.513748467001648, lr=0.049055224011443806
2023-11-15 15:27:06   INFO  epoch: 12/24, acc_iter=79394, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:04:38, time_cost(all): 1 day, 1:50:48/1 day, 2:14:50, loss=0.426056024716635, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=3.4855077303379605, lr=0.04901513223870322
2023-11-15 15:28:05   INFO  epoch: 12/24, acc_iter=79444, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:03:32, time_cost(all): 1 day, 1:51:47/1 day, 0:49:39, loss=0.425945082568458, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=1.8916531381147963, lr=0.04897504046596263
2023-11-15 15:29:04   INFO  epoch: 12/24, acc_iter=79494, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:04:35, time_cost(all): 1 day, 1:52:46/1 day, 1:27:40, loss=0.425834140420281, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=3.70078404582673, lr=0.04893494869322204
2023-11-15 15:30:03   INFO  epoch: 12/24, acc_iter=79544, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:56:08, time_cost(all): 1 day, 1:53:45/1 day, 2:57:08, loss=0.425723198272105, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=3.3332045667821926, lr=0.04889485692048145
2023-11-15 15:31:02   INFO  epoch: 12/24, acc_iter=79594, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:54:15, time_cost(all): 1 day, 1:54:44/1 day, 1:56:33, loss=0.425612256123928, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.1699461327001301, lr=0.04885476514774086
2023-11-15 15:32:01   INFO  epoch: 12/24, acc_iter=79644, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:57:12, time_cost(all): 1 day, 1:55:43/1 day, 1:22:27, loss=0.425501313975751, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=3.022523374265997, lr=0.04881467337500027
2023-11-15 15:33:00   INFO  epoch: 12/24, acc_iter=79694, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:51:21, time_cost(all): 1 day, 1:56:42/1 day, 2:41:16, loss=0.425390371827574, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=4.032866847723375, lr=0.048774581602259684
2023-11-15 15:33:59   INFO  epoch: 12/24, acc_iter=79744, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:50:50, time_cost(all): 1 day, 1:57:41/1 day, 2:20:12, loss=0.425279429679398, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.94(1.03), norm=3.4899344274019377, lr=0.048734489829519105
2023-11-15 15:34:58   INFO  epoch: 12/24, acc_iter=79794, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:58:08, time_cost(all): 1 day, 1:58:40/1 day, 1:07:47, loss=0.425168487531221, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=3.988390704542765, lr=0.04869439805677851
2023-11-15 15:35:57   INFO  epoch: 12/24, acc_iter=79844, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:50:12, time_cost(all): 1 day, 1:59:39/1 day, 1:29:46, loss=0.425057545383044, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=3.205654131540693, lr=0.048654306284037926
2023-11-15 15:36:56   INFO  epoch: 12/24, acc_iter=79894, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:54:29, time_cost(all): 1 day, 2:00:38/1 day, 2:35:25, loss=0.424946603234867, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=1.014610843061628, lr=0.048614214511297334
2023-11-15 15:37:54   INFO  epoch: 12/24, acc_iter=79944, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:46:33, time_cost(all): 1 day, 2:01:36/1 day, 0:38:24, loss=0.424835661086691, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=4.22101475857922, lr=0.04857412273855675
2023-11-15 15:38:53   INFO  epoch: 12/24, acc_iter=79994, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:45:41, time_cost(all): 1 day, 2:02:35/1 day, 2:40:39, loss=0.424724718938514, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=3.6729499992216206, lr=0.048534030965816155
2023-11-15 15:39:52   INFO  epoch: 12/24, acc_iter=80044, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:53:56, time_cost(all): 1 day, 2:03:34/1 day, 0:36:40, loss=0.424613776790337, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=3.4018205052720174, lr=0.04849393919307557
2023-11-15 15:40:51   INFO  epoch: 12/24, acc_iter=80094, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:46:17, time_cost(all): 1 day, 2:04:33/1 day, 2:27:05, loss=0.42450283464216, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=3.056928480068145, lr=0.048453847420334976
2023-11-15 15:41:50   INFO  epoch: 12/24, acc_iter=80144, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:46:24, time_cost(all): 1 day, 2:05:32/1 day, 1:32:57, loss=0.424391892493984, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=0.8879082025214569, lr=0.04841375564759439
2023-11-15 15:42:49   INFO  epoch: 12/24, acc_iter=80194, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:49:45, time_cost(all): 1 day, 2:06:31/1 day, 1:43:37, loss=0.424280950345807, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.346410298239338, lr=0.04837366387485381
2023-11-15 15:43:48   INFO  epoch: 12/24, acc_iter=80244, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:47:51, time_cost(all): 1 day, 2:07:30/1 day, 2:13:25, loss=0.42417000819763, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=1.6543890079951935, lr=0.04833357210211322
2023-11-15 15:44:47   INFO  epoch: 12/24, acc_iter=80294, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:44:52, time_cost(all): 1 day, 2:08:29/1 day, 1:36:05, loss=0.424059066049453, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.3468790958929224, lr=0.04829348032937263
2023-11-15 15:45:46   INFO  epoch: 12/24, acc_iter=80344, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:47:20, time_cost(all): 1 day, 2:09:28/1 day, 1:32:43, loss=0.423948123901277, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=1.9139068037642022, lr=0.04825338855663204
2023-11-15 15:46:45   INFO  epoch: 12/24, acc_iter=80394, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:47:29, time_cost(all): 1 day, 2:10:27/1 day, 0:42:28, loss=0.4238371817531, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.97(1.03), norm=1.7653273295259653, lr=0.048213296783891454
2023-11-15 15:47:44   INFO  epoch: 12/24, acc_iter=80444, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:45:58, time_cost(all): 1 day, 2:11:26/1 day, 0:57:34, loss=0.423726239604923, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.07(1.03), norm=2.043602393347149, lr=0.04817320501115086
2023-11-15 15:48:43   INFO  epoch: 12/24, acc_iter=80494, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:37:04, time_cost(all): 1 day, 2:12:25/1 day, 2:38:39, loss=0.423615297456746, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=4.405833925069485, lr=0.048133113238410276
2023-11-15 15:49:42   INFO  epoch: 12/24, acc_iter=80544, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:36:21, time_cost(all): 1 day, 2:13:24/1 day, 2:03:14, loss=0.42350435530857, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=4.616175628080979, lr=0.04809302146566968
2023-11-15 15:50:41   INFO  epoch: 12/24, acc_iter=80594, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:42:07, time_cost(all): 1 day, 2:14:23/1 day, 1:49:16, loss=0.423393413160393, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=3.2594087012361737, lr=0.0480529296929291
2023-11-15 15:51:39   INFO  epoch: 12/24, acc_iter=80644, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:42:09, time_cost(all): 1 day, 2:15:21/1 day, 2:07:43, loss=0.423282471012216, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=2.7395418930870905, lr=0.04801283792018852
2023-11-15 15:52:38   INFO  epoch: 12/24, acc_iter=80694, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:32:58, time_cost(all): 1 day, 2:16:20/1 day, 0:48:09, loss=0.423171528864039, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=1.5762008909876677, lr=0.047972746147447926
2023-11-15 15:53:37   INFO  epoch: 12/24, acc_iter=80744, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:40:44, time_cost(all): 1 day, 2:17:19/1 day, 2:07:36, loss=0.423060586715863, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=2.4011062667518317, lr=0.04793265437470734
2023-11-15 15:54:36   INFO  epoch: 12/24, acc_iter=80794, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:39:26, time_cost(all): 1 day, 2:18:18/1 day, 0:55:44, loss=0.422949644567686, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=2.4057708867141265, lr=0.04789256260196675
2023-11-15 15:55:35   INFO  epoch: 12/24, acc_iter=80844, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:05, time_cost(all): 1 day, 2:19:17/1 day, 1:48:12, loss=0.422838702419509, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=4.770537348909678, lr=0.04785247082922616
2023-11-15 15:56:34   INFO  epoch: 12/24, acc_iter=80894, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:36:27, time_cost(all): 1 day, 2:20:16/1 day, 0:35:17, loss=0.422727760271332, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.726433606332181, lr=0.04781237905648557
2023-11-15 15:57:33   INFO  epoch: 12/24, acc_iter=80944, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:34:49, time_cost(all): 1 day, 2:21:15/1 day, 0:55:27, loss=0.422616818123156, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=3.2541001628542814, lr=0.04777228728374498
2023-11-15 15:58:32   INFO  epoch: 12/24, acc_iter=80994, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:33:36, time_cost(all): 1 day, 2:22:14/1 day, 0:21:56, loss=0.422505875974979, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=0.6717155981424869, lr=0.0477321955110044
2023-11-15 15:59:31   INFO  epoch: 12/24, acc_iter=81044, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:34:10, time_cost(all): 1 day, 2:23:13/1 day, 0:16:18, loss=0.422394933826802, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=3.1491859373180056, lr=0.047692103738263804
2023-11-15 16:00:30   INFO  epoch: 12/24, acc_iter=81094, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:31:29, time_cost(all): 1 day, 2:24:12/1 day, 1:42:53, loss=0.422283991678625, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=4.100299924869729, lr=0.047652011965523225
2023-11-15 16:01:29   INFO  epoch: 12/24, acc_iter=81144, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:25:08, time_cost(all): 1 day, 2:25:11/1 day, 1:10:16, loss=0.422173049530449, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=3.5635982002146025, lr=0.04761192019278264
2023-11-15 16:02:28   INFO  epoch: 12/24, acc_iter=81194, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:26:06, time_cost(all): 1 day, 2:26:10/1 day, 2:03:11, loss=0.422062107382272, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=4.276775465769628, lr=0.047571828420042046
2023-11-15 16:03:27   INFO  epoch: 12/24, acc_iter=81244, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:28:34, time_cost(all): 1 day, 2:27:09/1 day, 0:08:54, loss=0.421951165234095, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=0.9323592348350338, lr=0.04753173664730146
2023-11-15 16:04:26   INFO  epoch: 12/24, acc_iter=81294, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:23:00, time_cost(all): 1 day, 2:28:08/1 day, 0:46:09, loss=0.421840223085918, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=0.831365995980071, lr=0.04749164487456087
2023-11-15 16:05:24   INFO  epoch: 12/24, acc_iter=81344, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:23:57, time_cost(all): 1 day, 2:29:06/1 day, 0:32:54, loss=0.421729280937742, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=2.7809683629739532, lr=0.04745155310182028
2023-11-15 16:06:23   INFO  epoch: 12/24, acc_iter=81394, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:20:30, time_cost(all): 1 day, 2:30:05/1 day, 1:34:00, loss=0.421618338789565, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=3.4077048076066156, lr=0.04741146132907969
2023-11-15 16:07:22   INFO  epoch: 12/24, acc_iter=81444, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:25:13, time_cost(all): 1 day, 2:31:04/1 day, 1:12:03, loss=0.421507396641388, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=1.1262737676576724, lr=0.0473713695563391
2023-11-15 16:08:21   INFO  epoch: 12/24, acc_iter=81494, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:51, time_cost(all): 1 day, 2:32:03/1 day, 0:47:25, loss=0.421396454493211, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=0.5824519001724275, lr=0.04733127778359851
2023-11-15 16:09:20   INFO  epoch: 12/24, acc_iter=81544, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:18:54, time_cost(all): 1 day, 2:33:02/1 day, 2:06:35, loss=0.421285512345035, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.133249460717419, lr=0.04729118601085793
2023-11-15 16:10:19   INFO  epoch: 12/24, acc_iter=81594, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:20:18, time_cost(all): 1 day, 2:34:01/1 day, 1:58:24, loss=0.421174570196858, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=0.9088035468584437, lr=0.047251094238117346
2023-11-15 16:11:18   INFO  epoch: 12/24, acc_iter=81644, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:14:46, time_cost(all): 1 day, 2:35:00/1 day, 0:25:00, loss=0.421063628048681, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=4.201894823938323, lr=0.04721100246537675
2023-11-15 16:12:17   INFO  epoch: 12/24, acc_iter=81694, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:08, time_cost(all): 1 day, 2:35:59/1 day, 2:10:57, loss=0.420952685900504, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=2.163111802224975, lr=0.04717091069263617
2023-11-15 16:13:16   INFO  epoch: 12/24, acc_iter=81744, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:14:08, time_cost(all): 1 day, 2:36:58/1 day, 1:06:32, loss=0.420841743752328, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=3.1094370738124417, lr=0.047130818919895574
2023-11-15 16:14:15   INFO  epoch: 12/24, acc_iter=81794, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:19:07, time_cost(all): 1 day, 2:37:57/1 day, 2:10:02, loss=0.420730801604151, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=4.205410005048006, lr=0.04709072714715499
2023-11-15 16:15:14   INFO  epoch: 12/24, acc_iter=81844, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:30, time_cost(all): 1 day, 2:38:56/1 day, 2:00:53, loss=0.420619859455974, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.86(1.03), norm=2.8459409352249327, lr=0.047050635374414396
2023-11-15 16:16:13   INFO  epoch: 12/24, acc_iter=81894, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:09:46, time_cost(all): 1 day, 2:39:55/1 day, 0:58:14, loss=0.420508917307797, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=0.6206548879099218, lr=0.04701054360167381
2023-11-15 16:17:12   INFO  epoch: 12/24, acc_iter=81944, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:09:52, time_cost(all): 1 day, 2:40:54/1 day, 0:37:24, loss=0.420397975159621, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=4.972092830828704, lr=0.04697045182893322
2023-11-15 16:18:11   INFO  epoch: 12/24, acc_iter=81994, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:08:45, time_cost(all): 1 day, 2:41:53/1 day, 2:00:59, loss=0.420287033011444, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=1.2873407405575195, lr=0.04693036005619264
2023-11-15 16:19:09   INFO  epoch: 12/24, acc_iter=82044, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:07:58, time_cost(all): 1 day, 2:42:51/1 day, 0:49:12, loss=0.420176090863267, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.4331601072290994, lr=0.04689026828345205
2023-11-15 16:20:08   INFO  epoch: 12/24, acc_iter=82094, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:06:24, time_cost(all): 1 day, 2:43:50/1 day, 1:52:09, loss=0.42006514871509, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=2.7780018345752873, lr=0.04685017651071146
2023-11-15 16:21:07   INFO  epoch: 12/24, acc_iter=82144, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:09, time_cost(all): 1 day, 2:44:49/1 day, 1:19:18, loss=0.419954206566914, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=4.840991360985152, lr=0.046810084737970874
2023-11-15 16:22:06   INFO  epoch: 12/24, acc_iter=82194, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:09:09, time_cost(all): 1 day, 2:45:48/23:51:12, loss=0.419843264418737, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.8638758501016932, lr=0.04676999296523028
2023-11-15 16:23:05   INFO  epoch: 12/24, acc_iter=82244, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:03:33, time_cost(all): 1 day, 2:46:47/1 day, 0:20:00, loss=0.41973232227056, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=1.5978849198771439, lr=0.046729901192489695
2023-11-15 16:24:04   INFO  epoch: 12/24, acc_iter=82294, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:07:50, time_cost(all): 1 day, 2:47:46/1 day, 0:49:02, loss=0.419621380122383, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=3.0622729462933655, lr=0.0466898094197491
2023-11-15 16:25:03   INFO  epoch: 12/24, acc_iter=82344, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:03:46, time_cost(all): 1 day, 2:48:45/1 day, 1:33:39, loss=0.419510437974207, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=2.9062238285790536, lr=0.046649717647008516
2023-11-15 16:26:02   INFO  epoch: 12/24, acc_iter=82394, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:06:32, time_cost(all): 1 day, 2:49:44/1 day, 0:53:45, loss=0.41939949582603, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=3.16192957870949, lr=0.046609625874267924
2023-11-15 16:27:01   INFO  epoch: 12/24, acc_iter=82444, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:05:33, time_cost(all): 1 day, 2:50:43/1 day, 0:57:50, loss=0.419288553677853, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=4.472930416205485, lr=0.046569534101527345
2023-11-15 16:28:00   INFO  epoch: 12/24, acc_iter=82494, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:03:58, time_cost(all): 1 day, 2:51:42/1 day, 0:54:43, loss=0.419177611529676, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.08(1.03), norm=1.8494897536941477, lr=0.04652944232878676
2023-11-15 16:28:59   INFO  epoch: 12/24, acc_iter=82544, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:02:23, time_cost(all): 1 day, 2:52:41/23:53:07, loss=0.4190666693815, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=2.1676511537165783, lr=0.046489350556046166
2023-11-15 16:29:58   INFO  epoch: 12/24, acc_iter=82594, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:58:24, time_cost(all): 1 day, 2:53:40/1 day, 0:24:38, loss=0.418955727233323, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=1.204441826675053, lr=0.04644925878330558
2023-11-15 16:30:57   INFO  epoch: 12/24, acc_iter=82644, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:56:15, time_cost(all): 1 day, 2:54:39/1 day, 1:57:00, loss=0.418844785085146, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=2.8497797149635886, lr=0.04640916701056499
2023-11-15 16:31:56   INFO  epoch: 12/24, acc_iter=82694, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:55:53, time_cost(all): 1 day, 2:55:38/1 day, 1:19:42, loss=0.418733842936969, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=4.375599934882215, lr=0.0463690752378244
2023-11-15 16:32:54   INFO  epoch: 12/24, acc_iter=82744, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:15, time_cost(all): 1 day, 2:56:36/1 day, 0:07:51, loss=0.418622900788792, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=4.444848909534667, lr=0.04632898346508381
2023-11-15 16:33:53   INFO  epoch: 12/24, acc_iter=82794, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:54:22, time_cost(all): 1 day, 2:57:35/1 day, 0:34:32, loss=0.418511958640616, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=4.836120105026662, lr=0.04628889169234322
2023-11-15 16:34:52   INFO  epoch: 12/24, acc_iter=82844, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:52:42, time_cost(all): 1 day, 2:58:34/1 day, 0:16:09, loss=0.418401016492439, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.219545161022447, lr=0.04624879991960263
2023-11-15 16:35:51   INFO  epoch: 12/24, acc_iter=82894, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:54:26, time_cost(all): 1 day, 2:59:33/23:45:46, loss=0.418290074344262, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=3.265830958067535, lr=0.04620870814686205
2023-11-15 16:36:50   INFO  epoch: 12/24, acc_iter=82944, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:20, time_cost(all): 1 day, 3:00:32/1 day, 1:20:04, loss=0.418179132196085, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=0.7407118442089242, lr=0.046168616374121466
2023-11-15 16:37:49   INFO  epoch: 12/24, acc_iter=82994, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:53:58, time_cost(all): 1 day, 3:01:31/1 day, 0:12:50, loss=0.418068190047909, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=3.9800786004805375, lr=0.04612852460138087
2023-11-15 16:38:48   INFO  epoch: 12/24, acc_iter=83044, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:11, time_cost(all): 1 day, 3:02:30/1 day, 1:01:48, loss=0.417957247899732, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=0.7128113048214277, lr=0.04608843282864029
2023-11-15 16:39:47   INFO  epoch: 12/24, acc_iter=83094, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:49:38, time_cost(all): 1 day, 3:03:29/23:58:34, loss=0.417846305751555, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=3.1726819145373435, lr=0.0460483410558997
2023-11-15 16:40:46   INFO  epoch: 12/24, acc_iter=83144, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:49:41, time_cost(all): 1 day, 3:04:28/1 day, 0:22:22, loss=0.417735363603379, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=4.887971053467118, lr=0.04600824928315911
2023-11-15 16:41:45   INFO  epoch: 12/24, acc_iter=83194, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:49:40, time_cost(all): 1 day, 3:05:27/23:54:59, loss=0.417624421455202, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=2.9178020269574874, lr=0.04596815751041852
2023-11-15 16:42:44   INFO  epoch: 12/24, acc_iter=83244, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:48:42, time_cost(all): 1 day, 3:06:26/1 day, 0:05:58, loss=0.417513479307025, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=0.6205100498705962, lr=0.04592806573767793
2023-11-15 16:43:43   INFO  epoch: 12/24, acc_iter=83294, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:47:13, time_cost(all): 1 day, 3:07:25/23:27:35, loss=0.417402537158848, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=2.1410689014092537, lr=0.045887973964937344
2023-11-15 16:44:42   INFO  epoch: 12/24, acc_iter=83344, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:22, time_cost(all): 1 day, 3:08:24/1 day, 1:32:37, loss=0.417291595010672, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=0.7209412226645686, lr=0.045847882192196765
2023-11-15 16:45:41   INFO  epoch: 12/24, acc_iter=83394, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:41:50, time_cost(all): 1 day, 3:09:23/23:40:47, loss=0.417180652862495, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=4.197930015760553, lr=0.04580779041945617
2023-11-15 16:46:39   INFO  epoch: 12/24, acc_iter=83444, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:07, time_cost(all): 1 day, 3:10:21/23:42:05, loss=0.417069710714318, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=2.384641410185577, lr=0.045767698646715586
2023-11-15 16:47:38   INFO  epoch: 12/24, acc_iter=83494, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:00, time_cost(all): 1 day, 3:11:20/1 day, 0:21:00, loss=0.416958768566141, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.3091345143617175, lr=0.045727606873974994
2023-11-15 16:48:37   INFO  epoch: 12/24, acc_iter=83544, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:09, time_cost(all): 1 day, 3:12:19/1 day, 1:32:42, loss=0.416847826417965, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=2.50414031710957, lr=0.04568751510123441
2023-11-15 16:49:36   INFO  epoch: 12/24, acc_iter=83594, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:57, time_cost(all): 1 day, 3:13:18/1 day, 1:18:47, loss=0.416736884269788, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=4.347320059503879, lr=0.045647423328493815
2023-11-15 16:50:35   INFO  epoch: 12/24, acc_iter=83644, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:22, time_cost(all): 1 day, 3:14:17/1 day, 0:11:10, loss=0.416625942121611, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.3778448642325172, lr=0.04560733155575323
2023-11-15 16:51:34   INFO  epoch: 12/24, acc_iter=83694, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:58, time_cost(all): 1 day, 3:15:16/23:33:16, loss=0.416514999973434, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=3.262084365176984, lr=0.045567239783012636
2023-11-15 16:52:33   INFO  epoch: 12/24, acc_iter=83744, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:38, time_cost(all): 1 day, 3:16:15/1 day, 0:34:35, loss=0.416404057825258, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.543352084005205, lr=0.04552714801027205
2023-11-15 16:53:32   INFO  epoch: 12/24, acc_iter=83794, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:53, time_cost(all): 1 day, 3:17:14/23:33:16, loss=0.416293115677081, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=2.1707892630372307, lr=0.04548705623753147
2023-11-15 16:54:31   INFO  epoch: 12/24, acc_iter=83844, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:36, time_cost(all): 1 day, 3:18:13/1 day, 1:15:37, loss=0.416182173528904, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=3.4314907956563716, lr=0.04544696446479088
2023-11-15 16:55:30   INFO  epoch: 12/24, acc_iter=83894, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:35:23, time_cost(all): 1 day, 3:19:12/23:50:21, loss=0.416071231380727, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.22(1.03), norm=1.9013835134659862, lr=0.04540687269205029
2023-11-15 16:56:29   INFO  epoch: 12/24, acc_iter=83944, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:26, time_cost(all): 1 day, 3:20:11/1 day, 0:02:47, loss=0.415960289232551, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=3.427491212335057, lr=0.0453667809193097
2023-11-15 16:57:28   INFO  epoch: 12/24, acc_iter=83994, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:30:54, time_cost(all): 1 day, 3:21:10/1 day, 0:10:22, loss=0.415849347084374, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=2.120118059237022, lr=0.045326689146569114
2023-11-15 16:58:27   INFO  epoch: 12/24, acc_iter=84044, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:30:22, time_cost(all): 1 day, 3:22:09/1 day, 1:22:13, loss=0.415738404936197, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.2324598718925035, lr=0.04528659737382852
2023-11-15 16:59:26   INFO  epoch: 12/24, acc_iter=84094, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:31, time_cost(all): 1 day, 3:23:08/1 day, 0:49:11, loss=0.41562746278802, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.03(1.03), norm=2.8919610614391185, lr=0.045246505601087936
2023-11-15 17:00:24   INFO  epoch: 12/24, acc_iter=84144, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:14, time_cost(all): 1 day, 3:24:06/1 day, 0:25:04, loss=0.415516520639844, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=3.503191718416284, lr=0.04520641382834734
2023-11-15 17:01:23   INFO  epoch: 12/24, acc_iter=84194, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:08, time_cost(all): 1 day, 3:25:05/1 day, 0:03:09, loss=0.415405578491667, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.23(1.03), norm=4.234575416681116, lr=0.04516632205560676
2023-11-15 17:02:22   INFO  epoch: 12/24, acc_iter=84244, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:25:58, time_cost(all): 1 day, 3:26:04/23:11:06, loss=0.41529463634349, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=3.109044877100127, lr=0.04512623028286618
2023-11-15 17:03:21   INFO  epoch: 12/24, acc_iter=84294, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:57, time_cost(all): 1 day, 3:27:03/23:48:17, loss=0.415183694195313, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=2.9997609467723216, lr=0.045086138510125585
2023-11-15 17:04:20   INFO  epoch: 12/24, acc_iter=84344, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:26:05, time_cost(all): 1 day, 3:28:02/23:12:50, loss=0.415072752047137, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=1.1957332975792367, lr=0.045046046737385
2023-11-15 17:05:19   INFO  epoch: 12/24, acc_iter=84394, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:12, time_cost(all): 1 day, 3:29:01/1 day, 0:45:53, loss=0.41496180989896, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=2.6524222841436065, lr=0.04500595496464441
2023-11-15 17:06:18   INFO  epoch: 12/24, acc_iter=84444, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:09, time_cost(all): 1 day, 3:30:00/23:26:55, loss=0.414850867750783, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.936969970469569, lr=0.04496586319190382
2023-11-15 17:07:17   INFO  epoch: 12/24, acc_iter=84494, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:06, time_cost(all): 1 day, 3:30:59/1 day, 0:13:00, loss=0.414739925602606, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=4.042685779809414, lr=0.04492577141916323
2023-11-15 17:08:16   INFO  epoch: 12/24, acc_iter=84544, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:35, time_cost(all): 1 day, 3:31:58/1 day, 0:31:30, loss=0.414628983454429, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=4.174514358245012, lr=0.04488567964642264
2023-11-15 17:09:15   INFO  epoch: 12/24, acc_iter=84594, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:55, time_cost(all): 1 day, 3:32:57/23:24:05, loss=0.414518041306253, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=1.870487639676218, lr=0.04484558787368205
2023-11-15 17:10:14   INFO  epoch: 12/24, acc_iter=84644, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:16, time_cost(all): 1 day, 3:33:56/23:22:56, loss=0.414407099158076, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=1.5717227258158422, lr=0.044805496100941464
2023-11-15 17:11:13   INFO  epoch: 12/24, acc_iter=84694, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:59, time_cost(all): 1 day, 3:34:55/23:49:24, loss=0.414296157009899, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.893556797215822, lr=0.044765404328200885
2023-11-15 17:12:12   INFO  epoch: 12/24, acc_iter=84744, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:18:06, time_cost(all): 1 day, 3:35:54/1 day, 1:04:56, loss=0.414185214861722, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=2.7705480024293108, lr=0.04472531255546029
2023-11-15 17:13:11   INFO  epoch: 12/24, acc_iter=84794, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:43, time_cost(all): 1 day, 3:36:53/23:27:36, loss=0.414074272713546, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=0.8579926473555934, lr=0.044685220782719706
2023-11-15 17:14:09   INFO  epoch: 12/24, acc_iter=84844, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:58, time_cost(all): 1 day, 3:37:51/23:13:57, loss=0.413963330565369, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=3.2469807554215513, lr=0.04464512900997911
2023-11-15 17:15:08   INFO  epoch: 12/24, acc_iter=84894, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:59, time_cost(all): 1 day, 3:38:50/23:02:29, loss=0.413852388417192, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.330186724466788, lr=0.04460503723723853
2023-11-15 17:16:07   INFO  epoch: 12/24, acc_iter=84944, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:08, time_cost(all): 1 day, 3:39:49/1 day, 1:15:36, loss=0.413741446269015, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=4.996155575078854, lr=0.04456494546449794
2023-11-15 17:17:06   INFO  epoch: 12/24, acc_iter=84994, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:36, time_cost(all): 1 day, 3:40:48/1 day, 0:19:56, loss=0.413630504120839, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=3.1443843131028415, lr=0.04452485369175735
2023-11-15 17:18:05   INFO  epoch: 12/24, acc_iter=85044, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:42, time_cost(all): 1 day, 3:41:47/23:44:02, loss=0.413519561972662, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.06(1.03), norm=0.9437914540156855, lr=0.04448476191901676
2023-11-15 17:19:04   INFO  epoch: 12/24, acc_iter=85094, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:39, time_cost(all): 1 day, 3:42:46/23:05:21, loss=0.413408619824485, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.88(1.03), norm=1.8249516509155777, lr=0.044444670146276184
2023-11-15 17:20:03   INFO  epoch: 12/24, acc_iter=85144, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:55, time_cost(all): 1 day, 3:43:45/1 day, 0:19:42, loss=0.413297677676308, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.115920009921831, lr=0.04440457837353559
2023-11-15 17:21:02   INFO  epoch: 12/24, acc_iter=85194, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:39, time_cost(all): 1 day, 3:44:44/1 day, 0:57:20, loss=0.413186735528132, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.279413317504417, lr=0.044364486600795006
2023-11-15 17:22:01   INFO  epoch: 12/24, acc_iter=85244, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:18, time_cost(all): 1 day, 3:45:43/23:33:22, loss=0.413075793379955, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=3.640544150067971, lr=0.04432439482805441
2023-11-15 17:23:00   INFO  epoch: 12/24, acc_iter=85294, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:22, time_cost(all): 1 day, 3:46:42/23:32:52, loss=0.412964851231778, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=1.6778575365029347, lr=0.04428430305531383
2023-11-15 17:23:59   INFO  epoch: 12/24, acc_iter=85344, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:33, time_cost(all): 1 day, 3:47:41/23:41:09, loss=0.412853909083601, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=1.853083180908523, lr=0.044244211282573234
2023-11-15 17:24:58   INFO  epoch: 12/24, acc_iter=85394, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:36, time_cost(all): 1 day, 3:48:40/1 day, 0:34:03, loss=0.412742966935425, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=0.7204446972712553, lr=0.04420411950983265
2023-11-15 17:25:57   INFO  epoch: 12/24, acc_iter=85444, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:47, time_cost(all): 1 day, 3:49:39/23:20:45, loss=0.412632024787248, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.717865007974743, lr=0.044164027737092056
2023-11-15 17:26:56   INFO  epoch: 12/24, acc_iter=85494, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:45, time_cost(all): 1 day, 3:50:38/23:16:34, loss=0.412521082639071, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=1.06604964657599, lr=0.04412393596435147
2023-11-15 17:27:54   INFO  epoch: 12/24, acc_iter=85544, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:44, time_cost(all): 1 day, 3:51:36/1 day, 0:55:02, loss=0.412410140490894, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=4.235614952727612, lr=0.04408384419161089
2023-11-15 17:28:53   INFO  epoch: 12/24, acc_iter=85594, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 1 day, 3:52:35/23:31:42, loss=0.412299198342718, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.4116210652964196, lr=0.0440437524188703
2023-11-15 17:29:52   INFO  epoch: 13/24, acc_iter=85681, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:10:11, time_cost(all): 1 day, 3:53:34/23:20:14, loss=0.41210615900489, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.07(1.03), norm=1.8851991830663932, lr=0.04397399273430167
2023-11-15 17:30:51   INFO  epoch: 13/24, acc_iter=85731, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:09:21, time_cost(all): 1 day, 3:54:33/23:31:49, loss=0.411995216856713, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.655946481634819, lr=0.043933900961561086
2023-11-15 17:31:50   INFO  epoch: 13/24, acc_iter=85781, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:01:33, time_cost(all): 1 day, 3:55:32/1 day, 0:01:49, loss=0.411884274708537, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=0.646336537306665, lr=0.04389380918882049
2023-11-15 17:32:49   INFO  epoch: 13/24, acc_iter=85831, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:10:20, time_cost(all): 1 day, 3:56:31/1 day, 0:47:25, loss=0.41177333256036, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=0.5019900322301026, lr=0.04385371741607991
2023-11-15 17:33:48   INFO  epoch: 13/24, acc_iter=85881, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:04:08, time_cost(all): 1 day, 3:57:30/23:19:07, loss=0.411662390412183, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=4.228960008765172, lr=0.04381362564333932
2023-11-15 17:34:47   INFO  epoch: 13/24, acc_iter=85931, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:57:50, time_cost(all): 1 day, 3:58:29/23:59:22, loss=0.411551448264006, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=1.0340304189101859, lr=0.04377353387059873
2023-11-15 17:35:46   INFO  epoch: 13/24, acc_iter=85981, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:03:19, time_cost(all): 1 day, 3:59:28/1 day, 0:50:55, loss=0.41144050611583, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=2.0190068642921313, lr=0.04373344209785814
2023-11-15 17:36:45   INFO  epoch: 13/24, acc_iter=86031, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:03:19, time_cost(all): 1 day, 4:00:27/23:29:38, loss=0.411329563967653, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.5696095901602845, lr=0.043693350325117564
2023-11-15 17:37:44   INFO  epoch: 13/24, acc_iter=86081, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:56:47, time_cost(all): 1 day, 4:01:26/1 day, 0:50:33, loss=0.411218621819476, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.214024223975215, lr=0.04365325855237697
2023-11-15 17:38:43   INFO  epoch: 13/24, acc_iter=86131, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:56:51, time_cost(all): 1 day, 4:02:25/23:42:39, loss=0.411107679671299, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=1.8566394210979524, lr=0.043613166779636385
2023-11-15 17:39:42   INFO  epoch: 13/24, acc_iter=86181, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:57:20, time_cost(all): 1 day, 4:03:24/1 day, 0:32:52, loss=0.410996737523123, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=2.4441475785815197, lr=0.04357307500689579
2023-11-15 17:40:41   INFO  epoch: 13/24, acc_iter=86231, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:59:29, time_cost(all): 1 day, 4:04:23/1 day, 0:00:24, loss=0.410885795374946, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=4.879825326836526, lr=0.043532983234155206
2023-11-15 17:41:39   INFO  epoch: 13/24, acc_iter=86281, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:58:00, time_cost(all): 1 day, 4:05:21/22:38:31, loss=0.410774853226769, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=3.6884888994816327, lr=0.043492891461414614
2023-11-15 17:42:38   INFO  epoch: 13/24, acc_iter=86331, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:54:11, time_cost(all): 1 day, 4:06:20/22:52:15, loss=0.410663911078592, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=1.5759531803827256, lr=0.04345279968867403
2023-11-15 17:43:37   INFO  epoch: 13/24, acc_iter=86381, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:57:47, time_cost(all): 1 day, 4:07:19/1 day, 0:03:02, loss=0.410552968930416, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=1.793659697507217, lr=0.043412707915933435
2023-11-15 17:44:36   INFO  epoch: 13/24, acc_iter=86431, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:49:49, time_cost(all): 1 day, 4:08:18/1 day, 0:06:17, loss=0.410442026782239, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.95(1.03), norm=4.2500127258792215, lr=0.04337261614319285
2023-11-15 17:45:35   INFO  epoch: 13/24, acc_iter=86481, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:56:55, time_cost(all): 1 day, 4:09:17/22:25:27, loss=0.410331084634062, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.4394802036779437, lr=0.04333252437045227
2023-11-15 17:46:34   INFO  epoch: 13/24, acc_iter=86531, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:47:02, time_cost(all): 1 day, 4:10:16/1 day, 0:05:05, loss=0.410220142485885, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=1.807455737974915, lr=0.04329243259771168
2023-11-15 17:47:33   INFO  epoch: 13/24, acc_iter=86581, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:48:37, time_cost(all): 1 day, 4:11:15/1 day, 0:37:23, loss=0.410109200337709, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=2.924932331664375, lr=0.04325234082497109
2023-11-15 17:48:32   INFO  epoch: 13/24, acc_iter=86631, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:44:42, time_cost(all): 1 day, 4:12:14/23:14:30, loss=0.409998258189532, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.782194262637059, lr=0.0432122490522305
2023-11-15 17:49:31   INFO  epoch: 13/24, acc_iter=86681, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:46:38, time_cost(all): 1 day, 4:13:13/23:46:41, loss=0.409887316041355, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=4.732824618291181, lr=0.04317215727948991
2023-11-15 17:50:30   INFO  epoch: 13/24, acc_iter=86731, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:25, time_cost(all): 1 day, 4:14:12/1 day, 0:26:03, loss=0.409776373893178, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.9(1.03), norm=1.8505586823010105, lr=0.04313206550674932
2023-11-15 17:51:29   INFO  epoch: 13/24, acc_iter=86781, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:45:50, time_cost(all): 1 day, 4:15:11/23:44:06, loss=0.409665431745002, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=3.7106916223360598, lr=0.043091973734008734
2023-11-15 17:52:28   INFO  epoch: 13/24, acc_iter=86831, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:41:40, time_cost(all): 1 day, 4:16:10/1 day, 0:37:57, loss=0.409554489596825, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=2.03002765731128, lr=0.04305188196126814
2023-11-15 17:53:27   INFO  epoch: 13/24, acc_iter=86881, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:41:02, time_cost(all): 1 day, 4:17:09/22:48:36, loss=0.409443547448648, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=1.9069410384356784, lr=0.043011790188527556
2023-11-15 17:54:26   INFO  epoch: 13/24, acc_iter=86931, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:44:36, time_cost(all): 1 day, 4:18:08/23:59:20, loss=0.409332605300471, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=1.5421537697675085, lr=0.04297169841578698
2023-11-15 17:55:24   INFO  epoch: 13/24, acc_iter=86981, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:46:14, time_cost(all): 1 day, 4:19:06/23:16:11, loss=0.409221663152295, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=2.035521286594636, lr=0.042931606643046384
2023-11-15 17:56:23   INFO  epoch: 13/24, acc_iter=87031, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:36:58, time_cost(all): 1 day, 4:20:05/23:01:27, loss=0.409110721004118, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=3.849875108415124, lr=0.0428915148703058
2023-11-15 17:57:22   INFO  epoch: 13/24, acc_iter=87081, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:38:55, time_cost(all): 1 day, 4:21:04/23:40:49, loss=0.408999778855941, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=1.4907670275730698, lr=0.042851423097565206
2023-11-15 17:58:21   INFO  epoch: 13/24, acc_iter=87131, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:44:41, time_cost(all): 1 day, 4:22:03/23:54:09, loss=0.408888836707764, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=3.6574515837505337, lr=0.04281133132482462
2023-11-15 17:59:20   INFO  epoch: 13/24, acc_iter=87181, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:39:11, time_cost(all): 1 day, 4:23:02/22:22:47, loss=0.408777894559588, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=4.172825190642868, lr=0.04277123955208403
2023-11-15 18:00:19   INFO  epoch: 13/24, acc_iter=87231, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:36:08, time_cost(all): 1 day, 4:24:01/23:41:37, loss=0.408666952411411, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.027718176767025, lr=0.04273114777934344
2023-11-15 18:01:18   INFO  epoch: 13/24, acc_iter=87281, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:32:15, time_cost(all): 1 day, 4:25:00/23:06:00, loss=0.408556010263234, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.18(1.03), norm=4.249626199772644, lr=0.04269105600660285
2023-11-15 18:02:17   INFO  epoch: 13/24, acc_iter=87331, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:39:53, time_cost(all): 1 day, 4:25:59/22:13:20, loss=0.408445068115057, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=4.164153923804057, lr=0.04265096423386226
2023-11-15 18:03:16   INFO  epoch: 13/24, acc_iter=87381, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:38:31, time_cost(all): 1 day, 4:26:58/22:38:56, loss=0.408334125966881, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=2.030148196619772, lr=0.042610872461121684
2023-11-15 18:04:15   INFO  epoch: 13/24, acc_iter=87431, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:37:02, time_cost(all): 1 day, 4:27:57/22:50:11, loss=0.408223183818704, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=2.445069662615259, lr=0.04257078068838109
2023-11-15 18:05:14   INFO  epoch: 13/24, acc_iter=87481, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:30:02, time_cost(all): 1 day, 4:28:56/23:52:31, loss=0.408112241670527, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=2.4481316805270255, lr=0.042530688915640505
2023-11-15 18:06:13   INFO  epoch: 13/24, acc_iter=87531, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:30:17, time_cost(all): 1 day, 4:29:55/23:25:44, loss=0.40800129952235, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.11(1.03), norm=4.302589114622102, lr=0.04249059714289991
2023-11-15 18:07:12   INFO  epoch: 13/24, acc_iter=87581, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:30:08, time_cost(all): 1 day, 4:30:54/1 day, 0:01:24, loss=0.407890357374174, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=1.066021014469442, lr=0.042450505370159326
2023-11-15 18:08:11   INFO  epoch: 13/24, acc_iter=87631, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:33:52, time_cost(all): 1 day, 4:31:53/23:56:01, loss=0.407779415225997, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.173589420278056, lr=0.042410413597418734
2023-11-15 18:09:09   INFO  epoch: 13/24, acc_iter=87681, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:25:17, time_cost(all): 1 day, 4:32:51/23:56:24, loss=0.40766847307782, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=0.8278086550728141, lr=0.04237032182467815
2023-11-15 18:10:08   INFO  epoch: 13/24, acc_iter=87731, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:24:39, time_cost(all): 1 day, 4:33:50/23:50:58, loss=0.407557530929643, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.1277713902082525, lr=0.04233023005193756
2023-11-15 18:11:07   INFO  epoch: 13/24, acc_iter=87781, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:25:44, time_cost(all): 1 day, 4:34:49/23:21:24, loss=0.407446588781467, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=2.4138943243253093, lr=0.04229013827919697
2023-11-15 18:12:06   INFO  epoch: 13/24, acc_iter=87831, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:26:55, time_cost(all): 1 day, 4:35:48/22:21:15, loss=0.40733564663329, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=2.2056665061107736, lr=0.04225004650645639
2023-11-15 18:13:05   INFO  epoch: 13/24, acc_iter=87881, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:28, time_cost(all): 1 day, 4:36:47/22:42:53, loss=0.407224704485113, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.11(1.03), norm=0.7723866954444397, lr=0.042209954733715804
2023-11-15 18:14:04   INFO  epoch: 13/24, acc_iter=87931, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:27:22, time_cost(all): 1 day, 4:37:46/22:03:29, loss=0.407113762336936, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=2.248026485806836, lr=0.04216986296097521
2023-11-15 18:15:03   INFO  epoch: 13/24, acc_iter=87981, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:20:45, time_cost(all): 1 day, 4:38:45/22:29:14, loss=0.40700282018876, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.889231455229641, lr=0.042129771188234626
2023-11-15 18:16:02   INFO  epoch: 13/24, acc_iter=88031, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:19:15, time_cost(all): 1 day, 4:39:44/22:15:10, loss=0.406891878040583, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=2.3306061914061713, lr=0.04208967941549403
2023-11-15 18:17:01   INFO  epoch: 13/24, acc_iter=88081, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:25:03, time_cost(all): 1 day, 4:40:43/23:37:53, loss=0.406780935892406, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=3.455879892653327, lr=0.04204958764275345
2023-11-15 18:18:00   INFO  epoch: 13/24, acc_iter=88131, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:45, time_cost(all): 1 day, 4:41:42/22:11:22, loss=0.406669993744229, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=1.9467195128953467, lr=0.042009495870012854
2023-11-15 18:18:59   INFO  epoch: 13/24, acc_iter=88181, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:17:05, time_cost(all): 1 day, 4:42:41/1 day, 0:05:24, loss=0.406559051596053, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=0.9229487994837522, lr=0.04196940409727227
2023-11-15 18:19:58   INFO  epoch: 13/24, acc_iter=88231, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:14:34, time_cost(all): 1 day, 4:43:40/22:33:58, loss=0.406448109447876, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=2.459032582409615, lr=0.041929312324531676
2023-11-15 18:20:57   INFO  epoch: 13/24, acc_iter=88281, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:20:44, time_cost(all): 1 day, 4:44:39/23:00:27, loss=0.406337167299699, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=4.581299682818639, lr=0.0418892205517911
2023-11-15 18:21:56   INFO  epoch: 13/24, acc_iter=88331, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:14:54, time_cost(all): 1 day, 4:45:38/23:07:08, loss=0.406226225151522, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.086486957610664, lr=0.04184912877905051
2023-11-15 18:22:54   INFO  epoch: 13/24, acc_iter=88381, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:12:39, time_cost(all): 1 day, 4:46:36/23:41:01, loss=0.406115283003346, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=1.1826178139992936, lr=0.04180903700630992
2023-11-15 18:23:53   INFO  epoch: 13/24, acc_iter=88431, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:37, time_cost(all): 1 day, 4:47:35/22:51:07, loss=0.406004340855169, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=0.6688949311167982, lr=0.04176894523356933
2023-11-15 18:24:52   INFO  epoch: 13/24, acc_iter=88481, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:16:29, time_cost(all): 1 day, 4:48:34/22:12:55, loss=0.405893398706992, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=2.352748806015416, lr=0.04172885346082874
2023-11-15 18:25:51   INFO  epoch: 13/24, acc_iter=88531, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:13, time_cost(all): 1 day, 4:49:33/23:28:42, loss=0.405782456558815, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=3.646141013424689, lr=0.041688761688088154
2023-11-15 18:26:50   INFO  epoch: 13/24, acc_iter=88581, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:09:14, time_cost(all): 1 day, 4:50:32/23:51:05, loss=0.405671514410639, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=3.9849014768241835, lr=0.04164866991534756
2023-11-15 18:27:49   INFO  epoch: 13/24, acc_iter=88631, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:11:47, time_cost(all): 1 day, 4:51:31/22:58:56, loss=0.405560572262462, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.92(1.03), norm=2.133200967796248, lr=0.041608578142606975
2023-11-15 18:28:48   INFO  epoch: 13/24, acc_iter=88681, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:06:55, time_cost(all): 1 day, 4:52:30/23:16:25, loss=0.405449630114285, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=3.461810277071092, lr=0.04156848636986638
2023-11-15 18:29:47   INFO  epoch: 13/24, acc_iter=88731, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:10:26, time_cost(all): 1 day, 4:53:29/23:40:31, loss=0.405338687966108, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.83(1.03), norm=0.8360850445968098, lr=0.0415283945971258
2023-11-15 18:30:46   INFO  epoch: 13/24, acc_iter=88781, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:07:46, time_cost(all): 1 day, 4:54:28/22:52:28, loss=0.405227745817932, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=2.93054250667635, lr=0.04148830282438522
2023-11-15 18:31:45   INFO  epoch: 13/24, acc_iter=88831, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:17, time_cost(all): 1 day, 4:55:27/22:36:37, loss=0.405116803669755, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=3.7951530762526406, lr=0.041448211051644625
2023-11-15 18:32:44   INFO  epoch: 13/24, acc_iter=88881, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:01, time_cost(all): 1 day, 4:56:26/23:19:34, loss=0.405005861521578, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=4.868640305458754, lr=0.04140811927890404
2023-11-15 18:33:43   INFO  epoch: 13/24, acc_iter=88931, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:07:36, time_cost(all): 1 day, 4:57:25/22:02:06, loss=0.404894919373401, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=1.3360410427570573, lr=0.041368027506163446
2023-11-15 18:34:42   INFO  epoch: 13/24, acc_iter=88981, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:03:06, time_cost(all): 1 day, 4:58:24/21:47:57, loss=0.404783977225225, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.0580044228023404, lr=0.04132793573342286
2023-11-15 18:35:41   INFO  epoch: 13/24, acc_iter=89031, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:40, time_cost(all): 1 day, 4:59:23/23:01:06, loss=0.404673035077048, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.5133289277460158, lr=0.04128784396068227
2023-11-15 18:36:39   INFO  epoch: 13/24, acc_iter=89081, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:04:19, time_cost(all): 1 day, 5:00:21/21:53:08, loss=0.404562092928871, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.7309312123230183, lr=0.04124775218794168
2023-11-15 18:37:38   INFO  epoch: 13/24, acc_iter=89131, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:58:55, time_cost(all): 1 day, 5:01:20/23:23:24, loss=0.404451150780694, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=4.781473307547428, lr=0.04120766041520109
2023-11-15 18:38:37   INFO  epoch: 13/24, acc_iter=89181, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:57:54, time_cost(all): 1 day, 5:02:19/23:10:46, loss=0.404340208632518, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=0.527929074278145, lr=0.04116756864246051
2023-11-15 18:39:36   INFO  epoch: 13/24, acc_iter=89231, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:56:46, time_cost(all): 1 day, 5:03:18/23:09:46, loss=0.404229266484341, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=2.265636984885322, lr=0.041127476869719924
2023-11-15 18:40:35   INFO  epoch: 13/24, acc_iter=89281, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:56:31, time_cost(all): 1 day, 5:04:17/23:14:31, loss=0.404118324336164, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=2.602241528840589, lr=0.04108738509697933
2023-11-15 18:41:34   INFO  epoch: 13/24, acc_iter=89331, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:54:13, time_cost(all): 1 day, 5:05:16/23:25:40, loss=0.404007382187987, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.106317264921699, lr=0.041047293324238746
2023-11-15 18:42:33   INFO  epoch: 13/24, acc_iter=89381, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:56:10, time_cost(all): 1 day, 5:06:15/21:54:34, loss=0.403896440039811, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=1.0444650883761522, lr=0.04100720155149815
2023-11-15 18:43:32   INFO  epoch: 13/24, acc_iter=89431, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:17, time_cost(all): 1 day, 5:07:14/22:07:40, loss=0.403785497891634, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=3.506893266967532, lr=0.04096710977875757
2023-11-15 18:44:31   INFO  epoch: 13/24, acc_iter=89481, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:52:41, time_cost(all): 1 day, 5:08:13/22:56:52, loss=0.403674555743457, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=4.4212032513948785, lr=0.040927018006016974
2023-11-15 18:45:30   INFO  epoch: 13/24, acc_iter=89531, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:52:48, time_cost(all): 1 day, 5:09:12/23:14:01, loss=0.40356361359528, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=4.8148803464661425, lr=0.04088692623327639
2023-11-15 18:46:29   INFO  epoch: 13/24, acc_iter=89581, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:54:11, time_cost(all): 1 day, 5:10:11/21:58:32, loss=0.403452671447104, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=4.69928507027223, lr=0.040846834460535796
2023-11-15 18:47:28   INFO  epoch: 13/24, acc_iter=89631, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:52:16, time_cost(all): 1 day, 5:11:10/21:52:30, loss=0.403341729298927, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=3.7064398752504766, lr=0.04080674268779522
2023-11-15 18:48:27   INFO  epoch: 13/24, acc_iter=89681, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:49:41, time_cost(all): 1 day, 5:12:09/22:23:01, loss=0.40323078715075, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=1.3666056960606985, lr=0.04076665091505463
2023-11-15 18:49:26   INFO  epoch: 13/24, acc_iter=89731, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:51:09, time_cost(all): 1 day, 5:13:08/21:54:32, loss=0.403119845002573, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=3.7235489933144357, lr=0.04072655914231404
2023-11-15 18:50:24   INFO  epoch: 13/24, acc_iter=89781, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:30, time_cost(all): 1 day, 5:14:06/23:36:27, loss=0.403008902854397, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=1.742920190364807, lr=0.04068646736957345
2023-11-15 18:51:23   INFO  epoch: 13/24, acc_iter=89831, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:45:42, time_cost(all): 1 day, 5:15:05/22:20:31, loss=0.40289796070622, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=2.0874945553531505, lr=0.040646375596832866
2023-11-15 18:52:22   INFO  epoch: 13/24, acc_iter=89881, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:46:33, time_cost(all): 1 day, 5:16:04/22:17:49, loss=0.402787018558043, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.276594269641374, lr=0.040606283824092274
2023-11-15 18:53:21   INFO  epoch: 13/24, acc_iter=89931, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:20, time_cost(all): 1 day, 5:17:03/22:45:05, loss=0.402676076409866, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=4.521628736333945, lr=0.04056619205135169
2023-11-15 18:54:20   INFO  epoch: 13/24, acc_iter=89981, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:27, time_cost(all): 1 day, 5:18:02/22:29:39, loss=0.40256513426169, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.7255177435310474, lr=0.040526100278611095
2023-11-15 18:55:19   INFO  epoch: 13/24, acc_iter=90031, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:03, time_cost(all): 1 day, 5:19:01/22:21:45, loss=0.402454192113513, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=1.5043399590969855, lr=0.04048600850587051
2023-11-15 18:56:18   INFO  epoch: 13/24, acc_iter=90081, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:40:48, time_cost(all): 1 day, 5:20:00/22:14:14, loss=0.402343249965336, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=2.3121615169334158, lr=0.04044591673312993
2023-11-15 18:57:17   INFO  epoch: 13/24, acc_iter=90131, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:15, time_cost(all): 1 day, 5:20:59/22:19:31, loss=0.402232307817159, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=2.742314409146197, lr=0.04040582496038934
2023-11-15 18:58:16   INFO  epoch: 13/24, acc_iter=90181, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:37, time_cost(all): 1 day, 5:21:58/22:40:48, loss=0.402121365668982, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=0.9470589374536487, lr=0.04036573318764875
2023-11-15 18:59:15   INFO  epoch: 13/24, acc_iter=90231, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:48, time_cost(all): 1 day, 5:22:57/23:08:35, loss=0.402010423520806, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=3.639297064095828, lr=0.04032564141490816
2023-11-15 19:00:14   INFO  epoch: 13/24, acc_iter=90281, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:34, time_cost(all): 1 day, 5:23:56/22:46:27, loss=0.401899481372629, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=1.8807669247023584, lr=0.04028554964216757
2023-11-15 19:01:13   INFO  epoch: 13/24, acc_iter=90331, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:15, time_cost(all): 1 day, 5:24:55/22:13:49, loss=0.401788539224452, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=2.0526078504762904, lr=0.04024545786942698
2023-11-15 19:02:12   INFO  epoch: 13/24, acc_iter=90381, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:01, time_cost(all): 1 day, 5:25:54/21:50:15, loss=0.401677597076275, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=0.8420968586853061, lr=0.040205366096686394
2023-11-15 19:03:11   INFO  epoch: 13/24, acc_iter=90431, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:52, time_cost(all): 1 day, 5:26:53/21:57:45, loss=0.401566654928099, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.87(1.03), norm=2.425313254129445, lr=0.0401652743239458
2023-11-15 19:04:09   INFO  epoch: 13/24, acc_iter=90481, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:07, time_cost(all): 1 day, 5:27:51/23:13:30, loss=0.401455712779922, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=3.808597468227237, lr=0.040125182551205216
2023-11-15 19:05:08   INFO  epoch: 13/24, acc_iter=90531, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:33, time_cost(all): 1 day, 5:28:50/21:51:51, loss=0.401344770631745, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=4.522136255269434, lr=0.04008509077846464
2023-11-15 19:06:07   INFO  epoch: 13/24, acc_iter=90581, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:06, time_cost(all): 1 day, 5:29:49/22:10:33, loss=0.401233828483568, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=3.008842177027708, lr=0.040044999005724044
2023-11-15 19:07:06   INFO  epoch: 13/24, acc_iter=90631, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:38, time_cost(all): 1 day, 5:30:48/21:56:42, loss=0.401122886335392, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=4.542448705462706, lr=0.04000490723298346
2023-11-15 19:08:05   INFO  epoch: 13/24, acc_iter=90681, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:18, time_cost(all): 1 day, 5:31:47/22:43:14, loss=0.401011944187215, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=0.9857202818887105, lr=0.039964815460242865
2023-11-15 19:09:04   INFO  epoch: 13/24, acc_iter=90731, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:23, time_cost(all): 1 day, 5:32:46/22:53:22, loss=0.400901002039038, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=1.5522770198121756, lr=0.03992472368750228
2023-11-15 19:10:03   INFO  epoch: 13/24, acc_iter=90781, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:29:05, time_cost(all): 1 day, 5:33:45/23:06:02, loss=0.400790059890861, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=3.1382467411632406, lr=0.03988463191476169
2023-11-15 19:11:02   INFO  epoch: 13/24, acc_iter=90831, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:22, time_cost(all): 1 day, 5:34:44/21:14:52, loss=0.400679117742685, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=4.8107155273450015, lr=0.0398445401420211
2023-11-15 19:12:01   INFO  epoch: 13/24, acc_iter=90881, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:42, time_cost(all): 1 day, 5:35:43/21:32:32, loss=0.400568175594508, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.273202773209733, lr=0.03980444836928051
2023-11-15 19:13:00   INFO  epoch: 13/24, acc_iter=90931, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:26:10, time_cost(all): 1 day, 5:36:42/22:38:46, loss=0.400457233446331, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=3.824879125298364, lr=0.03976435659653992
2023-11-15 19:13:59   INFO  epoch: 13/24, acc_iter=90981, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:03, time_cost(all): 1 day, 5:37:41/22:49:32, loss=0.400346291298155, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=4.5909811983497715, lr=0.03972426482379934
2023-11-15 19:14:58   INFO  epoch: 13/24, acc_iter=91031, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:57, time_cost(all): 1 day, 5:38:40/22:15:21, loss=0.400235349149978, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=4.0147465213763205, lr=0.03968417305105875
2023-11-15 19:15:57   INFO  epoch: 13/24, acc_iter=91081, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:26, time_cost(all): 1 day, 5:39:39/22:31:59, loss=0.400124407001801, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=2.595535396013202, lr=0.039644081278318165
2023-11-15 19:16:56   INFO  epoch: 13/24, acc_iter=91131, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:50, time_cost(all): 1 day, 5:40:38/21:09:18, loss=0.400013464853624, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=0.5352896783222614, lr=0.03960398950557757
2023-11-15 19:17:55   INFO  epoch: 13/24, acc_iter=91181, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:41, time_cost(all): 1 day, 5:41:37/22:10:33, loss=0.399902522705448, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.605300074994806, lr=0.039563897732836986
2023-11-15 19:18:53   INFO  epoch: 13/24, acc_iter=91231, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:33, time_cost(all): 1 day, 5:42:35/22:49:31, loss=0.399791580557271, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=2.8498981606918297, lr=0.03952380596009639
2023-11-15 19:19:52   INFO  epoch: 13/24, acc_iter=91281, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:38, time_cost(all): 1 day, 5:43:34/22:01:07, loss=0.399680638409094, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.479482228253498, lr=0.03948371418735581
2023-11-15 19:20:51   INFO  epoch: 13/24, acc_iter=91331, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:48, time_cost(all): 1 day, 5:44:33/21:07:21, loss=0.399569696260917, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=3.8398417158837757, lr=0.039443622414615215
2023-11-15 19:21:50   INFO  epoch: 13/24, acc_iter=91381, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:04, time_cost(all): 1 day, 5:45:32/22:08:59, loss=0.399458754112741, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.86(1.03), norm=4.456529480684166, lr=0.039403530641874636
2023-11-15 19:22:49   INFO  epoch: 13/24, acc_iter=91431, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:16:03, time_cost(all): 1 day, 5:46:31/22:38:53, loss=0.399347811964564, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=2.7800284825509105, lr=0.03936343886913405
2023-11-15 19:23:48   INFO  epoch: 13/24, acc_iter=91481, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:15:03, time_cost(all): 1 day, 5:47:30/22:21:45, loss=0.399236869816387, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=3.3278829821065092, lr=0.03932334709639346
2023-11-15 19:24:47   INFO  epoch: 13/24, acc_iter=91531, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:13, time_cost(all): 1 day, 5:48:29/22:05:56, loss=0.39912592766821, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=1.3973566979395295, lr=0.03928325532365287
2023-11-15 19:25:46   INFO  epoch: 13/24, acc_iter=91581, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:45, time_cost(all): 1 day, 5:49:28/22:58:50, loss=0.399014985520034, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=3.3104664188031867, lr=0.03924316355091228
2023-11-15 19:26:45   INFO  epoch: 13/24, acc_iter=91631, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:29, time_cost(all): 1 day, 5:50:27/22:39:51, loss=0.398904043371857, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=3.356343573539326, lr=0.03920307177817169
2023-11-15 19:27:44   INFO  epoch: 13/24, acc_iter=91681, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:23, time_cost(all): 1 day, 5:51:26/21:47:13, loss=0.39879310122368, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=0.9936184425293604, lr=0.0391629800054311
2023-11-15 19:28:43   INFO  epoch: 13/24, acc_iter=91731, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:41, time_cost(all): 1 day, 5:52:25/22:11:55, loss=0.398682159075503, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=3.6553720056496344, lr=0.039122888232690514
2023-11-15 19:29:42   INFO  epoch: 13/24, acc_iter=91781, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:24, time_cost(all): 1 day, 5:53:24/22:11:42, loss=0.398571216927326, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=0.8188905806328916, lr=0.03908279645994993
2023-11-15 19:30:41   INFO  epoch: 13/24, acc_iter=91831, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:32, time_cost(all): 1 day, 5:54:23/21:23:46, loss=0.39846027477915, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=3.712072562408035, lr=0.03904270468720934
2023-11-15 19:31:40   INFO  epoch: 13/24, acc_iter=91881, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:49, time_cost(all): 1 day, 5:55:22/22:11:38, loss=0.398349332630973, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=4.610703950502979, lr=0.03900261291446876
2023-11-15 19:32:38   INFO  epoch: 13/24, acc_iter=91931, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:24, time_cost(all): 1 day, 5:56:20/21:46:30, loss=0.398238390482796, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=3.07256207739728, lr=0.03896252114172817
2023-11-15 19:33:37   INFO  epoch: 13/24, acc_iter=91981, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:36, time_cost(all): 1 day, 5:57:19/20:43:13, loss=0.398127448334619, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=1.6038893095025368, lr=0.03892242936898758
2023-11-15 19:34:36   INFO  epoch: 13/24, acc_iter=92031, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:37, time_cost(all): 1 day, 5:58:18/22:15:04, loss=0.398016506186443, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=2.938235652836287, lr=0.03888233759624699
2023-11-15 19:35:35   INFO  epoch: 13/24, acc_iter=92081, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:46, time_cost(all): 1 day, 5:59:17/22:43:53, loss=0.397905564038266, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=0.5660070922484233, lr=0.0388422458235064
2023-11-15 19:36:34   INFO  epoch: 13/24, acc_iter=92131, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:46, time_cost(all): 1 day, 6:00:16/22:45:38, loss=0.397794621890089, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=4.469963696200747, lr=0.038802154050765814
2023-11-15 19:37:33   INFO  epoch: 13/24, acc_iter=92181, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 1 day, 6:01:15/20:47:21, loss=0.397683679741912, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.16(1.03), norm=2.684716178952331, lr=0.03876206227802522
2023-11-15 19:38:32   INFO  epoch: 14/24, acc_iter=92268, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:07:14, time_cost(all): 1 day, 6:02:14/22:12:51, loss=0.397490640404085, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.98(1.03), norm=1.284558778339237, lr=0.038692302593456594
2023-11-15 19:39:31   INFO  epoch: 14/24, acc_iter=92318, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:05:21, time_cost(all): 1 day, 6:03:13/21:27:06, loss=0.397379698255908, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.04(1.03), norm=3.131276010713469, lr=0.038652210820716015
2023-11-15 19:40:30   INFO  epoch: 14/24, acc_iter=92368, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:10:07, time_cost(all): 1 day, 6:04:12/20:52:33, loss=0.397268756107731, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=3.087030016831383, lr=0.03861211904797543
2023-11-15 19:41:29   INFO  epoch: 14/24, acc_iter=92418, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:09:43, time_cost(all): 1 day, 6:05:11/21:03:52, loss=0.397157813959555, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.88(1.03), norm=2.1568937759895848, lr=0.03857202727523484
2023-11-15 19:42:28   INFO  epoch: 14/24, acc_iter=92468, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:00:23, time_cost(all): 1 day, 6:06:10/20:45:37, loss=0.397046871811378, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=3.820536547846808, lr=0.03853193550249425
2023-11-15 19:43:27   INFO  epoch: 14/24, acc_iter=92518, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:03:48, time_cost(all): 1 day, 6:07:09/21:52:45, loss=0.396935929663201, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=4.021559050992741, lr=0.03849184372975366
2023-11-15 19:44:26   INFO  epoch: 14/24, acc_iter=92568, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/1:57:38, time_cost(all): 1 day, 6:08:08/22:17:29, loss=0.396824987515024, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=3.3864164675873196, lr=0.03845175195701307
2023-11-15 19:45:25   INFO  epoch: 14/24, acc_iter=92618, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:59:00, time_cost(all): 1 day, 6:09:07/20:44:44, loss=0.396714045366848, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.15(1.03), norm=0.8072765624785432, lr=0.038411660184272486
2023-11-15 19:46:23   INFO  epoch: 14/24, acc_iter=92668, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:56:03, time_cost(all): 1 day, 6:10:05/22:02:11, loss=0.396603103218671, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.382327413819719, lr=0.038371568411531894
2023-11-15 19:47:22   INFO  epoch: 14/24, acc_iter=92718, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:04:04, time_cost(all): 1 day, 6:11:04/22:04:57, loss=0.396492161070494, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.1(1.03), norm=0.8478364566060899, lr=0.03833147663879131
2023-11-15 19:48:21   INFO  epoch: 14/24, acc_iter=92768, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:53:26, time_cost(all): 1 day, 6:12:03/20:48:22, loss=0.396381218922317, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.84(1.03), norm=4.662173041808685, lr=0.03829138486605073
2023-11-15 19:49:20   INFO  epoch: 14/24, acc_iter=92818, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:52:40, time_cost(all): 1 day, 6:13:02/21:51:23, loss=0.396270276774141, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=0.8394846413518331, lr=0.038251293093310136
2023-11-15 19:50:19   INFO  epoch: 14/24, acc_iter=92868, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/2:01:25, time_cost(all): 1 day, 6:14:01/21:45:27, loss=0.396159334625964, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=0.9009689795599575, lr=0.03821120132056955
2023-11-15 19:51:18   INFO  epoch: 14/24, acc_iter=92918, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:59:41, time_cost(all): 1 day, 6:15:00/20:45:46, loss=0.396048392477787, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=3.5539533839365145, lr=0.03817110954782896
2023-11-15 19:52:17   INFO  epoch: 14/24, acc_iter=92968, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:59:52, time_cost(all): 1 day, 6:15:59/21:04:45, loss=0.39593745032961, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=1.611883013777695, lr=0.03813101777508837
2023-11-15 19:53:16   INFO  epoch: 14/24, acc_iter=93018, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:55:01, time_cost(all): 1 day, 6:16:58/20:26:02, loss=0.395826508181434, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=1.2999185176826602, lr=0.03809092600234778
2023-11-15 19:54:15   INFO  epoch: 14/24, acc_iter=93068, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:47:42, time_cost(all): 1 day, 6:17:57/20:41:38, loss=0.395715566033257, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=1.8975886582243957, lr=0.03805083422960719
2023-11-15 19:55:14   INFO  epoch: 14/24, acc_iter=93118, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:56:51, time_cost(all): 1 day, 6:18:56/22:03:45, loss=0.39560462388508, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=4.38344190860712, lr=0.0380107424568666
2023-11-15 19:56:13   INFO  epoch: 14/24, acc_iter=93168, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:45:17, time_cost(all): 1 day, 6:19:55/22:20:46, loss=0.395493681736903, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=4.9136639156839665, lr=0.037970650684126014
2023-11-15 19:57:12   INFO  epoch: 14/24, acc_iter=93218, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:49:58, time_cost(all): 1 day, 6:20:54/20:40:08, loss=0.395382739588727, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=3.8869774931811127, lr=0.037930558911385436
2023-11-15 19:58:11   INFO  epoch: 14/24, acc_iter=93268, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:56, time_cost(all): 1 day, 6:21:53/21:39:17, loss=0.39527179744055, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=0.792859801372668, lr=0.03789046713864484
2023-11-15 19:59:10   INFO  epoch: 14/24, acc_iter=93318, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:49:12, time_cost(all): 1 day, 6:22:52/21:00:50, loss=0.395160855292373, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.905059885845622, lr=0.03785037536590426
2023-11-15 20:00:08   INFO  epoch: 14/24, acc_iter=93368, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:46:56, time_cost(all): 1 day, 6:23:50/20:20:41, loss=0.395049913144196, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=0.9619858652543916, lr=0.037810283593163664
2023-11-15 20:01:07   INFO  epoch: 14/24, acc_iter=93418, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:44:04, time_cost(all): 1 day, 6:24:49/20:28:57, loss=0.39493897099602, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=2.59656914589186, lr=0.03777019182042308
2023-11-15 20:02:06   INFO  epoch: 14/24, acc_iter=93468, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:43:50, time_cost(all): 1 day, 6:25:48/21:10:23, loss=0.394828028847843, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=3.4265758473156467, lr=0.037730100047682485
2023-11-15 20:03:05   INFO  epoch: 14/24, acc_iter=93518, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:40:55, time_cost(all): 1 day, 6:26:47/21:00:47, loss=0.394717086699666, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.9(1.03), norm=2.1303073561494665, lr=0.0376900082749419
2023-11-15 20:04:04   INFO  epoch: 14/24, acc_iter=93568, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:43:26, time_cost(all): 1 day, 6:27:46/20:22:58, loss=0.394606144551489, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=3.6441869687266104, lr=0.03764991650220131
2023-11-15 20:05:03   INFO  epoch: 14/24, acc_iter=93618, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:43:58, time_cost(all): 1 day, 6:28:45/21:39:16, loss=0.394495202403313, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=4.8280943506502325, lr=0.03760982472946072
2023-11-15 20:06:02   INFO  epoch: 14/24, acc_iter=93668, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:40:05, time_cost(all): 1 day, 6:29:44/20:18:39, loss=0.394384260255136, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.8765680778348517, lr=0.03756973295672014
2023-11-15 20:07:01   INFO  epoch: 14/24, acc_iter=93718, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:36:45, time_cost(all): 1 day, 6:30:43/21:26:18, loss=0.394273318106959, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=3.1008794346325796, lr=0.03752964118397955
2023-11-15 20:08:00   INFO  epoch: 14/24, acc_iter=93768, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:41:09, time_cost(all): 1 day, 6:31:42/20:32:57, loss=0.394162375958782, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=4.113939646474386, lr=0.03748954941123896
2023-11-15 20:08:59   INFO  epoch: 14/24, acc_iter=93818, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:40:45, time_cost(all): 1 day, 6:32:41/20:57:52, loss=0.394051433810606, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.3256955236802024, lr=0.03744945763849837
2023-11-15 20:09:58   INFO  epoch: 14/24, acc_iter=93868, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:37:26, time_cost(all): 1 day, 6:33:40/21:44:02, loss=0.393940491662429, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=2.6643694648713847, lr=0.037409365865757785
2023-11-15 20:10:57   INFO  epoch: 14/24, acc_iter=93918, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:37:10, time_cost(all): 1 day, 6:34:39/21:06:10, loss=0.393829549514252, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=3.033404598233071, lr=0.0373692740930172
2023-11-15 20:11:56   INFO  epoch: 14/24, acc_iter=93968, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:33:59, time_cost(all): 1 day, 6:35:38/21:27:24, loss=0.393718607366075, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=3.329170510241694, lr=0.0373291823202766
2023-11-15 20:12:55   INFO  epoch: 14/24, acc_iter=94018, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:30:02, time_cost(all): 1 day, 6:36:37/21:20:23, loss=0.393607665217899, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.8631129325382085, lr=0.037289090547536013
2023-11-15 20:13:53   INFO  epoch: 14/24, acc_iter=94068, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:28:31, time_cost(all): 1 day, 6:37:35/21:50:48, loss=0.393496723069722, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=2.0660176149433926, lr=0.03724899877479543
2023-11-15 20:14:52   INFO  epoch: 14/24, acc_iter=94118, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:32:42, time_cost(all): 1 day, 6:38:34/20:52:54, loss=0.393385780921545, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=0.9840678214094474, lr=0.03720890700205484
2023-11-15 20:15:51   INFO  epoch: 14/24, acc_iter=94168, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:27:12, time_cost(all): 1 day, 6:39:33/21:27:46, loss=0.393274838773368, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=2.5388644446637936, lr=0.037168815229314256
2023-11-15 20:16:50   INFO  epoch: 14/24, acc_iter=94218, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:28:18, time_cost(all): 1 day, 6:40:32/21:16:50, loss=0.393163896625192, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=2.2097858175757956, lr=0.03712872345657367
2023-11-15 20:17:49   INFO  epoch: 14/24, acc_iter=94268, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:25:50, time_cost(all): 1 day, 6:41:31/21:47:17, loss=0.393052954477015, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.506083770998863, lr=0.037088631683833084
2023-11-15 20:18:48   INFO  epoch: 14/24, acc_iter=94318, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:25:56, time_cost(all): 1 day, 6:42:30/21:09:12, loss=0.392942012328838, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=3.2500538070910276, lr=0.037048539911092485
2023-11-15 20:19:47   INFO  epoch: 14/24, acc_iter=94368, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:23:24, time_cost(all): 1 day, 6:43:29/20:26:55, loss=0.392831070180661, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.858833461471407, lr=0.0370084481383519
2023-11-15 20:20:46   INFO  epoch: 14/24, acc_iter=94418, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:55, time_cost(all): 1 day, 6:44:28/21:59:22, loss=0.392720128032485, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=2.3652822489189624, lr=0.03696835636561131
2023-11-15 20:21:45   INFO  epoch: 14/24, acc_iter=94468, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:23:52, time_cost(all): 1 day, 6:45:27/21:44:34, loss=0.392609185884308, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=3.923252643373311, lr=0.03692826459287073
2023-11-15 20:22:44   INFO  epoch: 14/24, acc_iter=94518, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:28:18, time_cost(all): 1 day, 6:46:26/21:44:56, loss=0.392498243736131, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=4.6190344048679455, lr=0.03688817282013014
2023-11-15 20:23:43   INFO  epoch: 14/24, acc_iter=94568, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:23:07, time_cost(all): 1 day, 6:47:25/20:18:50, loss=0.392387301587954, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=2.589197293792947, lr=0.036848081047389555
2023-11-15 20:24:42   INFO  epoch: 14/24, acc_iter=94618, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:25:13, time_cost(all): 1 day, 6:48:24/21:29:43, loss=0.392276359439778, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=1.8948983924185272, lr=0.03680798927464897
2023-11-15 20:25:41   INFO  epoch: 14/24, acc_iter=94668, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:17:16, time_cost(all): 1 day, 6:49:23/21:55:32, loss=0.392165417291601, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.16(1.03), norm=0.8431191660375461, lr=0.036767897501908384
2023-11-15 20:26:40   INFO  epoch: 14/24, acc_iter=94718, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:17:45, time_cost(all): 1 day, 6:50:22/20:07:58, loss=0.392054475143424, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=2.004316074587575, lr=0.036727805729167784
2023-11-15 20:27:38   INFO  epoch: 14/24, acc_iter=94768, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:55, time_cost(all): 1 day, 6:51:20/20:01:19, loss=0.391943532995247, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=3.2499852027162524, lr=0.0366877139564272
2023-11-15 20:28:37   INFO  epoch: 14/24, acc_iter=94818, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:17:29, time_cost(all): 1 day, 6:52:19/21:45:05, loss=0.391832590847071, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=2.277484916949373, lr=0.03664762218368661
2023-11-15 20:29:36   INFO  epoch: 14/24, acc_iter=94868, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:15:40, time_cost(all): 1 day, 6:53:18/21:14:34, loss=0.391721648698894, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=3.3152062294363596, lr=0.036607530410946026
2023-11-15 20:30:35   INFO  epoch: 14/24, acc_iter=94918, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:19:02, time_cost(all): 1 day, 6:54:17/21:39:02, loss=0.391610706550717, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=4.831141590184262, lr=0.03656743863820543
2023-11-15 20:31:34   INFO  epoch: 14/24, acc_iter=94968, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:51, time_cost(all): 1 day, 6:55:16/20:16:24, loss=0.39149976440254, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=4.980979365304809, lr=0.03652734686546484
2023-11-15 20:32:33   INFO  epoch: 14/24, acc_iter=95018, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:13:08, time_cost(all): 1 day, 6:56:15/21:23:01, loss=0.391388822254364, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=0.8881653705996729, lr=0.03648725509272427
2023-11-15 20:33:32   INFO  epoch: 14/24, acc_iter=95068, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:16:25, time_cost(all): 1 day, 6:57:14/21:34:12, loss=0.391277880106187, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.14(1.03), norm=2.860666190214755, lr=0.03644716331998367
2023-11-15 20:34:31   INFO  epoch: 14/24, acc_iter=95118, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:11:15, time_cost(all): 1 day, 6:58:13/20:34:43, loss=0.39116693795801, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=3.842963649007377, lr=0.03640707154724308
2023-11-15 20:35:30   INFO  epoch: 14/24, acc_iter=95168, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:09:00, time_cost(all): 1 day, 6:59:12/21:44:35, loss=0.391055995809833, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=4.151899564856726, lr=0.0363669797745025
2023-11-15 20:36:29   INFO  epoch: 14/24, acc_iter=95218, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:08:54, time_cost(all): 1 day, 7:00:11/20:05:28, loss=0.390945053661657, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=1.4011235121933028, lr=0.03632688800176191
2023-11-15 20:37:28   INFO  epoch: 14/24, acc_iter=95268, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:11:18, time_cost(all): 1 day, 7:01:10/21:00:48, loss=0.39083411151348, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=1.302765881398551, lr=0.03628679622902131
2023-11-15 20:38:27   INFO  epoch: 14/24, acc_iter=95318, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:09:44, time_cost(all): 1 day, 7:02:09/19:52:42, loss=0.390723169365303, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=2.1969897622902685, lr=0.036246704456280726
2023-11-15 20:39:26   INFO  epoch: 14/24, acc_iter=95368, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:10:32, time_cost(all): 1 day, 7:03:08/20:55:26, loss=0.390612227217126, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=4.3105569400921215, lr=0.03620661268354014
2023-11-15 20:40:25   INFO  epoch: 14/24, acc_iter=95418, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:06:40, time_cost(all): 1 day, 7:04:07/19:54:13, loss=0.39050128506895, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=3.764183270606416, lr=0.036166520910799554
2023-11-15 20:41:23   INFO  epoch: 14/24, acc_iter=95468, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:21, time_cost(all): 1 day, 7:05:05/19:50:52, loss=0.390390342920773, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=0.6344040837024983, lr=0.03612642913805897
2023-11-15 20:42:22   INFO  epoch: 14/24, acc_iter=95518, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:45, time_cost(all): 1 day, 7:06:04/21:06:57, loss=0.390279400772596, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=2.4073889493827663, lr=0.03608633736531838
2023-11-15 20:43:21   INFO  epoch: 14/24, acc_iter=95568, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:03:33, time_cost(all): 1 day, 7:07:03/21:10:13, loss=0.390168458624419, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=2.2300185472159404, lr=0.0360462455925778
2023-11-15 20:44:20   INFO  epoch: 14/24, acc_iter=95618, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:54, time_cost(all): 1 day, 7:08:02/21:22:53, loss=0.390057516476243, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.218143840875201, lr=0.0360061538198372
2023-11-15 20:45:19   INFO  epoch: 14/24, acc_iter=95668, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:00:11, time_cost(all): 1 day, 7:09:01/20:11:04, loss=0.389946574328066, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.3749037508812223, lr=0.03596606204709661
2023-11-15 20:46:18   INFO  epoch: 14/24, acc_iter=95718, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:00:47, time_cost(all): 1 day, 7:10:00/21:30:39, loss=0.389835632179889, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=2.808697998921863, lr=0.035925970274356026
2023-11-15 20:47:17   INFO  epoch: 14/24, acc_iter=95768, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:02:12, time_cost(all): 1 day, 7:10:59/20:29:54, loss=0.389724690031712, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=1.3502994573603353, lr=0.03588587850161544
2023-11-15 20:48:16   INFO  epoch: 14/24, acc_iter=95818, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:59:05, time_cost(all): 1 day, 7:11:58/19:56:09, loss=0.389613747883536, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=0.5736266226923892, lr=0.03584578672887484
2023-11-15 20:49:15   INFO  epoch: 14/24, acc_iter=95868, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:59:53, time_cost(all): 1 day, 7:12:57/21:00:58, loss=0.389502805735359, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.119475737309981, lr=0.035805694956134254
2023-11-15 20:50:14   INFO  epoch: 14/24, acc_iter=95918, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:25, time_cost(all): 1 day, 7:13:56/21:12:34, loss=0.389391863587182, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=1.3281225742619882, lr=0.03576560318339368
2023-11-15 20:51:13   INFO  epoch: 14/24, acc_iter=95968, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:53:20, time_cost(all): 1 day, 7:14:55/20:06:00, loss=0.389280921439005, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=1.6586560140099635, lr=0.03572551141065308
2023-11-15 20:52:12   INFO  epoch: 14/24, acc_iter=96018, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:29, time_cost(all): 1 day, 7:15:54/21:06:14, loss=0.389169979290829, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=3.4507272757653142, lr=0.0356854196379125
2023-11-15 20:53:11   INFO  epoch: 14/24, acc_iter=96068, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:51:19, time_cost(all): 1 day, 7:16:53/21:21:19, loss=0.389059037142652, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=4.860600913458022, lr=0.03564532786517191
2023-11-15 20:54:10   INFO  epoch: 14/24, acc_iter=96118, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:24, time_cost(all): 1 day, 7:17:52/21:06:59, loss=0.388948094994475, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=1.7731007982539908, lr=0.035605236092431325
2023-11-15 20:55:08   INFO  epoch: 14/24, acc_iter=96168, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:51:25, time_cost(all): 1 day, 7:18:50/20:34:56, loss=0.388837152846298, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=3.1738759953989826, lr=0.035565144319690725
2023-11-15 20:56:07   INFO  epoch: 14/24, acc_iter=96218, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:49, time_cost(all): 1 day, 7:19:49/19:45:46, loss=0.388726210698122, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=2.1098844520564963, lr=0.03552505254695014
2023-11-15 20:57:06   INFO  epoch: 14/24, acc_iter=96268, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:57, time_cost(all): 1 day, 7:20:48/19:57:26, loss=0.388615268549945, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.19(1.03), norm=3.36478430201933, lr=0.035484960774209554
2023-11-15 20:58:05   INFO  epoch: 14/24, acc_iter=96318, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:29, time_cost(all): 1 day, 7:21:47/21:16:32, loss=0.388504326401768, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=2.4521605965529103, lr=0.03544486900146897
2023-11-15 20:59:04   INFO  epoch: 14/24, acc_iter=96368, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:49:27, time_cost(all): 1 day, 7:22:46/21:14:59, loss=0.388393384253591, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=4.417139853939732, lr=0.03540477722872838
2023-11-15 21:00:03   INFO  epoch: 14/24, acc_iter=96418, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:41, time_cost(all): 1 day, 7:23:45/19:42:59, loss=0.388282442105415, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.522849921316797, lr=0.035364685455987796
2023-11-15 21:01:02   INFO  epoch: 14/24, acc_iter=96468, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:06, time_cost(all): 1 day, 7:24:44/20:09:11, loss=0.388171499957238, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=2.5527452825138095, lr=0.03532459368324721
2023-11-15 21:02:01   INFO  epoch: 14/24, acc_iter=96518, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:39, time_cost(all): 1 day, 7:25:43/20:01:05, loss=0.388060557809061, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.04(1.03), norm=2.9698721463596165, lr=0.03528450191050661
2023-11-15 21:03:00   INFO  epoch: 14/24, acc_iter=96568, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:21, time_cost(all): 1 day, 7:26:42/19:25:21, loss=0.387949615660884, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=2.5670821866793823, lr=0.035244410137766025
2023-11-15 21:03:59   INFO  epoch: 14/24, acc_iter=96618, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:26, time_cost(all): 1 day, 7:27:41/20:37:11, loss=0.387838673512708, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=3.918700562195005, lr=0.03520431836502544
2023-11-15 21:04:58   INFO  epoch: 14/24, acc_iter=96668, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:43:05, time_cost(all): 1 day, 7:28:40/19:42:04, loss=0.387727731364531, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=0.7088170210366076, lr=0.03516422659228485
2023-11-15 21:05:57   INFO  epoch: 14/24, acc_iter=96718, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:40:14, time_cost(all): 1 day, 7:29:39/20:02:06, loss=0.387616789216354, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=2.784769996525926, lr=0.03512413481954427
2023-11-15 21:06:56   INFO  epoch: 14/24, acc_iter=96768, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:41:03, time_cost(all): 1 day, 7:30:38/20:20:58, loss=0.387505847068177, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=0.6736614055757494, lr=0.03508404304680367
2023-11-15 21:07:55   INFO  epoch: 14/24, acc_iter=96818, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:58, time_cost(all): 1 day, 7:31:37/19:25:36, loss=0.387394904920001, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=3.449126639138629, lr=0.035043951274063095
2023-11-15 21:08:53   INFO  epoch: 14/24, acc_iter=96868, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:10, time_cost(all): 1 day, 7:32:35/19:48:38, loss=0.387283962771824, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=1.1076496972544139, lr=0.03500385950132251
2023-11-15 21:09:52   INFO  epoch: 14/24, acc_iter=96918, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:36:49, time_cost(all): 1 day, 7:33:34/21:02:15, loss=0.387173020623647, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.459249951623736, lr=0.03496376772858191
2023-11-15 21:10:51   INFO  epoch: 14/24, acc_iter=96968, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:20, time_cost(all): 1 day, 7:34:33/19:23:10, loss=0.38706207847547, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.589634866229585, lr=0.034923675955841324
2023-11-15 21:11:50   INFO  epoch: 14/24, acc_iter=97018, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:39, time_cost(all): 1 day, 7:35:32/20:21:44, loss=0.386951136327294, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=0.6004505445781996, lr=0.03488358418310074
2023-11-15 21:12:49   INFO  epoch: 14/24, acc_iter=97068, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:42, time_cost(all): 1 day, 7:36:31/20:42:55, loss=0.386840194179117, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=0.7182419093507499, lr=0.03484349241036015
2023-11-15 21:13:48   INFO  epoch: 14/24, acc_iter=97118, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:33, time_cost(all): 1 day, 7:37:30/19:19:19, loss=0.38672925203094, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.9(1.03), norm=3.8426680224307983, lr=0.03480340063761955
2023-11-15 21:14:47   INFO  epoch: 14/24, acc_iter=97168, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:04, time_cost(all): 1 day, 7:38:29/20:55:27, loss=0.386618309882763, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.5611259285160093, lr=0.03476330886487897
2023-11-15 21:15:46   INFO  epoch: 14/24, acc_iter=97218, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:29:56, time_cost(all): 1 day, 7:39:28/19:13:07, loss=0.386507367734587, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=0.9968327570446704, lr=0.034723217092138395
2023-11-15 21:16:45   INFO  epoch: 14/24, acc_iter=97268, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:29:47, time_cost(all): 1 day, 7:40:27/19:19:47, loss=0.38639642558641, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.0618554139538423, lr=0.034683125319397795
2023-11-15 21:17:44   INFO  epoch: 14/24, acc_iter=97318, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:34, time_cost(all): 1 day, 7:41:26/19:31:03, loss=0.386285483438233, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=1.5762674556813732, lr=0.03464303354665721
2023-11-15 21:18:43   INFO  epoch: 14/24, acc_iter=97368, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:09, time_cost(all): 1 day, 7:42:25/20:13:23, loss=0.386174541290056, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=1.8149801787208097, lr=0.03460294177391662
2023-11-15 21:19:42   INFO  epoch: 14/24, acc_iter=97418, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:40, time_cost(all): 1 day, 7:43:24/19:29:16, loss=0.38606359914188, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.6372257126066625, lr=0.03456285000117604
2023-11-15 21:20:41   INFO  epoch: 14/24, acc_iter=97468, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:14, time_cost(all): 1 day, 7:44:23/20:43:30, loss=0.385952656993703, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=2.97867421140925, lr=0.03452275822843544
2023-11-15 21:21:40   INFO  epoch: 14/24, acc_iter=97518, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:59, time_cost(all): 1 day, 7:45:22/20:53:47, loss=0.385841714845526, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=1.340600594087557, lr=0.03448266645569485
2023-11-15 21:22:38   INFO  epoch: 14/24, acc_iter=97568, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:49, time_cost(all): 1 day, 7:46:20/20:35:20, loss=0.385730772697349, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=1.830130835326102, lr=0.034442574682954266
2023-11-15 21:23:37   INFO  epoch: 14/24, acc_iter=97618, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:24:04, time_cost(all): 1 day, 7:47:19/20:26:49, loss=0.385619830549173, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=0.5734432699092099, lr=0.03440248291021368
2023-11-15 21:24:36   INFO  epoch: 14/24, acc_iter=97668, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:21:25, time_cost(all): 1 day, 7:48:18/20:16:02, loss=0.385508888400996, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.680751572313789, lr=0.034362391137473094
2023-11-15 21:25:35   INFO  epoch: 14/24, acc_iter=97718, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:55, time_cost(all): 1 day, 7:49:17/19:25:21, loss=0.385397946252819, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=2.835511265237087, lr=0.03432229936473251
2023-11-15 21:26:34   INFO  epoch: 14/24, acc_iter=97768, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:59, time_cost(all): 1 day, 7:50:16/19:15:06, loss=0.385287004104642, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=4.940770236246198, lr=0.03428220759199192
2023-11-15 21:27:33   INFO  epoch: 14/24, acc_iter=97818, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:26, time_cost(all): 1 day, 7:51:15/20:09:00, loss=0.385176061956465, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.499672385079605, lr=0.03424211581925132
2023-11-15 21:28:32   INFO  epoch: 14/24, acc_iter=97868, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:19, time_cost(all): 1 day, 7:52:14/20:06:42, loss=0.385065119808289, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=4.958715573862662, lr=0.03420202404651074
2023-11-15 21:29:31   INFO  epoch: 14/24, acc_iter=97918, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:01, time_cost(all): 1 day, 7:53:13/19:35:55, loss=0.384954177660112, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=3.883738218324725, lr=0.03416193227377015
2023-11-15 21:30:30   INFO  epoch: 14/24, acc_iter=97968, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:43, time_cost(all): 1 day, 7:54:12/20:43:31, loss=0.384843235511935, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.2413094170778913, lr=0.034121840501029566
2023-11-15 21:31:29   INFO  epoch: 14/24, acc_iter=98018, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:50, time_cost(all): 1 day, 7:55:11/19:08:31, loss=0.384732293363758, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.1924247286648146, lr=0.034081748728288966
2023-11-15 21:32:28   INFO  epoch: 14/24, acc_iter=98068, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:37, time_cost(all): 1 day, 7:56:10/20:48:09, loss=0.384621351215582, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.382484940499925, lr=0.03404165695554838
2023-11-15 21:33:27   INFO  epoch: 14/24, acc_iter=98118, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:35, time_cost(all): 1 day, 7:57:09/20:38:23, loss=0.384510409067405, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=4.219875464101033, lr=0.03400156518280781
2023-11-15 21:34:26   INFO  epoch: 14/24, acc_iter=98168, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:24, time_cost(all): 1 day, 7:58:08/19:33:35, loss=0.384399466919228, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.0204844948366258, lr=0.03396147341006721
2023-11-15 21:35:25   INFO  epoch: 14/24, acc_iter=98218, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:56, time_cost(all): 1 day, 7:59:07/20:07:04, loss=0.384288524771051, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=1.1907820569420082, lr=0.03392138163732662
2023-11-15 21:36:23   INFO  epoch: 14/24, acc_iter=98268, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:11:04, time_cost(all): 1 day, 8:00:05/20:22:12, loss=0.384177582622875, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=3.6907116691804944, lr=0.03388128986458604
2023-11-15 21:37:22   INFO  epoch: 14/24, acc_iter=98318, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:15, time_cost(all): 1 day, 8:01:04/20:41:15, loss=0.384066640474698, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=4.315628207633363, lr=0.03384119809184545
2023-11-15 21:38:21   INFO  epoch: 14/24, acc_iter=98368, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:53, time_cost(all): 1 day, 8:02:03/19:05:52, loss=0.383955698326521, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.2867343786656686, lr=0.03380110631910485
2023-11-15 21:39:20   INFO  epoch: 14/24, acc_iter=98418, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:32, time_cost(all): 1 day, 8:03:02/19:55:36, loss=0.383844756178344, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=0.504047807229987, lr=0.033761014546364265
2023-11-15 21:40:19   INFO  epoch: 14/24, acc_iter=98468, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:22, time_cost(all): 1 day, 8:04:01/19:37:41, loss=0.383733814030168, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=4.246945820087362, lr=0.03372092277362368
2023-11-15 21:41:18   INFO  epoch: 14/24, acc_iter=98518, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:54, time_cost(all): 1 day, 8:05:00/19:20:17, loss=0.383622871881991, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=0.625039035777484, lr=0.033680831000883094
2023-11-15 21:42:17   INFO  epoch: 14/24, acc_iter=98568, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:51, time_cost(all): 1 day, 8:05:59/20:22:25, loss=0.383511929733814, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=4.757051358151865, lr=0.03364073922814251
2023-11-15 21:43:16   INFO  epoch: 14/24, acc_iter=98618, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:37, time_cost(all): 1 day, 8:06:58/19:19:39, loss=0.383400987585637, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=0.9841577091252081, lr=0.03360064745540192
2023-11-15 21:44:15   INFO  epoch: 14/24, acc_iter=98668, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:37, time_cost(all): 1 day, 8:07:57/19:24:51, loss=0.383290045437461, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.14(1.03), norm=2.3293695912673495, lr=0.033560555682661336
2023-11-15 21:45:14   INFO  epoch: 14/24, acc_iter=98718, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:40, time_cost(all): 1 day, 8:08:56/20:10:37, loss=0.383179103289284, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=3.215603800301159, lr=0.03352046390992075
2023-11-15 21:46:13   INFO  epoch: 14/24, acc_iter=98768, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:44, time_cost(all): 1 day, 8:09:55/18:54:46, loss=0.383068161141107, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=3.311590900633538, lr=0.03348037213718015
2023-11-15 21:47:12   INFO  epoch: 15/24, acc_iter=98855, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:04:41, time_cost(all): 1 day, 8:10:54/20:22:21, loss=0.38287512180328, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=3.382065108274595, lr=0.033410612452611524
2023-11-15 21:48:11   INFO  epoch: 15/24, acc_iter=98905, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:09:12, time_cost(all): 1 day, 8:11:53/19:40:30, loss=0.382764179655103, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=4.385953507127097, lr=0.03337052067987094
2023-11-15 21:49:10   INFO  epoch: 15/24, acc_iter=98955, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:04:43, time_cost(all): 1 day, 8:12:52/18:48:32, loss=0.382653237506926, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=4.316768996487592, lr=0.03333042890713035
2023-11-15 21:50:08   INFO  epoch: 15/24, acc_iter=99005, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:09:34, time_cost(all): 1 day, 8:13:50/19:34:52, loss=0.382542295358749, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.98(1.03), norm=2.049119493906166, lr=0.033290337134389766
2023-11-15 21:51:07   INFO  epoch: 15/24, acc_iter=99055, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:00:40, time_cost(all): 1 day, 8:14:49/20:03:13, loss=0.382431353210573, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=0.698954000010114, lr=0.03325024536164918
2023-11-15 21:52:06   INFO  epoch: 15/24, acc_iter=99105, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:00:24, time_cost(all): 1 day, 8:15:48/19:03:16, loss=0.382320411062396, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=1.7148497614789786, lr=0.033210153588908595
2023-11-15 21:53:05   INFO  epoch: 15/24, acc_iter=99155, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:01:38, time_cost(all): 1 day, 8:16:47/18:58:38, loss=0.382209468914219, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=4.599961205823621, lr=0.03317006181616801
2023-11-15 21:54:04   INFO  epoch: 15/24, acc_iter=99205, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:00:58, time_cost(all): 1 day, 8:17:46/19:31:43, loss=0.382098526766042, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=2.2166262120712683, lr=0.03312997004342741
2023-11-15 21:55:03   INFO  epoch: 15/24, acc_iter=99255, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:04:29, time_cost(all): 1 day, 8:18:45/18:33:31, loss=0.381987584617866, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=2.054406916583529, lr=0.03308987827068682
2023-11-15 21:56:02   INFO  epoch: 15/24, acc_iter=99305, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:05:01, time_cost(all): 1 day, 8:19:44/19:35:21, loss=0.381876642469689, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=0.6861514770966659, lr=0.03304978649794624
2023-11-15 21:57:01   INFO  epoch: 15/24, acc_iter=99355, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:57:07, time_cost(all): 1 day, 8:20:43/18:58:52, loss=0.381765700321512, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.1120981184406613, lr=0.03300969472520565
2023-11-15 21:58:00   INFO  epoch: 15/24, acc_iter=99405, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:52:08, time_cost(all): 1 day, 8:21:42/19:47:36, loss=0.381654758173335, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=1.6973816840183218, lr=0.032969602952465066
2023-11-15 21:58:59   INFO  epoch: 15/24, acc_iter=99455, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:56:44, time_cost(all): 1 day, 8:22:41/18:46:16, loss=0.381543816025159, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=3.4679682706064603, lr=0.032929511179724466
2023-11-15 21:59:58   INFO  epoch: 15/24, acc_iter=99505, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:59:44, time_cost(all): 1 day, 8:23:40/18:57:43, loss=0.381432873876982, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=3.837928329700094, lr=0.032889419406983894
2023-11-15 22:00:57   INFO  epoch: 15/24, acc_iter=99555, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:52:22, time_cost(all): 1 day, 8:24:39/19:12:41, loss=0.381321931728805, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=3.6177633967806657, lr=0.03284932763424331
2023-11-15 22:01:56   INFO  epoch: 15/24, acc_iter=99605, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:55:19, time_cost(all): 1 day, 8:25:38/19:57:57, loss=0.381210989580628, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.1779213662127532, lr=0.03280923586150271
2023-11-15 22:02:55   INFO  epoch: 15/24, acc_iter=99655, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:48:54, time_cost(all): 1 day, 8:26:37/18:26:18, loss=0.381100047432452, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=3.9861421393207848, lr=0.03276914408876212
2023-11-15 22:03:53   INFO  epoch: 15/24, acc_iter=99705, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:56:11, time_cost(all): 1 day, 8:27:35/18:48:29, loss=0.380989105284275, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=3.148122344527774, lr=0.03272905231602154
2023-11-15 22:04:52   INFO  epoch: 15/24, acc_iter=99755, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:45:19, time_cost(all): 1 day, 8:28:34/19:29:19, loss=0.380878163136098, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=3.6284280626731684, lr=0.03268896054328095
2023-11-15 22:05:51   INFO  epoch: 15/24, acc_iter=99805, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:45:41, time_cost(all): 1 day, 8:29:33/19:15:35, loss=0.380767220987921, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=0.9356086265020311, lr=0.03264886877054035
2023-11-15 22:06:50   INFO  epoch: 15/24, acc_iter=99855, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:03, time_cost(all): 1 day, 8:30:32/19:11:23, loss=0.380656278839745, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=4.640908663854643, lr=0.032608776997799765
2023-11-15 22:07:49   INFO  epoch: 15/24, acc_iter=99905, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:51:50, time_cost(all): 1 day, 8:31:31/19:46:12, loss=0.380545336691568, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.7046636595838882, lr=0.03256868522505918
2023-11-15 22:08:48   INFO  epoch: 15/24, acc_iter=99955, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:45:11, time_cost(all): 1 day, 8:32:30/19:38:13, loss=0.380434394543391, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=2.6808753663549028, lr=0.032528593452318594
2023-11-15 22:09:47   INFO  epoch: 15/24, acc_iter=100005, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:41:15, time_cost(all): 1 day, 8:33:29/18:32:33, loss=0.380323452395214, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.421189016363031, lr=0.03248850167957801
2023-11-15 22:10:46   INFO  epoch: 15/24, acc_iter=100055, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:47:50, time_cost(all): 1 day, 8:34:28/19:46:57, loss=0.380212510247038, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=4.5272126892651245, lr=0.03244840990683742
2023-11-15 22:11:45   INFO  epoch: 15/24, acc_iter=100105, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:40:37, time_cost(all): 1 day, 8:35:27/19:46:25, loss=0.380101568098861, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.271725652554183, lr=0.032408318134096836
2023-11-15 22:12:44   INFO  epoch: 15/24, acc_iter=100155, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:41:42, time_cost(all): 1 day, 8:36:26/19:05:22, loss=0.379990625950684, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=3.084398887090848, lr=0.03236822636135624
2023-11-15 22:13:43   INFO  epoch: 15/24, acc_iter=100205, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:46:34, time_cost(all): 1 day, 8:37:25/19:02:45, loss=0.379879683802507, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=3.642296036417379, lr=0.03232813458861565
2023-11-15 22:14:42   INFO  epoch: 15/24, acc_iter=100255, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:41:00, time_cost(all): 1 day, 8:38:24/19:31:05, loss=0.379768741654331, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=2.4147136137698304, lr=0.032288042815875065
2023-11-15 22:15:41   INFO  epoch: 15/24, acc_iter=100305, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:41:09, time_cost(all): 1 day, 8:39:23/18:23:21, loss=0.379657799506154, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=3.004049737947909, lr=0.03224795104313448
2023-11-15 22:16:40   INFO  epoch: 15/24, acc_iter=100355, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:38:20, time_cost(all): 1 day, 8:40:22/18:14:46, loss=0.379546857357977, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=4.005787842918, lr=0.03220785927039388
2023-11-15 22:17:38   INFO  epoch: 15/24, acc_iter=100405, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:37:41, time_cost(all): 1 day, 8:41:20/19:55:48, loss=0.3794359152098, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=3.2175102016282837, lr=0.03216776749765331
2023-11-15 22:18:37   INFO  epoch: 15/24, acc_iter=100455, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:52, time_cost(all): 1 day, 8:42:19/18:52:35, loss=0.379324973061624, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.8519908145648496, lr=0.03212767572491272
2023-11-15 22:19:36   INFO  epoch: 15/24, acc_iter=100505, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:32:25, time_cost(all): 1 day, 8:43:18/19:16:17, loss=0.379214030913447, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=1.6671695790842602, lr=0.03208758395217212
2023-11-15 22:20:35   INFO  epoch: 15/24, acc_iter=100555, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:37:58, time_cost(all): 1 day, 8:44:17/19:50:41, loss=0.37910308876527, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.1941437650216633, lr=0.032047492179431536
2023-11-15 22:21:34   INFO  epoch: 15/24, acc_iter=100605, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:31:53, time_cost(all): 1 day, 8:45:16/19:48:59, loss=0.378992146617093, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=2.0879606717946713, lr=0.03200740040669095
2023-11-15 22:22:33   INFO  epoch: 15/24, acc_iter=100655, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:34:34, time_cost(all): 1 day, 8:46:15/18:54:38, loss=0.378881204468917, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=2.5688583774292137, lr=0.031967308633950364
2023-11-15 22:23:32   INFO  epoch: 15/24, acc_iter=100705, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:34:37, time_cost(all): 1 day, 8:47:14/19:29:16, loss=0.37877026232074, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.473597330523848, lr=0.031927216861209765
2023-11-15 22:24:31   INFO  epoch: 15/24, acc_iter=100755, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:32:53, time_cost(all): 1 day, 8:48:13/19:40:36, loss=0.378659320172563, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.2716508512376965, lr=0.03188712508846918
2023-11-15 22:25:30   INFO  epoch: 15/24, acc_iter=100805, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:34:02, time_cost(all): 1 day, 8:49:12/18:49:27, loss=0.378548378024386, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.15(1.03), norm=1.3132433051705448, lr=0.03184703331572859
2023-11-15 22:26:29   INFO  epoch: 15/24, acc_iter=100855, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:24:46, time_cost(all): 1 day, 8:50:11/18:21:40, loss=0.37843743587621, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=4.145814996848339, lr=0.03180694154298801
2023-11-15 22:27:28   INFO  epoch: 15/24, acc_iter=100905, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:27:33, time_cost(all): 1 day, 8:51:10/18:37:51, loss=0.378326493728033, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.2633709376474846, lr=0.03176684977024742
2023-11-15 22:28:27   INFO  epoch: 15/24, acc_iter=100955, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:26:44, time_cost(all): 1 day, 8:52:09/19:43:02, loss=0.378215551579856, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=3.756840170974712, lr=0.031726757997506835
2023-11-15 22:29:26   INFO  epoch: 15/24, acc_iter=101005, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:29:21, time_cost(all): 1 day, 8:53:08/19:34:26, loss=0.378104609431679, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=2.8176448079582834, lr=0.03168666622476625
2023-11-15 22:30:25   INFO  epoch: 15/24, acc_iter=101055, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:26:32, time_cost(all): 1 day, 8:54:07/18:35:35, loss=0.377993667283503, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=4.608445741880378, lr=0.03164657445202565
2023-11-15 22:31:23   INFO  epoch: 15/24, acc_iter=101105, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:20:43, time_cost(all): 1 day, 8:55:05/18:04:44, loss=0.377882725135326, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=3.6224511249996527, lr=0.031606482679285064
2023-11-15 22:32:22   INFO  epoch: 15/24, acc_iter=101155, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:22:54, time_cost(all): 1 day, 8:56:04/18:34:15, loss=0.377771782987149, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=0.9101915003801895, lr=0.03156639090654448
2023-11-15 22:33:21   INFO  epoch: 15/24, acc_iter=101205, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:19:14, time_cost(all): 1 day, 8:57:03/19:18:10, loss=0.377660840838972, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.05(1.03), norm=2.340390956787195, lr=0.03152629913380389
2023-11-15 22:34:20   INFO  epoch: 15/24, acc_iter=101255, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:54, time_cost(all): 1 day, 8:58:02/18:12:22, loss=0.377549898690796, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=3.2829188076802867, lr=0.031486207361063306
2023-11-15 22:35:19   INFO  epoch: 15/24, acc_iter=101305, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:20:08, time_cost(all): 1 day, 8:59:01/19:28:46, loss=0.377438956542619, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=3.364287589142475, lr=0.03144611558832272
2023-11-15 22:36:18   INFO  epoch: 15/24, acc_iter=101355, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:18:20, time_cost(all): 1 day, 9:00:00/17:53:40, loss=0.377328014394442, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=1.6745971898259953, lr=0.031406023815582135
2023-11-15 22:37:17   INFO  epoch: 15/24, acc_iter=101405, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:16:24, time_cost(all): 1 day, 9:00:59/18:57:52, loss=0.377217072246265, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=0.7343263961347777, lr=0.03136593204284155
2023-11-15 22:38:16   INFO  epoch: 15/24, acc_iter=101455, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:14:51, time_cost(all): 1 day, 9:01:58/18:17:46, loss=0.377106130098089, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=3.24817551451177, lr=0.03132584027010095
2023-11-15 22:39:15   INFO  epoch: 15/24, acc_iter=101505, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:18:34, time_cost(all): 1 day, 9:02:57/18:47:35, loss=0.376995187949912, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.0547530718158984, lr=0.03128574849736036
2023-11-15 22:40:14   INFO  epoch: 15/24, acc_iter=101555, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:13:24, time_cost(all): 1 day, 9:03:56/18:43:42, loss=0.376884245801735, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=0.5661396061634318, lr=0.031245656724619778
2023-11-15 22:41:13   INFO  epoch: 15/24, acc_iter=101605, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:38, time_cost(all): 1 day, 9:04:55/18:17:47, loss=0.376773303653558, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.07(1.03), norm=1.5258202052668848, lr=0.03120556495187919
2023-11-15 22:42:12   INFO  epoch: 15/24, acc_iter=101655, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:09, time_cost(all): 1 day, 9:05:54/19:11:54, loss=0.376662361505382, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=1.3142777495589977, lr=0.031165473179138592
2023-11-15 22:43:11   INFO  epoch: 15/24, acc_iter=101705, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:12:03, time_cost(all): 1 day, 9:06:53/18:00:37, loss=0.376551419357205, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=4.313115194087703, lr=0.031125381406398006
2023-11-15 22:44:10   INFO  epoch: 15/24, acc_iter=101755, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:13:47, time_cost(all): 1 day, 9:07:52/19:15:21, loss=0.376440477209028, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=3.5838152866308195, lr=0.031085289633657434
2023-11-15 22:45:08   INFO  epoch: 15/24, acc_iter=101805, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:09:09, time_cost(all): 1 day, 9:08:50/18:39:27, loss=0.376329535060851, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.894499527897568, lr=0.031045197860916834
2023-11-15 22:46:07   INFO  epoch: 15/24, acc_iter=101855, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:06:53, time_cost(all): 1 day, 9:09:49/18:52:09, loss=0.376218592912675, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=0.7591625622897595, lr=0.03100510608817625
2023-11-15 22:47:06   INFO  epoch: 15/24, acc_iter=101905, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:48, time_cost(all): 1 day, 9:10:48/18:51:55, loss=0.376107650764498, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=0.9378515916984613, lr=0.030965014315435663
2023-11-15 22:48:05   INFO  epoch: 15/24, acc_iter=101955, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:04:13, time_cost(all): 1 day, 9:11:47/18:51:32, loss=0.375996708616321, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=1.9045705954498935, lr=0.030924922542695077
2023-11-15 22:49:04   INFO  epoch: 15/24, acc_iter=102005, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:05, time_cost(all): 1 day, 9:12:46/18:48:31, loss=0.375885766468144, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=4.216027586561021, lr=0.030884830769954477
2023-11-15 22:50:03   INFO  epoch: 15/24, acc_iter=102055, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:05:06, time_cost(all): 1 day, 9:13:45/17:45:19, loss=0.375774824319968, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=1.4215805818460812, lr=0.03084473899721389
2023-11-15 22:51:02   INFO  epoch: 15/24, acc_iter=102105, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:43, time_cost(all): 1 day, 9:14:44/19:16:09, loss=0.375663882171791, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=3.5415573134244225, lr=0.030804647224473306
2023-11-15 22:52:01   INFO  epoch: 15/24, acc_iter=102155, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:02:21, time_cost(all): 1 day, 9:15:43/18:19:49, loss=0.375552940023614, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=4.186480853297326, lr=0.03076455545173272
2023-11-15 22:53:00   INFO  epoch: 15/24, acc_iter=102205, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:03:13, time_cost(all): 1 day, 9:16:42/18:07:52, loss=0.375441997875437, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=4.7658864472126865, lr=0.030724463678992134
2023-11-15 22:53:59   INFO  epoch: 15/24, acc_iter=102255, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:03:00, time_cost(all): 1 day, 9:17:41/18:44:26, loss=0.375331055727261, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.05(1.03), norm=1.2658179697821916, lr=0.030684371906251548
2023-11-15 22:54:58   INFO  epoch: 15/24, acc_iter=102305, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:00:19, time_cost(all): 1 day, 9:18:40/18:23:07, loss=0.375220113579084, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=3.2073455120473264, lr=0.030644280133510962
2023-11-15 22:55:57   INFO  epoch: 15/24, acc_iter=102355, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:58:14, time_cost(all): 1 day, 9:19:39/17:56:19, loss=0.375109171430907, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=2.497895158054696, lr=0.030604188360770362
2023-11-15 22:56:56   INFO  epoch: 15/24, acc_iter=102405, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:59:14, time_cost(all): 1 day, 9:20:38/19:09:05, loss=0.37499822928273, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.83(1.03), norm=2.106572002428166, lr=0.030564096588029777
2023-11-15 22:57:55   INFO  epoch: 15/24, acc_iter=102455, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:57:05, time_cost(all): 1 day, 9:21:37/18:59:14, loss=0.374887287134554, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=0.6590186836705916, lr=0.03052400481528919
2023-11-15 22:58:54   INFO  epoch: 15/24, acc_iter=102505, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:59:26, time_cost(all): 1 day, 9:22:36/18:19:32, loss=0.374776344986377, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=1.503606413951592, lr=0.030483913042548605
2023-11-15 22:59:52   INFO  epoch: 15/24, acc_iter=102555, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:57:52, time_cost(all): 1 day, 9:23:34/18:01:52, loss=0.3746654028382, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=2.5892922855194938, lr=0.030443821269808005
2023-11-15 23:00:51   INFO  epoch: 15/24, acc_iter=102605, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:56:30, time_cost(all): 1 day, 9:24:33/18:44:03, loss=0.374554460690023, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=1.5724258449750057, lr=0.03040372949706742
2023-11-15 23:01:50   INFO  epoch: 15/24, acc_iter=102655, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:52:32, time_cost(all): 1 day, 9:25:32/18:11:25, loss=0.374443518541847, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=4.211212374160664, lr=0.030363637724326847
2023-11-15 23:02:49   INFO  epoch: 15/24, acc_iter=102705, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:39, time_cost(all): 1 day, 9:26:31/18:07:24, loss=0.37433257639367, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.94(1.03), norm=1.8485229947697561, lr=0.030323545951586248
2023-11-15 23:03:48   INFO  epoch: 15/24, acc_iter=102755, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:51:27, time_cost(all): 1 day, 9:27:30/18:25:44, loss=0.374221634245493, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=2.5655352464597345, lr=0.030283454178845662
2023-11-15 23:04:47   INFO  epoch: 15/24, acc_iter=102805, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:49:34, time_cost(all): 1 day, 9:28:29/17:41:11, loss=0.374110692097316, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.304556508067385, lr=0.030243362406105076
2023-11-15 23:05:46   INFO  epoch: 15/24, acc_iter=102855, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:48:01, time_cost(all): 1 day, 9:29:28/18:53:59, loss=0.37399974994914, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=4.1945167299438975, lr=0.03020327063336449
2023-11-15 23:06:45   INFO  epoch: 15/24, acc_iter=102905, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:56, time_cost(all): 1 day, 9:30:27/17:45:57, loss=0.373888807800963, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=4.4943294031491146, lr=0.03016317886062389
2023-11-15 23:07:44   INFO  epoch: 15/24, acc_iter=102955, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:46:27, time_cost(all): 1 day, 9:31:26/18:36:32, loss=0.373777865652786, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=1.4193472808314238, lr=0.030123087087883305
2023-11-15 23:08:43   INFO  epoch: 15/24, acc_iter=103005, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:48:31, time_cost(all): 1 day, 9:32:25/18:35:33, loss=0.373666923504609, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.07771173445277, lr=0.03008299531514272
2023-11-15 23:09:42   INFO  epoch: 15/24, acc_iter=103055, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:46:22, time_cost(all): 1 day, 9:33:24/18:23:42, loss=0.373555981356433, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=4.760824663813102, lr=0.030042903542402133
2023-11-15 23:10:41   INFO  epoch: 15/24, acc_iter=103105, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:44:02, time_cost(all): 1 day, 9:34:23/18:00:16, loss=0.373445039208256, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.95(1.03), norm=2.545173786034104, lr=0.030002811769661547
2023-11-15 23:11:40   INFO  epoch: 15/24, acc_iter=103155, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:02, time_cost(all): 1 day, 9:35:22/17:52:43, loss=0.373334097060079, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=2.1835792059970363, lr=0.02996271999692096
2023-11-15 23:12:39   INFO  epoch: 15/24, acc_iter=103205, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:24, time_cost(all): 1 day, 9:36:21/17:47:22, loss=0.373223154911902, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=3.5993528348390567, lr=0.029922628224180375
2023-11-15 23:13:37   INFO  epoch: 15/24, acc_iter=103255, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:41:57, time_cost(all): 1 day, 9:37:19/17:27:12, loss=0.373112212763726, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=3.6697126791257992, lr=0.029882536451439776
2023-11-15 23:14:36   INFO  epoch: 15/24, acc_iter=103305, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:52, time_cost(all): 1 day, 9:38:18/18:56:15, loss=0.373001270615549, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=3.8599314878901643, lr=0.02984244467869919
2023-11-15 23:15:35   INFO  epoch: 15/24, acc_iter=103355, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:52, time_cost(all): 1 day, 9:39:17/17:12:48, loss=0.372890328467372, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.21(1.03), norm=1.481710033413341, lr=0.029802352905958604
2023-11-15 23:16:34   INFO  epoch: 15/24, acc_iter=103405, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:51, time_cost(all): 1 day, 9:40:16/18:50:05, loss=0.372779386319195, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=3.808787209597318, lr=0.029762261133218018
2023-11-15 23:17:33   INFO  epoch: 15/24, acc_iter=103455, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:45, time_cost(all): 1 day, 9:41:15/17:37:53, loss=0.372668444171019, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=0.6833867254060779, lr=0.029722169360477432
2023-11-15 23:18:32   INFO  epoch: 15/24, acc_iter=103505, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:40, time_cost(all): 1 day, 9:42:14/18:39:07, loss=0.372557502022842, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.6130368515786584, lr=0.029682077587736846
2023-11-15 23:19:31   INFO  epoch: 15/24, acc_iter=103555, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:00, time_cost(all): 1 day, 9:43:13/17:56:46, loss=0.372446559874665, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.04(1.03), norm=0.7412322730852032, lr=0.02964198581499626
2023-11-15 23:20:30   INFO  epoch: 15/24, acc_iter=103605, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:11, time_cost(all): 1 day, 9:44:12/18:34:11, loss=0.372335617726488, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=3.86017429434017, lr=0.029601894042255675
2023-11-15 23:21:29   INFO  epoch: 15/24, acc_iter=103655, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:01, time_cost(all): 1 day, 9:45:11/17:18:48, loss=0.372224675578312, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=0.5112567526035571, lr=0.029561802269515075
2023-11-15 23:22:28   INFO  epoch: 15/24, acc_iter=103705, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:41, time_cost(all): 1 day, 9:46:10/17:26:35, loss=0.372113733430135, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=4.004355724443533, lr=0.02952171049677449
2023-11-15 23:23:27   INFO  epoch: 15/24, acc_iter=103755, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:10, time_cost(all): 1 day, 9:47:09/17:54:29, loss=0.372002791281958, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.22(1.03), norm=2.8750309141273296, lr=0.029481618724033903
2023-11-15 23:24:26   INFO  epoch: 15/24, acc_iter=103805, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:30:08, time_cost(all): 1 day, 9:48:08/18:39:19, loss=0.371891849133781, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.91(1.03), norm=4.98563490402431, lr=0.029441526951293318
2023-11-15 23:25:25   INFO  epoch: 15/24, acc_iter=103855, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:28:51, time_cost(all): 1 day, 9:49:07/18:40:22, loss=0.371780906985605, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=2.6334112671886967, lr=0.029401435178552718
2023-11-15 23:26:24   INFO  epoch: 15/24, acc_iter=103905, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:39, time_cost(all): 1 day, 9:50:06/17:22:51, loss=0.371669964837428, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.93(1.03), norm=3.4344921723263475, lr=0.029361343405812132
2023-11-15 23:27:22   INFO  epoch: 15/24, acc_iter=103955, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:26:59, time_cost(all): 1 day, 9:51:04/17:38:46, loss=0.371559022689251, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.5375148075692255, lr=0.02932125163307156
2023-11-15 23:28:21   INFO  epoch: 15/24, acc_iter=104005, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:25, time_cost(all): 1 day, 9:52:03/18:39:50, loss=0.371448080541074, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=3.8823881544168026, lr=0.02928115986033096
2023-11-15 23:29:20   INFO  epoch: 15/24, acc_iter=104055, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:47, time_cost(all): 1 day, 9:53:02/17:43:10, loss=0.371337138392898, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=2.64238587274361, lr=0.029241068087590374
2023-11-15 23:30:19   INFO  epoch: 15/24, acc_iter=104105, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:51, time_cost(all): 1 day, 9:54:01/18:13:55, loss=0.371226196244721, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=4.8525280053205755, lr=0.02920097631484979
2023-11-15 23:31:18   INFO  epoch: 15/24, acc_iter=104155, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:07, time_cost(all): 1 day, 9:55:00/17:25:37, loss=0.371115254096544, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.3048278525879, lr=0.029160884542109203
2023-11-15 23:32:17   INFO  epoch: 15/24, acc_iter=104205, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:30, time_cost(all): 1 day, 9:55:59/17:37:31, loss=0.371004311948367, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=3.5504602120045137, lr=0.029120792769368603
2023-11-15 23:33:16   INFO  epoch: 15/24, acc_iter=104255, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:24, time_cost(all): 1 day, 9:56:58/17:47:27, loss=0.37089336980019, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.08(1.03), norm=3.458382081908823, lr=0.029080700996628017
2023-11-15 23:34:15   INFO  epoch: 15/24, acc_iter=104305, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:43, time_cost(all): 1 day, 9:57:57/17:52:27, loss=0.370782427652014, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.01(1.03), norm=4.6866502207308045, lr=0.02904060922388743
2023-11-15 23:35:14   INFO  epoch: 15/24, acc_iter=104355, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:55, time_cost(all): 1 day, 9:58:56/17:56:26, loss=0.370671485503837, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=3.2893032555453168, lr=0.029000517451146846
2023-11-15 23:36:13   INFO  epoch: 15/24, acc_iter=104405, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:18, time_cost(all): 1 day, 9:59:55/16:58:31, loss=0.37056054335566, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=2.8930257648094244, lr=0.02896042567840626
2023-11-15 23:37:12   INFO  epoch: 15/24, acc_iter=104455, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:58, time_cost(all): 1 day, 10:00:54/18:05:27, loss=0.370449601207483, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=0.636856335232808, lr=0.028920333905665674
2023-11-15 23:38:11   INFO  epoch: 15/24, acc_iter=104505, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:08, time_cost(all): 1 day, 10:01:53/18:05:00, loss=0.370338659059307, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=0.9827898259084806, lr=0.028880242132925088
2023-11-15 23:39:10   INFO  epoch: 15/24, acc_iter=104555, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:12, time_cost(all): 1 day, 10:02:52/18:11:34, loss=0.37022771691113, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=3.308560362116522, lr=0.02884015036018449
2023-11-15 23:40:09   INFO  epoch: 15/24, acc_iter=104605, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:45, time_cost(all): 1 day, 10:03:51/17:22:04, loss=0.370116774762953, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=3.014717577681211, lr=0.028800058587443902
2023-11-15 23:41:07   INFO  epoch: 15/24, acc_iter=104655, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:13, time_cost(all): 1 day, 10:04:49/17:24:04, loss=0.370005832614776, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=0.849199783448908, lr=0.028759966814703317
2023-11-15 23:42:06   INFO  epoch: 15/24, acc_iter=104705, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:10, time_cost(all): 1 day, 10:05:48/18:06:11, loss=0.3698948904666, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=1.711170720154306, lr=0.02871987504196273
2023-11-15 23:43:05   INFO  epoch: 15/24, acc_iter=104755, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:55, time_cost(all): 1 day, 10:06:47/18:17:25, loss=0.369783948318423, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=4.717095429806494, lr=0.02867978326922213
2023-11-15 23:44:04   INFO  epoch: 15/24, acc_iter=104805, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:33, time_cost(all): 1 day, 10:07:46/17:49:26, loss=0.369673006170246, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=4.763741703575506, lr=0.028639691496481545
2023-11-15 23:45:03   INFO  epoch: 15/24, acc_iter=104855, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:56, time_cost(all): 1 day, 10:08:45/17:08:15, loss=0.36956206402207, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=2.5869997811526755, lr=0.028599599723740973
2023-11-15 23:46:02   INFO  epoch: 15/24, acc_iter=104905, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:13, time_cost(all): 1 day, 10:09:44/18:17:53, loss=0.369451121873893, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=3.4989744785983095, lr=0.028559507951000374
2023-11-15 23:47:01   INFO  epoch: 15/24, acc_iter=104955, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:31, time_cost(all): 1 day, 10:10:43/18:03:02, loss=0.369340179725716, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.032497714778418, lr=0.028519416178259788
2023-11-15 23:48:00   INFO  epoch: 15/24, acc_iter=105005, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:56, time_cost(all): 1 day, 10:11:42/16:56:56, loss=0.369229237577539, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=1.9698087850946835, lr=0.028479324405519202
2023-11-15 23:48:59   INFO  epoch: 15/24, acc_iter=105055, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:43, time_cost(all): 1 day, 10:12:41/18:04:46, loss=0.369118295429363, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=1.9737457419870308, lr=0.028439232632778616
2023-11-15 23:49:58   INFO  epoch: 15/24, acc_iter=105105, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:54, time_cost(all): 1 day, 10:13:40/16:57:13, loss=0.369007353281186, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=3.0883253080955666, lr=0.028399140860038016
2023-11-15 23:50:57   INFO  epoch: 15/24, acc_iter=105155, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:38, time_cost(all): 1 day, 10:14:39/18:17:58, loss=0.368896411133009, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=3.2242079494178286, lr=0.02835904908729743
2023-11-15 23:51:56   INFO  epoch: 15/24, acc_iter=105205, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:33, time_cost(all): 1 day, 10:15:38/17:59:39, loss=0.368785468984832, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=1.0430621091988792, lr=0.028318957314556845
2023-11-15 23:52:55   INFO  epoch: 15/24, acc_iter=105255, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:41, time_cost(all): 1 day, 10:16:37/17:38:02, loss=0.368674526836656, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=0.9812609892230039, lr=0.02827886554181626
2023-11-15 23:53:54   INFO  epoch: 15/24, acc_iter=105305, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:45, time_cost(all): 1 day, 10:17:36/17:26:46, loss=0.368563584688479, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.306694353522223, lr=0.028238773769075673
2023-11-15 23:54:52   INFO  epoch: 15/24, acc_iter=105355, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 1 day, 10:18:34/18:12:02, loss=0.368452642540302, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=1.3419286608298149, lr=0.028198681996335087
2023-11-15 23:55:51   INFO  epoch: 16/24, acc_iter=105442, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:13:21, time_cost(all): 1 day, 10:19:33/18:03:55, loss=0.368259603202474, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=4.190697387797044, lr=0.02812892231176646
2023-11-15 23:56:50   INFO  epoch: 16/24, acc_iter=105492, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:04:34, time_cost(all): 1 day, 10:20:32/17:20:01, loss=0.368148661054298, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.7932475421294174, lr=0.028088830539025875
2023-11-15 23:57:49   INFO  epoch: 16/24, acc_iter=105542, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:10:39, time_cost(all): 1 day, 10:21:31/17:54:30, loss=0.368037718906121, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=0.699596887406192, lr=0.02804873876628529
2023-11-15 23:58:48   INFO  epoch: 16/24, acc_iter=105592, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:03:11, time_cost(all): 1 day, 10:22:30/17:11:54, loss=0.367926776757944, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.23(1.03), norm=3.7235706772379884, lr=0.02800864699354469
2023-11-15 23:59:47   INFO  epoch: 16/24, acc_iter=105642, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:07:22, time_cost(all): 1 day, 10:23:29/17:13:30, loss=0.367815834609767, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=1.9733089516653226, lr=0.027968555220804103
2023-11-16 00:00:46   INFO  epoch: 16/24, acc_iter=105692, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:05:04, time_cost(all): 1 day, 10:24:28/16:32:32, loss=0.367704892461591, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=3.123062542948694, lr=0.027928463448063517
2023-11-16 00:01:45   INFO  epoch: 16/24, acc_iter=105742, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:03:03, time_cost(all): 1 day, 10:25:27/16:31:29, loss=0.367593950313414, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.89(1.03), norm=3.211691708362918, lr=0.02788837167532293
2023-11-16 00:02:44   INFO  epoch: 16/24, acc_iter=105792, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:06:57, time_cost(all): 1 day, 10:26:26/18:01:38, loss=0.367483008165237, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=4.490927951251185, lr=0.027848279902582346
2023-11-16 00:03:43   INFO  epoch: 16/24, acc_iter=105842, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:01:44, time_cost(all): 1 day, 10:27:25/17:53:26, loss=0.36737206601706, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=4.502793142605566, lr=0.02780818812984176
2023-11-16 00:04:42   INFO  epoch: 16/24, acc_iter=105892, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:54:58, time_cost(all): 1 day, 10:28:24/17:57:43, loss=0.367261123868884, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=4.096730867213151, lr=0.027768096357101174
2023-11-16 00:05:41   INFO  epoch: 16/24, acc_iter=105942, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:58:27, time_cost(all): 1 day, 10:29:23/16:36:12, loss=0.367150181720707, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.0634904943365877, lr=0.027728004584360574
2023-11-16 00:06:40   INFO  epoch: 16/24, acc_iter=105992, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:57:30, time_cost(all): 1 day, 10:30:22/16:49:08, loss=0.36703923957253, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=1.489586061818357, lr=0.02768791281161999
2023-11-16 00:07:39   INFO  epoch: 16/24, acc_iter=106042, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:56:25, time_cost(all): 1 day, 10:31:21/17:09:55, loss=0.366928297424353, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=3.9998365379548995, lr=0.027647821038879403
2023-11-16 00:08:37   INFO  epoch: 16/24, acc_iter=106092, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:59:02, time_cost(all): 1 day, 10:32:19/16:37:14, loss=0.366817355276177, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=2.287821106058834, lr=0.027607729266138817
2023-11-16 00:09:36   INFO  epoch: 16/24, acc_iter=106142, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:50:17, time_cost(all): 1 day, 10:33:18/17:25:17, loss=0.366706413128, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=3.9454026965313247, lr=0.02756763749339823
2023-11-16 00:10:35   INFO  epoch: 16/24, acc_iter=106192, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:55:22, time_cost(all): 1 day, 10:34:17/17:52:26, loss=0.366595470979823, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.3001398505258184, lr=0.02752754572065763
2023-11-16 00:11:34   INFO  epoch: 16/24, acc_iter=106242, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:53:43, time_cost(all): 1 day, 10:35:16/17:01:11, loss=0.366484528831646, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=1.2618496494188913, lr=0.02748745394791706
2023-11-16 00:12:33   INFO  epoch: 16/24, acc_iter=106292, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:49:10, time_cost(all): 1 day, 10:36:15/16:49:32, loss=0.36637358668347, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=1.8746964596035292, lr=0.027447362175176473
2023-11-16 00:13:32   INFO  epoch: 16/24, acc_iter=106342, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:48:50, time_cost(all): 1 day, 10:37:14/16:22:39, loss=0.366262644535293, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=4.11353487521175, lr=0.027407270402435874
2023-11-16 00:14:31   INFO  epoch: 16/24, acc_iter=106392, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:49:42, time_cost(all): 1 day, 10:38:13/16:36:36, loss=0.366151702387116, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=0.6519622427172252, lr=0.027367178629695288
2023-11-16 00:15:30   INFO  epoch: 16/24, acc_iter=106442, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:52:12, time_cost(all): 1 day, 10:39:12/17:30:28, loss=0.366040760238939, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=2.862449734148584, lr=0.027327086856954702
2023-11-16 00:16:29   INFO  epoch: 16/24, acc_iter=106492, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:03, time_cost(all): 1 day, 10:40:11/17:31:35, loss=0.365929818090763, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=3.575957213703643, lr=0.027286995084214116
2023-11-16 00:17:28   INFO  epoch: 16/24, acc_iter=106542, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:51:08, time_cost(all): 1 day, 10:41:10/17:42:27, loss=0.365818875942586, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=3.780772578601949, lr=0.027246903311473517
2023-11-16 00:18:27   INFO  epoch: 16/24, acc_iter=106592, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:41:05, time_cost(all): 1 day, 10:42:09/17:10:41, loss=0.365707933794409, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=2.360317318650469, lr=0.02720681153873293
2023-11-16 00:19:26   INFO  epoch: 16/24, acc_iter=106642, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:46:01, time_cost(all): 1 day, 10:43:08/17:01:55, loss=0.365596991646232, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.388925841571629, lr=0.027166719765992345
2023-11-16 00:20:25   INFO  epoch: 16/24, acc_iter=106692, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:45:52, time_cost(all): 1 day, 10:44:07/16:22:18, loss=0.365486049498056, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.16(1.03), norm=1.349521855387399, lr=0.02712662799325176
2023-11-16 00:21:24   INFO  epoch: 16/24, acc_iter=106742, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:38:04, time_cost(all): 1 day, 10:45:06/16:13:05, loss=0.365375107349879, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=2.4377755646647357, lr=0.027086536220511173
2023-11-16 00:22:22   INFO  epoch: 16/24, acc_iter=106792, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:37:51, time_cost(all): 1 day, 10:46:04/16:17:31, loss=0.365264165201702, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=4.6999639373221065, lr=0.027046444447770587
2023-11-16 00:23:21   INFO  epoch: 16/24, acc_iter=106842, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:37:23, time_cost(all): 1 day, 10:47:03/17:10:36, loss=0.365153223053525, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=4.08503654216357, lr=0.02700635267503
2023-11-16 00:24:20   INFO  epoch: 16/24, acc_iter=106892, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:35:41, time_cost(all): 1 day, 10:48:02/16:18:50, loss=0.365042280905349, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.4392759703566815, lr=0.026966260902289402
2023-11-16 00:25:19   INFO  epoch: 16/24, acc_iter=106942, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:40:07, time_cost(all): 1 day, 10:49:01/17:19:18, loss=0.364931338757172, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=2.92817157308846, lr=0.026926169129548816
2023-11-16 00:26:18   INFO  epoch: 16/24, acc_iter=106992, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:40:29, time_cost(all): 1 day, 10:50:00/16:09:50, loss=0.364820396608995, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=3.5283029708100724, lr=0.02688607735680823
2023-11-16 00:27:17   INFO  epoch: 16/24, acc_iter=107042, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:24, time_cost(all): 1 day, 10:50:59/17:22:04, loss=0.364709454460818, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=3.4810517261380522, lr=0.026845985584067644
2023-11-16 00:28:16   INFO  epoch: 16/24, acc_iter=107092, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:40:38, time_cost(all): 1 day, 10:51:58/16:33:14, loss=0.364598512312642, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.822607306227602, lr=0.026805893811327045
2023-11-16 00:29:15   INFO  epoch: 16/24, acc_iter=107142, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:34:12, time_cost(all): 1 day, 10:52:57/16:55:28, loss=0.364487570164465, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=3.3134980664677487, lr=0.026765802038586473
2023-11-16 00:30:14   INFO  epoch: 16/24, acc_iter=107192, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:35:36, time_cost(all): 1 day, 10:53:56/17:38:24, loss=0.364376628016288, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=4.910163167333228, lr=0.026725710265845887
2023-11-16 00:31:13   INFO  epoch: 16/24, acc_iter=107242, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:29:23, time_cost(all): 1 day, 10:54:55/17:03:37, loss=0.364265685868111, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=0.6370696849621059, lr=0.026685618493105287
2023-11-16 00:32:12   INFO  epoch: 16/24, acc_iter=107292, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:30:00, time_cost(all): 1 day, 10:55:54/17:03:59, loss=0.364154743719935, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=3.1938426325209, lr=0.0266455267203647
2023-11-16 00:33:11   INFO  epoch: 16/24, acc_iter=107342, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:30:24, time_cost(all): 1 day, 10:56:53/17:04:39, loss=0.364043801571758, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.9(1.03), norm=0.6867370324326201, lr=0.026605434947624115
2023-11-16 00:34:10   INFO  epoch: 16/24, acc_iter=107392, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:28:39, time_cost(all): 1 day, 10:57:52/16:37:53, loss=0.363932859423581, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=1.1561942433094572, lr=0.02656534317488353
2023-11-16 00:35:09   INFO  epoch: 16/24, acc_iter=107442, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:31:24, time_cost(all): 1 day, 10:58:51/16:29:27, loss=0.363821917275404, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.22(1.03), norm=2.505490267477056, lr=0.02652525140214293
2023-11-16 00:36:07   INFO  epoch: 16/24, acc_iter=107492, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:26:26, time_cost(all): 1 day, 10:59:49/16:17:47, loss=0.363710975127228, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=1.8252644359002006, lr=0.026485159629402344
2023-11-16 00:37:06   INFO  epoch: 16/24, acc_iter=107542, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:23:20, time_cost(all): 1 day, 11:00:48/16:57:19, loss=0.363600032979051, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=4.314218175999125, lr=0.026445067856661758
2023-11-16 00:38:05   INFO  epoch: 16/24, acc_iter=107592, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:07, time_cost(all): 1 day, 11:01:47/16:41:24, loss=0.363489090830874, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=4.415678062555811, lr=0.026404976083921172
2023-11-16 00:39:04   INFO  epoch: 16/24, acc_iter=107642, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:25:31, time_cost(all): 1 day, 11:02:46/16:19:09, loss=0.363378148682697, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=3.5409322218917754, lr=0.026364884311180586
2023-11-16 00:40:03   INFO  epoch: 16/24, acc_iter=107692, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:21:40, time_cost(all): 1 day, 11:03:45/16:48:38, loss=0.363267206534521, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=2.190388089766407, lr=0.02632479253844
2023-11-16 00:41:02   INFO  epoch: 16/24, acc_iter=107742, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:19:58, time_cost(all): 1 day, 11:04:44/17:04:43, loss=0.363156264386344, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=2.079308050157569, lr=0.026284700765699415
2023-11-16 00:42:01   INFO  epoch: 16/24, acc_iter=107792, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:24:42, time_cost(all): 1 day, 11:05:43/16:56:38, loss=0.363045322238167, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.04(1.03), norm=2.917152676497329, lr=0.026244608992958815
2023-11-16 00:43:00   INFO  epoch: 16/24, acc_iter=107842, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:20:31, time_cost(all): 1 day, 11:06:42/15:59:01, loss=0.36293438008999, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=2.204676197869618, lr=0.02620451722021823
2023-11-16 00:43:59   INFO  epoch: 16/24, acc_iter=107892, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:31, time_cost(all): 1 day, 11:07:41/17:21:29, loss=0.362823437941814, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=4.0894692048511985, lr=0.026164425447477643
2023-11-16 00:44:58   INFO  epoch: 16/24, acc_iter=107942, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:17:58, time_cost(all): 1 day, 11:08:40/17:25:53, loss=0.362712495793637, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.72255947848874, lr=0.026124333674737057
2023-11-16 00:45:57   INFO  epoch: 16/24, acc_iter=107992, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:20:39, time_cost(all): 1 day, 11:09:39/15:55:09, loss=0.36260155364546, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.7833065388280205, lr=0.026084241901996458
2023-11-16 00:46:56   INFO  epoch: 16/24, acc_iter=108042, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:14:47, time_cost(all): 1 day, 11:10:38/17:22:08, loss=0.362490611497283, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=4.766445578046416, lr=0.026044150129255886
2023-11-16 00:47:55   INFO  epoch: 16/24, acc_iter=108092, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:14:26, time_cost(all): 1 day, 11:11:37/15:55:41, loss=0.362379669349107, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=2.899338516208309, lr=0.0260040583565153
2023-11-16 00:48:54   INFO  epoch: 16/24, acc_iter=108142, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:23, time_cost(all): 1 day, 11:12:36/16:22:04, loss=0.36226872720093, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=2.4600146609877367, lr=0.0259639665837747
2023-11-16 00:49:52   INFO  epoch: 16/24, acc_iter=108192, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:12:14, time_cost(all): 1 day, 11:13:34/16:28:30, loss=0.362157785052753, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=0.9337977700045603, lr=0.025923874811034114
2023-11-16 00:50:51   INFO  epoch: 16/24, acc_iter=108242, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:11:19, time_cost(all): 1 day, 11:14:33/16:08:11, loss=0.362046842904576, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=4.671435564978652, lr=0.02588378303829353
2023-11-16 00:51:50   INFO  epoch: 16/24, acc_iter=108292, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:13:22, time_cost(all): 1 day, 11:15:32/15:52:48, loss=0.3619359007564, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=2.8169548550989734, lr=0.025843691265552943
2023-11-16 00:52:49   INFO  epoch: 16/24, acc_iter=108342, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:09:08, time_cost(all): 1 day, 11:16:31/16:05:07, loss=0.361824958608223, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=4.711598577933545, lr=0.025803599492812357
2023-11-16 00:53:48   INFO  epoch: 16/24, acc_iter=108392, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:10:50, time_cost(all): 1 day, 11:17:30/17:05:03, loss=0.361714016460046, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=4.718358472323443, lr=0.025763507720071757
2023-11-16 00:54:47   INFO  epoch: 16/24, acc_iter=108442, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:12:46, time_cost(all): 1 day, 11:18:29/16:27:52, loss=0.361603074311869, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=3.233216219620274, lr=0.02572341594733117
2023-11-16 00:55:46   INFO  epoch: 16/24, acc_iter=108492, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:08:24, time_cost(all): 1 day, 11:19:28/16:41:10, loss=0.361492132163693, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.4985250621837416, lr=0.0256833241745906
2023-11-16 00:56:45   INFO  epoch: 16/24, acc_iter=108542, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:08:32, time_cost(all): 1 day, 11:20:27/16:45:52, loss=0.361381190015516, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=0.7747470136329733, lr=0.02564323240185
2023-11-16 00:57:44   INFO  epoch: 16/24, acc_iter=108592, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:04:01, time_cost(all): 1 day, 11:21:26/15:50:23, loss=0.361270247867339, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.1248641869454588, lr=0.025603140629109414
2023-11-16 00:58:43   INFO  epoch: 16/24, acc_iter=108642, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:05:54, time_cost(all): 1 day, 11:22:25/16:19:56, loss=0.361159305719162, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.542942109572174, lr=0.025563048856368828
2023-11-16 00:59:42   INFO  epoch: 16/24, acc_iter=108692, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:07:34, time_cost(all): 1 day, 11:23:24/16:01:02, loss=0.361048363570986, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.1865932435879065, lr=0.025522957083628242
2023-11-16 01:00:41   INFO  epoch: 16/24, acc_iter=108742, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:03:11, time_cost(all): 1 day, 11:24:23/16:43:13, loss=0.360937421422809, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.21(1.03), norm=1.0768618220356596, lr=0.025482865310887642
2023-11-16 01:01:40   INFO  epoch: 16/24, acc_iter=108792, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:00:30, time_cost(all): 1 day, 11:25:22/16:00:48, loss=0.360826479274632, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=1.1446744926924188, lr=0.025442773538147057
2023-11-16 01:02:39   INFO  epoch: 16/24, acc_iter=108842, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/0:59:56, time_cost(all): 1 day, 11:26:21/16:33:00, loss=0.360715537126455, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=0.9461666514710202, lr=0.02540268176540647
2023-11-16 01:03:37   INFO  epoch: 16/24, acc_iter=108892, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:58:56, time_cost(all): 1 day, 11:27:19/16:16:42, loss=0.360604594978279, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=0.6425236047252557, lr=0.025362589992665885
2023-11-16 01:04:36   INFO  epoch: 16/24, acc_iter=108942, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:02:02, time_cost(all): 1 day, 11:28:18/16:32:48, loss=0.360493652830102, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=4.955757321284947, lr=0.0253224982199253
2023-11-16 01:05:35   INFO  epoch: 16/24, acc_iter=108992, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/1:01:10, time_cost(all): 1 day, 11:29:17/16:12:10, loss=0.360382710681925, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.642386861174323, lr=0.025282406447184713
2023-11-16 01:06:34   INFO  epoch: 16/24, acc_iter=109042, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:13, time_cost(all): 1 day, 11:30:16/15:45:22, loss=0.360271768533748, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.04(1.03), norm=0.9129266365220102, lr=0.025242314674444127
2023-11-16 01:07:33   INFO  epoch: 16/24, acc_iter=109092, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:35, time_cost(all): 1 day, 11:31:15/16:43:59, loss=0.360160826385572, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=2.3683568459017073, lr=0.025202222901703528
2023-11-16 01:08:32   INFO  epoch: 16/24, acc_iter=109142, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:54:54, time_cost(all): 1 day, 11:32:14/16:58:13, loss=0.360049884237395, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=3.724077061717347, lr=0.025162131128962942
2023-11-16 01:09:31   INFO  epoch: 16/24, acc_iter=109192, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:24, time_cost(all): 1 day, 11:33:13/16:01:15, loss=0.359938942089218, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=2.674623004465454, lr=0.025122039356222356
2023-11-16 01:10:30   INFO  epoch: 16/24, acc_iter=109242, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:55:30, time_cost(all): 1 day, 11:34:12/16:12:22, loss=0.359827999941041, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.3197095956082667, lr=0.02508194758348177
2023-11-16 01:11:29   INFO  epoch: 16/24, acc_iter=109292, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:55:19, time_cost(all): 1 day, 11:35:11/16:10:24, loss=0.359717057792865, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.02(1.03), norm=4.299933204089056, lr=0.02504185581074117
2023-11-16 01:12:28   INFO  epoch: 16/24, acc_iter=109342, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:53:22, time_cost(all): 1 day, 11:36:10/16:24:21, loss=0.359606115644688, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=1.6804437982066398, lr=0.0250017640380006
2023-11-16 01:13:27   INFO  epoch: 16/24, acc_iter=109392, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:45, time_cost(all): 1 day, 11:37:09/15:51:28, loss=0.359495173496511, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=1.104185395933919, lr=0.024961672265260013
2023-11-16 01:14:26   INFO  epoch: 16/24, acc_iter=109442, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:48:16, time_cost(all): 1 day, 11:38:08/16:39:42, loss=0.359384231348334, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.2928125560167247, lr=0.024921580492519413
2023-11-16 01:15:25   INFO  epoch: 16/24, acc_iter=109492, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:46:59, time_cost(all): 1 day, 11:39:07/16:27:02, loss=0.359273289200158, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.83(1.03), norm=1.4526289165797623, lr=0.024881488719778827
2023-11-16 01:16:24   INFO  epoch: 16/24, acc_iter=109542, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:28, time_cost(all): 1 day, 11:40:06/15:55:22, loss=0.359162347051981, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.65521058775871, lr=0.02484139694703824
2023-11-16 01:17:22   INFO  epoch: 16/24, acc_iter=109592, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:49:01, time_cost(all): 1 day, 11:41:04/15:57:48, loss=0.359051404903804, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.296561575901149, lr=0.024801305174297655
2023-11-16 01:18:21   INFO  epoch: 16/24, acc_iter=109642, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:29, time_cost(all): 1 day, 11:42:03/16:32:04, loss=0.358940462755627, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=2.746382294644643, lr=0.024761213401557056
2023-11-16 01:19:20   INFO  epoch: 16/24, acc_iter=109692, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:38, time_cost(all): 1 day, 11:43:02/16:16:38, loss=0.358829520607451, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.0121648961291654, lr=0.02472112162881647
2023-11-16 01:20:19   INFO  epoch: 16/24, acc_iter=109742, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:27, time_cost(all): 1 day, 11:44:01/15:42:44, loss=0.358718578459274, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.94694417269297, lr=0.024681029856075884
2023-11-16 01:21:18   INFO  epoch: 16/24, acc_iter=109792, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:26, time_cost(all): 1 day, 11:45:00/16:18:46, loss=0.358607636311097, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=3.5573451969722245, lr=0.024640938083335298
2023-11-16 01:22:17   INFO  epoch: 16/24, acc_iter=109842, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:41:20, time_cost(all): 1 day, 11:45:59/16:44:02, loss=0.35849669416292, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=0.6067854149540562, lr=0.024600846310594712
2023-11-16 01:23:16   INFO  epoch: 16/24, acc_iter=109892, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:42:50, time_cost(all): 1 day, 11:46:58/16:42:20, loss=0.358385752014744, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=2.8516409573300128, lr=0.024560754537854126
2023-11-16 01:24:15   INFO  epoch: 16/24, acc_iter=109942, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:30, time_cost(all): 1 day, 11:47:57/16:34:19, loss=0.358274809866567, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=3.1707872187084782, lr=0.02452066276511354
2023-11-16 01:25:14   INFO  epoch: 16/24, acc_iter=109992, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:43, time_cost(all): 1 day, 11:48:56/15:29:08, loss=0.35816386771839, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.755225843343998, lr=0.02448057099237294
2023-11-16 01:26:13   INFO  epoch: 16/24, acc_iter=110042, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:21, time_cost(all): 1 day, 11:49:55/15:45:06, loss=0.358052925570213, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=4.0777920672602175, lr=0.024440479219632355
2023-11-16 01:27:12   INFO  epoch: 16/24, acc_iter=110092, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:36:25, time_cost(all): 1 day, 11:50:54/15:44:01, loss=0.357941983422037, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=1.076612013302028, lr=0.02440038744689177
2023-11-16 01:28:11   INFO  epoch: 16/24, acc_iter=110142, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:01, time_cost(all): 1 day, 11:51:53/15:36:30, loss=0.35783104127386, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=3.466878245472596, lr=0.024360295674151183
2023-11-16 01:29:10   INFO  epoch: 16/24, acc_iter=110192, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:22, time_cost(all): 1 day, 11:52:52/16:19:31, loss=0.357720099125683, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=1.7124429462626747, lr=0.024320203901410598
2023-11-16 01:30:09   INFO  epoch: 16/24, acc_iter=110242, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:32:29, time_cost(all): 1 day, 11:53:51/15:28:27, loss=0.357609156977506, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=3.549660174070525, lr=0.02428011212867001
2023-11-16 01:31:07   INFO  epoch: 16/24, acc_iter=110292, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:34:47, time_cost(all): 1 day, 11:54:49/15:11:39, loss=0.35749821482933, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.86(1.03), norm=3.2425780726263222, lr=0.024240020355929426
2023-11-16 01:32:06   INFO  epoch: 16/24, acc_iter=110342, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:16, time_cost(all): 1 day, 11:55:48/16:28:58, loss=0.357387272681153, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=3.950359033979664, lr=0.02419992858318884
2023-11-16 01:33:05   INFO  epoch: 16/24, acc_iter=110392, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:36, time_cost(all): 1 day, 11:56:47/16:24:39, loss=0.357276330532976, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.15(1.03), norm=3.1473017972589292, lr=0.02415983681044824
2023-11-16 01:34:04   INFO  epoch: 16/24, acc_iter=110442, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:43, time_cost(all): 1 day, 11:57:46/16:12:10, loss=0.357165388384799, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=4.684689867374947, lr=0.024119745037707654
2023-11-16 01:35:03   INFO  epoch: 16/24, acc_iter=110492, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:30:35, time_cost(all): 1 day, 11:58:45/16:00:29, loss=0.357054446236623, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=2.5165048700331525, lr=0.02407965326496707
2023-11-16 01:36:02   INFO  epoch: 16/24, acc_iter=110542, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:31, time_cost(all): 1 day, 11:59:44/15:56:44, loss=0.356943504088446, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=2.0958579143763485, lr=0.024039561492226483
2023-11-16 01:37:01   INFO  epoch: 16/24, acc_iter=110592, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:16, time_cost(all): 1 day, 12:00:43/15:13:11, loss=0.356832561940269, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=1.846480448473392, lr=0.023999469719485883
2023-11-16 01:38:00   INFO  epoch: 16/24, acc_iter=110642, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:27:30, time_cost(all): 1 day, 12:01:42/14:59:54, loss=0.356721619792092, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=4.652402952872353, lr=0.023959377946745297
2023-11-16 01:38:59   INFO  epoch: 16/24, acc_iter=110692, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:26:30, time_cost(all): 1 day, 12:02:41/15:43:31, loss=0.356610677643916, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=4.6342279599059095, lr=0.023919286174004725
2023-11-16 01:39:58   INFO  epoch: 16/24, acc_iter=110742, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:28, time_cost(all): 1 day, 12:03:40/16:17:58, loss=0.356499735495739, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.16(1.03), norm=4.468191913367617, lr=0.023879194401264126
2023-11-16 01:40:57   INFO  epoch: 16/24, acc_iter=110792, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:07, time_cost(all): 1 day, 12:04:39/15:24:12, loss=0.356388793347562, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.23(1.03), norm=1.2794297329670017, lr=0.02383910262852354
2023-11-16 01:41:56   INFO  epoch: 16/24, acc_iter=110842, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:52, time_cost(all): 1 day, 12:05:38/15:34:05, loss=0.356277851199385, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.654831125053089, lr=0.023799010855782954
2023-11-16 01:42:55   INFO  epoch: 16/24, acc_iter=110892, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:48, time_cost(all): 1 day, 12:06:37/16:11:41, loss=0.356166909051209, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=4.5461056258426185, lr=0.023758919083042368
2023-11-16 01:43:54   INFO  epoch: 16/24, acc_iter=110942, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:21:06, time_cost(all): 1 day, 12:07:36/16:08:22, loss=0.356055966903032, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=2.1524640265657475, lr=0.02371882731030177
2023-11-16 01:44:52   INFO  epoch: 16/24, acc_iter=110992, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:43, time_cost(all): 1 day, 12:08:34/16:07:21, loss=0.355945024754855, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=4.411443783874769, lr=0.023678735537561182
2023-11-16 01:45:51   INFO  epoch: 16/24, acc_iter=111042, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:52, time_cost(all): 1 day, 12:09:33/15:45:36, loss=0.355834082606678, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.95(1.03), norm=3.578445376480974, lr=0.023638643764820597
2023-11-16 01:46:50   INFO  epoch: 16/24, acc_iter=111092, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:18:14, time_cost(all): 1 day, 12:10:32/15:45:09, loss=0.355723140458502, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=0.7149597252449222, lr=0.02359855199208001
2023-11-16 01:47:49   INFO  epoch: 16/24, acc_iter=111142, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:17:15, time_cost(all): 1 day, 12:11:31/15:41:25, loss=0.355612198310325, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.5818788498392755, lr=0.023558460219339425
2023-11-16 01:48:48   INFO  epoch: 16/24, acc_iter=111192, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:27, time_cost(all): 1 day, 12:12:30/15:16:21, loss=0.355501256162148, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=3.9317209480065056, lr=0.02351836844659884
2023-11-16 01:49:47   INFO  epoch: 16/24, acc_iter=111242, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:26, time_cost(all): 1 day, 12:13:29/15:42:14, loss=0.355390314013971, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.9724863725731634, lr=0.023478276673858253
2023-11-16 01:50:46   INFO  epoch: 16/24, acc_iter=111292, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:12:53, time_cost(all): 1 day, 12:14:28/16:01:57, loss=0.355279371865795, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=0.589747361465556, lr=0.023438184901117654
2023-11-16 01:51:45   INFO  epoch: 16/24, acc_iter=111342, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:13:01, time_cost(all): 1 day, 12:15:27/15:29:51, loss=0.355168429717618, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=3.60521518692576, lr=0.023398093128377068
2023-11-16 01:52:44   INFO  epoch: 16/24, acc_iter=111392, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:10:59, time_cost(all): 1 day, 12:16:26/15:26:16, loss=0.355057487569441, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.11(1.03), norm=4.205300664876804, lr=0.023358001355636482
2023-11-16 01:53:43   INFO  epoch: 16/24, acc_iter=111442, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:08, time_cost(all): 1 day, 12:17:25/15:31:14, loss=0.354946545421264, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.7338464285046262, lr=0.023317909582895896
2023-11-16 01:54:42   INFO  epoch: 16/24, acc_iter=111492, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:31, time_cost(all): 1 day, 12:18:24/15:27:52, loss=0.354835603273087, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.885216589048642, lr=0.023277817810155296
2023-11-16 01:55:41   INFO  epoch: 16/24, acc_iter=111542, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:11, time_cost(all): 1 day, 12:19:23/15:19:30, loss=0.354724661124911, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=2.143513458888991, lr=0.02323772603741471
2023-11-16 01:56:40   INFO  epoch: 16/24, acc_iter=111592, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:39, time_cost(all): 1 day, 12:20:22/15:33:46, loss=0.354613718976734, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=3.119280509591741, lr=0.02319763426467414
2023-11-16 01:57:39   INFO  epoch: 16/24, acc_iter=111642, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:20, time_cost(all): 1 day, 12:21:21/15:38:01, loss=0.354502776828557, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.86805084867021, lr=0.02315754249193354
2023-11-16 01:58:37   INFO  epoch: 16/24, acc_iter=111692, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:41, time_cost(all): 1 day, 12:22:19/15:16:09, loss=0.35439183468038, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=4.174296518251142, lr=0.023117450719192953
2023-11-16 01:59:36   INFO  epoch: 16/24, acc_iter=111742, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:38, time_cost(all): 1 day, 12:23:18/14:46:14, loss=0.354280892532204, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=3.0235363566881017, lr=0.023077358946452367
2023-11-16 02:00:35   INFO  epoch: 16/24, acc_iter=111792, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:41, time_cost(all): 1 day, 12:24:17/15:36:00, loss=0.354169950384027, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=3.6189011443039774, lr=0.02303726717371178
2023-11-16 02:01:34   INFO  epoch: 16/24, acc_iter=111842, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:38, time_cost(all): 1 day, 12:25:16/15:04:44, loss=0.35405900823585, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.2(1.03), norm=2.4231683752380686, lr=0.02299717540097118
2023-11-16 02:02:33   INFO  epoch: 16/24, acc_iter=111892, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 1 day, 12:26:15/15:54:43, loss=0.353948066087673, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=3.5296538468016396, lr=0.022957083628230596
2023-11-16 02:03:32   INFO  epoch: 16/24, acc_iter=111942, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 1 day, 12:27:14/14:37:07, loss=0.353837123939497, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=0.9609161195846794, lr=0.02291699185549001
2023-11-16 02:04:31   INFO  epoch: 17/24, acc_iter=112029, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:14:25, time_cost(all): 1 day, 12:28:13/14:52:47, loss=0.353644084601669, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=3.8374863665588435, lr=0.022847232170921383
2023-11-16 02:05:30   INFO  epoch: 17/24, acc_iter=112079, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:02:11, time_cost(all): 1 day, 12:29:12/15:21:06, loss=0.353533142453492, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=4.990792145355874, lr=0.02280714039818081
2023-11-16 02:06:29   INFO  epoch: 17/24, acc_iter=112129, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:08:41, time_cost(all): 1 day, 12:30:11/16:00:23, loss=0.353422200305316, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=3.78460247500859, lr=0.02276704862544021
2023-11-16 02:07:28   INFO  epoch: 17/24, acc_iter=112179, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:07:17, time_cost(all): 1 day, 12:31:10/15:30:45, loss=0.353311258157139, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.3286938911610762, lr=0.022726956852699626
2023-11-16 02:08:27   INFO  epoch: 17/24, acc_iter=112229, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:04:32, time_cost(all): 1 day, 12:32:09/15:41:32, loss=0.353200316008962, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=3.9068790029252023, lr=0.02268686507995904
2023-11-16 02:09:26   INFO  epoch: 17/24, acc_iter=112279, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:57:58, time_cost(all): 1 day, 12:33:08/15:03:25, loss=0.353089373860785, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.427159305060071, lr=0.022646773307218454
2023-11-16 02:10:25   INFO  epoch: 17/24, acc_iter=112329, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:07:30, time_cost(all): 1 day, 12:34:07/15:23:11, loss=0.352978431712609, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.3256935739830613, lr=0.022606681534477854
2023-11-16 02:11:24   INFO  epoch: 17/24, acc_iter=112379, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:59:40, time_cost(all): 1 day, 12:35:06/15:38:23, loss=0.352867489564432, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=3.960410904515094, lr=0.02256658976173727
2023-11-16 02:12:22   INFO  epoch: 17/24, acc_iter=112429, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:03:28, time_cost(all): 1 day, 12:36:04/14:28:45, loss=0.352756547416255, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=2.676252512552311, lr=0.022526497988996683
2023-11-16 02:13:21   INFO  epoch: 17/24, acc_iter=112479, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:59:30, time_cost(all): 1 day, 12:37:03/14:59:02, loss=0.352645605268078, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.2237399481294573, lr=0.022486406216256097
2023-11-16 02:14:20   INFO  epoch: 17/24, acc_iter=112529, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:01:03, time_cost(all): 1 day, 12:38:02/14:50:32, loss=0.352534663119902, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.674311141315805, lr=0.02244631444351551
2023-11-16 02:15:19   INFO  epoch: 17/24, acc_iter=112579, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:53:27, time_cost(all): 1 day, 12:39:01/14:49:29, loss=0.352423720971725, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.04(1.03), norm=1.1017749707727473, lr=0.022406222670774925
2023-11-16 02:16:18   INFO  epoch: 17/24, acc_iter=112629, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:54:35, time_cost(all): 1 day, 12:40:00/15:34:01, loss=0.352312778823548, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.14(1.03), norm=4.044913810894805, lr=0.02236613089803434
2023-11-16 02:17:17   INFO  epoch: 17/24, acc_iter=112679, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:51:43, time_cost(all): 1 day, 12:40:59/15:25:56, loss=0.352201836675371, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=3.140677258789239, lr=0.02232603912529374
2023-11-16 02:18:16   INFO  epoch: 17/24, acc_iter=112729, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:54:16, time_cost(all): 1 day, 12:41:58/15:19:15, loss=0.352090894527195, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=4.714050999964454, lr=0.022285947352553154
2023-11-16 02:19:15   INFO  epoch: 17/24, acc_iter=112779, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:55:39, time_cost(all): 1 day, 12:42:57/15:14:00, loss=0.351979952379018, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=1.5563956699560413, lr=0.022245855579812568
2023-11-16 02:20:14   INFO  epoch: 17/24, acc_iter=112829, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:50:26, time_cost(all): 1 day, 12:43:56/15:31:13, loss=0.351869010230841, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=2.641859662891047, lr=0.022205763807071982
2023-11-16 02:21:13   INFO  epoch: 17/24, acc_iter=112879, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:52:17, time_cost(all): 1 day, 12:44:55/14:24:04, loss=0.351758068082664, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.09(1.03), norm=3.271289852687148, lr=0.022165672034331396
2023-11-16 02:22:12   INFO  epoch: 17/24, acc_iter=112929, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:51:20, time_cost(all): 1 day, 12:45:54/14:51:25, loss=0.351647125934488, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.900537437004099, lr=0.022125580261590796
2023-11-16 02:23:11   INFO  epoch: 17/24, acc_iter=112979, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:54:34, time_cost(all): 1 day, 12:46:53/14:58:09, loss=0.351536183786311, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.89(1.03), norm=2.895120722308425, lr=0.022085488488850225
2023-11-16 02:24:10   INFO  epoch: 17/24, acc_iter=113029, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:46:12, time_cost(all): 1 day, 12:47:52/14:25:40, loss=0.351425241638134, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.276451490033494, lr=0.02204539671610964
2023-11-16 02:25:09   INFO  epoch: 17/24, acc_iter=113079, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:46:58, time_cost(all): 1 day, 12:48:51/14:25:04, loss=0.351314299489957, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.1002618501497405, lr=0.02200530494336904
2023-11-16 02:26:07   INFO  epoch: 17/24, acc_iter=113129, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:46:36, time_cost(all): 1 day, 12:49:49/14:45:42, loss=0.351203357341781, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=3.17582271456708, lr=0.021965213170628453
2023-11-16 02:27:06   INFO  epoch: 17/24, acc_iter=113179, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:04, time_cost(all): 1 day, 12:50:48/15:18:07, loss=0.351092415193604, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.94(1.03), norm=4.015520128881883, lr=0.021925121397887867
2023-11-16 02:28:05   INFO  epoch: 17/24, acc_iter=113229, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:46:33, time_cost(all): 1 day, 12:51:47/14:30:06, loss=0.350981473045427, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=2.3303282446636775, lr=0.02188502962514728
2023-11-16 02:29:04   INFO  epoch: 17/24, acc_iter=113279, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:45:53, time_cost(all): 1 day, 12:52:46/15:35:13, loss=0.35087053089725, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=4.1221052374345195, lr=0.02184493785240668
2023-11-16 02:30:03   INFO  epoch: 17/24, acc_iter=113329, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:41:56, time_cost(all): 1 day, 12:53:45/14:49:38, loss=0.350759588749074, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.5963962168019235, lr=0.021804846079666096
2023-11-16 02:31:02   INFO  epoch: 17/24, acc_iter=113379, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:45:19, time_cost(all): 1 day, 12:54:44/15:16:35, loss=0.350648646600897, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.92(1.03), norm=3.5480576434050533, lr=0.02176475430692551
2023-11-16 02:32:01   INFO  epoch: 17/24, acc_iter=113429, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:45:06, time_cost(all): 1 day, 12:55:43/14:48:25, loss=0.35053770445272, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.7683894137757955, lr=0.021724662534184924
2023-11-16 02:33:00   INFO  epoch: 17/24, acc_iter=113479, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:35:34, time_cost(all): 1 day, 12:56:42/15:18:15, loss=0.350426762304543, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=2.9529503105793555, lr=0.02168457076144434
2023-11-16 02:33:59   INFO  epoch: 17/24, acc_iter=113529, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:40:03, time_cost(all): 1 day, 12:57:41/14:14:02, loss=0.350315820156367, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=4.397976779164738, lr=0.021644478988703753
2023-11-16 02:34:58   INFO  epoch: 17/24, acc_iter=113579, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:35:31, time_cost(all): 1 day, 12:58:40/14:52:05, loss=0.35020487800819, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=1.4287916096547226, lr=0.021604387215963167
2023-11-16 02:35:57   INFO  epoch: 17/24, acc_iter=113629, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:39:59, time_cost(all): 1 day, 12:59:39/14:51:02, loss=0.350093935860013, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=2.067087466709207, lr=0.021564295443222567
2023-11-16 02:36:56   INFO  epoch: 17/24, acc_iter=113679, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:37:34, time_cost(all): 1 day, 13:00:38/14:52:09, loss=0.349982993711836, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=2.165038206511217, lr=0.02152420367048198
2023-11-16 02:37:55   INFO  epoch: 17/24, acc_iter=113729, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:34:10, time_cost(all): 1 day, 13:01:37/15:20:18, loss=0.34987205156366, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=4.711696469697933, lr=0.021484111897741395
2023-11-16 02:38:54   INFO  epoch: 17/24, acc_iter=113779, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:30:35, time_cost(all): 1 day, 13:02:36/14:43:30, loss=0.349761109415483, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=2.5518555377013468, lr=0.02144402012500081
2023-11-16 02:39:53   INFO  epoch: 17/24, acc_iter=113829, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:32:16, time_cost(all): 1 day, 13:03:35/14:31:34, loss=0.349650167267306, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.1(1.03), norm=2.569036155346787, lr=0.02140392835226021
2023-11-16 02:40:51   INFO  epoch: 17/24, acc_iter=113879, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:33:38, time_cost(all): 1 day, 13:04:33/14:13:54, loss=0.349539225119129, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.382714445245158, lr=0.021363836579519638
2023-11-16 02:41:50   INFO  epoch: 17/24, acc_iter=113929, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:30:02, time_cost(all): 1 day, 13:05:32/15:16:47, loss=0.349428282970953, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=3.2047673075456533, lr=0.021323744806779052
2023-11-16 02:42:49   INFO  epoch: 17/24, acc_iter=113979, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:31:41, time_cost(all): 1 day, 13:06:31/15:04:07, loss=0.349317340822776, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.746478233258373, lr=0.021283653034038452
2023-11-16 02:43:48   INFO  epoch: 17/24, acc_iter=114029, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:32:43, time_cost(all): 1 day, 13:07:30/14:06:41, loss=0.349206398674599, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=3.4377774121414073, lr=0.021243561261297866
2023-11-16 02:44:47   INFO  epoch: 17/24, acc_iter=114079, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:29:54, time_cost(all): 1 day, 13:08:29/14:16:24, loss=0.349095456526422, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=3.3330287561905068, lr=0.02120346948855728
2023-11-16 02:45:46   INFO  epoch: 17/24, acc_iter=114129, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:28:29, time_cost(all): 1 day, 13:09:28/14:09:24, loss=0.348984514378246, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=3.211672653092859, lr=0.021163377715816695
2023-11-16 02:46:45   INFO  epoch: 17/24, acc_iter=114179, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:40, time_cost(all): 1 day, 13:10:27/14:16:22, loss=0.348873572230069, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=4.380014475986549, lr=0.021123285943076095
2023-11-16 02:47:44   INFO  epoch: 17/24, acc_iter=114229, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:25, time_cost(all): 1 day, 13:11:26/15:09:41, loss=0.348762630081892, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.98(1.03), norm=3.587235150969216, lr=0.02108319417033551
2023-11-16 02:48:43   INFO  epoch: 17/24, acc_iter=114279, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:21:06, time_cost(all): 1 day, 13:12:25/15:04:29, loss=0.348651687933715, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=2.439393780566606, lr=0.021043102397594923
2023-11-16 02:49:42   INFO  epoch: 17/24, acc_iter=114329, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:19:56, time_cost(all): 1 day, 13:13:24/14:54:50, loss=0.348540745785539, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=3.823829369276358, lr=0.021003010624854337
2023-11-16 02:50:41   INFO  epoch: 17/24, acc_iter=114379, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:20:07, time_cost(all): 1 day, 13:14:23/15:06:42, loss=0.348429803637362, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=4.369680448875723, lr=0.02096291885211375
2023-11-16 02:51:40   INFO  epoch: 17/24, acc_iter=114429, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:24:58, time_cost(all): 1 day, 13:15:22/15:08:18, loss=0.348318861489185, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=4.847357641116914, lr=0.020922827079373166
2023-11-16 02:52:39   INFO  epoch: 17/24, acc_iter=114479, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:24:15, time_cost(all): 1 day, 13:16:21/14:07:08, loss=0.348207919341008, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=0.5879192260476502, lr=0.02088273530663258
2023-11-16 02:53:38   INFO  epoch: 17/24, acc_iter=114529, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:22:26, time_cost(all): 1 day, 13:17:20/14:56:14, loss=0.348096977192832, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=1.165443725691679, lr=0.02084264353389198
2023-11-16 02:54:36   INFO  epoch: 17/24, acc_iter=114579, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:19:12, time_cost(all): 1 day, 13:18:18/14:09:36, loss=0.347986035044655, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=2.582285070048563, lr=0.020802551761151394
2023-11-16 02:55:35   INFO  epoch: 17/24, acc_iter=114629, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:18, time_cost(all): 1 day, 13:19:17/14:48:21, loss=0.347875092896478, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=2.5011888797526804, lr=0.02076245998841081
2023-11-16 02:56:34   INFO  epoch: 17/24, acc_iter=114679, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:12:39, time_cost(all): 1 day, 13:20:16/14:35:30, loss=0.347764150748301, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=4.901701400328492, lr=0.020722368215670223
2023-11-16 02:57:33   INFO  epoch: 17/24, acc_iter=114729, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:17:22, time_cost(all): 1 day, 13:21:15/14:43:22, loss=0.347653208600125, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=2.0281727627631474, lr=0.020682276442929637
2023-11-16 02:58:32   INFO  epoch: 17/24, acc_iter=114779, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:13:29, time_cost(all): 1 day, 13:22:14/14:33:07, loss=0.347542266451948, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.0736929912479012, lr=0.02064218467018905
2023-11-16 02:59:31   INFO  epoch: 17/24, acc_iter=114829, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:09:45, time_cost(all): 1 day, 13:23:13/14:18:47, loss=0.347431324303771, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=4.170870861681288, lr=0.020602092897448465
2023-11-16 03:00:30   INFO  epoch: 17/24, acc_iter=114879, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:36, time_cost(all): 1 day, 13:24:12/14:09:14, loss=0.347320382155594, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=2.201623249676323, lr=0.020562001124707865
2023-11-16 03:01:29   INFO  epoch: 17/24, acc_iter=114929, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:36, time_cost(all): 1 day, 13:25:11/14:09:25, loss=0.347209440007418, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.86(1.03), norm=4.244426935606759, lr=0.02052190935196728
2023-11-16 03:02:28   INFO  epoch: 17/24, acc_iter=114979, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:08:12, time_cost(all): 1 day, 13:26:10/14:43:07, loss=0.347098497859241, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=3.456533787000391, lr=0.020481817579226694
2023-11-16 03:03:27   INFO  epoch: 17/24, acc_iter=115029, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:08:23, time_cost(all): 1 day, 13:27:09/14:24:25, loss=0.346987555711064, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=1.3191055153571019, lr=0.020441725806486108
2023-11-16 03:04:26   INFO  epoch: 17/24, acc_iter=115079, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:24, time_cost(all): 1 day, 13:28:08/14:04:01, loss=0.346876613562887, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.01(1.03), norm=1.5662501005522982, lr=0.020401634033745522
2023-11-16 03:05:25   INFO  epoch: 17/24, acc_iter=115129, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:04:23, time_cost(all): 1 day, 13:29:07/14:10:31, loss=0.346765671414711, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=2.997185132918163, lr=0.020361542261004922
2023-11-16 03:06:24   INFO  epoch: 17/24, acc_iter=115179, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:09:23, time_cost(all): 1 day, 13:30:06/14:27:12, loss=0.346654729266534, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.377446055237698, lr=0.02032145048826435
2023-11-16 03:07:23   INFO  epoch: 17/24, acc_iter=115229, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:18, time_cost(all): 1 day, 13:31:05/14:38:33, loss=0.346543787118357, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.17(1.03), norm=3.1288138175644926, lr=0.020281358715523765
2023-11-16 03:08:21   INFO  epoch: 17/24, acc_iter=115279, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:03:02, time_cost(all): 1 day, 13:32:03/14:25:10, loss=0.34643284497018, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=4.175052044472199, lr=0.020241266942783165
2023-11-16 03:09:20   INFO  epoch: 17/24, acc_iter=115329, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:25, time_cost(all): 1 day, 13:33:02/14:21:09, loss=0.346321902822004, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.13(1.03), norm=1.1098058907948876, lr=0.02020117517004258
2023-11-16 03:10:19   INFO  epoch: 17/24, acc_iter=115379, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:03, time_cost(all): 1 day, 13:34:01/14:10:35, loss=0.346210960673827, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=2.2452118013450857, lr=0.020161083397301993
2023-11-16 03:11:18   INFO  epoch: 17/24, acc_iter=115429, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:00:27, time_cost(all): 1 day, 13:35:00/14:04:18, loss=0.34610001852565, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=3.9883176841436994, lr=0.020120991624561407
2023-11-16 03:12:17   INFO  epoch: 17/24, acc_iter=115479, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:58:31, time_cost(all): 1 day, 13:35:59/13:33:24, loss=0.345989076377473, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.458105904664475, lr=0.020080899851820808
2023-11-16 03:13:16   INFO  epoch: 17/24, acc_iter=115529, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:59:58, time_cost(all): 1 day, 13:36:58/13:52:48, loss=0.345878134229297, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=4.252797142956666, lr=0.020040808079080222
2023-11-16 03:14:15   INFO  epoch: 17/24, acc_iter=115579, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:59:19, time_cost(all): 1 day, 13:37:57/13:42:54, loss=0.34576719208112, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.12(1.03), norm=1.3038381914886457, lr=0.020000716306339636
2023-11-16 03:15:14   INFO  epoch: 17/24, acc_iter=115629, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:19, time_cost(all): 1 day, 13:38:56/14:04:07, loss=0.345656249932943, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=3.7847664807547323, lr=0.01996062453359905
2023-11-16 03:16:13   INFO  epoch: 17/24, acc_iter=115679, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:15, time_cost(all): 1 day, 13:39:55/13:41:07, loss=0.345545307784766, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.1088606008064623, lr=0.019920532760858464
2023-11-16 03:17:12   INFO  epoch: 17/24, acc_iter=115729, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:58:10, time_cost(all): 1 day, 13:40:54/14:35:57, loss=0.34543436563659, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.0680636499511678, lr=0.01988044098811788
2023-11-16 03:18:11   INFO  epoch: 17/24, acc_iter=115779, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:53:13, time_cost(all): 1 day, 13:41:53/14:15:26, loss=0.345323423488413, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=1.6610578842906687, lr=0.019840349215377293
2023-11-16 03:19:10   INFO  epoch: 17/24, acc_iter=115829, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:55:10, time_cost(all): 1 day, 13:42:52/14:04:15, loss=0.345212481340236, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=3.0969471170748037, lr=0.019800257442636693
2023-11-16 03:20:09   INFO  epoch: 17/24, acc_iter=115879, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:39, time_cost(all): 1 day, 13:43:51/13:36:42, loss=0.345101539192059, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=3.531481249950446, lr=0.019760165669896107
2023-11-16 03:21:08   INFO  epoch: 17/24, acc_iter=115929, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:53:04, time_cost(all): 1 day, 13:44:50/13:38:52, loss=0.344990597043883, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=0.5423372871996417, lr=0.01972007389715552
2023-11-16 03:22:06   INFO  epoch: 17/24, acc_iter=115979, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:50:41, time_cost(all): 1 day, 13:45:48/13:21:39, loss=0.344879654895706, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.0525017356698907, lr=0.019679982124414935
2023-11-16 03:23:05   INFO  epoch: 17/24, acc_iter=116029, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:51:49, time_cost(all): 1 day, 13:46:47/13:20:50, loss=0.344768712747529, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=3.9908872599903837, lr=0.019639890351674336
2023-11-16 03:24:04   INFO  epoch: 17/24, acc_iter=116079, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:51:04, time_cost(all): 1 day, 13:47:46/14:00:32, loss=0.344657770599352, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=1.4299903491027541, lr=0.019599798578933764
2023-11-16 03:25:03   INFO  epoch: 17/24, acc_iter=116129, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:21, time_cost(all): 1 day, 13:48:45/13:24:41, loss=0.344546828451176, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.86(1.03), norm=3.5061881510457518, lr=0.019559706806193178
2023-11-16 03:26:02   INFO  epoch: 17/24, acc_iter=116179, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:30, time_cost(all): 1 day, 13:49:44/13:17:36, loss=0.344435886302999, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=4.138179452623236, lr=0.019519615033452578
2023-11-16 03:27:01   INFO  epoch: 17/24, acc_iter=116229, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:46, time_cost(all): 1 day, 13:50:43/13:18:50, loss=0.344324944154822, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=2.9810495725855275, lr=0.019479523260711992
2023-11-16 03:28:00   INFO  epoch: 17/24, acc_iter=116279, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:54, time_cost(all): 1 day, 13:51:42/13:16:37, loss=0.344214002006645, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=1.9895845973628972, lr=0.019439431487971406
2023-11-16 03:28:59   INFO  epoch: 17/24, acc_iter=116329, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:32, time_cost(all): 1 day, 13:52:41/13:56:31, loss=0.344103059858469, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=4.026285482605638, lr=0.01939933971523082
2023-11-16 03:29:58   INFO  epoch: 17/24, acc_iter=116379, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:57, time_cost(all): 1 day, 13:53:40/13:23:50, loss=0.343992117710292, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=3.5946571449823304, lr=0.01935924794249022
2023-11-16 03:30:57   INFO  epoch: 17/24, acc_iter=116429, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:41:59, time_cost(all): 1 day, 13:54:39/13:41:04, loss=0.343881175562115, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=4.960585318165043, lr=0.019319156169749635
2023-11-16 03:31:56   INFO  epoch: 17/24, acc_iter=116479, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:23, time_cost(all): 1 day, 13:55:38/13:56:31, loss=0.343770233413938, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.7686247586933375, lr=0.01927906439700905
2023-11-16 03:32:55   INFO  epoch: 17/24, acc_iter=116529, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:34, time_cost(all): 1 day, 13:56:37/14:11:45, loss=0.343659291265762, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.1012209873305023, lr=0.019238972624268463
2023-11-16 03:33:54   INFO  epoch: 17/24, acc_iter=116579, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:23, time_cost(all): 1 day, 13:57:36/13:10:07, loss=0.343548349117585, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=3.9591760730140284, lr=0.019198880851527877
2023-11-16 03:34:53   INFO  epoch: 17/24, acc_iter=116629, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:18, time_cost(all): 1 day, 13:58:35/14:15:34, loss=0.343437406969408, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=0.5528175311287724, lr=0.01915878907878729
2023-11-16 03:35:51   INFO  epoch: 17/24, acc_iter=116679, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:37:34, time_cost(all): 1 day, 13:59:33/14:14:05, loss=0.343326464821231, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.6609407157058188, lr=0.019118697306046706
2023-11-16 03:36:50   INFO  epoch: 17/24, acc_iter=116729, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:13, time_cost(all): 1 day, 14:00:32/13:22:46, loss=0.343215522673055, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=4.968811012833656, lr=0.019078605533306106
2023-11-16 03:37:49   INFO  epoch: 17/24, acc_iter=116779, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:36:12, time_cost(all): 1 day, 14:01:31/14:15:56, loss=0.343104580524878, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=3.3484936551334967, lr=0.01903851376056552
2023-11-16 03:38:48   INFO  epoch: 17/24, acc_iter=116829, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:12, time_cost(all): 1 day, 14:02:30/13:07:44, loss=0.342993638376701, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.5923638849086053, lr=0.018998421987824934
2023-11-16 03:39:47   INFO  epoch: 17/24, acc_iter=116879, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:24, time_cost(all): 1 day, 14:03:29/13:10:02, loss=0.342882696228524, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=0.912513381219422, lr=0.01895833021508435
2023-11-16 03:40:46   INFO  epoch: 17/24, acc_iter=116929, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:30:48, time_cost(all): 1 day, 14:04:28/13:55:43, loss=0.342771754080348, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=0.800678354308887, lr=0.018918238442343763
2023-11-16 03:41:45   INFO  epoch: 17/24, acc_iter=116979, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:49, time_cost(all): 1 day, 14:05:27/14:06:29, loss=0.342660811932171, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=3.7836979049099178, lr=0.018878146669603177
2023-11-16 03:42:44   INFO  epoch: 17/24, acc_iter=117029, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:04, time_cost(all): 1 day, 14:06:26/14:17:58, loss=0.342549869783994, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=3.294912650477921, lr=0.01883805489686259
2023-11-16 03:43:43   INFO  epoch: 17/24, acc_iter=117079, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:30:29, time_cost(all): 1 day, 14:07:25/14:17:42, loss=0.342438927635817, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.524193586887056, lr=0.018797963124122005
2023-11-16 03:44:42   INFO  epoch: 17/24, acc_iter=117129, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:35, time_cost(all): 1 day, 14:08:24/13:53:38, loss=0.342327985487641, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=4.05080023230293, lr=0.018757871351381405
2023-11-16 03:45:41   INFO  epoch: 17/24, acc_iter=117179, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:28:36, time_cost(all): 1 day, 14:09:23/14:07:30, loss=0.342217043339464, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.8931465729470003, lr=0.01871777957864082
2023-11-16 03:46:40   INFO  epoch: 17/24, acc_iter=117229, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:26:55, time_cost(all): 1 day, 14:10:22/13:26:48, loss=0.342106101191287, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=4.803950651589943, lr=0.018677687805900234
2023-11-16 03:47:39   INFO  epoch: 17/24, acc_iter=117279, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:33, time_cost(all): 1 day, 14:11:21/13:47:20, loss=0.34199515904311, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.85(1.03), norm=1.5121450775512955, lr=0.018637596033159648
2023-11-16 03:48:38   INFO  epoch: 17/24, acc_iter=117329, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:57, time_cost(all): 1 day, 14:12:20/13:33:58, loss=0.341884216894934, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=2.8196843948239327, lr=0.018597504260419048
2023-11-16 03:49:36   INFO  epoch: 17/24, acc_iter=117379, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:32, time_cost(all): 1 day, 14:13:18/13:06:49, loss=0.341773274746757, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.12(1.03), norm=3.9631719762155133, lr=0.018557412487678462
2023-11-16 03:50:35   INFO  epoch: 17/24, acc_iter=117429, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:23:03, time_cost(all): 1 day, 14:14:17/13:06:37, loss=0.34166233259858, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=4.867721788900515, lr=0.01851732071493789
2023-11-16 03:51:34   INFO  epoch: 17/24, acc_iter=117479, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:38, time_cost(all): 1 day, 14:15:16/13:04:27, loss=0.341551390450403, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=1.5465491882083817, lr=0.01847722894219729
2023-11-16 03:52:33   INFO  epoch: 17/24, acc_iter=117529, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:21:09, time_cost(all): 1 day, 14:16:15/13:53:50, loss=0.341440448302227, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=2.383183964303319, lr=0.018437137169456705
2023-11-16 03:53:32   INFO  epoch: 17/24, acc_iter=117579, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:09, time_cost(all): 1 day, 14:17:14/12:53:53, loss=0.34132950615405, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=2.0236470135273708, lr=0.01839704539671612
2023-11-16 03:54:31   INFO  epoch: 17/24, acc_iter=117629, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:33, time_cost(all): 1 day, 14:18:13/13:24:08, loss=0.341218564005873, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.625977960072332, lr=0.018356953623975533
2023-11-16 03:55:30   INFO  epoch: 17/24, acc_iter=117679, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:40, time_cost(all): 1 day, 14:19:12/13:34:56, loss=0.341107621857696, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.7088324030436137, lr=0.018316861851234933
2023-11-16 03:56:29   INFO  epoch: 17/24, acc_iter=117729, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:20, time_cost(all): 1 day, 14:20:11/12:57:48, loss=0.34099667970952, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=4.761694342522835, lr=0.018276770078494348
2023-11-16 03:57:28   INFO  epoch: 17/24, acc_iter=117779, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:59, time_cost(all): 1 day, 14:21:10/12:49:01, loss=0.340885737561343, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=0.9429604202943012, lr=0.018236678305753762
2023-11-16 03:58:27   INFO  epoch: 17/24, acc_iter=117829, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:14, time_cost(all): 1 day, 14:22:09/13:43:08, loss=0.340774795413166, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.563168072976643, lr=0.018196586533013176
2023-11-16 03:59:26   INFO  epoch: 17/24, acc_iter=117879, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:06, time_cost(all): 1 day, 14:23:08/13:53:04, loss=0.340663853264989, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=4.165731689383133, lr=0.01815649476027259
2023-11-16 04:00:25   INFO  epoch: 17/24, acc_iter=117929, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:38, time_cost(all): 1 day, 14:24:07/13:17:53, loss=0.340552911116813, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=0.8831425942351487, lr=0.018116402987532004
2023-11-16 04:01:24   INFO  epoch: 17/24, acc_iter=117979, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:29, time_cost(all): 1 day, 14:25:06/13:14:53, loss=0.340441968968636, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=0.7220491492444923, lr=0.01807631121479142
2023-11-16 04:02:23   INFO  epoch: 17/24, acc_iter=118029, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:07, time_cost(all): 1 day, 14:26:05/12:45:40, loss=0.340331026820459, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.04(1.03), norm=2.257089828435469, lr=0.01803621944205082
2023-11-16 04:03:21   INFO  epoch: 17/24, acc_iter=118079, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:14, time_cost(all): 1 day, 14:27:03/12:39:24, loss=0.340220084672282, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.536256971665207, lr=0.017996127669310233
2023-11-16 04:04:20   INFO  epoch: 17/24, acc_iter=118129, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:16, time_cost(all): 1 day, 14:28:02/13:20:04, loss=0.340109142524106, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=0.7546912381017086, lr=0.017956035896569647
2023-11-16 04:05:19   INFO  epoch: 17/24, acc_iter=118179, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:44, time_cost(all): 1 day, 14:29:01/12:40:43, loss=0.339998200375929, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=0.7881433697162927, lr=0.01791594412382906
2023-11-16 04:06:18   INFO  epoch: 17/24, acc_iter=118229, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:54, time_cost(all): 1 day, 14:30:00/13:31:04, loss=0.339887258227752, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=4.213935387620836, lr=0.01787585235108846
2023-11-16 04:07:17   INFO  epoch: 17/24, acc_iter=118279, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:23, time_cost(all): 1 day, 14:30:59/13:15:20, loss=0.339776316079575, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.560950503777325, lr=0.017835760578347876
2023-11-16 04:08:16   INFO  epoch: 17/24, acc_iter=118329, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:36, time_cost(all): 1 day, 14:31:58/12:41:42, loss=0.339665373931399, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=1.9301494502875391, lr=0.017795668805607304
2023-11-16 04:09:15   INFO  epoch: 17/24, acc_iter=118379, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:36, time_cost(all): 1 day, 14:32:57/13:08:18, loss=0.339554431783222, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=0.725787562501992, lr=0.017755577032866704
2023-11-16 04:10:14   INFO  epoch: 17/24, acc_iter=118429, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:45, time_cost(all): 1 day, 14:33:56/12:40:39, loss=0.339443489635045, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.5007723074041093, lr=0.017715485260126118
2023-11-16 04:11:13   INFO  epoch: 17/24, acc_iter=118479, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 1 day, 14:34:55/12:35:46, loss=0.339332547486868, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=2.38446521144571, lr=0.017675393487385532
2023-11-16 04:12:12   INFO  epoch: 17/24, acc_iter=118529, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 1 day, 14:35:54/13:05:10, loss=0.339221605338692, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=3.629330134991279, lr=0.017635301714644946
2023-11-16 04:13:11   INFO  epoch: 18/24, acc_iter=118616, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:10:13, time_cost(all): 1 day, 14:36:53/13:09:51, loss=0.339028566000864, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=3.0440638644502016, lr=0.01756554203007632
2023-11-16 04:14:10   INFO  epoch: 18/24, acc_iter=118666, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:03:05, time_cost(all): 1 day, 14:37:52/12:56:45, loss=0.338917623852687, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=1.2795810721168674, lr=0.017525450257335734
2023-11-16 04:15:09   INFO  epoch: 18/24, acc_iter=118716, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:05:48, time_cost(all): 1 day, 14:38:51/12:54:33, loss=0.338806681704511, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=1.031672453405056, lr=0.017485358484595134
2023-11-16 04:16:08   INFO  epoch: 18/24, acc_iter=118766, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:01:41, time_cost(all): 1 day, 14:39:50/13:00:14, loss=0.338695739556334, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.4593906994094565, lr=0.01744526671185455
2023-11-16 04:17:06   INFO  epoch: 18/24, acc_iter=118816, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:07:14, time_cost(all): 1 day, 14:40:48/13:41:22, loss=0.338584797408157, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=2.981087614592256, lr=0.017405174939113977
2023-11-16 04:18:05   INFO  epoch: 18/24, acc_iter=118866, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:07:24, time_cost(all): 1 day, 14:41:47/13:01:59, loss=0.33847385525998, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=3.953506730389897, lr=0.017365083166373377
2023-11-16 04:19:04   INFO  epoch: 18/24, acc_iter=118916, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:04:50, time_cost(all): 1 day, 14:42:46/12:25:28, loss=0.338362913111803, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.92(1.03), norm=1.2067131691870663, lr=0.01732499139363279
2023-11-16 04:20:03   INFO  epoch: 18/24, acc_iter=118966, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:56:20, time_cost(all): 1 day, 14:43:45/13:04:15, loss=0.338251970963627, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=3.257686299027536, lr=0.017284899620892205
2023-11-16 04:21:02   INFO  epoch: 18/24, acc_iter=119016, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/1:58:58, time_cost(all): 1 day, 14:44:44/13:15:36, loss=0.33814102881545, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.21(1.03), norm=2.1450750713263123, lr=0.01724480784815162
2023-11-16 04:22:01   INFO  epoch: 18/24, acc_iter=119066, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:55:45, time_cost(all): 1 day, 14:45:43/13:00:45, loss=0.338030086667273, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=3.6465809592429923, lr=0.01720471607541102
2023-11-16 04:23:00   INFO  epoch: 18/24, acc_iter=119116, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:00:55, time_cost(all): 1 day, 14:46:42/12:42:39, loss=0.337919144519096, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=3.7822762531602705, lr=0.017164624302670434
2023-11-16 04:23:59   INFO  epoch: 18/24, acc_iter=119166, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/1:59:14, time_cost(all): 1 day, 14:47:41/12:26:47, loss=0.33780820237092, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.1604088344005232, lr=0.017124532529929848
2023-11-16 04:24:58   INFO  epoch: 18/24, acc_iter=119216, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:51:50, time_cost(all): 1 day, 14:48:40/13:18:54, loss=0.337697260222743, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=2.930601176889021, lr=0.017084440757189262
2023-11-16 04:25:57   INFO  epoch: 18/24, acc_iter=119266, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:51:00, time_cost(all): 1 day, 14:49:39/13:24:44, loss=0.337586318074566, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=2.628609949188296, lr=0.017044348984448676
2023-11-16 04:26:56   INFO  epoch: 18/24, acc_iter=119316, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:58:34, time_cost(all): 1 day, 14:50:38/12:31:12, loss=0.337475375926389, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.08(1.03), norm=2.0787776899748485, lr=0.01700425721170809
2023-11-16 04:27:55   INFO  epoch: 18/24, acc_iter=119366, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:51:31, time_cost(all): 1 day, 14:51:37/13:00:30, loss=0.337364433778213, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.9026207247294193, lr=0.016964165438967505
2023-11-16 04:28:54   INFO  epoch: 18/24, acc_iter=119416, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:57:32, time_cost(all): 1 day, 14:52:36/13:19:14, loss=0.337253491630036, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=4.684487696735472, lr=0.016924073666226905
2023-11-16 04:29:53   INFO  epoch: 18/24, acc_iter=119466, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:49:02, time_cost(all): 1 day, 14:53:35/12:28:40, loss=0.337142549481859, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.093683075352194, lr=0.01688398189348632
2023-11-16 04:30:51   INFO  epoch: 18/24, acc_iter=119516, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:48:59, time_cost(all): 1 day, 14:54:33/12:43:08, loss=0.337031607333682, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=2.238443182707372, lr=0.016843890120745733
2023-11-16 04:31:50   INFO  epoch: 18/24, acc_iter=119566, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:50:17, time_cost(all): 1 day, 14:55:32/12:29:31, loss=0.336920665185506, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=2.288813866824593, lr=0.016803798348005147
2023-11-16 04:32:49   INFO  epoch: 18/24, acc_iter=119616, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:49, time_cost(all): 1 day, 14:56:31/13:17:41, loss=0.336809723037329, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.545782087875896, lr=0.016763706575264548
2023-11-16 04:33:48   INFO  epoch: 18/24, acc_iter=119666, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:42:39, time_cost(all): 1 day, 14:57:30/13:21:36, loss=0.336698780889152, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.5096366989791106, lr=0.01672361480252396
2023-11-16 04:34:47   INFO  epoch: 18/24, acc_iter=119716, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:44:26, time_cost(all): 1 day, 14:58:29/12:35:38, loss=0.336587838740975, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=4.606483628305155, lr=0.01668352302978339
2023-11-16 04:35:46   INFO  epoch: 18/24, acc_iter=119766, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:36, time_cost(all): 1 day, 14:59:28/12:34:16, loss=0.336476896592799, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=1.5750098791642244, lr=0.01664343125704279
2023-11-16 04:36:45   INFO  epoch: 18/24, acc_iter=119816, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:47:28, time_cost(all): 1 day, 15:00:27/12:39:31, loss=0.336365954444622, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=0.9609130208769405, lr=0.016603339484302204
2023-11-16 04:37:44   INFO  epoch: 18/24, acc_iter=119866, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:41:04, time_cost(all): 1 day, 15:01:26/12:34:10, loss=0.336255012296445, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.97(1.03), norm=4.6963820545632515, lr=0.01656324771156162
2023-11-16 04:38:43   INFO  epoch: 18/24, acc_iter=119916, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:47:15, time_cost(all): 1 day, 15:02:25/13:06:59, loss=0.336144070148268, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.4515818105972196, lr=0.016523155938821033
2023-11-16 04:39:42   INFO  epoch: 18/24, acc_iter=119966, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:45:21, time_cost(all): 1 day, 15:03:24/12:06:40, loss=0.336033128000092, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=1.959397344924744, lr=0.016483064166080447
2023-11-16 04:40:41   INFO  epoch: 18/24, acc_iter=120016, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:45:52, time_cost(all): 1 day, 15:04:23/12:51:09, loss=0.335922185851915, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=2.173028217587827, lr=0.016442972393339847
2023-11-16 04:41:40   INFO  epoch: 18/24, acc_iter=120066, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:38:20, time_cost(all): 1 day, 15:05:22/12:14:03, loss=0.335811243703738, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=2.8804707688234195, lr=0.01640288062059926
2023-11-16 04:42:39   INFO  epoch: 18/24, acc_iter=120116, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:39:15, time_cost(all): 1 day, 15:06:21/12:14:00, loss=0.335700301555561, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.381733451039656, lr=0.016362788847858675
2023-11-16 04:43:38   INFO  epoch: 18/24, acc_iter=120166, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:36:50, time_cost(all): 1 day, 15:07:20/12:11:05, loss=0.335589359407385, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=0.683251670141481, lr=0.01632269707511809
2023-11-16 04:44:36   INFO  epoch: 18/24, acc_iter=120216, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:33:08, time_cost(all): 1 day, 15:08:18/12:18:03, loss=0.335478417259208, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=1.2260919157409713, lr=0.016282605302377504
2023-11-16 04:45:35   INFO  epoch: 18/24, acc_iter=120266, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:34:38, time_cost(all): 1 day, 15:09:17/12:00:15, loss=0.335367475111031, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=2.9379685737572787, lr=0.016242513529636918
2023-11-16 04:46:34   INFO  epoch: 18/24, acc_iter=120316, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:39:23, time_cost(all): 1 day, 15:10:16/12:50:24, loss=0.335256532962854, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.97(1.03), norm=2.7543223963848273, lr=0.016202421756896332
2023-11-16 04:47:33   INFO  epoch: 18/24, acc_iter=120366, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:32:02, time_cost(all): 1 day, 15:11:15/13:08:07, loss=0.335145590814678, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=3.47643585462829, lr=0.016162329984155732
2023-11-16 04:48:32   INFO  epoch: 18/24, acc_iter=120416, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:37:02, time_cost(all): 1 day, 15:12:14/12:35:42, loss=0.335034648666501, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.7097224420014925, lr=0.016122238211415146
2023-11-16 04:49:31   INFO  epoch: 18/24, acc_iter=120466, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:29:57, time_cost(all): 1 day, 15:13:13/12:31:03, loss=0.334923706518324, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=3.5378836097876225, lr=0.01608214643867456
2023-11-16 04:50:30   INFO  epoch: 18/24, acc_iter=120516, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:31:57, time_cost(all): 1 day, 15:14:12/12:21:08, loss=0.334812764370147, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.16(1.03), norm=3.1942060420396654, lr=0.016042054665933975
2023-11-16 04:51:29   INFO  epoch: 18/24, acc_iter=120566, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:27:54, time_cost(all): 1 day, 15:15:11/12:37:00, loss=0.334701822221971, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=4.30732590970044, lr=0.016001962893193375
2023-11-16 04:52:28   INFO  epoch: 18/24, acc_iter=120616, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:30:48, time_cost(all): 1 day, 15:16:10/12:51:22, loss=0.334590880073794, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=1.9737815135308518, lr=0.015961871120452803
2023-11-16 04:53:27   INFO  epoch: 18/24, acc_iter=120666, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:31:11, time_cost(all): 1 day, 15:17:09/12:43:26, loss=0.334479937925617, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=3.1150687021978594, lr=0.015921779347712217
2023-11-16 04:54:26   INFO  epoch: 18/24, acc_iter=120716, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:27:50, time_cost(all): 1 day, 15:18:08/12:13:56, loss=0.33436899577744, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.224900589450094, lr=0.015881687574971617
2023-11-16 04:55:25   INFO  epoch: 18/24, acc_iter=120766, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:28, time_cost(all): 1 day, 15:19:07/12:44:47, loss=0.334258053629264, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=1.3217082093056058, lr=0.01584159580223103
2023-11-16 04:56:24   INFO  epoch: 18/24, acc_iter=120816, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:29:18, time_cost(all): 1 day, 15:20:06/13:00:47, loss=0.334147111481087, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=3.549515810202177, lr=0.015801504029490446
2023-11-16 04:57:23   INFO  epoch: 18/24, acc_iter=120866, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:26:23, time_cost(all): 1 day, 15:21:05/12:44:54, loss=0.33403616933291, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=2.689605059508585, lr=0.01576141225674986
2023-11-16 04:58:21   INFO  epoch: 18/24, acc_iter=120916, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:20:48, time_cost(all): 1 day, 15:22:03/12:07:28, loss=0.333925227184733, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=4.504097717557844, lr=0.01572132048400926
2023-11-16 04:59:20   INFO  epoch: 18/24, acc_iter=120966, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:22:06, time_cost(all): 1 day, 15:23:02/12:48:06, loss=0.333814285036557, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=4.96406259668721, lr=0.015681228711268674
2023-11-16 05:00:19   INFO  epoch: 18/24, acc_iter=121016, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:18:27, time_cost(all): 1 day, 15:24:01/12:44:10, loss=0.33370334288838, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=4.1827637765431795, lr=0.015641136938528102
2023-11-16 05:01:18   INFO  epoch: 18/24, acc_iter=121066, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:21:27, time_cost(all): 1 day, 15:25:00/12:20:16, loss=0.333592400740203, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=0.7726545190242394, lr=0.015601045165787503
2023-11-16 05:02:17   INFO  epoch: 18/24, acc_iter=121116, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:17:31, time_cost(all): 1 day, 15:25:59/11:49:13, loss=0.333481458592026, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.397273438388984, lr=0.015560953393046917
2023-11-16 05:03:16   INFO  epoch: 18/24, acc_iter=121166, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:14:26, time_cost(all): 1 day, 15:26:58/12:44:04, loss=0.33337051644385, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=3.8537403087887028, lr=0.015520861620306331
2023-11-16 05:04:15   INFO  epoch: 18/24, acc_iter=121216, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:51, time_cost(all): 1 day, 15:27:57/11:47:54, loss=0.333259574295673, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=0.6766465084068911, lr=0.015480769847565745
2023-11-16 05:05:14   INFO  epoch: 18/24, acc_iter=121266, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:17:17, time_cost(all): 1 day, 15:28:56/11:57:18, loss=0.333148632147496, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=4.116540381872586, lr=0.015440678074825145
2023-11-16 05:06:13   INFO  epoch: 18/24, acc_iter=121316, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:57, time_cost(all): 1 day, 15:29:55/12:08:42, loss=0.333037689999319, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.12(1.03), norm=0.6258044429023113, lr=0.01540058630208456
2023-11-16 05:07:12   INFO  epoch: 18/24, acc_iter=121366, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:11:28, time_cost(all): 1 day, 15:30:54/12:32:52, loss=0.332926747851143, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=1.8747117317525954, lr=0.015360494529343974
2023-11-16 05:08:11   INFO  epoch: 18/24, acc_iter=121416, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:14:55, time_cost(all): 1 day, 15:31:53/11:40:37, loss=0.332815805702966, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=3.4196127078379597, lr=0.015320402756603388
2023-11-16 05:09:10   INFO  epoch: 18/24, acc_iter=121466, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:21, time_cost(all): 1 day, 15:32:52/12:16:53, loss=0.332704863554789, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=2.380046252432236, lr=0.015280310983862802
2023-11-16 05:10:09   INFO  epoch: 18/24, acc_iter=121516, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:13:02, time_cost(all): 1 day, 15:33:51/11:51:29, loss=0.332593921406612, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=3.607133089323739, lr=0.015240219211122216
2023-11-16 05:11:08   INFO  epoch: 18/24, acc_iter=121566, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:20, time_cost(all): 1 day, 15:34:50/11:48:34, loss=0.332482979258436, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=0.5748402762221608, lr=0.01520012743838163
2023-11-16 05:12:06   INFO  epoch: 18/24, acc_iter=121616, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:11:04, time_cost(all): 1 day, 15:35:48/12:05:53, loss=0.332372037110259, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=3.4924679291469682, lr=0.01516003566564103
2023-11-16 05:13:05   INFO  epoch: 18/24, acc_iter=121666, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:32, time_cost(all): 1 day, 15:36:47/12:33:53, loss=0.332261094962082, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.02(1.03), norm=1.5011866055718872, lr=0.015119943892900445
2023-11-16 05:14:04   INFO  epoch: 18/24, acc_iter=121716, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:07:47, time_cost(all): 1 day, 15:37:46/11:57:09, loss=0.332150152813905, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.93(1.03), norm=1.800957700427263, lr=0.015079852120159859
2023-11-16 05:15:03   INFO  epoch: 18/24, acc_iter=121766, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:03:17, time_cost(all): 1 day, 15:38:45/12:00:16, loss=0.332039210665729, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.3136932168902802, lr=0.015039760347419273
2023-11-16 05:16:02   INFO  epoch: 18/24, acc_iter=121816, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:06:03, time_cost(all): 1 day, 15:39:44/12:39:21, loss=0.331928268517552, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=1.8357214455296744, lr=0.014999668574678687
2023-11-16 05:17:01   INFO  epoch: 18/24, acc_iter=121866, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:05:47, time_cost(all): 1 day, 15:40:43/12:23:51, loss=0.331817326369375, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=2.9297494632299093, lr=0.014959576801938088
2023-11-16 05:18:00   INFO  epoch: 18/24, acc_iter=121916, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:28, time_cost(all): 1 day, 15:41:42/12:16:13, loss=0.331706384221198, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=2.4626719041294334, lr=0.014919485029197516
2023-11-16 05:18:59   INFO  epoch: 18/24, acc_iter=121966, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:41, time_cost(all): 1 day, 15:42:41/11:30:00, loss=0.331595442073022, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=4.045968810530273, lr=0.01487939325645693
2023-11-16 05:19:58   INFO  epoch: 18/24, acc_iter=122016, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/0:58:56, time_cost(all): 1 day, 15:43:40/12:03:27, loss=0.331484499924845, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=3.456396195801922, lr=0.01483930148371633
2023-11-16 05:20:57   INFO  epoch: 18/24, acc_iter=122066, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:00:37, time_cost(all): 1 day, 15:44:39/12:28:53, loss=0.331373557776668, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=2.358044920208011, lr=0.014799209710975744
2023-11-16 05:21:56   INFO  epoch: 18/24, acc_iter=122116, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:56:55, time_cost(all): 1 day, 15:45:38/11:27:39, loss=0.331262615628491, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=1.0230457979920782, lr=0.014759117938235158
2023-11-16 05:22:55   INFO  epoch: 18/24, acc_iter=122166, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:57:32, time_cost(all): 1 day, 15:46:37/12:33:10, loss=0.331151673480315, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=0.9736357633599411, lr=0.014719026165494573
2023-11-16 05:23:54   INFO  epoch: 18/24, acc_iter=122216, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:54:57, time_cost(all): 1 day, 15:47:36/11:56:45, loss=0.331040731332138, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=4.352791945873722, lr=0.014678934392753973
2023-11-16 05:24:53   INFO  epoch: 18/24, acc_iter=122266, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:57:45, time_cost(all): 1 day, 15:48:35/11:36:49, loss=0.330929789183961, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.11(1.03), norm=4.914166678353789, lr=0.014638842620013387
2023-11-16 05:25:51   INFO  epoch: 18/24, acc_iter=122316, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:53:23, time_cost(all): 1 day, 15:49:33/12:18:03, loss=0.330818847035784, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=0.6722403504173344, lr=0.014598750847272801
2023-11-16 05:26:50   INFO  epoch: 18/24, acc_iter=122366, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:55:33, time_cost(all): 1 day, 15:50:32/11:54:33, loss=0.330707904887608, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=4.275524271231695, lr=0.014558659074532215
2023-11-16 05:27:49   INFO  epoch: 18/24, acc_iter=122416, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:52:53, time_cost(all): 1 day, 15:51:31/11:58:04, loss=0.330596962739431, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=2.9608256264265282, lr=0.01451856730179163
2023-11-16 05:28:48   INFO  epoch: 18/24, acc_iter=122466, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:39, time_cost(all): 1 day, 15:52:30/12:11:15, loss=0.330486020591254, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=0.918816038279169, lr=0.014478475529051044
2023-11-16 05:29:47   INFO  epoch: 18/24, acc_iter=122516, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:02, time_cost(all): 1 day, 15:53:29/12:03:54, loss=0.330375078443077, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.89(1.03), norm=4.2219675592868455, lr=0.014438383756310458
2023-11-16 05:30:46   INFO  epoch: 18/24, acc_iter=122566, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:52:06, time_cost(all): 1 day, 15:54:28/11:43:03, loss=0.330264136294901, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=2.9787621948154586, lr=0.014398291983569858
2023-11-16 05:31:45   INFO  epoch: 18/24, acc_iter=122616, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:51:46, time_cost(all): 1 day, 15:55:27/11:52:30, loss=0.330153194146724, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=2.940676540876135, lr=0.014358200210829272
2023-11-16 05:32:44   INFO  epoch: 18/24, acc_iter=122666, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:49:28, time_cost(all): 1 day, 15:56:26/11:21:17, loss=0.330042251998547, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=0.6314449398394477, lr=0.014318108438088686
2023-11-16 05:33:43   INFO  epoch: 18/24, acc_iter=122716, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:46:41, time_cost(all): 1 day, 15:57:25/11:13:19, loss=0.32993130985037, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.88(1.03), norm=4.129522326708763, lr=0.0142780166653481
2023-11-16 05:34:42   INFO  epoch: 18/24, acc_iter=122766, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:47:17, time_cost(all): 1 day, 15:58:24/11:27:24, loss=0.329820367702194, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.1452547414988343, lr=0.0142379248926075
2023-11-16 05:35:41   INFO  epoch: 18/24, acc_iter=122816, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:16, time_cost(all): 1 day, 15:59:23/11:36:26, loss=0.329709425554017, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=2.507468122640077, lr=0.014197833119866929
2023-11-16 05:36:40   INFO  epoch: 18/24, acc_iter=122866, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:50, time_cost(all): 1 day, 16:00:22/12:17:06, loss=0.32959848340584, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.765039315029709, lr=0.014157741347126343
2023-11-16 05:37:39   INFO  epoch: 18/24, acc_iter=122916, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:46, time_cost(all): 1 day, 16:01:21/11:56:44, loss=0.329487541257663, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=3.0337066163101047, lr=0.014117649574385743
2023-11-16 05:38:38   INFO  epoch: 18/24, acc_iter=122966, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:44:35, time_cost(all): 1 day, 16:02:20/12:08:41, loss=0.329376599109487, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.159585167326239, lr=0.014077557801645157
2023-11-16 05:39:36   INFO  epoch: 18/24, acc_iter=123016, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:40:37, time_cost(all): 1 day, 16:03:18/12:15:11, loss=0.32926565696131, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=0.7527650877181515, lr=0.014037466028904572
2023-11-16 05:40:35   INFO  epoch: 18/24, acc_iter=123066, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:40:59, time_cost(all): 1 day, 16:04:17/11:26:01, loss=0.329154714813133, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=1.182440273663768, lr=0.013997374256163986
2023-11-16 05:41:34   INFO  epoch: 18/24, acc_iter=123116, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:39:44, time_cost(all): 1 day, 16:05:16/11:55:17, loss=0.329043772664956, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.8278124525022106, lr=0.013957282483423386
2023-11-16 05:42:33   INFO  epoch: 18/24, acc_iter=123166, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:37:59, time_cost(all): 1 day, 16:06:15/12:08:56, loss=0.32893283051678, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=1.7366598413399799, lr=0.0139171907106828
2023-11-16 05:43:32   INFO  epoch: 18/24, acc_iter=123216, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:38:54, time_cost(all): 1 day, 16:07:14/11:43:22, loss=0.328821888368603, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.03(1.03), norm=1.2961608293026021, lr=0.013877098937942214
2023-11-16 05:44:31   INFO  epoch: 18/24, acc_iter=123266, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:49, time_cost(all): 1 day, 16:08:13/11:05:23, loss=0.328710946220426, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.95(1.03), norm=1.9026270507370264, lr=0.013837007165201629
2023-11-16 05:45:30   INFO  epoch: 18/24, acc_iter=123316, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:16, time_cost(all): 1 day, 16:09:12/11:08:29, loss=0.328600004072249, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=0.9223829924924267, lr=0.013796915392461043
2023-11-16 05:46:29   INFO  epoch: 18/24, acc_iter=123366, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:07, time_cost(all): 1 day, 16:10:11/11:53:46, loss=0.328489061924073, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=1.384297997937869, lr=0.013756823619720457
2023-11-16 05:47:28   INFO  epoch: 18/24, acc_iter=123416, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:36, time_cost(all): 1 day, 16:11:10/11:50:12, loss=0.328378119775896, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=3.39104501547176, lr=0.013716731846979871
2023-11-16 05:48:27   INFO  epoch: 18/24, acc_iter=123466, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:03, time_cost(all): 1 day, 16:12:09/12:00:42, loss=0.328267177627719, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.522878159586198, lr=0.013676640074239271
2023-11-16 05:49:26   INFO  epoch: 18/24, acc_iter=123516, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:30:44, time_cost(all): 1 day, 16:13:08/11:05:19, loss=0.328156235479542, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.7656792721540464, lr=0.013636548301498685
2023-11-16 05:50:25   INFO  epoch: 18/24, acc_iter=123566, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:30:05, time_cost(all): 1 day, 16:14:07/11:03:00, loss=0.328045293331366, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=2.5557460117039206, lr=0.0135964565287581
2023-11-16 05:51:24   INFO  epoch: 18/24, acc_iter=123616, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:31:41, time_cost(all): 1 day, 16:15:06/11:27:11, loss=0.327934351183189, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=3.6907681246461235, lr=0.013556364756017514
2023-11-16 05:52:23   INFO  epoch: 18/24, acc_iter=123666, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:08, time_cost(all): 1 day, 16:16:05/12:00:44, loss=0.327823409035012, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=4.344572630506013, lr=0.013516272983276928
2023-11-16 05:53:21   INFO  epoch: 18/24, acc_iter=123716, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:31, time_cost(all): 1 day, 16:17:03/11:53:39, loss=0.327712466886835, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=0.8546969159805682, lr=0.013476181210536342
2023-11-16 05:54:20   INFO  epoch: 18/24, acc_iter=123766, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:40, time_cost(all): 1 day, 16:18:02/11:17:06, loss=0.327601524738659, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.92(1.03), norm=1.4554329943088815, lr=0.013436089437795756
2023-11-16 05:55:19   INFO  epoch: 18/24, acc_iter=123816, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:49, time_cost(all): 1 day, 16:19:01/11:52:08, loss=0.327490582590482, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=2.4045624486358763, lr=0.01339599766505517
2023-11-16 05:56:18   INFO  epoch: 18/24, acc_iter=123866, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:58, time_cost(all): 1 day, 16:20:00/11:37:02, loss=0.327379640442305, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.072016291536109, lr=0.01335590589231457
2023-11-16 05:57:17   INFO  epoch: 18/24, acc_iter=123916, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:23:42, time_cost(all): 1 day, 16:20:59/11:54:37, loss=0.327268698294128, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=1.089071749885797, lr=0.013315814119573985
2023-11-16 05:58:16   INFO  epoch: 18/24, acc_iter=123966, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:36, time_cost(all): 1 day, 16:21:58/11:52:21, loss=0.327157756145952, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=2.612819653586307, lr=0.013275722346833399
2023-11-16 05:59:15   INFO  epoch: 18/24, acc_iter=124016, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:55, time_cost(all): 1 day, 16:22:57/11:30:15, loss=0.327046813997775, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=0.730416604816259, lr=0.013235630574092813
2023-11-16 06:00:14   INFO  epoch: 18/24, acc_iter=124066, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:41, time_cost(all): 1 day, 16:23:56/11:08:39, loss=0.326935871849598, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=3.2261897440881215, lr=0.013195538801352213
2023-11-16 06:01:13   INFO  epoch: 18/24, acc_iter=124116, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:22, time_cost(all): 1 day, 16:24:55/11:34:40, loss=0.326824929701421, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=2.6445291525637704, lr=0.013155447028611628
2023-11-16 06:02:12   INFO  epoch: 18/24, acc_iter=124166, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:10, time_cost(all): 1 day, 16:25:54/11:45:10, loss=0.326713987553245, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=4.374700435120833, lr=0.013115355255871056
2023-11-16 06:03:11   INFO  epoch: 18/24, acc_iter=124216, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:29, time_cost(all): 1 day, 16:26:53/10:50:06, loss=0.326603045405068, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.3471766034616834, lr=0.013075263483130456
2023-11-16 06:04:10   INFO  epoch: 18/24, acc_iter=124266, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:24, time_cost(all): 1 day, 16:27:52/11:34:18, loss=0.326492103256891, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.0(1.03), norm=2.9989679139806964, lr=0.01303517171038987
2023-11-16 06:05:09   INFO  epoch: 18/24, acc_iter=124316, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:48, time_cost(all): 1 day, 16:28:51/11:22:41, loss=0.326381161108714, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=4.425002799286798, lr=0.012995079937649284
2023-11-16 06:06:08   INFO  epoch: 18/24, acc_iter=124366, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:53, time_cost(all): 1 day, 16:29:50/10:54:05, loss=0.326270218960538, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=3.134977647838519, lr=0.012954988164908698
2023-11-16 06:07:06   INFO  epoch: 18/24, acc_iter=124416, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:05, time_cost(all): 1 day, 16:30:48/11:42:10, loss=0.326159276812361, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=1.678076390011599, lr=0.012914896392168099
2023-11-16 06:08:05   INFO  epoch: 18/24, acc_iter=124466, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:50, time_cost(all): 1 day, 16:31:47/11:45:30, loss=0.326048334664184, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.09(1.03), norm=1.1476328452061653, lr=0.012874804619427513
2023-11-16 06:09:04   INFO  epoch: 18/24, acc_iter=124516, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:12, time_cost(all): 1 day, 16:32:46/11:00:02, loss=0.325937392516007, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=2.2588381277765266, lr=0.012834712846686927
2023-11-16 06:10:03   INFO  epoch: 18/24, acc_iter=124566, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:26, time_cost(all): 1 day, 16:33:45/11:17:56, loss=0.325826450367831, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=2.190962602173922, lr=0.012794621073946341
2023-11-16 06:11:02   INFO  epoch: 18/24, acc_iter=124616, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:03, time_cost(all): 1 day, 16:34:44/11:23:17, loss=0.325715508219654, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=1.8849960568145496, lr=0.012754529301205755
2023-11-16 06:12:01   INFO  epoch: 18/24, acc_iter=124666, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:42, time_cost(all): 1 day, 16:35:43/11:42:13, loss=0.325604566071477, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=3.105560107194086, lr=0.01271443752846517
2023-11-16 06:13:00   INFO  epoch: 18/24, acc_iter=124716, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:34, time_cost(all): 1 day, 16:36:42/10:50:27, loss=0.3254936239233, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.597030263390783, lr=0.012674345755724584
2023-11-16 06:13:59   INFO  epoch: 18/24, acc_iter=124766, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:41, time_cost(all): 1 day, 16:37:41/11:10:20, loss=0.325382681775124, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.12(1.03), norm=2.9147291459296065, lr=0.012634253982983984
2023-11-16 06:14:58   INFO  epoch: 18/24, acc_iter=124816, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:47, time_cost(all): 1 day, 16:38:40/10:50:00, loss=0.325271739626947, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.07(1.03), norm=1.7170761361904399, lr=0.012594162210243398
2023-11-16 06:15:57   INFO  epoch: 18/24, acc_iter=124866, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:36, time_cost(all): 1 day, 16:39:39/10:37:05, loss=0.32516079747877, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=4.337133168588906, lr=0.012554070437502812
2023-11-16 06:16:56   INFO  epoch: 18/24, acc_iter=124916, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:43, time_cost(all): 1 day, 16:40:38/10:31:53, loss=0.325049855330593, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=4.7819377746402525, lr=0.012513978664762226
2023-11-16 06:17:55   INFO  epoch: 18/24, acc_iter=124966, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:48, time_cost(all): 1 day, 16:41:37/11:31:01, loss=0.324938913182417, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=0.9292427871877313, lr=0.012473886892021627
2023-11-16 06:18:54   INFO  epoch: 18/24, acc_iter=125016, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:43, time_cost(all): 1 day, 16:42:36/11:19:58, loss=0.32482797103424, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.496882440970424, lr=0.01243379511928104
2023-11-16 06:19:53   INFO  epoch: 18/24, acc_iter=125066, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:41, time_cost(all): 1 day, 16:43:35/11:22:02, loss=0.324717028886063, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=2.5189287164520797, lr=0.012393703346540469
2023-11-16 06:20:52   INFO  epoch: 18/24, acc_iter=125116, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:45, time_cost(all): 1 day, 16:44:34/11:18:21, loss=0.324606086737886, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=1.4300840764521627, lr=0.01235361157379987
2023-11-16 06:21:50   INFO  epoch: 19/24, acc_iter=125203, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:05:24, time_cost(all): 1 day, 16:45:32/10:53:25, loss=0.324413047400059, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=2.0731821598866005, lr=0.012283851889231256
2023-11-16 06:22:49   INFO  epoch: 19/24, acc_iter=125253, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:03, time_cost(all): 1 day, 16:46:31/11:03:38, loss=0.324302105251882, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.12608941775877, lr=0.012243760116490657
2023-11-16 06:23:48   INFO  epoch: 19/24, acc_iter=125303, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:11:44, time_cost(all): 1 day, 16:47:30/10:31:51, loss=0.324191163103705, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=1.8332512390883697, lr=0.012203668343750071
2023-11-16 06:24:47   INFO  epoch: 19/24, acc_iter=125353, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:08:10, time_cost(all): 1 day, 16:48:29/10:40:49, loss=0.324080220955528, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=3.5409801362979563, lr=0.012163576571009485
2023-11-16 06:25:46   INFO  epoch: 19/24, acc_iter=125403, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:04:53, time_cost(all): 1 day, 16:49:28/11:18:14, loss=0.323969278807352, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=2.9173758877445657, lr=0.0121234847982689
2023-11-16 06:26:45   INFO  epoch: 19/24, acc_iter=125453, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:02:31, time_cost(all): 1 day, 16:50:27/11:05:22, loss=0.323858336659175, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=1.2888134060979746, lr=0.0120833930255283
2023-11-16 06:27:44   INFO  epoch: 19/24, acc_iter=125503, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:01:39, time_cost(all): 1 day, 16:51:26/10:30:20, loss=0.323747394510998, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=4.546184078897391, lr=0.012043301252787714
2023-11-16 06:28:43   INFO  epoch: 19/24, acc_iter=125553, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:56:42, time_cost(all): 1 day, 16:52:25/11:11:38, loss=0.323636452362821, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=1.4251476668127534, lr=0.012003209480047142
2023-11-16 06:29:42   INFO  epoch: 19/24, acc_iter=125603, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:03:56, time_cost(all): 1 day, 16:53:24/11:00:12, loss=0.323525510214645, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=1.2543993246759635, lr=0.011963117707306542
2023-11-16 06:30:41   INFO  epoch: 19/24, acc_iter=125653, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:01:16, time_cost(all): 1 day, 16:54:23/10:20:24, loss=0.323414568066468, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.216305354987858, lr=0.011923025934565956
2023-11-16 06:31:40   INFO  epoch: 19/24, acc_iter=125703, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:53:32, time_cost(all): 1 day, 16:55:22/10:20:12, loss=0.323303625918291, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=4.979765115597974, lr=0.01188293416182537
2023-11-16 06:32:39   INFO  epoch: 19/24, acc_iter=125753, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:00:37, time_cost(all): 1 day, 16:56:21/11:14:31, loss=0.323192683770115, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=1.2009678731680862, lr=0.011842842389084784
2023-11-16 06:33:38   INFO  epoch: 19/24, acc_iter=125803, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:54:52, time_cost(all): 1 day, 16:57:20/10:29:56, loss=0.323081741621938, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=4.073271501715587, lr=0.011802750616344185
2023-11-16 06:34:37   INFO  epoch: 19/24, acc_iter=125853, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:58:10, time_cost(all): 1 day, 16:58:19/10:59:34, loss=0.322970799473761, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.464577665260204, lr=0.011762658843603599
2023-11-16 06:35:35   INFO  epoch: 19/24, acc_iter=125903, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:57:31, time_cost(all): 1 day, 16:59:17/10:55:58, loss=0.322859857325584, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=1.290750154956485, lr=0.011722567070863013
2023-11-16 06:36:34   INFO  epoch: 19/24, acc_iter=125953, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:56:53, time_cost(all): 1 day, 17:00:16/10:28:02, loss=0.322748915177408, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.930365811175847, lr=0.011682475298122427
2023-11-16 06:37:33   INFO  epoch: 19/24, acc_iter=126003, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:49:12, time_cost(all): 1 day, 17:01:15/10:38:51, loss=0.322637973029231, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=2.8236292239242435, lr=0.011642383525381841
2023-11-16 06:38:32   INFO  epoch: 19/24, acc_iter=126053, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:51:14, time_cost(all): 1 day, 17:02:14/10:24:10, loss=0.322527030881054, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=1.286649430220692, lr=0.011602291752641256
2023-11-16 06:39:31   INFO  epoch: 19/24, acc_iter=126103, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:47:24, time_cost(all): 1 day, 17:03:13/10:23:16, loss=0.322416088732877, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=4.429030831842015, lr=0.01156219997990067
2023-11-16 06:40:30   INFO  epoch: 19/24, acc_iter=126153, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:44:47, time_cost(all): 1 day, 17:04:12/10:29:22, loss=0.3223051465847, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.6431807692811664, lr=0.01152210820716007
2023-11-16 06:41:29   INFO  epoch: 19/24, acc_iter=126203, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:44:06, time_cost(all): 1 day, 17:05:11/10:35:26, loss=0.322194204436524, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=1.154249610966433, lr=0.011482016434419484
2023-11-16 06:42:28   INFO  epoch: 19/24, acc_iter=126253, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:50:07, time_cost(all): 1 day, 17:06:10/10:30:30, loss=0.322083262288347, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=1.3902393951847254, lr=0.011441924661678898
2023-11-16 06:43:27   INFO  epoch: 19/24, acc_iter=126303, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:44:53, time_cost(all): 1 day, 17:07:09/10:33:36, loss=0.32197232014017, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=2.9796128939387025, lr=0.011401832888938312
2023-11-16 06:44:26   INFO  epoch: 19/24, acc_iter=126353, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:50:49, time_cost(all): 1 day, 17:08:08/10:35:36, loss=0.321861377991993, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=4.915197895977797, lr=0.011361741116197713
2023-11-16 06:45:25   INFO  epoch: 19/24, acc_iter=126403, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:47:25, time_cost(all): 1 day, 17:09:07/10:21:12, loss=0.321750435843817, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=4.458495347555054, lr=0.011321649343457127
2023-11-16 06:46:24   INFO  epoch: 19/24, acc_iter=126453, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:45:56, time_cost(all): 1 day, 17:10:06/10:16:58, loss=0.32163949369564, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=1.1891985937390506, lr=0.011281557570716555
2023-11-16 06:47:23   INFO  epoch: 19/24, acc_iter=126503, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:38:50, time_cost(all): 1 day, 17:11:05/10:54:31, loss=0.321528551547463, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=3.987696598367659, lr=0.011241465797975955
2023-11-16 06:48:22   INFO  epoch: 19/24, acc_iter=126553, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:41:08, time_cost(all): 1 day, 17:12:04/10:38:33, loss=0.321417609399286, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=0.7556374044812552, lr=0.01120137402523537
2023-11-16 06:49:20   INFO  epoch: 19/24, acc_iter=126603, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:45:32, time_cost(all): 1 day, 17:13:02/10:50:11, loss=0.32130666725111, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=1.9861624495148789, lr=0.011161282252494784
2023-11-16 06:50:19   INFO  epoch: 19/24, acc_iter=126653, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:37:35, time_cost(all): 1 day, 17:14:01/10:31:51, loss=0.321195725102933, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.2982982482446195, lr=0.011121190479754198
2023-11-16 06:51:18   INFO  epoch: 19/24, acc_iter=126703, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:35:56, time_cost(all): 1 day, 17:15:00/10:32:02, loss=0.321084782954756, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.1319823938848754, lr=0.011081098707013612
2023-11-16 06:52:17   INFO  epoch: 19/24, acc_iter=126753, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:41:31, time_cost(all): 1 day, 17:15:59/10:27:39, loss=0.320973840806579, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.06(1.03), norm=3.5443625256700613, lr=0.011041006934273012
2023-11-16 06:53:16   INFO  epoch: 19/24, acc_iter=126803, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:54, time_cost(all): 1 day, 17:16:58/10:04:33, loss=0.320862898658403, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=3.3863360636390514, lr=0.011000915161532426
2023-11-16 06:54:15   INFO  epoch: 19/24, acc_iter=126853, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:40:33, time_cost(all): 1 day, 17:17:57/10:58:12, loss=0.320751956510226, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=3.5664333593317044, lr=0.010960823388791854
2023-11-16 06:55:14   INFO  epoch: 19/24, acc_iter=126903, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:38:14, time_cost(all): 1 day, 17:18:56/10:13:03, loss=0.320641014362049, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=2.919364656707092, lr=0.010920731616051255
2023-11-16 06:56:13   INFO  epoch: 19/24, acc_iter=126953, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:34:28, time_cost(all): 1 day, 17:19:55/10:51:06, loss=0.320530072213872, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=2.2777953801453523, lr=0.010880639843310669
2023-11-16 06:57:12   INFO  epoch: 19/24, acc_iter=127003, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:34:17, time_cost(all): 1 day, 17:20:54/10:27:32, loss=0.320419130065696, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=3.137930762381576, lr=0.010840548070570083
2023-11-16 06:58:11   INFO  epoch: 19/24, acc_iter=127053, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:28:31, time_cost(all): 1 day, 17:21:53/10:09:45, loss=0.320308187917519, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=1.20286394586564, lr=0.010800456297829497
2023-11-16 06:59:10   INFO  epoch: 19/24, acc_iter=127103, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:30:00, time_cost(all): 1 day, 17:22:52/10:27:06, loss=0.320197245769342, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=2.799109217457642, lr=0.010760364525088897
2023-11-16 07:00:09   INFO  epoch: 19/24, acc_iter=127153, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:27:33, time_cost(all): 1 day, 17:23:51/10:13:44, loss=0.320086303621165, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.1529260004403876, lr=0.010720272752348312
2023-11-16 07:01:08   INFO  epoch: 19/24, acc_iter=127203, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:31:22, time_cost(all): 1 day, 17:24:50/10:00:14, loss=0.319975361472989, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.268190147216747, lr=0.010680180979607726
2023-11-16 07:02:07   INFO  epoch: 19/24, acc_iter=127253, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:31:48, time_cost(all): 1 day, 17:25:49/10:28:57, loss=0.319864419324812, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=2.721777529200767, lr=0.01064008920686714
2023-11-16 07:03:05   INFO  epoch: 19/24, acc_iter=127303, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:22:50, time_cost(all): 1 day, 17:26:47/10:15:09, loss=0.319753477176635, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.1(1.03), norm=3.498946942722495, lr=0.010599997434126554
2023-11-16 07:04:04   INFO  epoch: 19/24, acc_iter=127353, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:26:27, time_cost(all): 1 day, 17:27:46/9:49:50, loss=0.319642535028458, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=0.8722004247214493, lr=0.010559905661385968
2023-11-16 07:05:03   INFO  epoch: 19/24, acc_iter=127403, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:22:51, time_cost(all): 1 day, 17:28:45/9:51:47, loss=0.319531592880282, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=1.229599596501453, lr=0.010519813888645382
2023-11-16 07:06:02   INFO  epoch: 19/24, acc_iter=127453, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:25:35, time_cost(all): 1 day, 17:29:44/10:20:42, loss=0.319420650732105, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=4.812687904110619, lr=0.010479722115904783
2023-11-16 07:07:01   INFO  epoch: 19/24, acc_iter=127503, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:23:42, time_cost(all): 1 day, 17:30:43/10:34:00, loss=0.319309708583928, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=4.806747069545271, lr=0.010439630343164197
2023-11-16 07:08:00   INFO  epoch: 19/24, acc_iter=127553, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:20:03, time_cost(all): 1 day, 17:31:42/10:16:19, loss=0.319198766435751, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.87(1.03), norm=0.9623537724592464, lr=0.010399538570423611
2023-11-16 07:08:59   INFO  epoch: 19/24, acc_iter=127603, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:51, time_cost(all): 1 day, 17:32:41/9:43:20, loss=0.319087824287575, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=0.8640714192803942, lr=0.010359446797683025
2023-11-16 07:09:58   INFO  epoch: 19/24, acc_iter=127653, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:24:15, time_cost(all): 1 day, 17:33:40/10:02:07, loss=0.318976882139398, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=0.5417671916254174, lr=0.010319355024942425
2023-11-16 07:10:57   INFO  epoch: 19/24, acc_iter=127703, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:15:26, time_cost(all): 1 day, 17:34:39/10:33:54, loss=0.318865939991221, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=0.7294729347400934, lr=0.01027926325220184
2023-11-16 07:11:56   INFO  epoch: 19/24, acc_iter=127753, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:15:06, time_cost(all): 1 day, 17:35:38/9:48:09, loss=0.318754997843044, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=3.21168539657544, lr=0.010239171479461268
2023-11-16 07:12:55   INFO  epoch: 19/24, acc_iter=127803, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:14:48, time_cost(all): 1 day, 17:36:37/10:34:51, loss=0.318644055694868, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.17(1.03), norm=0.6988272994713569, lr=0.010199079706720668
2023-11-16 07:13:54   INFO  epoch: 19/24, acc_iter=127853, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:14:57, time_cost(all): 1 day, 17:37:36/10:11:44, loss=0.318533113546691, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.01(1.03), norm=4.32909629424423, lr=0.010158987933980082
2023-11-16 07:14:53   INFO  epoch: 19/24, acc_iter=127903, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:03, time_cost(all): 1 day, 17:38:35/10:21:38, loss=0.318422171398514, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=4.7053251521339945, lr=0.010118896161239496
2023-11-16 07:15:52   INFO  epoch: 19/24, acc_iter=127953, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:16:19, time_cost(all): 1 day, 17:39:34/9:53:25, loss=0.318311229250337, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.99(1.03), norm=2.2098904546630465, lr=0.01007880438849891
2023-11-16 07:16:50   INFO  epoch: 19/24, acc_iter=128003, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:15:37, time_cost(all): 1 day, 17:40:32/9:52:06, loss=0.318200287102161, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.07(1.03), norm=1.7103746278123506, lr=0.01003871261575831
2023-11-16 07:17:49   INFO  epoch: 19/24, acc_iter=128053, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:11:05, time_cost(all): 1 day, 17:41:31/9:37:52, loss=0.318089344953984, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=1.558922155158512, lr=0.009999279998928376
2023-11-16 07:18:48   INFO  epoch: 19/24, acc_iter=128103, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:11, time_cost(all): 1 day, 17:42:30/10:05:08, loss=0.317978402805807, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=4.457898504732851, lr=0.009978349735218217
2023-11-16 07:19:47   INFO  epoch: 19/24, acc_iter=128153, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:07:09, time_cost(all): 1 day, 17:43:29/10:02:59, loss=0.31786746065763, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=1.6684401902499426, lr=0.009957419471508057
2023-11-16 07:20:46   INFO  epoch: 19/24, acc_iter=128203, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:52, time_cost(all): 1 day, 17:44:28/9:58:17, loss=0.317756518509454, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=3.6438566896266624, lr=0.009936489207797897
2023-11-16 07:21:45   INFO  epoch: 19/24, acc_iter=128253, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:09:05, time_cost(all): 1 day, 17:45:27/10:17:24, loss=0.317645576361277, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.3664162809531386, lr=0.009915558944087736
2023-11-16 07:22:44   INFO  epoch: 19/24, acc_iter=128303, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:06:19, time_cost(all): 1 day, 17:46:26/10:05:15, loss=0.3175346342131, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=2.685547634382649, lr=0.009894628680377576
2023-11-16 07:23:43   INFO  epoch: 19/24, acc_iter=128353, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:08:11, time_cost(all): 1 day, 17:47:25/9:28:37, loss=0.317423692064923, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=3.738795646557496, lr=0.009873698416667416
2023-11-16 07:24:42   INFO  epoch: 19/24, acc_iter=128403, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:04:34, time_cost(all): 1 day, 17:48:24/10:16:01, loss=0.317312749916747, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=1.9490516886960605, lr=0.009852768152957256
2023-11-16 07:25:41   INFO  epoch: 19/24, acc_iter=128453, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:03:38, time_cost(all): 1 day, 17:49:23/9:56:39, loss=0.31720180776857, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=2.222533534287762, lr=0.009831837889247097
2023-11-16 07:26:40   INFO  epoch: 19/24, acc_iter=128503, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:02:49, time_cost(all): 1 day, 17:50:22/9:54:52, loss=0.317090865620393, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.89325410018941, lr=0.009810907625536937
2023-11-16 07:27:39   INFO  epoch: 19/24, acc_iter=128553, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:03:52, time_cost(all): 1 day, 17:51:21/10:04:16, loss=0.316979923472216, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=4.57035160286109, lr=0.009789977361826777
2023-11-16 07:28:38   INFO  epoch: 19/24, acc_iter=128603, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:04:16, time_cost(all): 1 day, 17:52:20/9:29:00, loss=0.31686898132404, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=4.603790563439442, lr=0.009769047098116617
2023-11-16 07:29:37   INFO  epoch: 19/24, acc_iter=128653, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:59:56, time_cost(all): 1 day, 17:53:19/10:14:25, loss=0.316758039175863, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.2135563514293273, lr=0.009748116834406457
2023-11-16 07:30:35   INFO  epoch: 19/24, acc_iter=128703, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:16, time_cost(all): 1 day, 17:54:17/9:50:28, loss=0.316647097027686, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.8490666705291745, lr=0.009727186570696296
2023-11-16 07:31:34   INFO  epoch: 19/24, acc_iter=128753, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:58:02, time_cost(all): 1 day, 17:55:16/10:10:34, loss=0.316536154879509, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=3.0858121918208354, lr=0.009706256306986136
2023-11-16 07:32:33   INFO  epoch: 19/24, acc_iter=128803, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:56:43, time_cost(all): 1 day, 17:56:15/9:30:24, loss=0.316425212731333, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=2.668722350450997, lr=0.009685326043275978
2023-11-16 07:33:32   INFO  epoch: 19/24, acc_iter=128853, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:53, time_cost(all): 1 day, 17:57:14/9:30:17, loss=0.316314270583156, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=1.2659944668535812, lr=0.009664395779565817
2023-11-16 07:34:31   INFO  epoch: 19/24, acc_iter=128903, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:56:30, time_cost(all): 1 day, 17:58:13/10:09:16, loss=0.316203328434979, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=4.54033931877844, lr=0.009643465515855657
2023-11-16 07:35:30   INFO  epoch: 19/24, acc_iter=128953, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:52:02, time_cost(all): 1 day, 17:59:12/9:36:34, loss=0.316092386286802, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=1.3360380684591986, lr=0.009622535252145497
2023-11-16 07:36:29   INFO  epoch: 19/24, acc_iter=129003, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:52:05, time_cost(all): 1 day, 18:00:11/9:33:40, loss=0.315981444138626, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=4.012529972995521, lr=0.009601604988435337
2023-11-16 07:37:28   INFO  epoch: 19/24, acc_iter=129053, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:16, time_cost(all): 1 day, 18:01:10/10:07:49, loss=0.315870501990449, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=1.7496956820500236, lr=0.009580674724725177
2023-11-16 07:38:27   INFO  epoch: 19/24, acc_iter=129103, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:54:19, time_cost(all): 1 day, 18:02:09/9:48:39, loss=0.315759559842272, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.556159349809508, lr=0.009559744461015016
2023-11-16 07:39:26   INFO  epoch: 19/24, acc_iter=129153, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:48:30, time_cost(all): 1 day, 18:03:08/9:38:07, loss=0.315648617694095, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=3.4123918692207864, lr=0.009538814197304858
2023-11-16 07:40:25   INFO  epoch: 19/24, acc_iter=129203, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:49:00, time_cost(all): 1 day, 18:04:07/10:01:07, loss=0.315537675545919, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=0.9717148682197401, lr=0.009517883933594698
2023-11-16 07:41:24   INFO  epoch: 19/24, acc_iter=129253, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:46:54, time_cost(all): 1 day, 18:05:06/10:05:22, loss=0.315426733397742, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=2.5897436640324214, lr=0.009496953669884537
2023-11-16 07:42:23   INFO  epoch: 19/24, acc_iter=129303, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:20, time_cost(all): 1 day, 18:06:05/9:37:43, loss=0.315315791249565, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=3.4241221282072054, lr=0.009476023406174377
2023-11-16 07:43:22   INFO  epoch: 19/24, acc_iter=129353, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:46:42, time_cost(all): 1 day, 18:07:04/9:34:51, loss=0.315204849101388, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=0.5635709769763166, lr=0.009455093142464217
2023-11-16 07:44:20   INFO  epoch: 19/24, acc_iter=129403, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:48:04, time_cost(all): 1 day, 18:08:02/9:28:30, loss=0.315093906953212, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.22(1.03), norm=4.232401326636252, lr=0.009434162878754057
2023-11-16 07:45:19   INFO  epoch: 19/24, acc_iter=129453, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:26, time_cost(all): 1 day, 18:09:01/9:44:46, loss=0.314982964805035, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=4.832008582204261, lr=0.009413232615043897
2023-11-16 07:46:18   INFO  epoch: 19/24, acc_iter=129503, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:05, time_cost(all): 1 day, 18:10:00/9:25:33, loss=0.314872022656858, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.92(1.03), norm=1.9137445249531004, lr=0.009392302351333738
2023-11-16 07:47:17   INFO  epoch: 19/24, acc_iter=129553, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:45:01, time_cost(all): 1 day, 18:10:59/10:00:37, loss=0.314761080508681, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.06(1.03), norm=1.7829802379978232, lr=0.009371372087623578
2023-11-16 07:48:16   INFO  epoch: 19/24, acc_iter=129603, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:41:02, time_cost(all): 1 day, 18:11:58/9:34:43, loss=0.314650138360505, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=4.619203179482601, lr=0.009350441823913418
2023-11-16 07:49:15   INFO  epoch: 19/24, acc_iter=129653, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:40:33, time_cost(all): 1 day, 18:12:57/9:44:10, loss=0.314539196212328, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=1.273695761028376, lr=0.009329511560203257
2023-11-16 07:50:14   INFO  epoch: 19/24, acc_iter=129703, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:41:57, time_cost(all): 1 day, 18:13:56/9:38:58, loss=0.314428254064151, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.99(1.03), norm=4.3581564459579205, lr=0.009308581296493097
2023-11-16 07:51:13   INFO  epoch: 19/24, acc_iter=129753, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:37:25, time_cost(all): 1 day, 18:14:55/9:44:41, loss=0.314317311915974, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=1.9380536175102838, lr=0.009287651032782937
2023-11-16 07:52:12   INFO  epoch: 19/24, acc_iter=129803, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:27, time_cost(all): 1 day, 18:15:54/9:01:28, loss=0.314206369767798, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.35008668820988, lr=0.009266720769072777
2023-11-16 07:53:11   INFO  epoch: 19/24, acc_iter=129853, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:22, time_cost(all): 1 day, 18:16:53/9:15:21, loss=0.314095427619621, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=2.644293558103093, lr=0.009245790505362618
2023-11-16 07:54:10   INFO  epoch: 19/24, acc_iter=129903, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:09, time_cost(all): 1 day, 18:17:52/9:09:20, loss=0.313984485471444, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=4.819476071259046, lr=0.009224860241652458
2023-11-16 07:55:09   INFO  epoch: 19/24, acc_iter=129953, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:39, time_cost(all): 1 day, 18:18:51/9:41:23, loss=0.313873543323267, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=0.7219387132447728, lr=0.009203929977942298
2023-11-16 07:56:08   INFO  epoch: 19/24, acc_iter=130003, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:59, time_cost(all): 1 day, 18:19:50/9:27:36, loss=0.313762601175091, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.97(1.03), norm=4.758494316097451, lr=0.009182999714232138
2023-11-16 07:57:07   INFO  epoch: 19/24, acc_iter=130053, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:32:48, time_cost(all): 1 day, 18:20:49/9:42:55, loss=0.313651659026914, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.889493908032375, lr=0.009162069450521977
2023-11-16 07:58:05   INFO  epoch: 19/24, acc_iter=130103, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:30:51, time_cost(all): 1 day, 18:21:47/8:57:25, loss=0.313540716878737, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.9(1.03), norm=4.746415087717223, lr=0.009141139186811817
2023-11-16 07:59:04   INFO  epoch: 19/24, acc_iter=130153, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:03, time_cost(all): 1 day, 18:22:46/8:54:57, loss=0.31342977473056, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=1.6111097290411704, lr=0.009120208923101657
2023-11-16 08:00:03   INFO  epoch: 19/24, acc_iter=130203, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:29:24, time_cost(all): 1 day, 18:23:45/9:24:12, loss=0.313318832582384, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=3.0900420766332752, lr=0.009099278659391499
2023-11-16 08:01:02   INFO  epoch: 19/24, acc_iter=130253, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:24, time_cost(all): 1 day, 18:24:44/9:01:03, loss=0.313207890434207, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=1.6401749566319788, lr=0.009078348395681338
2023-11-16 08:02:01   INFO  epoch: 19/24, acc_iter=130303, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:29:13, time_cost(all): 1 day, 18:25:43/9:04:24, loss=0.31309694828603, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=3.415847174985894, lr=0.009057418131971178
2023-11-16 08:03:00   INFO  epoch: 19/24, acc_iter=130353, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:27:53, time_cost(all): 1 day, 18:26:42/9:01:12, loss=0.312986006137853, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.89(1.03), norm=4.553232026290048, lr=0.009036487868261018
2023-11-16 08:03:59   INFO  epoch: 19/24, acc_iter=130403, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:52, time_cost(all): 1 day, 18:27:41/9:35:21, loss=0.312875063989677, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=0.993558756271981, lr=0.009015557604550858
2023-11-16 08:04:58   INFO  epoch: 19/24, acc_iter=130453, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:10, time_cost(all): 1 day, 18:28:40/9:17:25, loss=0.3127641218415, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=2.4470209090720862, lr=0.008994627340840698
2023-11-16 08:05:57   INFO  epoch: 19/24, acc_iter=130503, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:25:08, time_cost(all): 1 day, 18:29:39/8:54:05, loss=0.312653179693323, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=1.512864544393226, lr=0.008973697077130537
2023-11-16 08:06:56   INFO  epoch: 19/24, acc_iter=130553, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:40, time_cost(all): 1 day, 18:30:38/9:18:04, loss=0.312542237545146, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.898089846107928, lr=0.008952766813420379
2023-11-16 08:07:55   INFO  epoch: 19/24, acc_iter=130603, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:13, time_cost(all): 1 day, 18:31:37/9:24:47, loss=0.31243129539697, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=4.434659767122391, lr=0.008931836549710219
2023-11-16 08:08:54   INFO  epoch: 19/24, acc_iter=130653, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:35, time_cost(all): 1 day, 18:32:36/9:08:19, loss=0.312320353248793, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.84(1.03), norm=1.127431672175077, lr=0.008910906286000058
2023-11-16 08:09:53   INFO  epoch: 19/24, acc_iter=130703, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:55, time_cost(all): 1 day, 18:33:35/9:30:29, loss=0.312209411100616, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=2.1720478498857845, lr=0.008889976022289898
2023-11-16 08:10:52   INFO  epoch: 19/24, acc_iter=130753, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:11, time_cost(all): 1 day, 18:34:34/9:16:18, loss=0.312098468952439, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=4.429924875531489, lr=0.008869045758579738
2023-11-16 08:11:50   INFO  epoch: 19/24, acc_iter=130803, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:17:42, time_cost(all): 1 day, 18:35:32/8:48:06, loss=0.311987526804263, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=4.983252298360845, lr=0.008848115494869578
2023-11-16 08:12:49   INFO  epoch: 19/24, acc_iter=130853, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:42, time_cost(all): 1 day, 18:36:31/9:14:17, loss=0.311876584656086, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.92(1.03), norm=4.103236058675678, lr=0.00882718523115942
2023-11-16 08:13:48   INFO  epoch: 19/24, acc_iter=130903, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:52, time_cost(all): 1 day, 18:37:30/9:06:49, loss=0.311765642507909, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=4.0555678527852885, lr=0.008806254967449259
2023-11-16 08:14:47   INFO  epoch: 19/24, acc_iter=130953, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:59, time_cost(all): 1 day, 18:38:29/8:50:52, loss=0.311654700359732, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=1.9361395424324062, lr=0.008785324703739099
2023-11-16 08:15:46   INFO  epoch: 19/24, acc_iter=131003, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:58, time_cost(all): 1 day, 18:39:28/8:53:14, loss=0.311543758211556, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=1.471511746203244, lr=0.008764394440028939
2023-11-16 08:16:45   INFO  epoch: 19/24, acc_iter=131053, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:02, time_cost(all): 1 day, 18:40:27/8:54:42, loss=0.311432816063379, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=3.1432770457288584, lr=0.008743464176318778
2023-11-16 08:17:44   INFO  epoch: 19/24, acc_iter=131103, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:34, time_cost(all): 1 day, 18:41:26/9:30:01, loss=0.311321873915202, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=4.648127921730949, lr=0.008722533912608618
2023-11-16 08:18:43   INFO  epoch: 19/24, acc_iter=131153, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:17, time_cost(all): 1 day, 18:42:25/8:47:45, loss=0.311210931767025, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=4.76757040954781, lr=0.008701603648898458
2023-11-16 08:19:42   INFO  epoch: 19/24, acc_iter=131203, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:21, time_cost(all): 1 day, 18:43:24/9:08:11, loss=0.311099989618849, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=4.152102263938274, lr=0.008680673385188298
2023-11-16 08:20:41   INFO  epoch: 19/24, acc_iter=131253, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:51, time_cost(all): 1 day, 18:44:23/8:54:26, loss=0.310989047470672, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=2.720732701624168, lr=0.00865974312147814
2023-11-16 08:21:40   INFO  epoch: 19/24, acc_iter=131303, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:32, time_cost(all): 1 day, 18:45:22/9:26:04, loss=0.310878105322495, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=0.956967202233671, lr=0.008638812857767979
2023-11-16 08:22:39   INFO  epoch: 19/24, acc_iter=131353, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:57, time_cost(all): 1 day, 18:46:21/8:38:37, loss=0.310767163174318, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=1.3866512389502401, lr=0.008617882594057819
2023-11-16 08:23:38   INFO  epoch: 19/24, acc_iter=131403, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:41, time_cost(all): 1 day, 18:47:20/9:19:05, loss=0.310656221026142, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.99(1.03), norm=4.818220343750132, lr=0.008596952330347659
2023-11-16 08:24:37   INFO  epoch: 19/24, acc_iter=131453, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:41, time_cost(all): 1 day, 18:48:19/8:58:24, loss=0.310545278877965, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.351261259226005, lr=0.008576022066637498
2023-11-16 08:25:35   INFO  epoch: 19/24, acc_iter=131503, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:49, time_cost(all): 1 day, 18:49:17/8:59:45, loss=0.310434336729788, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=2.6907788041322873, lr=0.008555091802927338
2023-11-16 08:26:34   INFO  epoch: 19/24, acc_iter=131553, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:32, time_cost(all): 1 day, 18:50:16/8:41:00, loss=0.310323394581611, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=1.1674919089562918, lr=0.00853416153921718
2023-11-16 08:27:33   INFO  epoch: 19/24, acc_iter=131603, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:45, time_cost(all): 1 day, 18:51:15/8:41:56, loss=0.310212452433435, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=4.115898037151764, lr=0.00851323127550702
2023-11-16 08:28:32   INFO  epoch: 19/24, acc_iter=131653, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:45, time_cost(all): 1 day, 18:52:14/8:37:24, loss=0.310101510285258, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.91(1.03), norm=1.6364215339388049, lr=0.00849230101179686
2023-11-16 08:29:31   INFO  epoch: 19/24, acc_iter=131703, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:42, time_cost(all): 1 day, 18:53:13/9:07:54, loss=0.309990568137081, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.86(1.03), norm=4.421336471524901, lr=0.008471370748086699
2023-11-16 08:30:30   INFO  epoch: 20/24, acc_iter=131790, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:12:15, time_cost(all): 1 day, 18:54:12/8:58:46, loss=0.309797528799254, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=4.262722241359906, lr=0.008434952089231021
2023-11-16 08:31:29   INFO  epoch: 20/24, acc_iter=131840, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:15, time_cost(all): 1 day, 18:55:11/9:05:07, loss=0.309686586651077, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.4869281272990325, lr=0.00841402182552086
2023-11-16 08:32:28   INFO  epoch: 20/24, acc_iter=131890, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:06:31, time_cost(all): 1 day, 18:56:10/8:34:46, loss=0.3095756445029, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.23(1.03), norm=3.2600888793823573, lr=0.0083930915618107
2023-11-16 08:33:27   INFO  epoch: 20/24, acc_iter=131940, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:04:47, time_cost(all): 1 day, 18:57:09/9:07:13, loss=0.309464702354723, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=3.585236291588287, lr=0.00837216129810054
2023-11-16 08:34:26   INFO  epoch: 20/24, acc_iter=131990, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:02:14, time_cost(all): 1 day, 18:58:08/8:48:19, loss=0.309353760206547, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.04(1.03), norm=3.0612940270799998, lr=0.00835123103439038
2023-11-16 08:35:25   INFO  epoch: 20/24, acc_iter=132040, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:58:26, time_cost(all): 1 day, 18:59:07/8:59:54, loss=0.30924281805837, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=2.885003624744465, lr=0.008330300770680222
2023-11-16 08:36:24   INFO  epoch: 20/24, acc_iter=132090, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:02:09, time_cost(all): 1 day, 19:00:06/8:56:16, loss=0.309131875910193, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.7993935181579026, lr=0.008309370506970061
2023-11-16 08:37:23   INFO  epoch: 20/24, acc_iter=132140, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:00:31, time_cost(all): 1 day, 19:01:05/8:48:07, loss=0.309020933762016, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.021685574692708, lr=0.008288440243259901
2023-11-16 08:38:22   INFO  epoch: 20/24, acc_iter=132190, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:02:40, time_cost(all): 1 day, 19:02:04/8:56:45, loss=0.308909991613839, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=2.3638623020396965, lr=0.008267509979549741
2023-11-16 08:39:20   INFO  epoch: 20/24, acc_iter=132240, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:00:29, time_cost(all): 1 day, 19:03:02/8:23:41, loss=0.308799049465663, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=2.8089048092339164, lr=0.00824657971583958
2023-11-16 08:40:19   INFO  epoch: 20/24, acc_iter=132290, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:59:14, time_cost(all): 1 day, 19:04:01/8:36:12, loss=0.308688107317486, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=2.6022288943375265, lr=0.00822564945212942
2023-11-16 08:41:18   INFO  epoch: 20/24, acc_iter=132340, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:42, time_cost(all): 1 day, 19:05:00/8:16:58, loss=0.308577165169309, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=3.9926956255102217, lr=0.00820471918841926
2023-11-16 08:42:17   INFO  epoch: 20/24, acc_iter=132390, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:55:18, time_cost(all): 1 day, 19:05:59/9:04:26, loss=0.308466223021132, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.1138743680025867, lr=0.008183788924709102
2023-11-16 08:43:16   INFO  epoch: 20/24, acc_iter=132440, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:55:14, time_cost(all): 1 day, 19:06:58/8:51:43, loss=0.308355280872956, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.4344639674023787, lr=0.008162858660998942
2023-11-16 08:44:15   INFO  epoch: 20/24, acc_iter=132490, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:50:25, time_cost(all): 1 day, 19:07:57/8:59:56, loss=0.308244338724779, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=0.9160409258084027, lr=0.008141928397288781
2023-11-16 08:45:14   INFO  epoch: 20/24, acc_iter=132540, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:49:20, time_cost(all): 1 day, 19:08:56/8:46:22, loss=0.308133396576602, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=3.9230434986530183, lr=0.008120998133578621
2023-11-16 08:46:13   INFO  epoch: 20/24, acc_iter=132590, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:55:54, time_cost(all): 1 day, 19:09:55/8:31:54, loss=0.308022454428425, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=0.5425454695128071, lr=0.008100067869868461
2023-11-16 08:47:12   INFO  epoch: 20/24, acc_iter=132640, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:54:42, time_cost(all): 1 day, 19:10:54/8:14:49, loss=0.307911512280249, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=2.7118061189645424, lr=0.0080791376061583
2023-11-16 08:48:11   INFO  epoch: 20/24, acc_iter=132690, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:52:40, time_cost(all): 1 day, 19:11:53/8:35:10, loss=0.307800570132072, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.3672883365897273, lr=0.00805820734244814
2023-11-16 08:49:10   INFO  epoch: 20/24, acc_iter=132740, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:48:59, time_cost(all): 1 day, 19:12:52/8:55:41, loss=0.307689627983895, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=2.602593017158273, lr=0.00803727707873798
2023-11-16 08:50:09   INFO  epoch: 20/24, acc_iter=132790, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:49:55, time_cost(all): 1 day, 19:13:51/8:30:45, loss=0.307578685835718, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=3.7980133056833236, lr=0.008016346815027822
2023-11-16 08:51:08   INFO  epoch: 20/24, acc_iter=132840, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:47:50, time_cost(all): 1 day, 19:14:50/8:54:42, loss=0.307467743687542, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.19(1.03), norm=2.661381478191679, lr=0.007995416551317662
2023-11-16 08:52:07   INFO  epoch: 20/24, acc_iter=132890, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:49:54, time_cost(all): 1 day, 19:15:49/8:12:46, loss=0.307356801539365, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=0.5034495116551565, lr=0.007974486287607501
2023-11-16 08:53:05   INFO  epoch: 20/24, acc_iter=132940, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:48:26, time_cost(all): 1 day, 19:16:47/8:05:47, loss=0.307245859391188, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=0.7934059862269973, lr=0.007953556023897341
2023-11-16 08:54:04   INFO  epoch: 20/24, acc_iter=132990, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:48:19, time_cost(all): 1 day, 19:17:46/8:04:25, loss=0.307134917243011, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=0.9246388960118492, lr=0.007932625760187181
2023-11-16 08:55:03   INFO  epoch: 20/24, acc_iter=133040, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:45:44, time_cost(all): 1 day, 19:18:45/8:39:28, loss=0.307023975094835, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=3.632454114768316, lr=0.007911695496477023
2023-11-16 08:56:02   INFO  epoch: 20/24, acc_iter=133090, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:42:52, time_cost(all): 1 day, 19:19:44/8:26:58, loss=0.306913032946658, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=4.9180114333976395, lr=0.007890765232766862
2023-11-16 08:57:01   INFO  epoch: 20/24, acc_iter=133140, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:45:07, time_cost(all): 1 day, 19:20:43/8:27:17, loss=0.306802090798481, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.04(1.03), norm=0.8969342527502034, lr=0.007869834969056702
2023-11-16 08:58:00   INFO  epoch: 20/24, acc_iter=133190, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:36:13, time_cost(all): 1 day, 19:21:42/8:20:25, loss=0.306691148650304, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=2.272673355753163, lr=0.007848904705346542
2023-11-16 08:58:59   INFO  epoch: 20/24, acc_iter=133240, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:44:08, time_cost(all): 1 day, 19:22:41/8:21:27, loss=0.306580206502128, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=0.7996546509203565, lr=0.007827974441636382
2023-11-16 08:59:58   INFO  epoch: 20/24, acc_iter=133290, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:41:32, time_cost(all): 1 day, 19:23:40/8:20:28, loss=0.306469264353951, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=0.7508451817453479, lr=0.007807044177926221
2023-11-16 09:00:57   INFO  epoch: 20/24, acc_iter=133340, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:42:31, time_cost(all): 1 day, 19:24:39/8:24:44, loss=0.306358322205774, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=1.389294257054427, lr=0.007786113914216061
2023-11-16 09:01:56   INFO  epoch: 20/24, acc_iter=133390, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:39:32, time_cost(all): 1 day, 19:25:38/8:09:03, loss=0.306247380057597, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=0.5407183281628023, lr=0.007765183650505902
2023-11-16 09:02:55   INFO  epoch: 20/24, acc_iter=133440, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:34:21, time_cost(all): 1 day, 19:26:37/8:23:28, loss=0.306136437909421, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.759517343903249, lr=0.007744253386795742
2023-11-16 09:03:54   INFO  epoch: 20/24, acc_iter=133490, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:33:26, time_cost(all): 1 day, 19:27:36/8:24:38, loss=0.306025495761244, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=3.6947717224298104, lr=0.007723323123085581
2023-11-16 09:04:53   INFO  epoch: 20/24, acc_iter=133540, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:34:29, time_cost(all): 1 day, 19:28:35/8:32:17, loss=0.305914553613067, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.4876303068929553, lr=0.007702392859375422
2023-11-16 09:05:52   INFO  epoch: 20/24, acc_iter=133590, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:30:37, time_cost(all): 1 day, 19:29:34/8:04:11, loss=0.30580361146489, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=4.105232975160444, lr=0.007681462595665262
2023-11-16 09:06:50   INFO  epoch: 20/24, acc_iter=133640, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:28:37, time_cost(all): 1 day, 19:30:32/8:38:01, loss=0.305692669316714, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=1.5913760323649795, lr=0.007660532331955102
2023-11-16 09:07:49   INFO  epoch: 20/24, acc_iter=133690, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:32:52, time_cost(all): 1 day, 19:31:31/8:31:16, loss=0.305581727168537, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=0.5003276138773977, lr=0.007639602068244941
2023-11-16 09:08:48   INFO  epoch: 20/24, acc_iter=133740, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:28:44, time_cost(all): 1 day, 19:32:30/8:18:35, loss=0.30547078502036, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.3129450453528415, lr=0.007618671804534782
2023-11-16 09:09:47   INFO  epoch: 20/24, acc_iter=133790, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:29:48, time_cost(all): 1 day, 19:33:29/8:06:31, loss=0.305359842872183, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=0.9608028636463808, lr=0.007597741540824622
2023-11-16 09:10:46   INFO  epoch: 20/24, acc_iter=133840, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:29:40, time_cost(all): 1 day, 19:34:28/7:59:58, loss=0.305248900724007, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=0.574640463262859, lr=0.007576811277114463
2023-11-16 09:11:45   INFO  epoch: 20/24, acc_iter=133890, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:23:46, time_cost(all): 1 day, 19:35:27/7:57:57, loss=0.30513795857583, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.16(1.03), norm=1.2109267708199034, lr=0.007555881013404302
2023-11-16 09:12:44   INFO  epoch: 20/24, acc_iter=133940, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:21:58, time_cost(all): 1 day, 19:36:26/8:08:59, loss=0.305027016427653, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=2.207117470435198, lr=0.007534950749694142
2023-11-16 09:13:43   INFO  epoch: 20/24, acc_iter=133990, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:27:26, time_cost(all): 1 day, 19:37:25/8:16:54, loss=0.304916074279476, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=3.32402465220326, lr=0.007514020485983982
2023-11-16 09:14:42   INFO  epoch: 20/24, acc_iter=134040, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:20:41, time_cost(all): 1 day, 19:38:24/7:55:17, loss=0.3048051321313, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=3.6446617223128617, lr=0.007493090222273822
2023-11-16 09:15:41   INFO  epoch: 20/24, acc_iter=134090, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:20:41, time_cost(all): 1 day, 19:39:23/8:17:23, loss=0.304694189983123, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.06(1.03), norm=4.809236636785785, lr=0.007472159958563662
2023-11-16 09:16:40   INFO  epoch: 20/24, acc_iter=134140, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:19:37, time_cost(all): 1 day, 19:40:22/8:16:56, loss=0.304583247834946, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=1.6785838743223522, lr=0.007451229694853503
2023-11-16 09:17:39   INFO  epoch: 20/24, acc_iter=134190, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:23:04, time_cost(all): 1 day, 19:41:21/8:17:54, loss=0.304472305686769, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=1.3887840508574527, lr=0.007430299431143343
2023-11-16 09:18:38   INFO  epoch: 20/24, acc_iter=134240, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:18:21, time_cost(all): 1 day, 19:42:20/8:22:10, loss=0.304361363538593, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=0.743796892930727, lr=0.007409369167433183
2023-11-16 09:19:37   INFO  epoch: 20/24, acc_iter=134290, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:20:39, time_cost(all): 1 day, 19:43:19/7:42:27, loss=0.304250421390416, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=2.6377957204889833, lr=0.007388438903723022
2023-11-16 09:20:35   INFO  epoch: 20/24, acc_iter=134340, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:16:03, time_cost(all): 1 day, 19:44:17/8:15:11, loss=0.304139479242239, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.23(1.03), norm=0.5887804951252519, lr=0.007367508640012862
2023-11-16 09:21:34   INFO  epoch: 20/24, acc_iter=134390, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:13:59, time_cost(all): 1 day, 19:45:16/7:49:59, loss=0.304028537094062, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=1.6685469699180047, lr=0.007346578376302702
2023-11-16 09:22:33   INFO  epoch: 20/24, acc_iter=134440, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:15:45, time_cost(all): 1 day, 19:46:15/7:57:18, loss=0.303917594945886, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=1.5179697644300911, lr=0.007325648112592542
2023-11-16 09:23:32   INFO  epoch: 20/24, acc_iter=134490, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:18:36, time_cost(all): 1 day, 19:47:14/7:44:46, loss=0.303806652797709, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.377470731476349, lr=0.007304717848882382
2023-11-16 09:24:31   INFO  epoch: 20/24, acc_iter=134540, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:37, time_cost(all): 1 day, 19:48:13/7:44:20, loss=0.303695710649532, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=1.7479605324800567, lr=0.007283787585172223
2023-11-16 09:25:30   INFO  epoch: 20/24, acc_iter=134590, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:16:34, time_cost(all): 1 day, 19:49:12/7:47:08, loss=0.303584768501355, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=2.062496459971539, lr=0.007262857321462063
2023-11-16 09:26:29   INFO  epoch: 20/24, acc_iter=134640, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:08:56, time_cost(all): 1 day, 19:50:11/7:39:07, loss=0.303473826353179, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=2.854155941914045, lr=0.007241927057751903
2023-11-16 09:27:28   INFO  epoch: 20/24, acc_iter=134690, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:07:52, time_cost(all): 1 day, 19:51:10/8:00:23, loss=0.303362884205002, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=2.967292831591628, lr=0.007220996794041742
2023-11-16 09:28:27   INFO  epoch: 20/24, acc_iter=134740, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:13:34, time_cost(all): 1 day, 19:52:09/7:29:53, loss=0.303251942056825, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.09(1.03), norm=4.241987276224615, lr=0.007200066530331582
2023-11-16 09:29:26   INFO  epoch: 20/24, acc_iter=134790, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:06:27, time_cost(all): 1 day, 19:53:08/7:34:04, loss=0.303140999908648, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=4.4511115163782335, lr=0.007179136266621423
2023-11-16 09:30:25   INFO  epoch: 20/24, acc_iter=134840, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:09, time_cost(all): 1 day, 19:54:07/7:42:34, loss=0.303030057760472, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.943104793410215, lr=0.007158206002911263
2023-11-16 09:31:24   INFO  epoch: 20/24, acc_iter=134890, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:07:25, time_cost(all): 1 day, 19:55:06/7:59:24, loss=0.302919115612295, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.86(1.03), norm=4.571502469533404, lr=0.007137275739201103
2023-11-16 09:32:23   INFO  epoch: 20/24, acc_iter=134940, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:14, time_cost(all): 1 day, 19:56:05/7:44:25, loss=0.302808173464118, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=2.829585500477087, lr=0.007116345475490943
2023-11-16 09:33:22   INFO  epoch: 20/24, acc_iter=134990, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:07:12, time_cost(all): 1 day, 19:57:04/7:42:28, loss=0.302697231315941, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=3.3094719474343703, lr=0.007095415211780783
2023-11-16 09:34:20   INFO  epoch: 20/24, acc_iter=135040, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:04:28, time_cost(all): 1 day, 19:58:02/7:33:33, loss=0.302586289167765, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=1.2485665627520393, lr=0.007074484948070623
2023-11-16 09:35:19   INFO  epoch: 20/24, acc_iter=135090, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:29, time_cost(all): 1 day, 19:59:01/8:08:01, loss=0.302475347019588, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=0.9807061251839553, lr=0.007053554684360463
2023-11-16 09:36:18   INFO  epoch: 20/24, acc_iter=135140, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:01:43, time_cost(all): 1 day, 20:00:00/8:00:55, loss=0.302364404871411, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=0.8561429484186478, lr=0.007032624420650303
2023-11-16 09:37:17   INFO  epoch: 20/24, acc_iter=135190, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:03:35, time_cost(all): 1 day, 20:00:59/7:38:33, loss=0.302253462723234, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=3.82795316512115, lr=0.007011694156940144
2023-11-16 09:38:16   INFO  epoch: 20/24, acc_iter=135240, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:02:02, time_cost(all): 1 day, 20:01:58/7:26:43, loss=0.302142520575058, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=0.9833979742072183, lr=0.006990763893229984
2023-11-16 09:39:15   INFO  epoch: 20/24, acc_iter=135290, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/1:01:02, time_cost(all): 1 day, 20:02:57/7:23:30, loss=0.302031578426881, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.8608294552060345, lr=0.006969833629519823
2023-11-16 09:40:14   INFO  epoch: 20/24, acc_iter=135340, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/1:01:14, time_cost(all): 1 day, 20:03:56/8:04:01, loss=0.301920636278704, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=0.6474577287861268, lr=0.006948903365809663
2023-11-16 09:41:13   INFO  epoch: 20/24, acc_iter=135390, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:57:58, time_cost(all): 1 day, 20:04:55/7:35:38, loss=0.301809694130527, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.9566232303222943, lr=0.006927973102099503
2023-11-16 09:42:12   INFO  epoch: 20/24, acc_iter=135440, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:17, time_cost(all): 1 day, 20:05:54/7:20:22, loss=0.301698751982351, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=2.4924493428615198, lr=0.006907042838389343
2023-11-16 09:43:11   INFO  epoch: 20/24, acc_iter=135490, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:39, time_cost(all): 1 day, 20:06:53/7:54:32, loss=0.301587809834174, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=3.9729005300747806, lr=0.006886112574679183
2023-11-16 09:44:10   INFO  epoch: 20/24, acc_iter=135540, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:56:30, time_cost(all): 1 day, 20:07:52/7:49:37, loss=0.301476867685997, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=2.3757952490776892, lr=0.006865182310969023
2023-11-16 09:45:09   INFO  epoch: 20/24, acc_iter=135590, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:53:55, time_cost(all): 1 day, 20:08:51/7:24:37, loss=0.30136592553782, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=0.6950701525768122, lr=0.006844252047258864
2023-11-16 09:46:08   INFO  epoch: 20/24, acc_iter=135640, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:51:06, time_cost(all): 1 day, 20:09:50/7:16:30, loss=0.301254983389644, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.18(1.03), norm=2.9096346329548743, lr=0.006823321783548704
2023-11-16 09:47:07   INFO  epoch: 20/24, acc_iter=135690, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:53:26, time_cost(all): 1 day, 20:10:49/7:36:00, loss=0.301144041241467, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.0868107651926344, lr=0.006802391519838543
2023-11-16 09:48:05   INFO  epoch: 20/24, acc_iter=135740, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:51:30, time_cost(all): 1 day, 20:11:47/7:31:24, loss=0.30103309909329, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=4.705106750162348, lr=0.006781461256128383
2023-11-16 09:49:04   INFO  epoch: 20/24, acc_iter=135790, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:47:34, time_cost(all): 1 day, 20:12:46/7:52:54, loss=0.300922156945113, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=0.814963348296637, lr=0.006760530992418223
2023-11-16 09:50:03   INFO  epoch: 20/24, acc_iter=135840, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:50:48, time_cost(all): 1 day, 20:13:45/7:31:40, loss=0.300811214796937, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=0.5676333076308386, lr=0.006739600728708064
2023-11-16 09:51:02   INFO  epoch: 20/24, acc_iter=135890, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:48:06, time_cost(all): 1 day, 20:14:44/7:14:14, loss=0.30070027264876, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=1.6845597098854446, lr=0.006718670464997904
2023-11-16 09:52:01   INFO  epoch: 20/24, acc_iter=135940, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:49:12, time_cost(all): 1 day, 20:15:43/7:27:37, loss=0.300589330500583, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=3.723379385405802, lr=0.006697740201287744
2023-11-16 09:53:00   INFO  epoch: 20/24, acc_iter=135990, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:44:36, time_cost(all): 1 day, 20:16:42/7:11:30, loss=0.300478388352406, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.55967026370131, lr=0.006676809937577584
2023-11-16 09:53:59   INFO  epoch: 20/24, acc_iter=136040, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:33, time_cost(all): 1 day, 20:17:41/7:27:02, loss=0.30036744620423, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=3.3186426506059483, lr=0.006655879673867424
2023-11-16 09:54:58   INFO  epoch: 20/24, acc_iter=136090, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:42, time_cost(all): 1 day, 20:18:40/7:48:14, loss=0.300256504056053, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.86(1.03), norm=2.714754524527155, lr=0.006634949410157263
2023-11-16 09:55:57   INFO  epoch: 20/24, acc_iter=136140, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:05, time_cost(all): 1 day, 20:19:39/7:40:19, loss=0.300145561907876, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=3.3915265859802823, lr=0.006614019146447104
2023-11-16 09:56:56   INFO  epoch: 20/24, acc_iter=136190, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:40:47, time_cost(all): 1 day, 20:20:38/7:35:03, loss=0.300034619759699, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=0.900123402023856, lr=0.006593088882736944
2023-11-16 09:57:55   INFO  epoch: 20/24, acc_iter=136240, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:42:47, time_cost(all): 1 day, 20:21:37/7:37:37, loss=0.299923677611523, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=2.279527078382672, lr=0.006572158619026785
2023-11-16 09:58:54   INFO  epoch: 20/24, acc_iter=136290, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:59, time_cost(all): 1 day, 20:22:36/7:08:44, loss=0.299812735463346, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.96(1.03), norm=2.78964637032536, lr=0.006551228355316623
2023-11-16 09:59:53   INFO  epoch: 20/24, acc_iter=136340, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:14, time_cost(all): 1 day, 20:23:35/7:25:51, loss=0.299701793315169, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=1.3799184141437293, lr=0.006530298091606464
2023-11-16 10:00:52   INFO  epoch: 20/24, acc_iter=136390, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:39:28, time_cost(all): 1 day, 20:24:34/7:10:30, loss=0.299590851166992, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=1.3154728770724868, lr=0.006509367827896304
2023-11-16 10:01:51   INFO  epoch: 20/24, acc_iter=136440, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:35:50, time_cost(all): 1 day, 20:25:33/7:26:34, loss=0.299479909018816, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.84(1.03), norm=3.9806023855382198, lr=0.006488437564186144
2023-11-16 10:02:49   INFO  epoch: 20/24, acc_iter=136490, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:35:55, time_cost(all): 1 day, 20:26:31/7:19:33, loss=0.299368966870639, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.03(1.03), norm=1.090265477347013, lr=0.006467507300475983
2023-11-16 10:03:48   INFO  epoch: 20/24, acc_iter=136540, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:17, time_cost(all): 1 day, 20:27:30/7:24:40, loss=0.299258024722462, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=2.4003783030531647, lr=0.006446577036765824
2023-11-16 10:04:47   INFO  epoch: 20/24, acc_iter=136590, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:32:26, time_cost(all): 1 day, 20:28:29/7:07:00, loss=0.299147082574285, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.01(1.03), norm=4.204774256964168, lr=0.006425646773055664
2023-11-16 10:05:46   INFO  epoch: 20/24, acc_iter=136640, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:08, time_cost(all): 1 day, 20:29:28/7:26:49, loss=0.299036140426109, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=4.277452260865267, lr=0.006404716509345505
2023-11-16 10:06:45   INFO  epoch: 20/24, acc_iter=136690, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:02, time_cost(all): 1 day, 20:30:27/7:04:51, loss=0.298925198277932, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=3.3758389741439117, lr=0.006383786245635344
2023-11-16 10:07:44   INFO  epoch: 20/24, acc_iter=136740, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:57, time_cost(all): 1 day, 20:31:26/6:53:16, loss=0.298814256129755, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=0.7281983627097318, lr=0.006362855981925184
2023-11-16 10:08:43   INFO  epoch: 20/24, acc_iter=136790, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:05, time_cost(all): 1 day, 20:32:25/7:15:23, loss=0.298703313981578, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=0.9391836377624523, lr=0.006341925718215024
2023-11-16 10:09:42   INFO  epoch: 20/24, acc_iter=136840, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:27, time_cost(all): 1 day, 20:33:24/7:21:25, loss=0.298592371833402, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=2.6192561342974985, lr=0.006320995454504864
2023-11-16 10:10:41   INFO  epoch: 20/24, acc_iter=136890, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:22, time_cost(all): 1 day, 20:34:23/7:04:48, loss=0.298481429685225, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=3.1139946108544927, lr=0.006300065190794704
2023-11-16 10:11:40   INFO  epoch: 20/24, acc_iter=136940, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:28:01, time_cost(all): 1 day, 20:35:22/7:18:08, loss=0.298370487537048, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=0.800863266233758, lr=0.006279134927084545
2023-11-16 10:12:39   INFO  epoch: 20/24, acc_iter=136990, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:12, time_cost(all): 1 day, 20:36:21/7:07:59, loss=0.298259545388871, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=1.950927664421502, lr=0.006258204663374385
2023-11-16 10:13:38   INFO  epoch: 20/24, acc_iter=137040, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:35, time_cost(all): 1 day, 20:37:20/7:24:50, loss=0.298148603240695, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=2.7125883333700096, lr=0.006237274399664225
2023-11-16 10:14:37   INFO  epoch: 20/24, acc_iter=137090, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:18, time_cost(all): 1 day, 20:38:19/7:26:11, loss=0.298037661092518, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=2.1306639878186098, lr=0.006216344135954064
2023-11-16 10:15:36   INFO  epoch: 20/24, acc_iter=137140, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:27, time_cost(all): 1 day, 20:39:18/7:23:24, loss=0.297926718944341, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=0.6688886976001378, lr=0.006195413872243904
2023-11-16 10:16:34   INFO  epoch: 20/24, acc_iter=137190, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:16, time_cost(all): 1 day, 20:40:16/7:15:26, loss=0.297815776796164, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=2.650592345734431, lr=0.006174483608533745
2023-11-16 10:17:33   INFO  epoch: 20/24, acc_iter=137240, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:26, time_cost(all): 1 day, 20:41:15/7:19:32, loss=0.297704834647988, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.15(1.03), norm=1.0346137856737656, lr=0.006153553344823585
2023-11-16 10:18:32   INFO  epoch: 20/24, acc_iter=137290, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:44, time_cost(all): 1 day, 20:42:14/7:13:23, loss=0.297593892499811, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=1.448605221951692, lr=0.006132623081113424
2023-11-16 10:19:31   INFO  epoch: 20/24, acc_iter=137340, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:59, time_cost(all): 1 day, 20:43:13/7:02:14, loss=0.297482950351634, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=3.859853967444012, lr=0.006111692817403265
2023-11-16 10:20:30   INFO  epoch: 20/24, acc_iter=137390, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:29, time_cost(all): 1 day, 20:44:12/6:43:45, loss=0.297372008203457, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=2.9574728118799234, lr=0.006090762553693105
2023-11-16 10:21:29   INFO  epoch: 20/24, acc_iter=137440, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:48, time_cost(all): 1 day, 20:45:11/7:14:13, loss=0.297261066055281, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.14(1.03), norm=1.2910424295386616, lr=0.006069832289982945
2023-11-16 10:22:28   INFO  epoch: 20/24, acc_iter=137490, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:17:05, time_cost(all): 1 day, 20:46:10/6:55:47, loss=0.297150123907104, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=3.510331636301057, lr=0.006048902026272784
2023-11-16 10:23:27   INFO  epoch: 20/24, acc_iter=137540, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:25, time_cost(all): 1 day, 20:47:09/7:00:27, loss=0.297039181758927, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=2.6566802629767206, lr=0.006027971762562625
2023-11-16 10:24:26   INFO  epoch: 20/24, acc_iter=137590, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:15:10, time_cost(all): 1 day, 20:48:08/6:41:17, loss=0.29692823961075, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=4.76739379182788, lr=0.006007041498852465
2023-11-16 10:25:25   INFO  epoch: 20/24, acc_iter=137640, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:45, time_cost(all): 1 day, 20:49:07/7:02:26, loss=0.296817297462574, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=2.040709472602538, lr=0.005986111235142305
2023-11-16 10:26:24   INFO  epoch: 20/24, acc_iter=137690, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:53, time_cost(all): 1 day, 20:50:06/6:41:43, loss=0.296706355314397, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.4600195107459752, lr=0.005965180971432145
2023-11-16 10:27:23   INFO  epoch: 20/24, acc_iter=137740, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:12:00, time_cost(all): 1 day, 20:51:05/6:34:58, loss=0.29659541316622, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.6524541593224633, lr=0.005944250707721985
2023-11-16 10:28:22   INFO  epoch: 20/24, acc_iter=137790, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:11:00, time_cost(all): 1 day, 20:52:04/7:09:44, loss=0.296484471018043, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=2.51152987452404, lr=0.005923320444011825
2023-11-16 10:29:21   INFO  epoch: 20/24, acc_iter=137840, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:50, time_cost(all): 1 day, 20:53:03/7:00:56, loss=0.296373528869867, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=3.0132638163742382, lr=0.005902390180301666
2023-11-16 10:30:19   INFO  epoch: 20/24, acc_iter=137890, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:34, time_cost(all): 1 day, 20:54:01/6:39:18, loss=0.29626258672169, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.93(1.03), norm=3.13293365170315, lr=0.005881459916591505
2023-11-16 10:31:18   INFO  epoch: 20/24, acc_iter=137940, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:39, time_cost(all): 1 day, 20:55:00/6:48:27, loss=0.296151644573513, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=3.1187472264846194, lr=0.005860529652881345
2023-11-16 10:32:17   INFO  epoch: 20/24, acc_iter=137990, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:36, time_cost(all): 1 day, 20:55:59/6:48:22, loss=0.296040702425336, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=0.7208013664933026, lr=0.005839599389171185
2023-11-16 10:33:16   INFO  epoch: 20/24, acc_iter=138040, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:47, time_cost(all): 1 day, 20:56:58/6:55:54, loss=0.29592976027716, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=3.950402803599374, lr=0.005818669125461026
2023-11-16 10:34:15   INFO  epoch: 20/24, acc_iter=138090, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:51, time_cost(all): 1 day, 20:57:57/6:34:32, loss=0.295818818128983, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=3.3757827850811983, lr=0.005797738861750865
2023-11-16 10:35:14   INFO  epoch: 20/24, acc_iter=138140, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:45, time_cost(all): 1 day, 20:58:56/7:02:35, loss=0.295707875980806, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=4.139647822371254, lr=0.005776808598040705
2023-11-16 10:36:13   INFO  epoch: 20/24, acc_iter=138190, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:42, time_cost(all): 1 day, 20:59:55/6:52:23, loss=0.295596933832629, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=1.7141281038621416, lr=0.005755878334330546
2023-11-16 10:37:12   INFO  epoch: 20/24, acc_iter=138240, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:40, time_cost(all): 1 day, 21:00:54/6:39:40, loss=0.295485991684453, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=0.8796178817563369, lr=0.005734948070620385
2023-11-16 10:38:11   INFO  epoch: 20/24, acc_iter=138290, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:44, time_cost(all): 1 day, 21:01:53/6:24:35, loss=0.295375049536276, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.03(1.03), norm=1.076362966955799, lr=0.005714017806910225
2023-11-16 10:39:10   INFO  epoch: 21/24, acc_iter=138377, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:13:37, time_cost(all): 1 day, 21:02:52/7:00:40, loss=0.295182010198448, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.0335224129330869, lr=0.005677599148054547
2023-11-16 10:40:09   INFO  epoch: 21/24, acc_iter=138427, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:02:21, time_cost(all): 1 day, 21:03:51/6:55:45, loss=0.295071068050272, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=0.8670622378387145, lr=0.005656668884344387
2023-11-16 10:41:08   INFO  epoch: 21/24, acc_iter=138477, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:09:33, time_cost(all): 1 day, 21:04:50/6:21:49, loss=0.294960125902095, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.0136354700489663, lr=0.005635738620634227
2023-11-16 10:42:07   INFO  epoch: 21/24, acc_iter=138527, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:06:48, time_cost(all): 1 day, 21:05:49/6:22:15, loss=0.294849183753918, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.736397104145271, lr=0.005614808356924067
2023-11-16 10:43:06   INFO  epoch: 21/24, acc_iter=138577, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:06:35, time_cost(all): 1 day, 21:06:48/6:26:52, loss=0.294738241605741, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=4.243438819979643, lr=0.005593878093213907
2023-11-16 10:44:04   INFO  epoch: 21/24, acc_iter=138627, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/2:07:56, time_cost(all): 1 day, 21:07:46/6:48:43, loss=0.294627299457564, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=2.0418752632983943, lr=0.005572947829503747
2023-11-16 10:45:03   INFO  epoch: 21/24, acc_iter=138677, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:02:31, time_cost(all): 1 day, 21:08:45/6:23:13, loss=0.294516357309388, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=4.753548663501324, lr=0.005552017565793588
2023-11-16 10:46:02   INFO  epoch: 21/24, acc_iter=138727, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:00:35, time_cost(all): 1 day, 21:09:44/6:16:04, loss=0.294405415161211, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.476306926758036, lr=0.005531087302083427
2023-11-16 10:47:01   INFO  epoch: 21/24, acc_iter=138777, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:00:54, time_cost(all): 1 day, 21:10:43/6:26:00, loss=0.294294473013034, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=3.628433931691439, lr=0.005510157038373267
2023-11-16 10:48:00   INFO  epoch: 21/24, acc_iter=138827, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/2:03:50, time_cost(all): 1 day, 21:11:42/6:31:59, loss=0.294183530864857, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=3.0896457042849024, lr=0.005489226774663108
2023-11-16 10:48:59   INFO  epoch: 21/24, acc_iter=138877, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:53:11, time_cost(all): 1 day, 21:12:41/6:41:59, loss=0.294072588716681, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=0.7679307076063777, lr=0.005468296510952948
2023-11-16 10:49:58   INFO  epoch: 21/24, acc_iter=138927, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:14, time_cost(all): 1 day, 21:13:40/6:29:49, loss=0.293961646568504, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=1.2425836084483988, lr=0.005447366247242787
2023-11-16 10:50:57   INFO  epoch: 21/24, acc_iter=138977, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:55:00, time_cost(all): 1 day, 21:14:39/6:21:14, loss=0.293850704420327, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=4.830706793651486, lr=0.005426435983532628
2023-11-16 10:51:56   INFO  epoch: 21/24, acc_iter=139027, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:53:01, time_cost(all): 1 day, 21:15:38/6:15:17, loss=0.29373976227215, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.277064421151167, lr=0.005405505719822468
2023-11-16 10:52:55   INFO  epoch: 21/24, acc_iter=139077, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:49:15, time_cost(all): 1 day, 21:16:37/6:37:04, loss=0.293628820123974, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.1(1.03), norm=0.592536691270408, lr=0.005384575456112308
2023-11-16 10:53:54   INFO  epoch: 21/24, acc_iter=139127, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:58:19, time_cost(all): 1 day, 21:17:36/6:42:51, loss=0.293517877975797, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=4.273558217974637, lr=0.005363645192402148
2023-11-16 10:54:53   INFO  epoch: 21/24, acc_iter=139177, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:47:54, time_cost(all): 1 day, 21:18:35/6:08:00, loss=0.29340693582762, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=1.5275954724268765, lr=0.005342714928691988
2023-11-16 10:55:52   INFO  epoch: 21/24, acc_iter=139227, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:54:39, time_cost(all): 1 day, 21:19:34/6:42:24, loss=0.293295993679443, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=2.015252341715822, lr=0.005321784664981828
2023-11-16 10:56:51   INFO  epoch: 21/24, acc_iter=139277, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:52:42, time_cost(all): 1 day, 21:20:33/6:38:49, loss=0.293185051531267, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=4.74753158654959, lr=0.005300854401271668
2023-11-16 10:57:49   INFO  epoch: 21/24, acc_iter=139327, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:47:00, time_cost(all): 1 day, 21:21:31/6:28:40, loss=0.29307410938309, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.11(1.03), norm=1.7818067222095186, lr=0.005279924137561508
2023-11-16 10:58:48   INFO  epoch: 21/24, acc_iter=139377, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:43:55, time_cost(all): 1 day, 21:22:30/6:23:13, loss=0.292963167234913, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=2.0883359991053645, lr=0.005258993873851348
2023-11-16 10:59:47   INFO  epoch: 21/24, acc_iter=139427, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:43:46, time_cost(all): 1 day, 21:23:29/6:29:10, loss=0.292852225086736, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.7749703046621228, lr=0.005238063610141188
2023-11-16 11:00:46   INFO  epoch: 21/24, acc_iter=139477, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:51:25, time_cost(all): 1 day, 21:24:28/6:36:50, loss=0.29274128293856, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=4.748095394308755, lr=0.005217133346431028
2023-11-16 11:01:45   INFO  epoch: 21/24, acc_iter=139527, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:50:41, time_cost(all): 1 day, 21:25:27/6:15:19, loss=0.292630340790383, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.94(1.03), norm=1.0682416677556827, lr=0.005196203082720867
2023-11-16 11:02:44   INFO  epoch: 21/24, acc_iter=139577, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:49:21, time_cost(all): 1 day, 21:26:26/6:36:27, loss=0.292519398642206, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.7865724664002534, lr=0.005175272819010707
2023-11-16 11:03:43   INFO  epoch: 21/24, acc_iter=139627, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:42:03, time_cost(all): 1 day, 21:27:25/6:23:26, loss=0.29240845649403, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=2.8287790100146717, lr=0.005154342555300548
2023-11-16 11:04:42   INFO  epoch: 21/24, acc_iter=139677, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:38:53, time_cost(all): 1 day, 21:28:24/6:20:11, loss=0.292297514345853, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=4.6413818181314745, lr=0.005133412291590388
2023-11-16 11:05:41   INFO  epoch: 21/24, acc_iter=139727, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:39:47, time_cost(all): 1 day, 21:29:23/6:15:00, loss=0.292186572197676, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=1.6177125285957619, lr=0.005112482027880227
2023-11-16 11:06:40   INFO  epoch: 21/24, acc_iter=139777, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:45:10, time_cost(all): 1 day, 21:30:22/6:13:17, loss=0.292075630049499, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=2.05846619549274, lr=0.005091551764170068
2023-11-16 11:07:39   INFO  epoch: 21/24, acc_iter=139827, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:44:26, time_cost(all): 1 day, 21:31:21/6:28:18, loss=0.291964687901322, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=3.182642024923586, lr=0.005070621500459908
2023-11-16 11:08:38   INFO  epoch: 21/24, acc_iter=139877, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:42:36, time_cost(all): 1 day, 21:32:20/5:56:32, loss=0.291853745753146, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=2.4330632747910306, lr=0.005049691236749748
2023-11-16 11:09:37   INFO  epoch: 21/24, acc_iter=139927, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:36:57, time_cost(all): 1 day, 21:33:19/6:19:02, loss=0.291742803604969, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.964481890393931, lr=0.005028760973039587
2023-11-16 11:10:36   INFO  epoch: 21/24, acc_iter=139977, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:38:45, time_cost(all): 1 day, 21:34:18/5:54:40, loss=0.291631861456792, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.735941485407067, lr=0.005007830709329428
2023-11-16 11:11:34   INFO  epoch: 21/24, acc_iter=140027, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:37:56, time_cost(all): 1 day, 21:35:16/5:51:35, loss=0.291520919308615, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.12(1.03), norm=1.4301562964079995, lr=0.004986900445619268
2023-11-16 11:12:33   INFO  epoch: 21/24, acc_iter=140077, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:35:35, time_cost(all): 1 day, 21:36:15/6:00:10, loss=0.291409977160439, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=4.531161707683025, lr=0.004965970181909108
2023-11-16 11:13:32   INFO  epoch: 21/24, acc_iter=140127, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:34:10, time_cost(all): 1 day, 21:37:14/6:03:45, loss=0.291299035012262, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.93(1.03), norm=4.651185048698851, lr=0.004945039918198948
2023-11-16 11:14:31   INFO  epoch: 21/24, acc_iter=140177, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:36:49, time_cost(all): 1 day, 21:38:13/5:52:28, loss=0.291188092864085, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=3.7547386844173687, lr=0.004924109654488788
2023-11-16 11:15:30   INFO  epoch: 21/24, acc_iter=140227, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:36:32, time_cost(all): 1 day, 21:39:12/6:02:14, loss=0.291077150715908, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.04(1.03), norm=4.752875907500958, lr=0.004903179390778628
2023-11-16 11:16:29   INFO  epoch: 21/24, acc_iter=140277, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:29:43, time_cost(all): 1 day, 21:40:11/6:15:17, loss=0.290966208567732, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=3.580776325031339, lr=0.004882249127068469
2023-11-16 11:17:28   INFO  epoch: 21/24, acc_iter=140327, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:31:41, time_cost(all): 1 day, 21:41:10/5:48:25, loss=0.290855266419555, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=3.5659066000891046, lr=0.004861318863358308
2023-11-16 11:18:27   INFO  epoch: 21/24, acc_iter=140377, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:31:28, time_cost(all): 1 day, 21:42:09/5:45:32, loss=0.290744324271378, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=4.170106199364471, lr=0.004840388599648148
2023-11-16 11:19:26   INFO  epoch: 21/24, acc_iter=140427, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:27:28, time_cost(all): 1 day, 21:43:08/6:03:38, loss=0.290633382123201, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=4.371364443363748, lr=0.004819458335937989
2023-11-16 11:20:25   INFO  epoch: 21/24, acc_iter=140477, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:28:42, time_cost(all): 1 day, 21:44:07/6:05:26, loss=0.290522439975025, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.07(1.03), norm=0.8911034192284801, lr=0.004798528072227829
2023-11-16 11:21:24   INFO  epoch: 21/24, acc_iter=140527, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:27:35, time_cost(all): 1 day, 21:45:06/5:51:00, loss=0.290411497826848, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=4.921147394971915, lr=0.004777597808517668
2023-11-16 11:22:23   INFO  epoch: 21/24, acc_iter=140577, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:21:59, time_cost(all): 1 day, 21:46:05/5:47:13, loss=0.290300555678671, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.7063617928529196, lr=0.004756667544807509
2023-11-16 11:23:22   INFO  epoch: 21/24, acc_iter=140627, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:22:38, time_cost(all): 1 day, 21:47:04/6:01:11, loss=0.290189613530494, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=3.279448361691747, lr=0.004735737281097349
2023-11-16 11:24:21   INFO  epoch: 21/24, acc_iter=140677, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:21:57, time_cost(all): 1 day, 21:48:03/6:06:08, loss=0.290078671382318, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.11(1.03), norm=4.094949657810469, lr=0.004714807017387189
2023-11-16 11:25:19   INFO  epoch: 21/24, acc_iter=140727, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:19:07, time_cost(all): 1 day, 21:49:01/5:56:34, loss=0.289967729234141, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.15(1.03), norm=4.460582692584424, lr=0.004693876753677029
2023-11-16 11:26:18   INFO  epoch: 21/24, acc_iter=140777, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:21:55, time_cost(all): 1 day, 21:50:00/6:10:33, loss=0.289856787085964, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=2.1984271781106415, lr=0.004672946489966869
2023-11-16 11:27:17   INFO  epoch: 21/24, acc_iter=140827, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:19:54, time_cost(all): 1 day, 21:50:59/5:50:44, loss=0.289745844937787, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=4.1789446838961695, lr=0.004652016226256709
2023-11-16 11:28:16   INFO  epoch: 21/24, acc_iter=140877, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:21:59, time_cost(all): 1 day, 21:51:58/6:06:25, loss=0.289634902789611, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=4.3753554849704965, lr=0.004631085962546549
2023-11-16 11:29:15   INFO  epoch: 21/24, acc_iter=140927, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:21:15, time_cost(all): 1 day, 21:52:57/5:37:37, loss=0.289523960641434, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=4.425943526343421, lr=0.004610155698836389
2023-11-16 11:30:14   INFO  epoch: 21/24, acc_iter=140977, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:16:06, time_cost(all): 1 day, 21:53:56/5:43:30, loss=0.289413018493257, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=1.001533869627145, lr=0.004589225435126229
2023-11-16 11:31:13   INFO  epoch: 21/24, acc_iter=141027, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:16:07, time_cost(all): 1 day, 21:54:55/6:03:20, loss=0.28930207634508, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=2.149976646546128, lr=0.004568295171416069
2023-11-16 11:32:12   INFO  epoch: 21/24, acc_iter=141077, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:17:42, time_cost(all): 1 day, 21:55:54/5:48:57, loss=0.289191134196904, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.274178499369884, lr=0.004547364907705909
2023-11-16 11:33:11   INFO  epoch: 21/24, acc_iter=141127, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:17:49, time_cost(all): 1 day, 21:56:53/5:40:34, loss=0.289080192048727, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=1.9100423905781827, lr=0.004526434643995749
2023-11-16 11:34:10   INFO  epoch: 21/24, acc_iter=141177, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:14:32, time_cost(all): 1 day, 21:57:52/6:02:52, loss=0.28896924990055, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=2.6102984074949607, lr=0.004505504380285589
2023-11-16 11:35:09   INFO  epoch: 21/24, acc_iter=141227, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:12:53, time_cost(all): 1 day, 21:58:51/5:38:11, loss=0.288858307752373, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=3.3391839989453893, lr=0.00448457411657543
2023-11-16 11:36:08   INFO  epoch: 21/24, acc_iter=141277, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:14:11, time_cost(all): 1 day, 21:59:50/6:01:48, loss=0.288747365604197, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.761781175142452, lr=0.00446364385286527
2023-11-16 11:37:07   INFO  epoch: 21/24, acc_iter=141327, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:13:40, time_cost(all): 1 day, 22:00:49/5:57:21, loss=0.28863642345602, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=0.9502063218798064, lr=0.004442713589155109
2023-11-16 11:38:06   INFO  epoch: 21/24, acc_iter=141377, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:07:07, time_cost(all): 1 day, 22:01:48/5:48:11, loss=0.288525481307843, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.23(1.03), norm=1.7925097781354615, lr=0.00442178332544495
2023-11-16 11:39:04   INFO  epoch: 21/24, acc_iter=141427, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:06:14, time_cost(all): 1 day, 22:02:46/5:52:17, loss=0.288414539159666, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=1.1837804301581807, lr=0.004400853061734789
2023-11-16 11:40:03   INFO  epoch: 21/24, acc_iter=141477, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:05:39, time_cost(all): 1 day, 22:03:45/5:46:08, loss=0.28830359701149, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.802592541190305, lr=0.004379922798024629
2023-11-16 11:41:02   INFO  epoch: 21/24, acc_iter=141527, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:50, time_cost(all): 1 day, 22:04:44/5:37:47, loss=0.288192654863313, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=3.5487528294839996, lr=0.004358992534314468
2023-11-16 11:42:01   INFO  epoch: 21/24, acc_iter=141577, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:03:24, time_cost(all): 1 day, 22:05:43/5:22:55, loss=0.288081712715136, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=4.2622680969678655, lr=0.004338062270604309
2023-11-16 11:43:00   INFO  epoch: 21/24, acc_iter=141627, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:02:58, time_cost(all): 1 day, 22:06:42/5:43:47, loss=0.287970770566959, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.2553420041279395, lr=0.004317132006894149
2023-11-16 11:43:59   INFO  epoch: 21/24, acc_iter=141677, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:01:39, time_cost(all): 1 day, 22:07:41/5:23:13, loss=0.287859828418783, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=3.772967840510588, lr=0.004296201743183989
2023-11-16 11:44:58   INFO  epoch: 21/24, acc_iter=141727, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:52, time_cost(all): 1 day, 22:08:40/5:21:18, loss=0.287748886270606, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=4.819108475886349, lr=0.004275271479473829
2023-11-16 11:45:57   INFO  epoch: 21/24, acc_iter=141777, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:02:53, time_cost(all): 1 day, 22:09:39/5:38:29, loss=0.287637944122429, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.85(1.03), norm=2.052815245441752, lr=0.004254341215763669
2023-11-16 11:46:56   INFO  epoch: 21/24, acc_iter=141827, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:01:04, time_cost(all): 1 day, 22:10:38/5:30:49, loss=0.287527001974252, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=2.376892212805771, lr=0.004233410952053509
2023-11-16 11:47:55   INFO  epoch: 21/24, acc_iter=141877, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:59:29, time_cost(all): 1 day, 22:11:37/5:31:57, loss=0.287416059826076, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=4.742959811118719, lr=0.00421248068834335
2023-11-16 11:48:54   INFO  epoch: 21/24, acc_iter=141927, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:58:38, time_cost(all): 1 day, 22:12:36/5:21:13, loss=0.287305117677899, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.6781405443869675, lr=0.004191550424633189
2023-11-16 11:49:53   INFO  epoch: 21/24, acc_iter=141977, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/1:00:10, time_cost(all): 1 day, 22:13:35/5:41:45, loss=0.287194175529722, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=1.81626908277306, lr=0.004170620160923029
2023-11-16 11:50:52   INFO  epoch: 21/24, acc_iter=142027, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:55:43, time_cost(all): 1 day, 22:14:34/5:29:02, loss=0.287083233381545, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.88(1.03), norm=0.7546529243883181, lr=0.00414968989721287
2023-11-16 11:51:51   INFO  epoch: 21/24, acc_iter=142077, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:55:14, time_cost(all): 1 day, 22:15:33/5:25:25, loss=0.286972291233369, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=4.324326698103, lr=0.00412875963350271
2023-11-16 11:52:49   INFO  epoch: 21/24, acc_iter=142127, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:54:06, time_cost(all): 1 day, 22:16:31/5:24:12, loss=0.286861349085192, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=2.5420343347522207, lr=0.004107829369792549
2023-11-16 11:53:48   INFO  epoch: 21/24, acc_iter=142177, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:53:52, time_cost(all): 1 day, 22:17:30/5:16:41, loss=0.286750406937015, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=2.877192173192213, lr=0.00408689910608239
2023-11-16 11:54:47   INFO  epoch: 21/24, acc_iter=142227, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:11, time_cost(all): 1 day, 22:18:29/5:14:23, loss=0.286639464788838, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.1(1.03), norm=1.0274198068726463, lr=0.00406596884237223
2023-11-16 11:55:46   INFO  epoch: 21/24, acc_iter=142277, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:51:11, time_cost(all): 1 day, 22:19:28/5:31:32, loss=0.286528522640662, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=3.7335278068506335, lr=0.00404503857866207
2023-11-16 11:56:45   INFO  epoch: 21/24, acc_iter=142327, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:53:00, time_cost(all): 1 day, 22:20:27/5:28:03, loss=0.286417580492485, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=1.8966122488824406, lr=0.00402410831495191
2023-11-16 11:57:44   INFO  epoch: 21/24, acc_iter=142377, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:49:38, time_cost(all): 1 day, 22:21:26/5:18:19, loss=0.286306638344308, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.393717006872162, lr=0.00400317805124175
2023-11-16 11:58:43   INFO  epoch: 21/24, acc_iter=142427, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:49:28, time_cost(all): 1 day, 22:22:25/5:17:55, loss=0.286195696196131, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.1(1.03), norm=1.8909418452956799, lr=0.00398224778753159
2023-11-16 11:59:42   INFO  epoch: 21/24, acc_iter=142477, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:50:09, time_cost(all): 1 day, 22:23:24/5:15:05, loss=0.286084754047955, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=1.4864291070772229, lr=0.00396131752382143
2023-11-16 12:00:41   INFO  epoch: 21/24, acc_iter=142527, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:48:50, time_cost(all): 1 day, 22:24:23/5:35:25, loss=0.285973811899778, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.1(1.03), norm=2.022709526015917, lr=0.00394038726011127
2023-11-16 12:01:40   INFO  epoch: 21/24, acc_iter=142577, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:46:08, time_cost(all): 1 day, 22:25:22/5:31:00, loss=0.285862869751601, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=0.7961707088785792, lr=0.00391945699640111
2023-11-16 12:02:39   INFO  epoch: 21/24, acc_iter=142627, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:09, time_cost(all): 1 day, 22:26:21/5:26:38, loss=0.285751927603424, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.83(1.03), norm=2.6207388611099436, lr=0.00389852673269095
2023-11-16 12:03:38   INFO  epoch: 21/24, acc_iter=142677, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:43:25, time_cost(all): 1 day, 22:27:20/5:07:20, loss=0.285640985455248, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=1.4640498014414312, lr=0.00387759646898079
2023-11-16 12:04:37   INFO  epoch: 21/24, acc_iter=142727, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:12, time_cost(all): 1 day, 22:28:19/5:25:24, loss=0.285530043307071, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=1.7370274885647703, lr=0.00385666620527063
2023-11-16 12:05:36   INFO  epoch: 21/24, acc_iter=142777, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:42:49, time_cost(all): 1 day, 22:29:18/5:25:55, loss=0.285419101158894, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=3.1997580390894202, lr=0.00383573594156047
2023-11-16 12:06:34   INFO  epoch: 21/24, acc_iter=142827, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:06, time_cost(all): 1 day, 22:30:16/5:03:34, loss=0.285308159010717, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=1.5559120274196416, lr=0.003814805677850311
2023-11-16 12:07:33   INFO  epoch: 21/24, acc_iter=142877, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:40:18, time_cost(all): 1 day, 22:31:15/5:16:14, loss=0.285197216862541, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=3.1379418769351632, lr=0.00379387541414015
2023-11-16 12:08:32   INFO  epoch: 21/24, acc_iter=142927, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:38:44, time_cost(all): 1 day, 22:32:14/5:11:04, loss=0.285086274714364, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=2.633092909068132, lr=0.00377294515042999
2023-11-16 12:09:31   INFO  epoch: 21/24, acc_iter=142977, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:37:53, time_cost(all): 1 day, 22:33:13/5:04:06, loss=0.284975332566187, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=0.9875057615289572, lr=0.003752014886719831
2023-11-16 12:10:30   INFO  epoch: 21/24, acc_iter=143027, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:36:05, time_cost(all): 1 day, 22:34:12/5:09:04, loss=0.28486439041801, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=2.0732626255024584, lr=0.003731084623009671
2023-11-16 12:11:29   INFO  epoch: 21/24, acc_iter=143077, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:34:50, time_cost(all): 1 day, 22:35:11/4:59:14, loss=0.284753448269834, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=2.1353944072699917, lr=0.003710154359299511
2023-11-16 12:12:28   INFO  epoch: 21/24, acc_iter=143127, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:34:37, time_cost(all): 1 day, 22:36:10/4:59:32, loss=0.284642506121657, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.9064619976809487, lr=0.003689224095589351
2023-11-16 12:13:27   INFO  epoch: 21/24, acc_iter=143177, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:33:18, time_cost(all): 1 day, 22:37:09/5:04:38, loss=0.28453156397348, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=0.6028322897432516, lr=0.003668293831879191
2023-11-16 12:14:26   INFO  epoch: 21/24, acc_iter=143227, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:34:37, time_cost(all): 1 day, 22:38:08/5:14:03, loss=0.284420621825303, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=2.683045930460835, lr=0.003647363568169031
2023-11-16 12:15:25   INFO  epoch: 21/24, acc_iter=143277, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:31:03, time_cost(all): 1 day, 22:39:07/4:53:53, loss=0.284309679677127, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=0.8490116384724464, lr=0.003626433304458871
2023-11-16 12:16:24   INFO  epoch: 21/24, acc_iter=143327, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:24, time_cost(all): 1 day, 22:40:06/4:55:19, loss=0.28419873752895, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=3.0038468052880205, lr=0.003605503040748711
2023-11-16 12:17:23   INFO  epoch: 21/24, acc_iter=143377, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:28:52, time_cost(all): 1 day, 22:41:05/4:56:58, loss=0.284087795380773, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=4.348067606465623, lr=0.00358457277703855
2023-11-16 12:18:22   INFO  epoch: 21/24, acc_iter=143427, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:16, time_cost(all): 1 day, 22:42:04/5:13:03, loss=0.283976853232596, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=2.1666650851378373, lr=0.00356364251332839
2023-11-16 12:19:21   INFO  epoch: 21/24, acc_iter=143477, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:16, time_cost(all): 1 day, 22:43:03/5:12:22, loss=0.28386591108442, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.366147899840741, lr=0.003542712249618231
2023-11-16 12:20:19   INFO  epoch: 21/24, acc_iter=143527, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:38, time_cost(all): 1 day, 22:44:01/4:56:11, loss=0.283754968936243, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=1.3081039892021593, lr=0.00352178198590807
2023-11-16 12:21:18   INFO  epoch: 21/24, acc_iter=143577, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:27:02, time_cost(all): 1 day, 22:45:00/4:55:55, loss=0.283644026788066, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=3.8252518214385454, lr=0.00350085172219791
2023-11-16 12:22:17   INFO  epoch: 21/24, acc_iter=143627, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:32, time_cost(all): 1 day, 22:45:59/4:48:07, loss=0.283533084639889, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=4.138988418013548, lr=0.003479921458487751
2023-11-16 12:23:16   INFO  epoch: 21/24, acc_iter=143677, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:08, time_cost(all): 1 day, 22:46:58/4:53:45, loss=0.283422142491713, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.8942872935939472, lr=0.003458991194777591
2023-11-16 12:24:15   INFO  epoch: 21/24, acc_iter=143727, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:34, time_cost(all): 1 day, 22:47:57/4:59:23, loss=0.283311200343536, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.22(1.03), norm=2.1933671460745803, lr=0.00343806093106743
2023-11-16 12:25:14   INFO  epoch: 21/24, acc_iter=143777, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:53, time_cost(all): 1 day, 22:48:56/4:56:37, loss=0.283200258195359, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.891196121764161, lr=0.003417130667357271
2023-11-16 12:26:13   INFO  epoch: 21/24, acc_iter=143827, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:21:31, time_cost(all): 1 day, 22:49:55/5:03:33, loss=0.283089316047182, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=2.2097596824817156, lr=0.003396200403647111
2023-11-16 12:27:12   INFO  epoch: 21/24, acc_iter=143877, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:09, time_cost(all): 1 day, 22:50:54/5:08:10, loss=0.282978373899006, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.06(1.03), norm=1.842572220017211, lr=0.003375270139936951
2023-11-16 12:28:11   INFO  epoch: 21/24, acc_iter=143927, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:18:49, time_cost(all): 1 day, 22:51:53/5:05:13, loss=0.282867431750829, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.30260286480192, lr=0.003354339876226791
2023-11-16 12:29:10   INFO  epoch: 21/24, acc_iter=143977, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:14, time_cost(all): 1 day, 22:52:52/4:48:03, loss=0.282756489602652, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=1.4823671885707945, lr=0.003333409612516631
2023-11-16 12:30:09   INFO  epoch: 21/24, acc_iter=144027, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:50, time_cost(all): 1 day, 22:53:51/4:47:39, loss=0.282645547454475, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=1.3844986459992301, lr=0.003312479348806471
2023-11-16 12:31:08   INFO  epoch: 21/24, acc_iter=144077, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:42, time_cost(all): 1 day, 22:54:50/4:46:45, loss=0.282534605306299, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=2.477208902135553, lr=0.003291549085096311
2023-11-16 12:32:07   INFO  epoch: 21/24, acc_iter=144127, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:16:01, time_cost(all): 1 day, 22:55:49/5:02:15, loss=0.282423663158122, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=2.3037239706006027, lr=0.003270618821386151
2023-11-16 12:33:06   INFO  epoch: 21/24, acc_iter=144177, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:15:08, time_cost(all): 1 day, 22:56:48/4:47:09, loss=0.282312721009945, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=2.537066509540366, lr=0.003249688557675991
2023-11-16 12:34:04   INFO  epoch: 21/24, acc_iter=144227, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:52, time_cost(all): 1 day, 22:57:46/4:47:36, loss=0.282201778861768, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=2.8038203199855483, lr=0.003228758293965831
2023-11-16 12:35:03   INFO  epoch: 21/24, acc_iter=144277, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:10, time_cost(all): 1 day, 22:58:45/4:48:16, loss=0.282090836713592, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=3.3049308388026453, lr=0.003207828030255671
2023-11-16 12:36:02   INFO  epoch: 21/24, acc_iter=144327, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:56, time_cost(all): 1 day, 22:59:44/4:35:20, loss=0.281979894565415, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=3.508351064167094, lr=0.003186897766545511
2023-11-16 12:37:01   INFO  epoch: 21/24, acc_iter=144377, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:32, time_cost(all): 1 day, 23:00:43/4:58:29, loss=0.281868952417238, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=1.0645006154195569, lr=0.003165967502835351
2023-11-16 12:38:00   INFO  epoch: 21/24, acc_iter=144427, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:43, time_cost(all): 1 day, 23:01:42/4:49:05, loss=0.281758010269061, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=3.7723936881086417, lr=0.003145037239125192
2023-11-16 12:38:59   INFO  epoch: 21/24, acc_iter=144477, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:39, time_cost(all): 1 day, 23:02:41/4:54:53, loss=0.281647068120885, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.05(1.03), norm=3.5012374986550965, lr=0.003124106975415031
2023-11-16 12:39:58   INFO  epoch: 21/24, acc_iter=144527, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:56, time_cost(all): 1 day, 23:03:40/4:53:57, loss=0.281536125972708, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.87(1.03), norm=0.9706228907084732, lr=0.003103176711704871
2023-11-16 12:40:57   INFO  epoch: 21/24, acc_iter=144577, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:56, time_cost(all): 1 day, 23:04:39/4:42:12, loss=0.281425183824531, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.6682311030773027, lr=0.003082246447994712
2023-11-16 12:41:56   INFO  epoch: 21/24, acc_iter=144627, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:33, time_cost(all): 1 day, 23:05:38/4:44:43, loss=0.281314241676354, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=1.8763746093762272, lr=0.003061316184284552
2023-11-16 12:42:55   INFO  epoch: 21/24, acc_iter=144677, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:27, time_cost(all): 1 day, 23:06:37/4:49:26, loss=0.281203299528178, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=4.157118143605343, lr=0.003040385920574392
2023-11-16 12:43:54   INFO  epoch: 21/24, acc_iter=144727, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:49, time_cost(all): 1 day, 23:07:36/4:40:24, loss=0.281092357380001, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.533815442680756, lr=0.003019455656864232
2023-11-16 12:44:53   INFO  epoch: 21/24, acc_iter=144777, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:37, time_cost(all): 1 day, 23:08:35/4:46:31, loss=0.280981415231824, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.22(1.03), norm=0.5440246956833783, lr=0.002998525393154072
2023-11-16 12:45:52   INFO  epoch: 21/24, acc_iter=144827, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:45, time_cost(all): 1 day, 23:09:34/4:28:39, loss=0.280870473083647, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=2.81747778307095, lr=0.002977595129443912
2023-11-16 12:46:51   INFO  epoch: 21/24, acc_iter=144877, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 1 day, 23:10:33/4:42:24, loss=0.280759530935471, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.8129280157207248, lr=0.002956664865733752
2023-11-16 12:47:49   INFO  epoch: 22/24, acc_iter=144964, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:04:15, time_cost(all): 1 day, 23:11:31/4:28:13, loss=0.280566491597643, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=1.1539194339774306, lr=0.002920246206878074
2023-11-16 12:48:48   INFO  epoch: 22/24, acc_iter=145014, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:12:25, time_cost(all): 1 day, 23:12:30/4:20:17, loss=0.280455549449466, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=3.741308727157394, lr=0.002899315943167914
2023-11-16 12:49:47   INFO  epoch: 22/24, acc_iter=145064, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:10:30, time_cost(all): 1 day, 23:13:29/4:32:56, loss=0.28034460730129, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.01(1.03), norm=3.2276279547255347, lr=0.002878385679457754
2023-11-16 12:50:46   INFO  epoch: 22/24, acc_iter=145114, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:01:43, time_cost(all): 1 day, 23:14:28/4:24:19, loss=0.280233665153113, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.85(1.03), norm=3.378601125699067, lr=0.002857455415747593
2023-11-16 12:51:45   INFO  epoch: 22/24, acc_iter=145164, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:04:25, time_cost(all): 1 day, 23:15:27/4:17:37, loss=0.280122723004936, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=2.467992189685963, lr=0.002836525152037433
2023-11-16 12:52:44   INFO  epoch: 22/24, acc_iter=145214, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:57:24, time_cost(all): 1 day, 23:16:26/4:21:37, loss=0.280011780856759, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.84(1.03), norm=4.054153876070995, lr=0.002815594888327273
2023-11-16 12:53:43   INFO  epoch: 22/24, acc_iter=145264, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:00:56, time_cost(all): 1 day, 23:17:25/4:28:14, loss=0.279900838708583, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.4520977177478975, lr=0.002794664624617113
2023-11-16 12:54:42   INFO  epoch: 22/24, acc_iter=145314, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/2:03:41, time_cost(all): 1 day, 23:18:24/4:18:04, loss=0.279789896560406, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=3.3682227765025288, lr=0.002773734360906954
2023-11-16 12:55:41   INFO  epoch: 22/24, acc_iter=145364, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:05:56, time_cost(all): 1 day, 23:19:23/4:34:03, loss=0.279678954412229, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=1.8466174748110387, lr=0.002752804097196793
2023-11-16 12:56:40   INFO  epoch: 22/24, acc_iter=145414, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:53:41, time_cost(all): 1 day, 23:20:22/4:27:54, loss=0.279568012264052, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.96970930758624, lr=0.002731873833486633
2023-11-16 12:57:39   INFO  epoch: 22/24, acc_iter=145464, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/2:03:28, time_cost(all): 1 day, 23:21:21/4:27:39, loss=0.279457070115876, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=4.182905704964995, lr=0.002710943569776474
2023-11-16 12:58:38   INFO  epoch: 22/24, acc_iter=145514, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:11, time_cost(all): 1 day, 23:22:20/4:15:50, loss=0.279346127967699, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=1.5483076720633817, lr=0.002690013306066314
2023-11-16 12:59:37   INFO  epoch: 22/24, acc_iter=145564, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:52:48, time_cost(all): 1 day, 23:23:19/4:16:10, loss=0.279235185819522, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.2143886518090214, lr=0.002669083042356153
2023-11-16 13:00:36   INFO  epoch: 22/24, acc_iter=145614, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:59:45, time_cost(all): 1 day, 23:24:18/4:09:53, loss=0.279124243671345, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=2.546685377604115, lr=0.002648152778645994
2023-11-16 13:01:34   INFO  epoch: 22/24, acc_iter=145664, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:52:29, time_cost(all): 1 day, 23:25:16/4:25:45, loss=0.279013301523168, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.2(1.03), norm=1.4323746771042174, lr=0.002627222514935834
2023-11-16 13:02:33   INFO  epoch: 22/24, acc_iter=145714, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:51:23, time_cost(all): 1 day, 23:26:15/4:27:17, loss=0.278902359374992, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.11(1.03), norm=1.414840887931971, lr=0.002606292251225674
2023-11-16 13:03:32   INFO  epoch: 22/24, acc_iter=145764, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:51:58, time_cost(all): 1 day, 23:27:14/4:18:12, loss=0.278791417226815, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=3.6649629169162665, lr=0.002585361987515514
2023-11-16 13:04:31   INFO  epoch: 22/24, acc_iter=145814, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:54:38, time_cost(all): 1 day, 23:28:13/4:08:45, loss=0.278680475078638, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=0.6220203148841152, lr=0.002564431723805354
2023-11-16 13:05:30   INFO  epoch: 22/24, acc_iter=145864, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:50:05, time_cost(all): 1 day, 23:29:12/4:08:42, loss=0.278569532930461, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.01(1.03), norm=4.34729213014018, lr=0.002543501460095194
2023-11-16 13:06:29   INFO  epoch: 22/24, acc_iter=145914, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:51:59, time_cost(all): 1 day, 23:30:11/4:19:04, loss=0.278458590782285, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=2.8038669610429965, lr=0.002522571196385034
2023-11-16 13:07:28   INFO  epoch: 22/24, acc_iter=145964, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:53:32, time_cost(all): 1 day, 23:31:10/4:22:48, loss=0.278347648634108, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.558196051453348, lr=0.002501640932674874
2023-11-16 13:08:27   INFO  epoch: 22/24, acc_iter=146014, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:49:19, time_cost(all): 1 day, 23:32:09/4:11:33, loss=0.278236706485931, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=3.439595889465189, lr=0.002480710668964714
2023-11-16 13:09:26   INFO  epoch: 22/24, acc_iter=146064, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:49:03, time_cost(all): 1 day, 23:33:08/4:24:10, loss=0.278125764337754, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=1.2480366574058095, lr=0.002459780405254554
2023-11-16 13:10:25   INFO  epoch: 22/24, acc_iter=146114, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:43:17, time_cost(all): 1 day, 23:34:07/4:00:02, loss=0.278014822189578, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.89(1.03), norm=4.756880708177355, lr=0.002438850141544394
2023-11-16 13:11:24   INFO  epoch: 22/24, acc_iter=146164, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:46:20, time_cost(all): 1 day, 23:35:06/4:07:46, loss=0.277903880041401, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=2.5398480567375503, lr=0.002417919877834234
2023-11-16 13:12:23   INFO  epoch: 22/24, acc_iter=146214, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:46:12, time_cost(all): 1 day, 23:36:05/4:11:49, loss=0.277792937893224, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.98(1.03), norm=1.5317235050634863, lr=0.002396989614124074
2023-11-16 13:13:22   INFO  epoch: 22/24, acc_iter=146264, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:40:50, time_cost(all): 1 day, 23:37:04/4:07:59, loss=0.277681995745047, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=2.5382636235717895, lr=0.002376059350413915
2023-11-16 13:14:21   INFO  epoch: 22/24, acc_iter=146314, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:43:20, time_cost(all): 1 day, 23:38:03/3:57:14, loss=0.277571053596871, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=1.2536736893476688, lr=0.002355129086703754
2023-11-16 13:15:19   INFO  epoch: 22/24, acc_iter=146364, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:44:26, time_cost(all): 1 day, 23:39:01/4:13:53, loss=0.277460111448694, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=2.313711749008979, lr=0.002334198822993594
2023-11-16 13:16:18   INFO  epoch: 22/24, acc_iter=146414, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:41:00, time_cost(all): 1 day, 23:40:00/4:04:09, loss=0.277349169300517, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=4.286307259485671, lr=0.002313268559283435
2023-11-16 13:17:17   INFO  epoch: 22/24, acc_iter=146464, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:40:11, time_cost(all): 1 day, 23:40:59/4:15:56, loss=0.27723822715234, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=1.21979169468943, lr=0.002292338295573275
2023-11-16 13:18:16   INFO  epoch: 22/24, acc_iter=146514, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:35:49, time_cost(all): 1 day, 23:41:58/4:10:44, loss=0.277127285004164, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=0.6816005397400329, lr=0.002271408031863115
2023-11-16 13:19:15   INFO  epoch: 22/24, acc_iter=146564, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:41:11, time_cost(all): 1 day, 23:42:57/3:51:56, loss=0.277016342855987, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.96(1.03), norm=3.064660863788117, lr=0.002250477768152955
2023-11-16 13:20:14   INFO  epoch: 22/24, acc_iter=146614, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:31:26, time_cost(all): 1 day, 23:43:56/3:50:51, loss=0.27690540070781, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=2.624601564555519, lr=0.002229547504442795
2023-11-16 13:21:13   INFO  epoch: 22/24, acc_iter=146664, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:31:25, time_cost(all): 1 day, 23:44:55/4:06:31, loss=0.276794458559633, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=4.95697325707846, lr=0.002208617240732635
2023-11-16 13:22:12   INFO  epoch: 22/24, acc_iter=146714, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:33:15, time_cost(all): 1 day, 23:45:54/4:03:46, loss=0.276683516411457, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=2.4257499197321453, lr=0.002187686977022475
2023-11-16 13:23:11   INFO  epoch: 22/24, acc_iter=146764, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:33:27, time_cost(all): 1 day, 23:46:53/4:07:33, loss=0.27657257426328, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=3.4972939444847126, lr=0.002166756713312314
2023-11-16 13:24:10   INFO  epoch: 22/24, acc_iter=146814, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:36:37, time_cost(all): 1 day, 23:47:52/3:56:39, loss=0.276461632115103, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.85(1.03), norm=3.2871232747058574, lr=0.002145826449602156
2023-11-16 13:25:09   INFO  epoch: 22/24, acc_iter=146864, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:35:29, time_cost(all): 1 day, 23:48:51/3:46:43, loss=0.276350689966926, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=3.647675847417891, lr=0.002124896185891996
2023-11-16 13:26:08   INFO  epoch: 22/24, acc_iter=146914, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:26:14, time_cost(all): 1 day, 23:49:50/3:59:38, loss=0.27623974781875, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=2.865515231846288, lr=0.002103965922181835
2023-11-16 13:27:07   INFO  epoch: 22/24, acc_iter=146964, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:28:50, time_cost(all): 1 day, 23:50:49/3:51:33, loss=0.276128805670573, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=4.6353090860516835, lr=0.002083035658471675
2023-11-16 13:28:06   INFO  epoch: 22/24, acc_iter=147014, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:31:57, time_cost(all): 1 day, 23:51:48/4:00:18, loss=0.276017863522396, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.98(1.03), norm=2.908646921645586, lr=0.002062105394761515
2023-11-16 13:29:04   INFO  epoch: 22/24, acc_iter=147064, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:30:47, time_cost(all): 1 day, 23:52:46/4:00:59, loss=0.275906921374219, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=2.8619375182655715, lr=0.002041175131051355
2023-11-16 13:30:03   INFO  epoch: 22/24, acc_iter=147114, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:28:18, time_cost(all): 1 day, 23:53:45/3:55:05, loss=0.275795979226043, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=2.819140014280153, lr=0.002020244867341195
2023-11-16 13:31:02   INFO  epoch: 22/24, acc_iter=147164, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:23:18, time_cost(all): 1 day, 23:54:44/3:59:33, loss=0.275685037077866, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=1.5038606055403472, lr=0.001999314603631034
2023-11-16 13:32:01   INFO  epoch: 22/24, acc_iter=147214, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:26:15, time_cost(all): 1 day, 23:55:43/3:59:20, loss=0.275574094929689, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=0.5877544798010844, lr=0.001978384339920874
2023-11-16 13:33:00   INFO  epoch: 22/24, acc_iter=147264, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:26:51, time_cost(all): 1 day, 23:56:42/3:46:44, loss=0.275463152781512, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=3.3593306919823864, lr=0.001957454076210714
2023-11-16 13:33:59   INFO  epoch: 22/24, acc_iter=147314, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:22:21, time_cost(all): 1 day, 23:57:41/3:39:47, loss=0.275352210633336, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=4.596462889171279, lr=0.001936523812500555
2023-11-16 13:34:58   INFO  epoch: 22/24, acc_iter=147364, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:22:15, time_cost(all): 1 day, 23:58:40/3:53:26, loss=0.275241268485159, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=0.9369621345761977, lr=0.001915593548790395
2023-11-16 13:35:57   INFO  epoch: 22/24, acc_iter=147414, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:22:33, time_cost(all): 1 day, 23:59:39/3:56:11, loss=0.275130326336982, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.08(1.03), norm=4.268346200714854, lr=0.001894663285080235
2023-11-16 13:36:56   INFO  epoch: 22/24, acc_iter=147464, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:20:36, time_cost(all): 2 days, 0:00:38/3:37:05, loss=0.275019384188805, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=3.7713540261027365, lr=0.001873733021370075
2023-11-16 13:37:55   INFO  epoch: 22/24, acc_iter=147514, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:18:21, time_cost(all): 2 days, 0:01:37/3:51:41, loss=0.274908442040629, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=4.379530578616366, lr=0.001852802757659915
2023-11-16 13:38:54   INFO  epoch: 22/24, acc_iter=147564, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:15:01, time_cost(all): 2 days, 0:02:36/3:46:03, loss=0.274797499892452, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.85(1.03), norm=1.7307157942608702, lr=0.001831872493949754
2023-11-16 13:39:53   INFO  epoch: 22/24, acc_iter=147614, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:18:23, time_cost(all): 2 days, 0:03:35/3:33:50, loss=0.274686557744275, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=3.9077991372741154, lr=0.001810942230239596
2023-11-16 13:40:52   INFO  epoch: 22/24, acc_iter=147664, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:14:19, time_cost(all): 2 days, 0:04:34/3:48:13, loss=0.274575615596098, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.83(1.03), norm=1.8790505575183327, lr=0.001790011966529436
2023-11-16 13:41:51   INFO  epoch: 22/24, acc_iter=147714, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:15:36, time_cost(all): 2 days, 0:05:33/3:42:16, loss=0.274464673447922, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.106433388936387, lr=0.001769081702819275
2023-11-16 13:42:50   INFO  epoch: 22/24, acc_iter=147764, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:16:51, time_cost(all): 2 days, 0:06:32/3:42:19, loss=0.274353731299745, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.012519451928212, lr=0.001748151439109115
2023-11-16 13:43:48   INFO  epoch: 22/24, acc_iter=147814, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:18, time_cost(all): 2 days, 0:07:30/3:39:47, loss=0.274242789151568, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=0.58571157161637, lr=0.001727221175398955
2023-11-16 13:44:47   INFO  epoch: 22/24, acc_iter=147864, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:55, time_cost(all): 2 days, 0:08:29/3:36:49, loss=0.274131847003391, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=3.001824822993702, lr=0.001706290911688795
2023-11-16 13:45:46   INFO  epoch: 22/24, acc_iter=147914, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:08:11, time_cost(all): 2 days, 0:09:28/3:39:15, loss=0.274020904855215, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=1.1213695474242087, lr=0.001685360647978636
2023-11-16 13:46:45   INFO  epoch: 22/24, acc_iter=147964, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:12:14, time_cost(all): 2 days, 0:10:27/3:45:17, loss=0.273909962707038, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=3.5607599022459917, lr=0.001664430384268476
2023-11-16 13:47:44   INFO  epoch: 22/24, acc_iter=148014, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:08:57, time_cost(all): 2 days, 0:11:26/3:37:39, loss=0.273799020558861, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.7496409229753995, lr=0.001643500120558316
2023-11-16 13:48:43   INFO  epoch: 22/24, acc_iter=148064, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:09:10, time_cost(all): 2 days, 0:12:25/3:23:51, loss=0.273688078410684, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.07(1.03), norm=3.403691520659643, lr=0.001622569856848156
2023-11-16 13:49:42   INFO  epoch: 22/24, acc_iter=148114, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:04:31, time_cost(all): 2 days, 0:13:24/3:25:57, loss=0.273577136262508, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=1.9147404670497177, lr=0.001601639593137996
2023-11-16 13:50:41   INFO  epoch: 22/24, acc_iter=148164, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:04:19, time_cost(all): 2 days, 0:14:23/3:31:02, loss=0.273466194114331, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.3888666678382124, lr=0.001580709329427835
2023-11-16 13:51:40   INFO  epoch: 22/24, acc_iter=148214, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:06:01, time_cost(all): 2 days, 0:15:22/3:33:16, loss=0.273355251966154, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=1.8494960049694287, lr=0.001559779065717675
2023-11-16 13:52:39   INFO  epoch: 22/24, acc_iter=148264, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:03, time_cost(all): 2 days, 0:16:21/3:21:36, loss=0.273244309817977, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.682280372622155, lr=0.001538848802007517
2023-11-16 13:53:38   INFO  epoch: 22/24, acc_iter=148314, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:03:52, time_cost(all): 2 days, 0:17:20/3:22:59, loss=0.273133367669801, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=4.085142394082128, lr=0.001517918538297356
2023-11-16 13:54:37   INFO  epoch: 22/24, acc_iter=148364, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:04:31, time_cost(all): 2 days, 0:18:19/3:33:18, loss=0.273022425521624, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=4.08311896177846, lr=0.001496988274587196
2023-11-16 13:55:36   INFO  epoch: 22/24, acc_iter=148414, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/0:59:28, time_cost(all): 2 days, 0:19:18/3:21:09, loss=0.272911483373447, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.03(1.03), norm=3.483916903443449, lr=0.001476058010877036
2023-11-16 13:56:35   INFO  epoch: 22/24, acc_iter=148464, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:57:49, time_cost(all): 2 days, 0:20:17/3:27:39, loss=0.27280054122527, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.5075302979426017, lr=0.001455127747166876
2023-11-16 13:57:33   INFO  epoch: 22/24, acc_iter=148514, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:55:58, time_cost(all): 2 days, 0:21:15/3:25:05, loss=0.272689599077094, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=4.0131137947935525, lr=0.001434197483456716
2023-11-16 13:58:32   INFO  epoch: 22/24, acc_iter=148564, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:58:26, time_cost(all): 2 days, 0:22:14/3:32:11, loss=0.272578656928917, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=1.108957964172725, lr=0.001413267219746557
2023-11-16 13:59:31   INFO  epoch: 22/24, acc_iter=148614, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:50, time_cost(all): 2 days, 0:23:13/3:26:07, loss=0.27246771478074, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=2.8738241318722313, lr=0.001392336956036397
2023-11-16 14:00:30   INFO  epoch: 22/24, acc_iter=148664, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:53:20, time_cost(all): 2 days, 0:24:12/3:29:18, loss=0.272356772632563, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=1.1435538190758254, lr=0.001371406692326237
2023-11-16 14:01:29   INFO  epoch: 22/24, acc_iter=148714, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:52:19, time_cost(all): 2 days, 0:25:11/3:15:54, loss=0.272245830484387, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.91(1.03), norm=3.5685449703393735, lr=0.001350476428616076
2023-11-16 14:02:28   INFO  epoch: 22/24, acc_iter=148764, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:55:40, time_cost(all): 2 days, 0:26:10/3:10:51, loss=0.27213488833621, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.1113069912511455, lr=0.001329546164905916
2023-11-16 14:03:27   INFO  epoch: 22/24, acc_iter=148814, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:52:03, time_cost(all): 2 days, 0:27:09/3:23:23, loss=0.272023946188033, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=1.2136001107482368, lr=0.001308615901195756
2023-11-16 14:04:26   INFO  epoch: 22/24, acc_iter=148864, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:54:20, time_cost(all): 2 days, 0:28:08/3:22:39, loss=0.271913004039856, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=0.8900217920133553, lr=0.001287685637485598
2023-11-16 14:05:25   INFO  epoch: 22/24, acc_iter=148914, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:51:59, time_cost(all): 2 days, 0:29:07/3:20:52, loss=0.27180206189168, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.11(1.03), norm=1.5246925410418017, lr=0.001266755373775437
2023-11-16 14:06:24   INFO  epoch: 22/24, acc_iter=148964, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:05, time_cost(all): 2 days, 0:30:06/3:22:01, loss=0.271691119743503, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=4.850495076541371, lr=0.001245825110065275
2023-11-16 14:07:23   INFO  epoch: 22/24, acc_iter=149014, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:48:23, time_cost(all): 2 days, 0:31:05/3:17:07, loss=0.271580177595326, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=1.3740946907409035, lr=0.001224894846355115
2023-11-16 14:08:22   INFO  epoch: 22/24, acc_iter=149064, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:46:16, time_cost(all): 2 days, 0:32:04/3:12:39, loss=0.271469235447149, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.04655416117536, lr=0.001203964582644957
2023-11-16 14:09:21   INFO  epoch: 22/24, acc_iter=149114, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:45:46, time_cost(all): 2 days, 0:33:03/3:08:59, loss=0.271358293298973, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.247063166032522, lr=0.001183034318934796
2023-11-16 14:10:20   INFO  epoch: 22/24, acc_iter=149164, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:47, time_cost(all): 2 days, 0:34:02/3:19:00, loss=0.271247351150796, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.94(1.03), norm=3.8706120068818004, lr=0.001162104055224636
2023-11-16 14:11:18   INFO  epoch: 22/24, acc_iter=149214, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:46:51, time_cost(all): 2 days, 0:35:00/3:06:36, loss=0.271136409002619, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.686995103378865, lr=0.001141173791514476
2023-11-16 14:12:17   INFO  epoch: 22/24, acc_iter=149264, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:44:10, time_cost(all): 2 days, 0:35:59/3:09:02, loss=0.271025466854442, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=3.485965308289312, lr=0.001120243527804316
2023-11-16 14:13:16   INFO  epoch: 22/24, acc_iter=149314, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:42:44, time_cost(all): 2 days, 0:36:58/3:09:54, loss=0.270914524706266, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=1.4437773595478294, lr=0.001099313264094156
2023-11-16 14:14:15   INFO  epoch: 22/24, acc_iter=149364, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:42:12, time_cost(all): 2 days, 0:37:57/3:10:36, loss=0.270803582558089, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=2.1701021322405314, lr=0.001078383000383997
2023-11-16 14:15:14   INFO  epoch: 22/24, acc_iter=149414, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:41:27, time_cost(all): 2 days, 0:38:56/2:58:10, loss=0.270692640409912, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=0.951803750366691, lr=0.001057452736673837
2023-11-16 14:16:13   INFO  epoch: 22/24, acc_iter=149464, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:06, time_cost(all): 2 days, 0:39:55/3:12:42, loss=0.270581698261735, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=3.7937162790943892, lr=0.001036522472963677
2023-11-16 14:17:12   INFO  epoch: 22/24, acc_iter=149514, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:40:28, time_cost(all): 2 days, 0:40:54/3:10:13, loss=0.270470756113559, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=3.870882231916748, lr=0.001015592209253516
2023-11-16 14:18:11   INFO  epoch: 22/24, acc_iter=149564, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:36:34, time_cost(all): 2 days, 0:41:53/3:11:33, loss=0.270359813965382, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=2.4943399176638197, lr=0.000998506372821887
2023-11-16 14:19:10   INFO  epoch: 22/24, acc_iter=149614, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:38:22, time_cost(all): 2 days, 0:42:52/3:05:26, loss=0.270248871817205, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=3.5598338274788808, lr=0.000992649929996794
2023-11-16 14:20:09   INFO  epoch: 22/24, acc_iter=149664, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:37:19, time_cost(all): 2 days, 0:43:51/2:57:43, loss=0.270137929669028, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=3.7759897519828325, lr=0.000986793487171701
2023-11-16 14:21:08   INFO  epoch: 22/24, acc_iter=149714, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:33:22, time_cost(all): 2 days, 0:44:50/3:01:55, loss=0.270026987520852, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=1.2352234933620856, lr=0.000980937044346608
2023-11-16 14:22:07   INFO  epoch: 22/24, acc_iter=149764, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:34:41, time_cost(all): 2 days, 0:45:49/2:53:27, loss=0.269916045372675, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.95(1.03), norm=4.207008691408934, lr=0.000975080601521515
2023-11-16 14:23:06   INFO  epoch: 22/24, acc_iter=149814, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:34:46, time_cost(all): 2 days, 0:46:48/2:57:17, loss=0.269805103224498, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.96(1.03), norm=2.660829878294554, lr=0.000969224158696421
2023-11-16 14:24:05   INFO  epoch: 22/24, acc_iter=149864, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:32:21, time_cost(all): 2 days, 0:47:47/3:04:09, loss=0.269694161076321, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=1.656020254790699, lr=0.000963367715871328
2023-11-16 14:25:03   INFO  epoch: 22/24, acc_iter=149914, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:32:22, time_cost(all): 2 days, 0:48:45/2:51:03, loss=0.269583218928145, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=1.110681525070178, lr=0.000957511273046235
2023-11-16 14:26:02   INFO  epoch: 22/24, acc_iter=149964, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:29:36, time_cost(all): 2 days, 0:49:44/2:46:57, loss=0.269472276779968, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=3.0801785633852723, lr=0.000951654830221142
2023-11-16 14:27:01   INFO  epoch: 22/24, acc_iter=150014, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:28:21, time_cost(all): 2 days, 0:50:43/2:50:49, loss=0.269361334631791, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=2.4591527056155003, lr=0.000945798387396049
2023-11-16 14:28:00   INFO  epoch: 22/24, acc_iter=150064, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:27:12, time_cost(all): 2 days, 0:51:42/2:56:23, loss=0.269250392483614, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=2.81056577689549, lr=0.000939941944570955
2023-11-16 14:28:59   INFO  epoch: 22/24, acc_iter=150114, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:23, time_cost(all): 2 days, 0:52:41/2:51:07, loss=0.269139450335438, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=3.431417734328862, lr=0.000934085501745862
2023-11-16 14:29:58   INFO  epoch: 22/24, acc_iter=150164, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:55, time_cost(all): 2 days, 0:53:40/2:43:18, loss=0.269028508187261, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.02133634939464, lr=0.000928229058920769
2023-11-16 14:30:57   INFO  epoch: 22/24, acc_iter=150214, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:25:32, time_cost(all): 2 days, 0:54:39/2:54:54, loss=0.268917566039084, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.08(1.03), norm=1.0318220035442949, lr=0.000922372616095676
2023-11-16 14:31:56   INFO  epoch: 22/24, acc_iter=150264, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:22, time_cost(all): 2 days, 0:55:38/2:41:20, loss=0.268806623890907, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.482181641120146, lr=0.000916516173270583
2023-11-16 14:32:55   INFO  epoch: 22/24, acc_iter=150314, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:22:42, time_cost(all): 2 days, 0:56:37/2:54:13, loss=0.268695681742731, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=1.0671758709258021, lr=0.000910659730445489
2023-11-16 14:33:54   INFO  epoch: 22/24, acc_iter=150364, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:23:26, time_cost(all): 2 days, 0:57:36/2:54:52, loss=0.268584739594554, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=1.8105521651027752, lr=0.000904803287620396
2023-11-16 14:34:53   INFO  epoch: 22/24, acc_iter=150414, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:28, time_cost(all): 2 days, 0:58:35/2:50:29, loss=0.268473797446377, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.21(1.03), norm=3.938796262760093, lr=0.000898946844795303
2023-11-16 14:35:52   INFO  epoch: 22/24, acc_iter=150464, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:19:39, time_cost(all): 2 days, 0:59:34/2:39:21, loss=0.2683628552982, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=0.832665430478458, lr=0.00089309040197021
2023-11-16 14:36:51   INFO  epoch: 22/24, acc_iter=150514, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:20:07, time_cost(all): 2 days, 1:00:33/2:49:58, loss=0.268251913150024, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=1.6352940889054608, lr=0.000887233959145117
2023-11-16 14:37:50   INFO  epoch: 22/24, acc_iter=150564, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:47, time_cost(all): 2 days, 1:01:32/2:49:50, loss=0.268140971001847, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=2.9227301853192182, lr=0.000881377516320024
2023-11-16 14:38:48   INFO  epoch: 22/24, acc_iter=150614, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:17:19, time_cost(all): 2 days, 1:02:30/2:40:56, loss=0.26803002885367, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=3.1660566153697145, lr=0.00087552107349493
2023-11-16 14:39:47   INFO  epoch: 22/24, acc_iter=150664, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:16:24, time_cost(all): 2 days, 1:03:29/2:41:12, loss=0.267919086705493, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=4.095456905531598, lr=0.000869664630669837
2023-11-16 14:40:46   INFO  epoch: 22/24, acc_iter=150714, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:15:01, time_cost(all): 2 days, 1:04:28/2:45:54, loss=0.267808144557317, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=3.1235182724302595, lr=0.000863808187844744
2023-11-16 14:41:45   INFO  epoch: 22/24, acc_iter=150764, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:13:47, time_cost(all): 2 days, 1:05:27/2:38:39, loss=0.26769720240914, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=0.9205676208341627, lr=0.000857951745019651
2023-11-16 14:42:44   INFO  epoch: 22/24, acc_iter=150814, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:14:09, time_cost(all): 2 days, 1:06:26/2:41:47, loss=0.267586260260963, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.2527040432779355, lr=0.000852095302194558
2023-11-16 14:43:43   INFO  epoch: 22/24, acc_iter=150864, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:07, time_cost(all): 2 days, 1:07:25/2:44:20, loss=0.267475318112786, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=4.3600647203935665, lr=0.000846238859369464
2023-11-16 14:44:42   INFO  epoch: 22/24, acc_iter=150914, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:26, time_cost(all): 2 days, 1:08:24/2:30:42, loss=0.26736437596461, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=4.9544802369131045, lr=0.000840382416544371
2023-11-16 14:45:41   INFO  epoch: 22/24, acc_iter=150964, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:04, time_cost(all): 2 days, 1:09:23/2:38:31, loss=0.267253433816433, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=0.7834115146735547, lr=0.000834525973719278
2023-11-16 14:46:40   INFO  epoch: 22/24, acc_iter=151014, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:52, time_cost(all): 2 days, 1:10:22/2:41:40, loss=0.267142491668256, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.151754766235873, lr=0.000828669530894185
2023-11-16 14:47:39   INFO  epoch: 22/24, acc_iter=151064, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:18, time_cost(all): 2 days, 1:11:21/2:39:41, loss=0.267031549520079, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=4.362410406245957, lr=0.000822813088069092
2023-11-16 14:48:38   INFO  epoch: 22/24, acc_iter=151114, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:45, time_cost(all): 2 days, 1:12:20/2:35:21, loss=0.266920607371903, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=4.1201205730374335, lr=0.000816956645243998
2023-11-16 14:49:37   INFO  epoch: 22/24, acc_iter=151164, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:42, time_cost(all): 2 days, 1:13:19/2:30:18, loss=0.266809665223726, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.93(1.03), norm=1.2139684779341378, lr=0.000811100202418905
2023-11-16 14:50:36   INFO  epoch: 22/24, acc_iter=151214, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:42, time_cost(all): 2 days, 1:14:18/2:25:03, loss=0.266698723075549, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.7781175762090224, lr=0.000805243759593812
2023-11-16 14:51:35   INFO  epoch: 22/24, acc_iter=151264, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:39, time_cost(all): 2 days, 1:15:17/2:25:08, loss=0.266587780927372, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=1.5961196835747902, lr=0.000799387316768719
2023-11-16 14:52:33   INFO  epoch: 22/24, acc_iter=151314, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:33, time_cost(all): 2 days, 1:16:15/2:33:10, loss=0.266476838779196, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.91(1.03), norm=1.0634104715450818, lr=0.000793530873943626
2023-11-16 14:53:32   INFO  epoch: 22/24, acc_iter=151364, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:40, time_cost(all): 2 days, 1:17:14/2:34:34, loss=0.266365896631019, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=0.8018741659673638, lr=0.000787674431118533
2023-11-16 14:54:31   INFO  epoch: 22/24, acc_iter=151414, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:39, time_cost(all): 2 days, 1:18:13/2:21:58, loss=0.266254954482842, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=4.396160117928281, lr=0.000781817988293439
2023-11-16 14:55:30   INFO  epoch: 22/24, acc_iter=151464, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:43, time_cost(all): 2 days, 1:19:12/2:30:58, loss=0.266144012334665, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=1.3091778812975998, lr=0.000775961545468346
2023-11-16 14:56:29   INFO  epoch: 23/24, acc_iter=151551, cur_iter=50/6587, batch_size=24, time_cost(epoch): 0:00:58/2:10:37, time_cost(all): 2 days, 1:20:11/2:22:31, loss=0.265950972996838, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.7235436961634814, lr=0.000765771334952684
2023-11-16 14:57:28   INFO  epoch: 23/24, acc_iter=151601, cur_iter=100/6587, batch_size=24, time_cost(epoch): 0:01:57/2:10:13, time_cost(all): 2 days, 1:21:10/2:29:40, loss=0.265840030848661, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=1.425909062357049, lr=0.000759914892127591
2023-11-16 14:58:27   INFO  epoch: 23/24, acc_iter=151651, cur_iter=150/6587, batch_size=24, time_cost(epoch): 0:02:56/2:07:19, time_cost(all): 2 days, 1:22:09/2:26:48, loss=0.265729088700484, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.95(1.03), norm=4.072764844183788, lr=0.000754058449302497
2023-11-16 14:59:26   INFO  epoch: 23/24, acc_iter=151701, cur_iter=200/6587, batch_size=24, time_cost(epoch): 0:03:55/2:02:23, time_cost(all): 2 days, 1:23:08/2:22:16, loss=0.265618146552308, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.9651836456270195, lr=0.000748202006477404
2023-11-16 15:00:25   INFO  epoch: 23/24, acc_iter=151751, cur_iter=250/6587, batch_size=24, time_cost(epoch): 0:04:54/2:09:54, time_cost(all): 2 days, 1:24:07/2:23:17, loss=0.265507204404131, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=1.3789122079956153, lr=0.000742345563652311
2023-11-16 15:01:24   INFO  epoch: 23/24, acc_iter=151801, cur_iter=300/6587, batch_size=24, time_cost(epoch): 0:05:53/1:58:45, time_cost(all): 2 days, 1:25:06/2:14:02, loss=0.265396262255954, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=2.899129230381755, lr=0.000736489120827218
2023-11-16 15:02:23   INFO  epoch: 23/24, acc_iter=151851, cur_iter=350/6587, batch_size=24, time_cost(epoch): 0:06:52/2:05:40, time_cost(all): 2 days, 1:26:05/2:12:55, loss=0.265285320107777, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.87(1.03), norm=2.325777681044369, lr=0.000730632678002125
2023-11-16 15:03:22   INFO  epoch: 23/24, acc_iter=151901, cur_iter=400/6587, batch_size=24, time_cost(epoch): 0:07:51/1:59:30, time_cost(all): 2 days, 1:27:04/2:23:50, loss=0.265174377959601, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=4.119247287812774, lr=0.000724776235177032
2023-11-16 15:04:21   INFO  epoch: 23/24, acc_iter=151951, cur_iter=450/6587, batch_size=24, time_cost(epoch): 0:08:50/2:04:38, time_cost(all): 2 days, 1:28:03/2:22:45, loss=0.265063435811424, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=3.9350999068680927, lr=0.000718919792351938
2023-11-16 15:05:20   INFO  epoch: 23/24, acc_iter=152001, cur_iter=500/6587, batch_size=24, time_cost(epoch): 0:09:49/1:53:50, time_cost(all): 2 days, 1:29:02/2:22:48, loss=0.264952493663247, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.05(1.03), norm=4.145811787988892, lr=0.000713063349526845
2023-11-16 15:06:18   INFO  epoch: 23/24, acc_iter=152051, cur_iter=550/6587, batch_size=24, time_cost(epoch): 0:10:48/1:54:30, time_cost(all): 2 days, 1:30:00/2:18:19, loss=0.26484155151507, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.11(1.03), norm=4.348964098022591, lr=0.000707206906701752
2023-11-16 15:07:17   INFO  epoch: 23/24, acc_iter=152101, cur_iter=600/6587, batch_size=24, time_cost(epoch): 0:11:47/2:02:37, time_cost(all): 2 days, 1:30:59/2:19:33, loss=0.264730609366894, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=3.9016837739517665, lr=0.000701350463876659
2023-11-16 15:08:16   INFO  epoch: 23/24, acc_iter=152151, cur_iter=650/6587, batch_size=24, time_cost(epoch): 0:12:46/1:53:17, time_cost(all): 2 days, 1:31:58/2:15:41, loss=0.264619667218717, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=1.6188454613702172, lr=0.000695494021051566
2023-11-16 15:09:15   INFO  epoch: 23/24, acc_iter=152201, cur_iter=700/6587, batch_size=24, time_cost(epoch): 0:13:45/1:54:57, time_cost(all): 2 days, 1:32:57/2:16:08, loss=0.26450872507054, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=1.4926858780015655, lr=0.000689637578226472
2023-11-16 15:10:14   INFO  epoch: 23/24, acc_iter=152251, cur_iter=750/6587, batch_size=24, time_cost(epoch): 0:14:43/1:52:57, time_cost(all): 2 days, 1:33:56/2:07:40, loss=0.264397782922363, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=2.3922569375261915, lr=0.000683781135401379
2023-11-16 15:11:13   INFO  epoch: 23/24, acc_iter=152301, cur_iter=800/6587, batch_size=24, time_cost(epoch): 0:15:42/1:48:26, time_cost(all): 2 days, 1:34:55/2:10:27, loss=0.264286840774187, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.21(1.03), norm=3.6374369581114014, lr=0.000677924692576286
2023-11-16 15:12:12   INFO  epoch: 23/24, acc_iter=152351, cur_iter=850/6587, batch_size=24, time_cost(epoch): 0:16:41/1:56:42, time_cost(all): 2 days, 1:35:54/2:10:15, loss=0.26417589862601, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=2.4625093360278063, lr=0.000672068249751193
2023-11-16 15:13:11   INFO  epoch: 23/24, acc_iter=152401, cur_iter=900/6587, batch_size=24, time_cost(epoch): 0:17:40/1:55:21, time_cost(all): 2 days, 1:36:53/2:06:08, loss=0.264064956477833, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.05(1.03), norm=2.673494592318626, lr=0.0006662118069261
2023-11-16 15:14:10   INFO  epoch: 23/24, acc_iter=152451, cur_iter=950/6587, batch_size=24, time_cost(epoch): 0:18:39/1:53:56, time_cost(all): 2 days, 1:37:52/2:09:05, loss=0.263954014329656, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.1(1.03), norm=0.9112917412330268, lr=0.000660355364101007
2023-11-16 15:15:09   INFO  epoch: 23/24, acc_iter=152501, cur_iter=1000/6587, batch_size=24, time_cost(epoch): 0:19:38/1:47:07, time_cost(all): 2 days, 1:38:51/2:08:47, loss=0.26384307218148, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.86(1.03), norm=0.7449132534430737, lr=0.000654498921275913
2023-11-16 15:16:08   INFO  epoch: 23/24, acc_iter=152551, cur_iter=1050/6587, batch_size=24, time_cost(epoch): 0:20:37/1:51:23, time_cost(all): 2 days, 1:39:50/2:02:49, loss=0.263732130033303, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.2(1.03), norm=3.096199996611203, lr=0.00064864247845082
2023-11-16 15:17:07   INFO  epoch: 23/24, acc_iter=152601, cur_iter=1100/6587, batch_size=24, time_cost(epoch): 0:21:36/1:44:54, time_cost(all): 2 days, 1:40:49/2:01:01, loss=0.263621187885126, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.91(1.03), norm=3.1556072407924, lr=0.000642786035625727
2023-11-16 15:18:06   INFO  epoch: 23/24, acc_iter=152651, cur_iter=1150/6587, batch_size=24, time_cost(epoch): 0:22:35/1:47:00, time_cost(all): 2 days, 1:41:48/2:04:58, loss=0.263510245736949, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.459974416343503, lr=0.000636929592800634
2023-11-16 15:19:05   INFO  epoch: 23/24, acc_iter=152701, cur_iter=1200/6587, batch_size=24, time_cost(epoch): 0:23:34/1:46:54, time_cost(all): 2 days, 1:42:47/1:59:37, loss=0.263399303588773, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=2.563901452504903, lr=0.000631073149975541
2023-11-16 15:20:03   INFO  epoch: 23/24, acc_iter=152751, cur_iter=1250/6587, batch_size=24, time_cost(epoch): 0:24:33/1:43:09, time_cost(all): 2 days, 1:43:45/1:56:38, loss=0.263288361440596, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.99(1.03), norm=2.1792394110318827, lr=0.000625216707150447
2023-11-16 15:21:02   INFO  epoch: 23/24, acc_iter=152801, cur_iter=1300/6587, batch_size=24, time_cost(epoch): 0:25:32/1:46:37, time_cost(all): 2 days, 1:44:44/2:04:14, loss=0.263177419292419, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.2(1.03), norm=4.820023759365618, lr=0.000619360264325354
2023-11-16 15:22:01   INFO  epoch: 23/24, acc_iter=152851, cur_iter=1350/6587, batch_size=24, time_cost(epoch): 0:26:31/1:38:12, time_cost(all): 2 days, 1:45:43/1:54:07, loss=0.263066477144242, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=4.110416727555321, lr=0.000613503821500261
2023-11-16 15:23:00   INFO  epoch: 23/24, acc_iter=152901, cur_iter=1400/6587, batch_size=24, time_cost(epoch): 0:27:30/1:36:56, time_cost(all): 2 days, 1:46:42/2:02:02, loss=0.262955534996065, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.761075501592047, lr=0.000607647378675168
2023-11-16 15:23:59   INFO  epoch: 23/24, acc_iter=152951, cur_iter=1450/6587, batch_size=24, time_cost(epoch): 0:28:28/1:37:26, time_cost(all): 2 days, 1:47:41/1:53:25, loss=0.262844592847889, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=2.7356642595570917, lr=0.000601790935850075
2023-11-16 15:24:58   INFO  epoch: 23/24, acc_iter=153001, cur_iter=1500/6587, batch_size=24, time_cost(epoch): 0:29:27/1:40:20, time_cost(all): 2 days, 1:48:40/1:54:31, loss=0.262733650699712, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.89(1.03), norm=1.7839911958731085, lr=0.000595934493024981
2023-11-16 15:25:57   INFO  epoch: 23/24, acc_iter=153051, cur_iter=1550/6587, batch_size=24, time_cost(epoch): 0:30:26/1:42:56, time_cost(all): 2 days, 1:49:39/1:58:11, loss=0.262622708551535, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.994268249360012, lr=0.000590078050199888
2023-11-16 15:26:56   INFO  epoch: 23/24, acc_iter=153101, cur_iter=1600/6587, batch_size=24, time_cost(epoch): 0:31:25/1:36:10, time_cost(all): 2 days, 1:50:38/1:54:44, loss=0.262511766403358, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=1.21571094662223, lr=0.000584221607374795
2023-11-16 15:27:55   INFO  epoch: 23/24, acc_iter=153151, cur_iter=1650/6587, batch_size=24, time_cost(epoch): 0:32:24/1:35:38, time_cost(all): 2 days, 1:51:37/1:57:38, loss=0.262400824255182, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=1.8456735508624693, lr=0.000578365164549702
2023-11-16 15:28:54   INFO  epoch: 23/24, acc_iter=153201, cur_iter=1700/6587, batch_size=24, time_cost(epoch): 0:33:23/1:32:24, time_cost(all): 2 days, 1:52:36/1:52:01, loss=0.262289882107005, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=1.9095203404665968, lr=0.000572508721724609
2023-11-16 15:29:53   INFO  epoch: 23/24, acc_iter=153251, cur_iter=1750/6587, batch_size=24, time_cost(epoch): 0:34:22/1:34:05, time_cost(all): 2 days, 1:53:35/1:47:38, loss=0.262178939958828, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=1.997769821092676, lr=0.000566652278899515
2023-11-16 15:30:52   INFO  epoch: 23/24, acc_iter=153301, cur_iter=1800/6587, batch_size=24, time_cost(epoch): 0:35:21/1:35:04, time_cost(all): 2 days, 1:54:34/1:49:55, loss=0.262067997810651, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=3.220848302482485, lr=0.000560795836074422
2023-11-16 15:31:51   INFO  epoch: 23/24, acc_iter=153351, cur_iter=1850/6587, batch_size=24, time_cost(epoch): 0:36:20/1:29:48, time_cost(all): 2 days, 1:55:33/1:48:10, loss=0.261957055662475, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=4.946188247816882, lr=0.000554939393249329
2023-11-16 15:32:50   INFO  epoch: 23/24, acc_iter=153401, cur_iter=1900/6587, batch_size=24, time_cost(epoch): 0:37:19/1:30:26, time_cost(all): 2 days, 1:56:32/1:46:14, loss=0.261846113514298, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=1.3994832253286091, lr=0.000549082950424236
2023-11-16 15:33:48   INFO  epoch: 23/24, acc_iter=153451, cur_iter=1950/6587, batch_size=24, time_cost(epoch): 0:38:18/1:31:27, time_cost(all): 2 days, 1:57:30/1:51:15, loss=0.261735171366121, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=2.6708416224842746, lr=0.000543226507599143
2023-11-16 15:34:47   INFO  epoch: 23/24, acc_iter=153501, cur_iter=2000/6587, batch_size=24, time_cost(epoch): 0:39:17/1:34:02, time_cost(all): 2 days, 1:58:29/1:42:47, loss=0.261624229217944, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=0.910245716444456, lr=0.000537370064774049
2023-11-16 15:35:46   INFO  epoch: 23/24, acc_iter=153551, cur_iter=2050/6587, batch_size=24, time_cost(epoch): 0:40:16/1:28:25, time_cost(all): 2 days, 1:59:28/1:43:52, loss=0.261513287069768, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.85(1.03), norm=4.803822842377728, lr=0.000531513621948956
2023-11-16 15:36:45   INFO  epoch: 23/24, acc_iter=153601, cur_iter=2100/6587, batch_size=24, time_cost(epoch): 0:41:15/1:30:20, time_cost(all): 2 days, 2:00:27/1:44:42, loss=0.261402344921591, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=4.232254426508913, lr=0.000525657179123863
2023-11-16 15:37:44   INFO  epoch: 23/24, acc_iter=153651, cur_iter=2150/6587, batch_size=24, time_cost(epoch): 0:42:13/1:24:22, time_cost(all): 2 days, 2:01:26/1:44:51, loss=0.261291402773414, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.02(1.03), norm=2.302223511077282, lr=0.00051980073629877
2023-11-16 15:38:43   INFO  epoch: 23/24, acc_iter=153701, cur_iter=2200/6587, batch_size=24, time_cost(epoch): 0:43:12/1:23:49, time_cost(all): 2 days, 2:02:25/1:43:19, loss=0.261180460625237, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=3.471306289282641, lr=0.000513944293473677
2023-11-16 15:39:42   INFO  epoch: 23/24, acc_iter=153751, cur_iter=2250/6587, batch_size=24, time_cost(epoch): 0:44:11/1:23:11, time_cost(all): 2 days, 2:03:24/1:46:52, loss=0.261069518477061, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=1.1144070741565564, lr=0.000508087850648584
2023-11-16 15:40:41   INFO  epoch: 23/24, acc_iter=153801, cur_iter=2300/6587, batch_size=24, time_cost(epoch): 0:45:10/1:27:12, time_cost(all): 2 days, 2:04:23/1:43:14, loss=0.260958576328884, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=4.904747164944484, lr=0.00050223140782349
2023-11-16 15:41:40   INFO  epoch: 23/24, acc_iter=153851, cur_iter=2350/6587, batch_size=24, time_cost(epoch): 0:46:09/1:24:46, time_cost(all): 2 days, 2:05:22/1:40:14, loss=0.260847634180707, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.4712289756954853, lr=0.000496374964998397
2023-11-16 15:42:39   INFO  epoch: 23/24, acc_iter=153901, cur_iter=2400/6587, batch_size=24, time_cost(epoch): 0:47:08/1:22:03, time_cost(all): 2 days, 2:06:21/1:42:30, loss=0.26073669203253, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=2.976713075628454, lr=0.000490518522173304
2023-11-16 15:43:38   INFO  epoch: 23/24, acc_iter=153951, cur_iter=2450/6587, batch_size=24, time_cost(epoch): 0:48:07/1:25:03, time_cost(all): 2 days, 2:07:20/1:37:21, loss=0.260625749884354, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=3.7842085926204034, lr=0.000484662079348211
2023-11-16 15:44:37   INFO  epoch: 23/24, acc_iter=154001, cur_iter=2500/6587, batch_size=24, time_cost(epoch): 0:49:06/1:17:58, time_cost(all): 2 days, 2:08:19/1:39:19, loss=0.260514807736177, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=3.7983081619638006, lr=0.000478805636523118
2023-11-16 15:45:36   INFO  epoch: 23/24, acc_iter=154051, cur_iter=2550/6587, batch_size=24, time_cost(epoch): 0:50:05/1:15:57, time_cost(all): 2 days, 2:09:18/1:40:47, loss=0.260403865588, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=2.9345432976127697, lr=0.000472949193698024
2023-11-16 15:46:35   INFO  epoch: 23/24, acc_iter=154101, cur_iter=2600/6587, batch_size=24, time_cost(epoch): 0:51:04/1:15:18, time_cost(all): 2 days, 2:10:17/1:34:26, loss=0.260292923439823, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=1.1297025616085339, lr=0.000467092750872931
2023-11-16 15:47:33   INFO  epoch: 23/24, acc_iter=154151, cur_iter=2650/6587, batch_size=24, time_cost(epoch): 0:52:03/1:17:44, time_cost(all): 2 days, 2:11:15/1:31:08, loss=0.260181981291647, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=0.8157818444491014, lr=0.000461236308047838
2023-11-16 15:48:32   INFO  epoch: 23/24, acc_iter=154201, cur_iter=2700/6587, batch_size=24, time_cost(epoch): 0:53:02/1:17:00, time_cost(all): 2 days, 2:12:14/1:31:56, loss=0.26007103914347, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=4.37553783336502, lr=0.000455379865222745
2023-11-16 15:49:31   INFO  epoch: 23/24, acc_iter=154251, cur_iter=2750/6587, batch_size=24, time_cost(epoch): 0:54:01/1:11:42, time_cost(all): 2 days, 2:13:13/1:29:41, loss=0.259960096995293, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.6689337676857745, lr=0.000449523422397652
2023-11-16 15:50:30   INFO  epoch: 23/24, acc_iter=154301, cur_iter=2800/6587, batch_size=24, time_cost(epoch): 0:55:00/1:11:04, time_cost(all): 2 days, 2:14:12/1:28:52, loss=0.259849154847116, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=4.089870136759901, lr=0.000443666979572558
2023-11-16 15:51:29   INFO  epoch: 23/24, acc_iter=154351, cur_iter=2850/6587, batch_size=24, time_cost(epoch): 0:55:58/1:13:00, time_cost(all): 2 days, 2:15:11/1:29:54, loss=0.25973821269894, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.05(1.03), norm=4.4942602517916725, lr=0.000437810536747465
2023-11-16 15:52:28   INFO  epoch: 23/24, acc_iter=154401, cur_iter=2900/6587, batch_size=24, time_cost(epoch): 0:56:57/1:15:26, time_cost(all): 2 days, 2:16:10/1:32:52, loss=0.259627270550763, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.92(1.03), norm=3.449037230498462, lr=0.000431954093922372
2023-11-16 15:53:27   INFO  epoch: 23/24, acc_iter=154451, cur_iter=2950/6587, batch_size=24, time_cost(epoch): 0:57:56/1:12:29, time_cost(all): 2 days, 2:17:09/1:25:23, loss=0.259516328402586, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=1.1312801598793931, lr=0.000426097651097279
2023-11-16 15:54:26   INFO  epoch: 23/24, acc_iter=154501, cur_iter=3000/6587, batch_size=24, time_cost(epoch): 0:58:55/1:12:31, time_cost(all): 2 days, 2:18:08/1:30:53, loss=0.259405386254409, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.561229328076724, lr=0.000420241208272186
2023-11-16 15:55:25   INFO  epoch: 23/24, acc_iter=154551, cur_iter=3050/6587, batch_size=24, time_cost(epoch): 0:59:54/1:09:25, time_cost(all): 2 days, 2:19:07/1:27:22, loss=0.259294444106233, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=1.2229146778773767, lr=0.000414384765447092
2023-11-16 15:56:24   INFO  epoch: 23/24, acc_iter=154601, cur_iter=3100/6587, batch_size=24, time_cost(epoch): 1:00:53/1:05:28, time_cost(all): 2 days, 2:20:06/1:21:47, loss=0.259183501958056, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=1.2047414324109484, lr=0.000408528322621999
2023-11-16 15:57:23   INFO  epoch: 23/24, acc_iter=154651, cur_iter=3150/6587, batch_size=24, time_cost(epoch): 1:01:52/1:06:03, time_cost(all): 2 days, 2:21:05/1:27:12, loss=0.259072559809879, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.18(1.03), norm=2.4777478666133854, lr=0.000402671879796906
2023-11-16 15:58:22   INFO  epoch: 23/24, acc_iter=154701, cur_iter=3200/6587, batch_size=24, time_cost(epoch): 1:02:51/1:05:12, time_cost(all): 2 days, 2:22:04/1:19:16, loss=0.258961617661702, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=1.6861971867564116, lr=0.000396815436971813
2023-11-16 15:59:21   INFO  epoch: 23/24, acc_iter=154751, cur_iter=3250/6587, batch_size=24, time_cost(epoch): 1:03:50/1:05:49, time_cost(all): 2 days, 2:23:03/1:21:27, loss=0.258850675513526, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.110000938988109, lr=0.00039095899414672
2023-11-16 16:00:20   INFO  epoch: 23/24, acc_iter=154801, cur_iter=3300/6587, batch_size=24, time_cost(epoch): 1:04:49/1:03:19, time_cost(all): 2 days, 2:24:02/1:21:12, loss=0.258739733365349, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=1.9068362678408945, lr=0.000385102551321626
2023-11-16 16:01:18   INFO  epoch: 23/24, acc_iter=154851, cur_iter=3350/6587, batch_size=24, time_cost(epoch): 1:05:48/1:04:20, time_cost(all): 2 days, 2:25:00/1:21:33, loss=0.258628791217172, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.16(1.03), norm=4.229696254372719, lr=0.000379246108496533
2023-11-16 16:02:17   INFO  epoch: 23/24, acc_iter=154901, cur_iter=3400/6587, batch_size=24, time_cost(epoch): 1:06:47/1:04:48, time_cost(all): 2 days, 2:25:59/1:17:00, loss=0.258517849068995, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.0(1.03), norm=1.3224327599192989, lr=0.00037338966567144
2023-11-16 16:03:16   INFO  epoch: 23/24, acc_iter=154951, cur_iter=3450/6587, batch_size=24, time_cost(epoch): 1:07:46/1:04:29, time_cost(all): 2 days, 2:26:58/1:14:36, loss=0.258406906920819, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.23(1.03), norm=4.923989777738026, lr=0.000367533222846347
2023-11-16 16:04:15   INFO  epoch: 23/24, acc_iter=155001, cur_iter=3500/6587, batch_size=24, time_cost(epoch): 1:08:45/1:03:30, time_cost(all): 2 days, 2:27:57/1:19:27, loss=0.258295964772642, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.86(1.03), norm=1.4035668640145897, lr=0.000361676780021254
2023-11-16 16:05:14   INFO  epoch: 23/24, acc_iter=155051, cur_iter=3550/6587, batch_size=24, time_cost(epoch): 1:09:43/0:57:12, time_cost(all): 2 days, 2:28:56/1:16:53, loss=0.258185022624465, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=3.081403532624711, lr=0.00035582033719616
2023-11-16 16:06:13   INFO  epoch: 23/24, acc_iter=155101, cur_iter=3600/6587, batch_size=24, time_cost(epoch): 1:10:42/0:58:08, time_cost(all): 2 days, 2:29:55/1:16:07, loss=0.258074080476288, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=4.703241026790909, lr=0.000349963894371067
2023-11-16 16:07:12   INFO  epoch: 23/24, acc_iter=155151, cur_iter=3650/6587, batch_size=24, time_cost(epoch): 1:11:41/0:59:00, time_cost(all): 2 days, 2:30:54/1:10:50, loss=0.257963138328112, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=1.7462516264515635, lr=0.000344107451545974
2023-11-16 16:08:11   INFO  epoch: 23/24, acc_iter=155201, cur_iter=3700/6587, batch_size=24, time_cost(epoch): 1:12:40/0:56:52, time_cost(all): 2 days, 2:31:53/1:10:06, loss=0.257852196179935, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=0.8721549255767285, lr=0.000338251008720881
2023-11-16 16:09:10   INFO  epoch: 23/24, acc_iter=155251, cur_iter=3750/6587, batch_size=24, time_cost(epoch): 1:13:39/0:56:10, time_cost(all): 2 days, 2:32:52/1:14:42, loss=0.257741254031758, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=0.6143656145657685, lr=0.000332394565895788
2023-11-16 16:10:09   INFO  epoch: 23/24, acc_iter=155301, cur_iter=3800/6587, batch_size=24, time_cost(epoch): 1:14:38/0:52:14, time_cost(all): 2 days, 2:33:51/1:09:36, loss=0.257630311883581, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=4.824507779285016, lr=0.000326538123070695
2023-11-16 16:11:08   INFO  epoch: 23/24, acc_iter=155351, cur_iter=3850/6587, batch_size=24, time_cost(epoch): 1:15:37/0:54:08, time_cost(all): 2 days, 2:34:50/1:07:32, loss=0.257519369735405, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=4.078747826279858, lr=0.000320681680245601
2023-11-16 16:12:07   INFO  epoch: 23/24, acc_iter=155401, cur_iter=3900/6587, batch_size=24, time_cost(epoch): 1:16:36/0:54:28, time_cost(all): 2 days, 2:35:49/1:06:28, loss=0.257408427587228, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=0.6396299361649253, lr=0.000314825237420508
2023-11-16 16:13:06   INFO  epoch: 23/24, acc_iter=155451, cur_iter=3950/6587, batch_size=24, time_cost(epoch): 1:17:35/0:52:02, time_cost(all): 2 days, 2:36:48/1:08:45, loss=0.257297485439051, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=2.136167237457528, lr=0.000308968794595415
2023-11-16 16:14:05   INFO  epoch: 23/24, acc_iter=155501, cur_iter=4000/6587, batch_size=24, time_cost(epoch): 1:18:34/0:51:16, time_cost(all): 2 days, 2:37:47/1:10:52, loss=0.257186543290874, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=1.217010203661211, lr=0.000303112351770322
2023-11-16 16:15:03   INFO  epoch: 23/24, acc_iter=155551, cur_iter=4050/6587, batch_size=24, time_cost(epoch): 1:19:33/0:50:12, time_cost(all): 2 days, 2:38:45/1:07:39, loss=0.257075601142698, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.94(1.03), norm=1.8972863675610516, lr=0.000297255908945229
2023-11-16 16:16:02   INFO  epoch: 23/24, acc_iter=155601, cur_iter=4100/6587, batch_size=24, time_cost(epoch): 1:20:32/0:47:05, time_cost(all): 2 days, 2:39:44/1:05:48, loss=0.256964658994521, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=0.935899581699843, lr=0.000291399466120135
2023-11-16 16:17:01   INFO  epoch: 23/24, acc_iter=155651, cur_iter=4150/6587, batch_size=24, time_cost(epoch): 1:21:31/0:47:29, time_cost(all): 2 days, 2:40:43/1:03:41, loss=0.256853716846344, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.86(1.03), norm=4.068943583069995, lr=0.000285543023295042
2023-11-16 16:18:00   INFO  epoch: 23/24, acc_iter=155701, cur_iter=4200/6587, batch_size=24, time_cost(epoch): 1:22:30/0:49:09, time_cost(all): 2 days, 2:41:42/1:03:08, loss=0.256742774698167, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=2.0142311920932494, lr=0.000279686580469949
2023-11-16 16:18:59   INFO  epoch: 23/24, acc_iter=155751, cur_iter=4250/6587, batch_size=24, time_cost(epoch): 1:23:28/0:45:59, time_cost(all): 2 days, 2:42:41/0:59:31, loss=0.256631832549991, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.7944050064958614, lr=0.000273830137644856
2023-11-16 16:19:58   INFO  epoch: 23/24, acc_iter=155801, cur_iter=4300/6587, batch_size=24, time_cost(epoch): 1:24:27/0:43:53, time_cost(all): 2 days, 2:43:40/1:04:35, loss=0.256520890401814, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.84(1.03), norm=2.212404129365874, lr=0.000267973694819763
2023-11-16 16:20:57   INFO  epoch: 23/24, acc_iter=155851, cur_iter=4350/6587, batch_size=24, time_cost(epoch): 1:25:26/0:45:22, time_cost(all): 2 days, 2:44:39/0:59:29, loss=0.256409948253637, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.88(1.03), norm=3.3943937825763832, lr=0.000262117251994669
2023-11-16 16:21:56   INFO  epoch: 23/24, acc_iter=155901, cur_iter=4400/6587, batch_size=24, time_cost(epoch): 1:26:25/0:43:21, time_cost(all): 2 days, 2:45:38/0:56:57, loss=0.25629900610546, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=1.9977785526902192, lr=0.000256260809169576
2023-11-16 16:22:55   INFO  epoch: 23/24, acc_iter=155951, cur_iter=4450/6587, batch_size=24, time_cost(epoch): 1:27:24/0:42:48, time_cost(all): 2 days, 2:46:37/0:56:59, loss=0.256188063957284, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.09(1.03), norm=3.2233716048023915, lr=0.000250404366344483
2023-11-16 16:23:54   INFO  epoch: 23/24, acc_iter=156001, cur_iter=4500/6587, batch_size=24, time_cost(epoch): 1:28:23/0:39:41, time_cost(all): 2 days, 2:47:36/0:58:28, loss=0.256077121809107, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=2.009034224416057, lr=0.00024454792351939
2023-11-16 16:24:53   INFO  epoch: 23/24, acc_iter=156051, cur_iter=4550/6587, batch_size=24, time_cost(epoch): 1:29:22/0:38:45, time_cost(all): 2 days, 2:48:35/0:56:02, loss=0.25596617966093, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=3.854140933196908, lr=0.000238691480694297
2023-11-16 16:25:52   INFO  epoch: 23/24, acc_iter=156101, cur_iter=4600/6587, batch_size=24, time_cost(epoch): 1:30:21/0:39:20, time_cost(all): 2 days, 2:49:34/0:55:01, loss=0.255855237512753, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=1.9344946162745897, lr=0.000232835037869204
2023-11-16 16:26:51   INFO  epoch: 23/24, acc_iter=156151, cur_iter=4650/6587, batch_size=24, time_cost(epoch): 1:31:20/0:39:10, time_cost(all): 2 days, 2:50:33/0:54:14, loss=0.255744295364577, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.85(1.03), norm=1.3712277599631946, lr=0.00022697859504411
2023-11-16 16:27:50   INFO  epoch: 23/24, acc_iter=156201, cur_iter=4700/6587, batch_size=24, time_cost(epoch): 1:32:19/0:36:42, time_cost(all): 2 days, 2:51:32/0:56:00, loss=0.2556333532164, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=4.220219906058421, lr=0.000221122152219017
2023-11-16 16:28:48   INFO  epoch: 23/24, acc_iter=156251, cur_iter=4750/6587, batch_size=24, time_cost(epoch): 1:33:18/0:36:59, time_cost(all): 2 days, 2:52:30/0:54:45, loss=0.255522411068223, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.22(1.03), norm=2.4759681826602598, lr=0.000215265709393924
2023-11-16 16:29:47   INFO  epoch: 23/24, acc_iter=156301, cur_iter=4800/6587, batch_size=24, time_cost(epoch): 1:34:17/0:35:57, time_cost(all): 2 days, 2:53:29/0:52:56, loss=0.255411468920046, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.4172739188111796, lr=0.000209409266568831
2023-11-16 16:30:46   INFO  epoch: 23/24, acc_iter=156351, cur_iter=4850/6587, batch_size=24, time_cost(epoch): 1:35:16/0:32:44, time_cost(all): 2 days, 2:54:28/0:50:08, loss=0.25530052677187, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.89(1.03), norm=3.2552050051557617, lr=0.000203552823743738
2023-11-16 16:31:45   INFO  epoch: 23/24, acc_iter=156401, cur_iter=4900/6587, batch_size=24, time_cost(epoch): 1:36:15/0:33:57, time_cost(all): 2 days, 2:55:27/0:48:18, loss=0.255189584623693, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=4.884444416281525, lr=0.000197696380918644
2023-11-16 16:32:44   INFO  epoch: 23/24, acc_iter=156451, cur_iter=4950/6587, batch_size=24, time_cost(epoch): 1:37:13/0:33:22, time_cost(all): 2 days, 2:56:26/0:47:21, loss=0.255078642475516, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=2.5418491980501896, lr=0.000191839938093551
2023-11-16 16:33:43   INFO  epoch: 23/24, acc_iter=156501, cur_iter=5000/6587, batch_size=24, time_cost(epoch): 1:38:12/0:31:49, time_cost(all): 2 days, 2:57:25/0:48:26, loss=0.254967700327339, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=3.5502026540610268, lr=0.000185983495268458
2023-11-16 16:34:42   INFO  epoch: 23/24, acc_iter=156551, cur_iter=5050/6587, batch_size=24, time_cost(epoch): 1:39:11/0:30:00, time_cost(all): 2 days, 2:58:24/0:47:18, loss=0.254856758179163, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=4.172053771042769, lr=0.000180127052443365
2023-11-16 16:35:41   INFO  epoch: 23/24, acc_iter=156601, cur_iter=5100/6587, batch_size=24, time_cost(epoch): 1:40:10/0:29:24, time_cost(all): 2 days, 2:59:23/0:47:05, loss=0.254745816030986, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=4.54172409111422, lr=0.000174270609618271
2023-11-16 16:36:40   INFO  epoch: 23/24, acc_iter=156651, cur_iter=5150/6587, batch_size=24, time_cost(epoch): 1:41:09/0:28:25, time_cost(all): 2 days, 3:00:22/0:43:31, loss=0.254634873882809, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=2.0306467446257774, lr=0.000168414166793178
2023-11-16 16:37:39   INFO  epoch: 23/24, acc_iter=156701, cur_iter=5200/6587, batch_size=24, time_cost(epoch): 1:42:08/0:26:02, time_cost(all): 2 days, 3:01:21/0:45:53, loss=0.254523931734632, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=3.5965045258329655, lr=0.000162557723968085
2023-11-16 16:38:38   INFO  epoch: 23/24, acc_iter=156751, cur_iter=5250/6587, batch_size=24, time_cost(epoch): 1:43:07/0:25:00, time_cost(all): 2 days, 3:02:20/0:42:43, loss=0.254412989586456, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=4.822058403667957, lr=0.000156701281142992
2023-11-16 16:39:37   INFO  epoch: 23/24, acc_iter=156801, cur_iter=5300/6587, batch_size=24, time_cost(epoch): 1:44:06/0:24:29, time_cost(all): 2 days, 3:03:19/0:41:17, loss=0.254302047438279, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.08181560811272, lr=0.000150844838317899
2023-11-16 16:40:36   INFO  epoch: 23/24, acc_iter=156851, cur_iter=5350/6587, batch_size=24, time_cost(epoch): 1:45:05/0:24:30, time_cost(all): 2 days, 3:04:18/0:40:05, loss=0.254191105290102, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.4689337735957082, lr=0.000144988395492806
2023-11-16 16:41:35   INFO  epoch: 23/24, acc_iter=156901, cur_iter=5400/6587, batch_size=24, time_cost(epoch): 1:46:04/0:23:39, time_cost(all): 2 days, 3:05:17/0:38:44, loss=0.254080163141925, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.13(1.03), norm=4.602743030404243, lr=0.000139131952667712
2023-11-16 16:42:33   INFO  epoch: 23/24, acc_iter=156951, cur_iter=5450/6587, batch_size=24, time_cost(epoch): 1:47:03/0:22:33, time_cost(all): 2 days, 3:06:15/0:40:18, loss=0.253969220993749, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.0(1.03), norm=4.4704000247904325, lr=0.000133275509842619
2023-11-16 16:43:32   INFO  epoch: 23/24, acc_iter=157001, cur_iter=5500/6587, batch_size=24, time_cost(epoch): 1:48:02/0:20:53, time_cost(all): 2 days, 3:07:14/0:36:28, loss=0.253858278845572, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.85(1.03), norm=1.9114500345854206, lr=0.000127419067017526
2023-11-16 16:44:31   INFO  epoch: 23/24, acc_iter=157051, cur_iter=5550/6587, batch_size=24, time_cost(epoch): 1:49:01/0:20:18, time_cost(all): 2 days, 3:08:13/0:36:32, loss=0.253747336697395, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=0.526482371494392, lr=0.000121562624192433
2023-11-16 16:45:30   INFO  epoch: 23/24, acc_iter=157101, cur_iter=5600/6587, batch_size=24, time_cost(epoch): 1:50:00/0:19:00, time_cost(all): 2 days, 3:09:12/0:36:31, loss=0.253636394549218, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=2.8563756744818507, lr=0.00011570618136734
2023-11-16 16:46:29   INFO  epoch: 23/24, acc_iter=157151, cur_iter=5650/6587, batch_size=24, time_cost(epoch): 1:50:58/0:18:58, time_cost(all): 2 days, 3:10:11/0:36:06, loss=0.253525452401042, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=4.7003357823784215, lr=0.000109849738542246
2023-11-16 16:47:28   INFO  epoch: 23/24, acc_iter=157201, cur_iter=5700/6587, batch_size=24, time_cost(epoch): 1:51:57/0:16:44, time_cost(all): 2 days, 3:11:10/0:34:04, loss=0.253414510252865, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=2.8439038131227883, lr=0.000103993295717153
2023-11-16 16:48:27   INFO  epoch: 23/24, acc_iter=157251, cur_iter=5750/6587, batch_size=24, time_cost(epoch): 1:52:56/0:15:51, time_cost(all): 2 days, 3:12:09/0:34:06, loss=0.253303568104688, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=3.7208808885657345, lr=9.813685289206e-05
2023-11-16 16:49:26   INFO  epoch: 23/24, acc_iter=157301, cur_iter=5800/6587, batch_size=24, time_cost(epoch): 1:53:55/0:14:53, time_cost(all): 2 days, 3:13:08/0:31:20, loss=0.253192625956511, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.9651680077653526, lr=9.2280410066967e-05
2023-11-16 16:50:25   INFO  epoch: 23/24, acc_iter=157351, cur_iter=5850/6587, batch_size=24, time_cost(epoch): 1:54:54/0:14:23, time_cost(all): 2 days, 3:14:07/0:30:34, loss=0.253081683808335, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=1.2805192170647133, lr=8.6423967241874e-05
2023-11-16 16:51:24   INFO  epoch: 23/24, acc_iter=157401, cur_iter=5900/6587, batch_size=24, time_cost(epoch): 1:55:53/0:13:14, time_cost(all): 2 days, 3:15:06/0:31:36, loss=0.252970741660158, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=1.3864292142043018, lr=8.056752441678e-05
2023-11-16 16:52:23   INFO  epoch: 23/24, acc_iter=157451, cur_iter=5950/6587, batch_size=24, time_cost(epoch): 1:56:52/0:12:55, time_cost(all): 2 days, 3:16:05/0:29:41, loss=0.252859799511981, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=2.6062353050351335, lr=7.4711081591687e-05
2023-11-16 16:53:22   INFO  epoch: 23/24, acc_iter=157501, cur_iter=6000/6587, batch_size=24, time_cost(epoch): 1:57:51/0:11:45, time_cost(all): 2 days, 3:17:04/0:26:54, loss=0.252748857363804, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=3.9173492498020863, lr=6.8854638766594e-05
2023-11-16 16:54:21   INFO  epoch: 23/24, acc_iter=157551, cur_iter=6050/6587, batch_size=24, time_cost(epoch): 1:58:50/0:10:51, time_cost(all): 2 days, 3:18:03/0:28:10, loss=0.252637915215628, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=1.022960967204233, lr=6.2998195941501e-05
2023-11-16 16:55:20   INFO  epoch: 23/24, acc_iter=157601, cur_iter=6100/6587, batch_size=24, time_cost(epoch): 1:59:49/0:09:29, time_cost(all): 2 days, 3:19:02/0:26:36, loss=0.252526973067451, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.597939588435446, lr=5.7141753116408e-05
2023-11-16 16:56:18   INFO  epoch: 23/24, acc_iter=157651, cur_iter=6150/6587, batch_size=24, time_cost(epoch): 2:00:48/0:08:28, time_cost(all): 2 days, 3:20:00/0:24:06, loss=0.252416030919274, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.456521151920574, lr=5.1285310291315e-05
2023-11-16 16:57:17   INFO  epoch: 23/24, acc_iter=157701, cur_iter=6200/6587, batch_size=24, time_cost(epoch): 2:01:47/0:07:56, time_cost(all): 2 days, 3:20:59/0:23:50, loss=0.252305088771097, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.04(1.03), norm=1.1832517814412682, lr=4.5428867466221e-05
2023-11-16 16:58:16   INFO  epoch: 23/24, acc_iter=157751, cur_iter=6250/6587, batch_size=24, time_cost(epoch): 2:02:46/0:06:53, time_cost(all): 2 days, 3:21:58/0:23:25, loss=0.252194146622921, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.23(1.03), norm=2.301915336506327, lr=3.9572424641128e-05
2023-11-16 16:59:15   INFO  epoch: 23/24, acc_iter=157801, cur_iter=6300/6587, batch_size=24, time_cost(epoch): 2:03:45/0:05:52, time_cost(all): 2 days, 3:22:57/0:21:51, loss=0.252083204474744, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=1.1735664683988971, lr=3.3715981816035e-05
2023-11-16 17:00:14   INFO  epoch: 23/24, acc_iter=157851, cur_iter=6350/6587, batch_size=24, time_cost(epoch): 2:04:43/0:04:49, time_cost(all): 2 days, 3:23:56/0:21:04, loss=0.251972262326567, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=1.735779683495827, lr=2.7859538990942e-05
2023-11-16 17:01:13   INFO  epoch: 23/24, acc_iter=157901, cur_iter=6400/6587, batch_size=24, time_cost(epoch): 2:05:42/0:03:49, time_cost(all): 2 days, 3:24:55/0:20:14, loss=0.25186132017839, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.19(1.03), norm=1.3611532601796104, lr=2.2003096165849e-05
2023-11-16 17:02:12   INFO  epoch: 23/24, acc_iter=157951, cur_iter=6450/6587, batch_size=24, time_cost(epoch): 2:06:41/0:02:39, time_cost(all): 2 days, 3:25:54/0:18:54, loss=0.251750378030214, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.0(1.03), norm=3.739345617415067, lr=1.6146653340755e-05
2023-11-16 17:03:11   INFO  epoch: 23/24, acc_iter=158001, cur_iter=6500/6587, batch_size=24, time_cost(epoch): 2:07:40/0:01:44, time_cost(all): 2 days, 3:26:53/0:17:51, loss=0.251639435882037, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=4.5125602449624695, lr=1.0290210515662e-05
2023-11-16 17:04:10   INFO  epoch: 23/24, acc_iter=158051, cur_iter=6550/6587, batch_size=24, time_cost(epoch): 2:08:39/0:00:41, time_cost(all): 2 days, 3:27:52/0:16:38, loss=0.25152849373386, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.04(1.03), norm=3.0575164927351968, lr=4.433767690569e-06
2023-11-16 17:04:10   INFO  **********************End training cfgs/picture_model/picture_waymo_detection_0.2(detection)**********************



2023-11-16 17:04:10   INFO  **********************Start evaluation cfgs/picture_model/picture_waymo_detection_0.2(detection)**********************
2023-11-16 17:04:10   INFO  Loading Waymo dataset
2023-11-16 17:04:10   INFO  Total skipped info 0
2023-11-16 17:04:10   INFO  Total samples for Waymo dataset: 39987
2023-11-16 17:04:10   INFO  ==> Loading parameters from checkpoint xxxxxx to CPU
2023-11-16 17:04:10   INFO  ==> Checkpoint trained from version: pcdet+0.6.0+0000000
2023-11-16 17:04:10   INFO  ==> Done (loaded 448/448)
2023-11-16 17:04:10   INFO  *************** EPOCH 24 EVALUATION *****************
2023-11-16 17:14:48   INFO  *************** Performance of EPOCH 24 *****************
2023-11-16 17:14:48   INFO  Generate label finished(sec_per_example: 0.0151 second).
2023-11-16 17:14:48   INFO  recall_roi_0.3: 0.000000
2023-11-16 17:14:48   INFO  recall_rcnn_0.3: 0.846827
2023-11-16 17:14:48   INFO  recall_roi_0.5: 0.000000
2023-11-16 17:14:48   INFO  recall_rcnn_0.5: 0.803597
2023-11-16 17:14:48   INFO  recall_roi_0.7: 0.000000
2023-11-16 17:14:48   INFO  recall_rcnn_0.7: 0.583581
2023-11-16 17:14:48   INFO  Average predicted number of objects(39987 samples): 120.153
2023-11-16 17:33:27   INFO  
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/AP: 0.7921 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/APH: 0.7895 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_1/APL: 0.7921 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/AP: 0.7166 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/APH: 0.7135 
OBJECT_TYPE_TYPE_VEHICLE_LEVEL_2/APL: 0.7166 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/AP: 0.8324 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/APH: 0.7742 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_1/APL: 0.8324 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/AP: 0.7588 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/APH: 0.7052 
OBJECT_TYPE_TYPE_PEDESTRIAN_LEVEL_2/APL: 0.7588 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/AP: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/APH: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_1/APL: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/AP: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/APH: 0.0000 
OBJECT_TYPE_TYPE_SIGN_LEVEL_2/APL: 0.0000 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/AP: 0.7685 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/APH: 0.7586 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_1/APL: 0.7685 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/AP: 0.7398 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/APH: 0.7353 
OBJECT_TYPE_TYPE_CYCLIST_LEVEL_2/APL: 0.7398 

2023-11-16 17:33:27   INFO  Result is save to xxxxxxxxxxxxxxxxx
2023-11-16 17:33:27   INFO  ****************Evaluation done.*****************
2023-11-16 17:33:27   INFO  Epoch 24 has been evaluated
2023-11-16 17:33:27   INFO  **********************End evaluation cfgs/picture_model/picture_waymo_detection_0.2(detection)**********************
