2023-12-12 15:43:21   INFO  **********************Start logging**********************
2023-12-12 15:43:21   INFO  CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7
2023-12-12 15:43:21   INFO  total_batch_size: 32
2023-12-12 15:43:21   INFO  cfg_file         ./cfgs/picture_models/picture_nuscenes_detection.yaml
2023-12-12 15:43:21   INFO  batch_size       4
2023-12-12 15:43:21   INFO  epochs           24
2023-12-12 15:43:21   INFO  workers          4
2023-12-12 15:43:21   INFO  extra_tag        default
2023-12-12 15:43:21   INFO  ckpt             None
2023-12-12 15:43:21   INFO  pretrained_model nuscenes_pretrain_model.pth
2023-12-12 15:43:21   INFO  launcher         pytorch
2023-12-12 15:43:21   INFO  tcp_port         18888
2023-12-12 15:43:21   INFO  sync_bn          True
2023-12-12 15:43:21   INFO  fix_random_seed  False
2023-12-12 15:43:21   INFO  ckpt_save_interval 20
2023-12-12 15:43:21   INFO  local_rank       0
2023-12-12 15:43:21   INFO  max_ckpt_save_num 30
2023-12-12 15:43:21   INFO  merge_all_iters_to_one_epoch False
2023-12-12 15:43:21   INFO  set_cfgs         None
2023-12-12 15:43:21   INFO  max_waiting_mins 0
2023-12-12 15:43:21   INFO  start_epoch      0
2023-12-12 15:43:21   INFO  num_epochs_to_eval 0
2023-12-12 15:43:21   INFO  save_to_file     False
2023-12-12 15:43:21   INFO  use_tqdm_to_record False
2023-12-12 15:43:21   INFO  logger_iter_interval 50
2023-12-12 15:43:21   INFO  ckpt_save_time_interval 300
2023-12-12 15:43:21   INFO  wo_gpu_stat      False
2023-12-12 15:43:21   INFO  fp16             False
2023-12-12 15:43:21   INFO  cfg.ROOT_DIR: xxxxxxxxxxxxx
2023-12-12 15:43:21   INFO  cfg.LOCAL_RANK: 0
2023-12-12 15:43:21   INFO  cfg.CLASS_NAMES: ['car', 'truck', 'construction_vehicle', 'bus', 'trailer', 'barrier', 'motorcycle', 'bicycle', 'pedestrian', 'traffic_cone']
2023-12-12 15:43:21   INFO  
cfg.DATA_CONFIG = edict()
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATASET: NuScenesDataset
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_PATH: ../data/nuscenes
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.VERSION: v1.0-trainval
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.PRED_VELOCITY: True
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.SET_NAN_VELOCITY_TO_ZEROS: True
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.FILTER_MIN_POINTS_IN_GT: 1
2023-12-12 15:43:21   INFO  
cfg.DATA_CONFIG.DATA_SPLIT = edict()
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_SPLIT.train: train
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_SPLIT.test: val
2023-12-12 15:43:21   INFO  
cfg.DATA_CONFIG.INFO_PATH = edict()
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.INFO_PATH.train: ['nuscenes_infos_10sweeps_train.pkl']
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.INFO_PATH.test: ['nuscenes_infos_10sweeps_val.pkl']
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.POINT_CLOUD_RANGE: [-51.2, -51.2, -5.0, 51.2, 51.2, 3.0]
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.BALANCED_RESAMPLING: True
2023-12-12 15:43:21   INFO  
cfg.DATA_CONFIG.DATA_AUGMENTOR = edict()
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.DISABLE_AUG_LIST: ['placeholder']
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_AUGMENTOR.AUG_CONFIG_LIST: [{'NAME': 'gt_sampling', 'USE_ROAD_PLANE': False, 'DB_INFO_PATH': ['nuscenes_dbinfos_10sweeps_withvelo.pkl'], 'USE_SHARED_MEMORY': True, 'DB_DATA_PATH': ['nuscenes_gt_database__10sweeps_global.npy'], 'PREPARE': {'filter_by_min_points': ['car:5', 'truck:5', 'construction_vehicle:5', 'bus:5', 'trailer:5', 'barrier:5', 'motorcycle:5', 'bicycle:5', 'pedestrian:5', 'traffic_cone:5']}, 'SAMPLE_GROUPS': ['car:2', 'truck:3', 'construction_vehicle:7', 'bus:4', 'trailer:6', 'barrier:2', 'motorcycle:6', 'bicycle:6', 'pedestrian:2', 'traffic_cone:2'], 'NUM_POINT_FEATURES': 5, 'DATABASE_WITH_FAKELIDAR': False, 'REMOVE_EXTRA_WIDTH': [0.0, 0.0, 0.0], 'LIMIT_WHOLE_SCENE': True}, {'NAME': 'random_world_flip', 'ALONG_AXIS_LIST': ['x', 'y']}, {'NAME': 'random_world_rotation', 'WORLD_ROT_ANGLE': [-0.78539816, 0.78539816]}, {'NAME': 'random_world_scaling', 'WORLD_SCALE_RANGE': [0.9, 1.1]}, {'NAME': 'random_world_translation', 'NOISE_TRANSLATE_STD': [0.5, 0.5, 0.5]}]
2023-12-12 15:43:21   INFO  
cfg.DATA_CONFIG.POINT_FEATURE_ENCODING = edict()
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.encoding_type: absolute_coordinates_encoding
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.used_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.POINT_FEATURE_ENCODING.src_feature_list: ['x', 'y', 'z', 'intensity', 'timestamp']
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG.DATA_PROCESSOR: [{'NAME': 'mask_points_and_boxes_outside_rangeV2', 'REMOVE_OUTSIDE_BOXES': True}, {'NAME': 'shuffle_points', 'SHUFFLE_ENABLED': {'train': True, 'test': True}}, {'NAME': 'transform_points_to_voxels_placeholder', 'VOXEL_SIZE': [0.3, 0.3, 8.0]}]
2023-12-12 15:43:21   INFO  cfg.DATA_CONFIG._BASE_CONFIG_: cfgs/dataset_configs/nuscenes_dataset.yaml
2023-12-12 15:43:21   INFO  
cfg.MODEL = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.NAME: TransFusion
2023-12-12 15:43:21   INFO  
cfg.MODEL.VFE = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.VFE.NAME: DynPillarVFE
2023-12-12 15:43:21   INFO  cfg.MODEL.VFE.WITH_DISTANCE: False
2023-12-12 15:43:21   INFO  cfg.MODEL.VFE.USE_ABSLOTE_XYZ: True
2023-12-12 15:43:21   INFO  cfg.MODEL.VFE.USE_NORM: True
2023-12-12 15:43:21   INFO  cfg.MODEL.VFE.NUM_FILTERS: [256, 256]
2023-12-12 15:43:21   INFO  
cfg.MODEL.BACKBONE_3D = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.NAME: DSVT
2023-12-12 15:43:21   INFO  
cfg.MODEL.BACKBONE_3D.INPUT_LAYER = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.sparse_shape: [360, 360, 1]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.downsample_stride: []
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.d_model: [256]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.set_info: [[90, 4]]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.window_shape: [[30, 30, 1]]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.hybrid_factor: [1, 1, 1]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.shifts_list: [[[0, 0, 0], [15, 15, 0]]]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.INPUT_LAYER.normalize_pos: False
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.block_name: ['DSVTBlock']
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.set_info: [[90, 4]]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.d_model: [256]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.nhead: [8]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.dim_feedforward: [256]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.dropout: 0.0
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.activation: gelu
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.output_shape: [360, 360]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_3D.conv_out_channel: 256
2023-12-12 15:43:21   INFO  
cfg.MODEL.MAP_TO_BEV = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.MAP_TO_BEV.NAME: PointPillarScatter3d
2023-12-12 15:43:21   INFO  cfg.MODEL.MAP_TO_BEV.INPUT_SHAPE: [360, 360, 1]
2023-12-12 15:43:21   INFO  cfg.MODEL.MAP_TO_BEV.NUM_BEV_FEATURES: 256
2023-12-12 15:43:21   INFO  
cfg.MODEL.BACKBONE_2D = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.NAME: BaseBEVResBackbone
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.LAYER_NUMS: [1, 2, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.LAYER_STRIDES: [1, 2, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.NUM_FILTERS: [256, 256, 256]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.UPSAMPLE_STRIDES: [0.5, 1, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.BACKBONE_2D.NUM_UPSAMPLE_FILTERS: [256, 256, 256]
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.CLASS_AGNOSTIC: False
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.NAME: TransFusionHead
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.num_proposals: 200
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.query_radius: 20
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.auxiliary: True
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.in_channels: None
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.hidden_channel: 256
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.num_classes: 10
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.num_decoder_layers: 1
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.num_heads: 8
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.nms_kernel_size: 3
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.ffn_channel: 256
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.dropout: 0.0
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bn_momentum: 0.1
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.activation: relu
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.dataset: nuScenes
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.point_cloud_range: [-51.2, -51.2, -5.0, 51.2, 51.2, 3.0]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.grid_size: [360, 360, 1]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.voxel_size: [0.3, 0.3, 8.0]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.out_size_factor: 2
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.gaussian_overlap: 0.1
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.min_radius: 2
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.pos_weight: -1
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.code_weights: [1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 1.0, 0.2, 0.2]
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg.assigner = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.type: HungarianAssigner3D
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_calculator = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_calculator.type: BboxOverlaps3D
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_calculator.coordinate: lidar
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg.assigner.cls_cost = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.cls_cost.type: FocalLossCost
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.cls_cost.gamma: 2.0
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.cls_cost.alpha: 0.25
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.cls_cost.weight: 0.15
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg.assigner.reg_cost = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.reg_cost.type: BBoxBEVL1Cost
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.reg_cost.weight: 0.25
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_cost = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_cost.type: IoU3DCost
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.train_cfg.assigner.iou_cost.weight: 0.25
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.test_cfg = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.dataset: nuScenes
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.grid_size: [360, 360, 1]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.out_size_factor: 2
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.voxel_size: [0.3, 0.3]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.pc_range: [-51.2, -51.2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.nms_type: nms_gpu
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG.NMS_TYPE: nms_gpu
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG.NMS_THRESH: 0.2
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG.NMS_PRE_MAXSIZE: 1000
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG.NMS_POST_MAXSIZE: 100
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.NMS_CONFIG.SCORE_THRES: 0.0
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.USE_IOU_TO_RECTIFY_SCORE: True
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.test_cfg.IOU_RECTIFIER: [0.5]
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.common_heads = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.center: [2, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.height: [1, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.dim: [3, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.rot: [2, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.vel: [2, 2]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.common_heads.iou: [1, 2]
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.bbox_coder = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.type: TransFusionBBoxCoder
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.pc_range: [-54.0, -54.0]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.post_center_range: [-61.2, -61.2, -10.0, 61.2, 61.2, 10.0]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.score_threshold: 0.0
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.out_size_factor: 2
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.voxel_size: [0.3, 0.3]
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.bbox_coder.code_size: 10
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_iou_rescore_weight: 0.5
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.loss_cls = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.type: FocalLoss
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.use_sigmoid: True
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.gamma: 2.0
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.alpha: 0.25
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.reduction: mean
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_cls.loss_weight: 1.0
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.loss_heatmap = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_heatmap.type: GaussianFocalLoss
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_heatmap.reduction: mean
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_heatmap.loss_weight: 1.0
2023-12-12 15:43:21   INFO  
cfg.MODEL.DENSE_HEAD.loss_bbox = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_bbox.type: L1Loss
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_bbox.reduction: mean
2023-12-12 15:43:21   INFO  cfg.MODEL.DENSE_HEAD.loss_bbox.loss_weight: 0.25
2023-12-12 15:43:21   INFO  
cfg.MODEL.POST_PROCESSING = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.RECALL_THRESH_LIST: [0.3, 0.5, 0.7]
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.SCORE_THRESH: 0.1
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.OUTPUT_RAW_SCORE: False
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.EVAL_METRIC: kitti
2023-12-12 15:43:21   INFO  
cfg.MODEL.POST_PROCESSING.NMS_CONFIG = edict()
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.NMS_CONFIG.MULTI_CLASSES_NMS: True
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.NMS_CONFIG.NMS_TYPE: nms_gpu
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.NMS_CONFIG.NMS_THRESH: 0.2
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.NMS_CONFIG.NMS_PRE_MAXSIZE: 1000
2023-12-12 15:43:21   INFO  cfg.MODEL.POST_PROCESSING.NMS_CONFIG.NMS_POST_MAXSIZE: 83
2023-12-12 15:43:21   INFO  
cfg.OPTIMIZATION = edict()
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.BATCH_SIZE_PER_GPU: 4
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.NUM_EPOCHS: 24
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.OPTIMIZER: adamw
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.LR: 0.005
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.WEIGHT_DECAY: 0.05
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.MOMENTUM: 0.9
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.MOMS: [0.95, 0.85]
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.PCT_START: 0.4
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.DIV_FACTOR: 10
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.DECAY_STEP_LIST: [35, 45]
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.LR_DECAY: 0.1
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.LR_CLIP: 1e-07
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.LR_WARMUP: False
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.WARMUP_EPOCH: 1
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.GRAD_NORM_CLIP: 35
2023-12-12 15:43:21   INFO  cfg.OPTIMIZATION.LOSS_SCALE_FP16: 4.0
2023-12-12 15:43:21   INFO  
cfg.HOOK = edict()
2023-12-12 15:43:21   INFO  
cfg.HOOK.DisableAugmentationHook = edict()
2023-12-12 15:43:21   INFO  cfg.HOOK.DisableAugmentationHook.DISABLE_AUG_LIST: ['gt_sampling']
2023-12-12 15:43:21   INFO  cfg.HOOK.DisableAugmentationHook.NUM_LAST_EPOCHS: 4
2023-12-12 15:43:21   INFO  cfg.TAG: picture_nuscenes_detection
2023-12-12 15:43:21   INFO  cfg.EXP_GROUP_PATH: cfgs/picture_models
2023-12-12 15:43:21   INFO  Database filter by min points car: 339949 => 294532
2023-12-12 15:43:21   INFO  Database filter by min points truck: 65262 => 60344
2023-12-12 15:43:21   INFO  Database filter by min points construction_vehicle: 11050 => 10589
2023-12-12 15:43:21   INFO  Database filter by min points bus: 12286 => 11619
2023-12-12 15:43:21   INFO  Database filter by min points trailer: 19202 => 17934
2023-12-12 15:43:21   INFO  Database filter by min points barrier: 107507 => 101993
2023-12-12 15:43:21   INFO  Database filter by min points motorcycle: 8846 => 8055
2023-12-12 15:43:21   INFO  Database filter by min points bicycle: 8185 => 7531
2023-12-12 15:43:21   INFO  Database filter by min points pedestrian: 161928 => 148520
2023-12-12 15:43:21   INFO  Database filter by min points traffic_cone: 62964 => 55504
2023-12-12 15:43:21   INFO  Loading GT database to shared memory
2023-12-12 15:43:27   INFO  GT database has been saved to shared memory
2023-12-12 15:43:27   INFO  Loading NuScenes dataset
2023-12-12 15:43:29   INFO  Total samples for NuScenes dataset: 28130
2023-12-12 15:43:32   INFO  Total samples after balanced resampling: 123580
2023-12-12 15:43:32   INFO  DistributedDataParallel(
  (module): TransFusion(
    (vfe): DynamicPillarVFE(
      (pfn_layers): ModuleList(
        (0): PFNLayerV2(
          (linear): Linear(in_features=11, out_features=64, bias=False)
          (norm): SyncBatchNorm(64, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
        (1): PFNLayerV2(
          (linear): Linear(in_features=256, out_features=256, bias=False)
          (norm): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (relu): ReLU()
        )
      )
    )
    (backbone_3d): DSVT(
      (input_layer): DSVTInputLayer(
        (posembed_layers): ModuleList(
          (0): ModuleList(
            (0): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (1): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (2): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
            (3): ModuleList(
              (0): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
              (1): PositionEmbeddingLearned(
                (position_embedding_head): Sequential(
                  (0): Linear(in_features=2, out_features=256, bias=True)
                  (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
                  (2): ReLU(inplace=True)
                  (3): Linear(in_features=256, out_features=256, bias=True)
                )
              )
            )
          )
        )
      )
      (stage_0): ModuleList(
        (0): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (1): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (2): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
        (3): DSVTBlock(
          (encoder_list): ModuleList(
            (0): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
            (1): DSVT_EncoderLayer(
              (win_attn): SetAttention(
                (self_attn): MultiheadAttention(
                  (out_proj): NonDynamicallyQuantizableLinear(in_features=256, out_features=256, bias=True)
                )
                (linear1): Linear(in_features=256, out_features=256, bias=True)
                (dropout): Dropout(p=0, inplace=False)
                (linear2): Linear(in_features=256, out_features=256, bias=True)
                (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
                (dropout1): Identity()
                (dropout2): Identity()
              )
              (norm): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
            )
          )
        )
      )
      (residual_norm_stage_0): ModuleList(
        (0): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
        (3): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
      )
    )
    (map_to_bev_module): PointPillarScatter3d()
    (pfe): None
    (backbone_2d): BaseBEVResBackbone(
      (blocks): ModuleList(
        (0): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (1): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
        (2): Sequential(
          (0): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(2, 2), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
            (downsample_layer): Sequential(
              (0): Conv2d(256, 256, kernel_size=(1, 1), stride=(2, 2), bias=False)
              (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            )
          )
          (1): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
          (2): BasicBlock(
            (conv1): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu1): ReLU()
            (conv2): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
            (bn2): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
            (relu2): ReLU()
          )
        )
      )
      (deblocks): ModuleList(
        (0): Sequential(
          (0): Conv2d(256, 256, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (1): Sequential(
          (0): ConvTranspose2d(256, 256, kernel_size=(1, 1), stride=(1, 1), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
        (2): Sequential(
          (0): ConvTranspose2d(256, 256, kernel_size=(2, 2), stride=(2, 2), bias=False)
          (1): SyncBatchNorm(256, eps=0.001, momentum=0.01, affine=True, track_running_stats=True)
          (2): ReLU()
        )
      )
    )
    (dense_head): TransFusionHeadV2(
      (loss_cls): SigmoidFocalClassificationLoss()
      (loss_bbox): L1Loss()
      (loss_heatmap): GaussianFocalLoss()
      (shared_conv): Conv2d(384, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
      (heatmap_head): Sequential(
        (0): BasicBlock2D(
          (conv): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
          (bn): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
          (relu): ReLU(inplace=True)
        )
        (1): Conv2d(256, 10, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
      )
      (class_encoding): Conv1d(10, 256, kernel_size=(1,), stride=(1,))
      (decoder): ModuleList(
        (0): TransformerDecoderLayer(
          (self_attn): MultiheadAttention(
            (out_proj): Linear(in_features=256, out_features=256, bias=True)
          )
          (multihead_attn): MultiheadAttention(
            (out_proj): Linear(in_features=256, out_features=256, bias=True)
          )
          (linear1): Linear(in_features=256, out_features=256, bias=True)
          (dropout): Dropout(p=0.0, inplace=False)
          (linear2): Linear(in_features=256, out_features=256, bias=True)
          (norm1): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          (norm2): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          (norm3): LayerNorm((256,), eps=1e-05, elementwise_affine=True)
          (dropout1): Dropout(p=0.0, inplace=False)
          (dropout2): Dropout(p=0.0, inplace=False)
          (dropout3): Dropout(p=0.0, inplace=False)
          (self_posembed): PositionEmbeddingLearned(
            (position_embedding_head): Sequential(
              (0): Conv1d(2, 256, kernel_size=(1,), stride=(1,))
              (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (2): ReLU(inplace=True)
              (3): Conv1d(256, 256, kernel_size=(1,), stride=(1,))
            )
          )
          (cross_posembed): PositionEmbeddingLearned(
            (position_embedding_head): Sequential(
              (0): Conv1d(2, 256, kernel_size=(1,), stride=(1,))
              (1): SyncBatchNorm(256, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (2): ReLU(inplace=True)
              (3): Conv1d(256, 256, kernel_size=(1,), stride=(1,))
            )
          )
        )
      )
      (prediction_heads): ModuleList(
        (0): FFN(
          (center): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 2, kernel_size=(1,), stride=(1,))
          )
          (height): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 1, kernel_size=(1,), stride=(1,))
          )
          (dim): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 3, kernel_size=(1,), stride=(1,))
          )
          (rot): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 2, kernel_size=(1,), stride=(1,))
          )
          (vel): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 2, kernel_size=(1,), stride=(1,))
          )
          (iou): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 1, kernel_size=(1,), stride=(1,))
          )
          (heatmap): Sequential(
            (0): BasicBlock1D(
              (conv): Conv1d(256, 64, kernel_size=(1,), stride=(1,))
              (bn): SyncBatchNorm(64, eps=1e-05, momentum=0.1, affine=True, track_running_stats=True)
              (relu): ReLU(inplace=True)
            )
            (1): Conv1d(64, 10, kernel_size=(1,), stride=(1,))
          )
        )
      )
    )
    (point_head): None
    (roi_head): None
  )
)
2023-12-12 15:44:37   INFO  Total number of parameters: 12543841
2023-12-12 15:44:37   INFO  **********************Start training cfgs/picture_models/picture_nuscenes_detection(default)**********************
2023-12-12 15:45:22   INFO  epoch: 0/24, acc_iter=50, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:07:34, time_cost(all): 0:00:55/1 day, 4:39:48, loss=3.496392422344086, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.9(1.03), norm=0.8243638988594859, lr=0.006213749352667012
2023-12-12 15:46:17   INFO  epoch: 0/24, acc_iter=100, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:36, time_cost(all): 0:01:50/1 day, 3:09:43, loss=3.343950715904923, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=3.6980294330020307, lr=0.007427498705334024
2023-12-12 15:47:13   INFO  epoch: 0/24, acc_iter=150, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:05:40, time_cost(all): 0:02:46/1 day, 3:53:06, loss=3.191509009465761, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=0.8528261636508854, lr=0.008641248058001037
2023-12-12 15:48:08   INFO  epoch: 0/24, acc_iter=200, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:05:27, time_cost(all): 0:03:41/1 day, 5:29:49, loss=3.039067303026599, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.97(1.03), norm=2.0507953482477195, lr=0.009854997410668049
2023-12-12 15:49:03   INFO  epoch: 0/24, acc_iter=250, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:08:43, time_cost(all): 0:04:36/1 day, 3:56:12, loss=2.886625596587436, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=2.676314548579949, lr=0.01106874676333506
2023-12-12 15:49:59   INFO  epoch: 0/24, acc_iter=300, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:03:17, time_cost(all): 0:05:32/1 day, 5:02:15, loss=2.734183890148274, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=0.8350444880541774, lr=0.012282496116002073
2023-12-12 15:50:54   INFO  epoch: 0/24, acc_iter=350, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:07:01, time_cost(all): 0:06:27/1 day, 4:57:52, loss=2.581742183709112, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=3.645749668521735, lr=0.013496245468669083
2023-12-12 15:51:49   INFO  epoch: 0/24, acc_iter=400, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:01:40, time_cost(all): 0:07:22/1 day, 5:06:29, loss=2.429300477269949, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.472724750144969, lr=0.014709994821336097
2023-12-12 15:52:45   INFO  epoch: 0/24, acc_iter=450, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:02:35, time_cost(all): 0:08:18/1 day, 3:04:53, loss=2.276858770830787, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.905330479232699, lr=0.015923744174003107
2023-12-12 15:53:40   INFO  epoch: 0/24, acc_iter=500, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:32, time_cost(all): 0:09:13/1 day, 3:16:58, loss=2.124417064391624, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=4.625602297007916, lr=0.01713749352667012
2023-12-12 15:54:35   INFO  epoch: 0/24, acc_iter=550, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:58:18, time_cost(all): 0:10:08/1 day, 4:58:17, loss=1.971975357952462, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=3.872903906136866, lr=0.01835124287933713
2023-12-12 15:55:31   INFO  epoch: 0/24, acc_iter=600, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:02:55, time_cost(all): 0:11:04/1 day, 5:32:26, loss=1.8195336515133, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=1.0367384233544685, lr=0.019564992232004145
2023-12-12 15:56:26   INFO  epoch: 0/24, acc_iter=650, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:02:01, time_cost(all): 0:11:59/1 day, 3:05:25, loss=1.667091945074137, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=3.7911160121992458, lr=0.020778741584671155
2023-12-12 15:57:21   INFO  epoch: 0/24, acc_iter=700, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:56:30, time_cost(all): 0:12:54/1 day, 5:16:21, loss=1.514650238634975, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=3.544925332929238, lr=0.02199249093733817
2023-12-12 15:58:17   INFO  epoch: 0/24, acc_iter=750, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:56:23, time_cost(all): 0:13:50/1 day, 5:23:32, loss=1.362208532195812, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=4.571744216000075, lr=0.02320624029000518
2023-12-12 15:59:12   INFO  epoch: 0/24, acc_iter=800, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:57:56, time_cost(all): 0:14:45/1 day, 3:50:40, loss=1.209766825756649, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=1.5310045502022491, lr=0.024419989642672193
2023-12-12 16:00:07   INFO  epoch: 0/24, acc_iter=850, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:57:47, time_cost(all): 0:15:40/1 day, 4:43:57, loss=1.057325119317488, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=2.4627050060035196, lr=0.025633738995339203
2023-12-12 16:01:03   INFO  epoch: 0/24, acc_iter=900, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:52:03, time_cost(all): 0:16:36/1 day, 3:06:43, loss=0.904883412878325, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=3.285863814378287, lr=0.026847488348006217
2023-12-12 16:01:58   INFO  epoch: 0/24, acc_iter=950, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:47, time_cost(all): 0:17:31/1 day, 5:34:57, loss=0.752441706439163, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=3.0955500574624755, lr=0.02806123770067323
2023-12-12 16:02:53   INFO  epoch: 0/24, acc_iter=1000, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:54:47, time_cost(all): 0:18:26/1 day, 2:48:10, loss=0.629591995684287, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=0.8585417253440644, lr=0.02927498705334024
2023-12-12 16:03:49   INFO  epoch: 0/24, acc_iter=1050, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:18, time_cost(all): 0:19:22/1 day, 4:28:23, loss=0.599812955382911, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.9731999249120247, lr=0.030488736406007255
2023-12-12 16:04:44   INFO  epoch: 0/24, acc_iter=1100, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:46, time_cost(all): 0:20:17/1 day, 3:03:46, loss=0.599625910765823, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=2.977632164009295, lr=0.031702485758674265
2023-12-12 16:05:39   INFO  epoch: 0/24, acc_iter=1150, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:50:31, time_cost(all): 0:21:12/1 day, 4:32:15, loss=0.599438866148734, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=3.251760088339528, lr=0.03291623511134128
2023-12-12 16:06:35   INFO  epoch: 0/24, acc_iter=1200, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:49:54, time_cost(all): 0:22:08/1 day, 4:30:41, loss=0.599251821531646, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=4.15480759223279, lr=0.03412998446400829
2023-12-12 16:07:30   INFO  epoch: 0/24, acc_iter=1250, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:57, time_cost(all): 0:23:03/1 day, 4:35:17, loss=0.599064776914557, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=3.0532698433536476, lr=0.0353437338166753
2023-12-12 16:08:26   INFO  epoch: 0/24, acc_iter=1300, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:45:58, time_cost(all): 0:23:59/1 day, 5:26:43, loss=0.598877732297469, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=3.756900436386461, lr=0.036557483169342306
2023-12-12 16:09:21   INFO  epoch: 0/24, acc_iter=1350, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:59, time_cost(all): 0:24:54/1 day, 2:51:00, loss=0.59869068768038, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.07(1.03), norm=4.493926442388792, lr=0.037771232522009326
2023-12-12 16:10:16   INFO  epoch: 0/24, acc_iter=1400, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:44:44, time_cost(all): 0:25:49/1 day, 4:58:40, loss=0.598503643063292, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=2.0755217176124887, lr=0.03898498187467633
2023-12-12 16:11:12   INFO  epoch: 0/24, acc_iter=1450, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:43:31, time_cost(all): 0:26:45/1 day, 5:10:22, loss=0.598316598446203, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=3.37828750196912, lr=0.04019873122734335
2023-12-12 16:12:07   INFO  epoch: 0/24, acc_iter=1500, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:23, time_cost(all): 0:27:40/1 day, 5:13:57, loss=0.598129553829115, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=2.190705520783281, lr=0.041412480580010354
2023-12-12 16:13:02   INFO  epoch: 0/24, acc_iter=1550, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:04, time_cost(all): 0:28:35/1 day, 3:35:14, loss=0.597942509212026, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=4.580004574806528, lr=0.042626229932677374
2023-12-12 16:13:58   INFO  epoch: 0/24, acc_iter=1600, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:40:18, time_cost(all): 0:29:31/1 day, 4:01:40, loss=0.597755464594938, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=1.355189797287015, lr=0.04383997928534438
2023-12-12 16:14:53   INFO  epoch: 0/24, acc_iter=1650, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:52, time_cost(all): 0:30:26/1 day, 4:19:00, loss=0.597568419977849, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=3.8058396117910176, lr=0.045053728638011395
2023-12-12 16:15:48   INFO  epoch: 0/24, acc_iter=1700, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:41:06, time_cost(all): 0:31:21/1 day, 5:16:51, loss=0.597381375360761, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.94(1.03), norm=4.243945566944917, lr=0.0462674779906784
2023-12-12 16:16:44   INFO  epoch: 0/24, acc_iter=1750, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:39:38, time_cost(all): 0:32:17/1 day, 3:09:45, loss=0.597194330743672, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.93(1.03), norm=4.632994978631183, lr=0.04748122734334542
2023-12-12 16:17:39   INFO  epoch: 0/24, acc_iter=1800, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:53, time_cost(all): 0:33:12/1 day, 5:02:38, loss=0.597007286126583, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=3.273795777169204, lr=0.04869497669601243
2023-12-12 16:18:34   INFO  epoch: 0/24, acc_iter=1850, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:16, time_cost(all): 0:34:07/1 day, 3:59:03, loss=0.596820241509495, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=2.106727327851549, lr=0.04990872604867944
2023-12-12 16:19:30   INFO  epoch: 0/24, acc_iter=1900, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:34, time_cost(all): 0:35:03/1 day, 4:06:43, loss=0.596633196892406, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=3.435983571239559, lr=0.05280618850336614
2023-12-12 16:20:25   INFO  epoch: 0/24, acc_iter=1950, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:13, time_cost(all): 0:35:58/1 day, 3:50:09, loss=0.596446152275318, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=3.0695750170780656, lr=0.05584056188503367
2023-12-12 16:21:20   INFO  epoch: 0/24, acc_iter=2000, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:34, time_cost(all): 0:36:53/1 day, 4:03:44, loss=0.596259107658229, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=3.5770356597074544, lr=0.05887493526670119
2023-12-12 16:22:16   INFO  epoch: 0/24, acc_iter=2050, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:21, time_cost(all): 0:37:49/1 day, 4:53:59, loss=0.596072063041141, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=1.5191271259709196, lr=0.06190930864836872
2023-12-12 16:23:11   INFO  epoch: 0/24, acc_iter=2100, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:11, time_cost(all): 0:38:44/1 day, 3:07:59, loss=0.595885018424052, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.9(1.03), norm=3.641844971461073, lr=0.06494368203003625
2023-12-12 16:24:06   INFO  epoch: 0/24, acc_iter=2150, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:32, time_cost(all): 0:39:39/1 day, 2:38:25, loss=0.595697973806964, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=1.1106641328359497, lr=0.06797805541170378
2023-12-12 16:25:02   INFO  epoch: 0/24, acc_iter=2200, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:37, time_cost(all): 0:40:35/1 day, 3:48:53, loss=0.595510929189875, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=4.616236593563804, lr=0.07101242879337132
2023-12-12 16:25:57   INFO  epoch: 0/24, acc_iter=2250, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:39, time_cost(all): 0:41:30/1 day, 5:00:10, loss=0.595323884572787, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=4.478942086384041, lr=0.07404680217503884
2023-12-12 16:26:52   INFO  epoch: 0/24, acc_iter=2300, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:31, time_cost(all): 0:42:25/1 day, 2:53:18, loss=0.595136839955698, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.06(1.03), norm=2.1074473001404876, lr=0.07708117555670638
2023-12-12 16:27:48   INFO  epoch: 0/24, acc_iter=2350, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:30, time_cost(all): 0:43:21/1 day, 4:39:35, loss=0.594949795338609, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=0.7032445374962363, lr=0.0801155489383739
2023-12-12 16:28:43   INFO  epoch: 0/24, acc_iter=2400, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:14, time_cost(all): 0:44:16/1 day, 3:43:47, loss=0.594762750721521, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=2.4186990119706833, lr=0.08314992232004143
2023-12-12 16:29:39   INFO  epoch: 0/24, acc_iter=2450, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:43, time_cost(all): 0:45:12/1 day, 3:23:38, loss=0.594575706104432, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=1.2629436409540045, lr=0.08618429570170896
2023-12-12 16:30:34   INFO  epoch: 0/24, acc_iter=2500, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:39, time_cost(all): 0:46:07/1 day, 2:59:36, loss=0.594388661487344, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=4.266346117057463, lr=0.08921866908337649
2023-12-12 16:31:29   INFO  epoch: 0/24, acc_iter=2550, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:13, time_cost(all): 0:47:02/1 day, 3:40:55, loss=0.594201616870255, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.01(1.03), norm=2.63311054991666, lr=0.09225304246504401
2023-12-12 16:32:25   INFO  epoch: 0/24, acc_iter=2600, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:07, time_cost(all): 0:47:58/1 day, 4:13:22, loss=0.594014572253167, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.547091943028581, lr=0.09528741584671155
2023-12-12 16:33:20   INFO  epoch: 0/24, acc_iter=2650, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:56, time_cost(all): 0:48:53/1 day, 4:34:12, loss=0.593827527636078, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.01(1.03), norm=4.551782214490748, lr=0.09832178922837907
2023-12-12 16:34:15   INFO  epoch: 0/24, acc_iter=2700, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:31, time_cost(all): 0:49:48/1 day, 3:17:07, loss=0.59364048301899, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.2697124418357215, lr=0.10135616261004661
2023-12-12 16:35:11   INFO  epoch: 0/24, acc_iter=2750, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:59, time_cost(all): 0:50:44/1 day, 4:23:03, loss=0.593453438401901, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=1.9919540423369786, lr=0.10439053599171413
2023-12-12 16:36:06   INFO  epoch: 0/24, acc_iter=2800, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:21, time_cost(all): 0:51:39/1 day, 3:18:51, loss=0.593266393784813, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=2.6770164703438404, lr=0.10742490937338166
2023-12-12 16:37:01   INFO  epoch: 0/24, acc_iter=2850, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:43, time_cost(all): 0:52:34/1 day, 4:04:57, loss=0.593079349167724, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.3521801828180735, lr=0.11045928275504921
2023-12-12 16:37:57   INFO  epoch: 0/24, acc_iter=2900, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:33, time_cost(all): 0:53:30/1 day, 3:24:22, loss=0.592892304550636, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=2.7736534512022715, lr=0.11349365613671673
2023-12-12 16:38:52   INFO  epoch: 0/24, acc_iter=2950, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:40, time_cost(all): 0:54:25/1 day, 2:59:12, loss=0.592705259933547, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=3.5907267498789506, lr=0.11652802951838426
2023-12-12 16:39:47   INFO  epoch: 0/24, acc_iter=3000, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:09, time_cost(all): 0:55:20/1 day, 2:56:03, loss=0.592518215316459, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.1(1.03), norm=1.615397218852127, lr=0.11956240290005178
2023-12-12 16:40:43   INFO  epoch: 0/24, acc_iter=3050, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:22, time_cost(all): 0:56:16/1 day, 4:37:59, loss=0.59233117069937, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=0.760642289858129, lr=0.1225967762817193
2023-12-12 16:41:38   INFO  epoch: 0/24, acc_iter=3100, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:30, time_cost(all): 0:57:11/1 day, 3:15:21, loss=0.592144126082282, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=3.0219282206367044, lr=0.12563114966338684
2023-12-12 16:42:33   INFO  epoch: 0/24, acc_iter=3150, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:22, time_cost(all): 0:58:06/1 day, 4:20:45, loss=0.591957081465193, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=2.2143678147156827, lr=0.12866552304505435
2023-12-12 16:43:29   INFO  epoch: 0/24, acc_iter=3200, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:47, time_cost(all): 0:59:02/1 day, 2:40:24, loss=0.591770036848104, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.03(1.03), norm=1.4991185502342987, lr=0.13169989642672192
2023-12-12 16:44:24   INFO  epoch: 0/24, acc_iter=3250, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:05, time_cost(all): 0:59:57/1 day, 3:01:02, loss=0.591582992231016, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.957066649941907, lr=0.13473426980838943
2023-12-12 16:45:19   INFO  epoch: 0/24, acc_iter=3300, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:24, time_cost(all): 1:00:52/1 day, 2:18:45, loss=0.591395947613927, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=2.7360277186371054, lr=0.13776864319005697
2023-12-12 16:46:15   INFO  epoch: 0/24, acc_iter=3350, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:32, time_cost(all): 1:01:48/1 day, 4:16:51, loss=0.591208902996839, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=1.1448605462160413, lr=0.1408030165717245
2023-12-12 16:47:10   INFO  epoch: 0/24, acc_iter=3400, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:42, time_cost(all): 1:02:43/1 day, 2:06:14, loss=0.59102185837975, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.09(1.03), norm=3.5865947991008413, lr=0.14383738995339201
2023-12-12 16:48:05   INFO  epoch: 0/24, acc_iter=3450, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:17, time_cost(all): 1:03:38/1 day, 4:17:29, loss=0.590834813762662, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.6825993055904425, lr=0.14687176333505955
2023-12-12 16:49:01   INFO  epoch: 0/24, acc_iter=3500, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:40, time_cost(all): 1:04:34/1 day, 3:58:47, loss=0.590647769145573, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.83507853202442, lr=0.1499061367167271
2023-12-12 16:49:56   INFO  epoch: 0/24, acc_iter=3550, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:49, time_cost(all): 1:05:29/1 day, 3:12:23, loss=0.590460724528485, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=0.7806994751361105, lr=0.1529405100983946
2023-12-12 16:50:52   INFO  epoch: 0/24, acc_iter=3600, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:59, time_cost(all): 1:06:25/1 day, 2:22:12, loss=0.590273679911396, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=2.2518978549931523, lr=0.15597488348006214
2023-12-12 16:51:47   INFO  epoch: 0/24, acc_iter=3650, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:45, time_cost(all): 1:07:20/1 day, 3:05:35, loss=0.590086635294308, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=2.2810074462632954, lr=0.15900925686172968
2023-12-12 16:52:42   INFO  epoch: 0/24, acc_iter=3700, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:00, time_cost(all): 1:08:15/1 day, 3:52:21, loss=0.589899590677219, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=3.1637261445050258, lr=0.16204363024339719
2023-12-12 16:53:38   INFO  epoch: 0/24, acc_iter=3750, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:01, time_cost(all): 1:09:11/1 day, 4:31:34, loss=0.589712546060131, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=1.6447274434555348, lr=0.16507800362506475
2023-12-12 16:54:33   INFO  epoch: 0/24, acc_iter=3800, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:10, time_cost(all): 1:10:06/1 day, 4:24:40, loss=0.589525501443042, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=3.351111472959375, lr=0.16811237700673226
2023-12-12 16:55:28   INFO  epoch: 0/24, acc_iter=3850, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:12, time_cost(all): 1:11:01/1 day, 1:59:34, loss=0.589338456825953, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=4.438960251543579, lr=0.17114675038839977
2023-12-12 16:56:24   INFO  epoch: 1/24, acc_iter=3912, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:13:14, time_cost(all): 1:11:57/1 day, 4:15:25, loss=0.589106521500764, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=4.389243564638546, lr=0.1749093733816675
2023-12-12 16:57:19   INFO  epoch: 1/24, acc_iter=3962, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:12:21, time_cost(all): 1:12:52/1 day, 4:25:38, loss=0.588919476883675, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=0.6666804110696076, lr=0.17794374676333502
2023-12-12 16:58:14   INFO  epoch: 1/24, acc_iter=4012, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:08:44, time_cost(all): 1:13:47/1 day, 3:06:03, loss=0.588732432266587, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=4.611229454795903, lr=0.18097812014500259
2023-12-12 16:59:10   INFO  epoch: 1/24, acc_iter=4062, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:04:28, time_cost(all): 1:14:43/1 day, 2:44:26, loss=0.588545387649498, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=2.791213882479486, lr=0.1840124935266701
2023-12-12 17:00:05   INFO  epoch: 1/24, acc_iter=4112, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:05:41, time_cost(all): 1:15:38/1 day, 3:51:37, loss=0.58835834303241, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=4.329882996131477, lr=0.1870468669083376
2023-12-12 17:01:00   INFO  epoch: 1/24, acc_iter=4162, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:05:40, time_cost(all): 1:16:33/1 day, 2:20:50, loss=0.588171298415321, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=0.7771730683944278, lr=0.19008124029000517
2023-12-12 17:01:56   INFO  epoch: 1/24, acc_iter=4212, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:02:19, time_cost(all): 1:17:29/1 day, 3:46:36, loss=0.587984253798232, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.05(1.03), norm=2.336193713201417, lr=0.19311561367167268
2023-12-12 17:02:51   INFO  epoch: 1/24, acc_iter=4262, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:04:49, time_cost(all): 1:18:24/1 day, 2:49:18, loss=0.587797209181144, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=2.9682836725275443, lr=0.1961499870533402
2023-12-12 17:03:46   INFO  epoch: 1/24, acc_iter=4312, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:44, time_cost(all): 1:19:19/1 day, 3:54:19, loss=0.587610164564055, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.654744925776565, lr=0.19918436043500776
2023-12-12 17:04:42   INFO  epoch: 1/24, acc_iter=4362, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/0:59:12, time_cost(all): 1:20:15/1 day, 3:23:02, loss=0.587423119946967, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=1.4550895008513525, lr=0.20221873381667527
2023-12-12 17:05:37   INFO  epoch: 1/24, acc_iter=4412, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:03:55, time_cost(all): 1:21:10/1 day, 2:31:25, loss=0.587236075329878, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=3.320987257495055, lr=0.20525310719834278
2023-12-12 17:06:32   INFO  epoch: 1/24, acc_iter=4462, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:02:15, time_cost(all): 1:22:05/1 day, 4:05:41, loss=0.58704903071279, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.03(1.03), norm=2.412670246561545, lr=0.20828748058001034
2023-12-12 17:07:28   INFO  epoch: 1/24, acc_iter=4512, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:59:48, time_cost(all): 1:23:01/1 day, 2:22:51, loss=0.586861986095701, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.12(1.03), norm=2.8112867326428463, lr=0.21132185396167785
2023-12-12 17:08:23   INFO  epoch: 1/24, acc_iter=4562, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:08, time_cost(all): 1:23:56/1 day, 3:52:45, loss=0.586674941478613, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.94(1.03), norm=3.7183195635759194, lr=0.21435622734334542
2023-12-12 17:09:18   INFO  epoch: 1/24, acc_iter=4612, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:54:37, time_cost(all): 1:24:51/1 day, 2:27:28, loss=0.586487896861524, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=3.9140830398719224, lr=0.21739060072501293
2023-12-12 17:10:14   INFO  epoch: 1/24, acc_iter=4662, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:59:12, time_cost(all): 1:25:47/1 day, 2:02:37, loss=0.586300852244436, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.02(1.03), norm=1.7627640022416857, lr=0.22042497410668044
2023-12-12 17:11:09   INFO  epoch: 1/24, acc_iter=4712, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:03, time_cost(all): 1:26:42/1 day, 3:14:56, loss=0.586113807627347, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.2182373152543953, lr=0.223459347488348
2023-12-12 17:12:05   INFO  epoch: 1/24, acc_iter=4762, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:52:38, time_cost(all): 1:27:38/1 day, 2:27:33, loss=0.585926763010259, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.670037987247441, lr=0.2264937208700155
2023-12-12 17:13:00   INFO  epoch: 1/24, acc_iter=4812, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:54, time_cost(all): 1:28:33/1 day, 2:53:24, loss=0.58573971839317, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=4.642351892977154, lr=0.22952809425168302
2023-12-12 17:13:55   INFO  epoch: 1/24, acc_iter=4862, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:51:49, time_cost(all): 1:29:28/1 day, 2:22:01, loss=0.585552673776081, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=4.697545237786519, lr=0.23256246763335053
2023-12-12 17:14:51   INFO  epoch: 1/24, acc_iter=4912, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:50:04, time_cost(all): 1:30:24/1 day, 3:20:17, loss=0.585365629158993, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=4.584807520634028, lr=0.2355968410150181
2023-12-12 17:15:46   INFO  epoch: 1/24, acc_iter=4962, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:53:06, time_cost(all): 1:31:19/1 day, 3:57:06, loss=0.585178584541904, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.04(1.03), norm=3.157268642584677, lr=0.2386312143966856
2023-12-12 17:16:41   INFO  epoch: 1/24, acc_iter=5012, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:48:54, time_cost(all): 1:32:14/1 day, 1:52:45, loss=0.584991539924816, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.034965748354244, lr=0.24166558777835317
2023-12-12 17:17:37   INFO  epoch: 1/24, acc_iter=5062, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:38, time_cost(all): 1:33:10/1 day, 4:09:18, loss=0.584804495307727, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.09(1.03), norm=3.6514319330636504, lr=0.24469996116002068
2023-12-12 17:18:32   INFO  epoch: 1/24, acc_iter=5112, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:38, time_cost(all): 1:34:05/1 day, 2:48:47, loss=0.584617450690639, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=3.1401332243742766, lr=0.2477343345416882
2023-12-12 17:19:27   INFO  epoch: 1/24, acc_iter=5162, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:45:51, time_cost(all): 1:35:00/1 day, 3:07:41, loss=0.58443040607355, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=1.9723573353331454, lr=0.25076870792335576
2023-12-12 17:20:23   INFO  epoch: 1/24, acc_iter=5212, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:47, time_cost(all): 1:35:56/1 day, 2:06:58, loss=0.584243361456462, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=2.8095651391690297, lr=0.25380308130502327
2023-12-12 17:21:18   INFO  epoch: 1/24, acc_iter=5262, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:43:26, time_cost(all): 1:36:51/1 day, 4:12:31, loss=0.584056316839373, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=1.391330465995587, lr=0.2568374546866908
2023-12-12 17:22:13   INFO  epoch: 1/24, acc_iter=5312, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:46, time_cost(all): 1:37:46/1 day, 4:06:18, loss=0.583869272222285, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=3.218556923436477, lr=0.25987182806835835
2023-12-12 17:23:09   INFO  epoch: 1/24, acc_iter=5362, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:05, time_cost(all): 1:38:42/1 day, 2:25:37, loss=0.583682227605196, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=4.913616437587268, lr=0.26290620145002586
2023-12-12 17:24:04   INFO  epoch: 1/24, acc_iter=5412, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:05, time_cost(all): 1:39:37/1 day, 2:30:00, loss=0.583495182988108, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=2.02285589237864, lr=0.2659405748316934
2023-12-12 17:24:59   INFO  epoch: 1/24, acc_iter=5462, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:40:37, time_cost(all): 1:40:32/1 day, 2:35:07, loss=0.583308138371019, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=0.5760114971928896, lr=0.26897494821336093
2023-12-12 17:25:55   INFO  epoch: 1/24, acc_iter=5512, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:39:36, time_cost(all): 1:41:28/1 day, 3:28:11, loss=0.583121093753931, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=1.1826758187660333, lr=0.27200932159502844
2023-12-12 17:26:50   INFO  epoch: 1/24, acc_iter=5562, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:19, time_cost(all): 1:42:23/1 day, 1:57:59, loss=0.582934049136842, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.94(1.03), norm=2.4292376288439854, lr=0.275043694976696
2023-12-12 17:27:45   INFO  epoch: 1/24, acc_iter=5612, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:05, time_cost(all): 1:43:18/1 day, 1:42:56, loss=0.582747004519753, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=3.05670200166371, lr=0.2780780683583635
2023-12-12 17:28:41   INFO  epoch: 1/24, acc_iter=5662, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:25, time_cost(all): 1:44:14/1 day, 3:51:30, loss=0.582559959902665, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.09(1.03), norm=0.9689276397807824, lr=0.281112441740031
2023-12-12 17:29:36   INFO  epoch: 1/24, acc_iter=5712, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:21, time_cost(all): 1:45:09/1 day, 2:32:00, loss=0.582372915285576, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=2.845438191184326, lr=0.28414681512169854
2023-12-12 17:30:31   INFO  epoch: 1/24, acc_iter=5762, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:16, time_cost(all): 1:46:04/1 day, 1:58:21, loss=0.582185870668488, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.99(1.03), norm=2.6374112727105468, lr=0.2871811885033661
2023-12-12 17:31:27   INFO  epoch: 1/24, acc_iter=5812, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:33:57, time_cost(all): 1:47:00/1 day, 2:40:40, loss=0.581998826051399, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.17(1.03), norm=1.177419372945793, lr=0.2902155618850336
2023-12-12 17:32:22   INFO  epoch: 1/24, acc_iter=5862, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:32:46, time_cost(all): 1:47:55/1 day, 3:29:33, loss=0.581811781434311, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.1(1.03), norm=2.891025755778278, lr=0.2932499352667011
2023-12-12 17:33:18   INFO  epoch: 1/24, acc_iter=5912, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:52, time_cost(all): 1:48:51/1 day, 1:39:54, loss=0.581624736817222, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=3.7881113056306286, lr=0.2962843086483687
2023-12-12 17:34:13   INFO  epoch: 1/24, acc_iter=5962, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:19, time_cost(all): 1:49:46/1 day, 2:57:17, loss=0.581437692200134, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=2.8990183417323783, lr=0.2993186820300362
2023-12-12 17:35:08   INFO  epoch: 1/24, acc_iter=6012, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:33:07, time_cost(all): 1:50:41/1 day, 1:51:35, loss=0.581250647583045, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.22(1.03), norm=1.3362658025034586, lr=0.3023530554117037
2023-12-12 17:36:04   INFO  epoch: 1/24, acc_iter=6062, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:23, time_cost(all): 1:51:37/1 day, 2:38:15, loss=0.581063602965957, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.4376180443148012, lr=0.3053874287933713
2023-12-12 17:36:59   INFO  epoch: 1/24, acc_iter=6112, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:36, time_cost(all): 1:52:32/1 day, 2:21:31, loss=0.580876558348868, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=4.968139388608686, lr=0.30842180217503884
2023-12-12 17:37:54   INFO  epoch: 1/24, acc_iter=6162, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:37, time_cost(all): 1:53:27/1 day, 1:22:06, loss=0.58068951373178, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=3.900123237344186, lr=0.31145617555670635
2023-12-12 17:38:50   INFO  epoch: 1/24, acc_iter=6212, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:56, time_cost(all): 1:54:23/1 day, 3:48:54, loss=0.580502469114691, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=2.567172615383283, lr=0.31449054893837386
2023-12-12 17:39:45   INFO  epoch: 1/24, acc_iter=6262, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:10, time_cost(all): 1:55:18/1 day, 1:35:53, loss=0.580315424497603, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.09(1.03), norm=3.3063324145574806, lr=0.31752492232004137
2023-12-12 17:40:40   INFO  epoch: 1/24, acc_iter=6312, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:16, time_cost(all): 1:56:13/1 day, 2:00:56, loss=0.580128379880514, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=0.9494306801379186, lr=0.32055929570170894
2023-12-12 17:41:36   INFO  epoch: 1/24, acc_iter=6362, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:35, time_cost(all): 1:57:09/1 day, 1:16:18, loss=0.579941335263425, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=2.566501432214804, lr=0.32359366908337645
2023-12-12 17:42:31   INFO  epoch: 1/24, acc_iter=6412, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:49, time_cost(all): 1:58:04/1 day, 1:50:59, loss=0.579754290646337, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=0.6954636942322326, lr=0.32662804246504396
2023-12-12 17:43:26   INFO  epoch: 1/24, acc_iter=6462, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:09, time_cost(all): 1:58:59/1 day, 3:30:38, loss=0.579567246029248, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=4.7275382169091955, lr=0.3296624158467115
2023-12-12 17:44:22   INFO  epoch: 1/24, acc_iter=6512, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:44, time_cost(all): 1:59:55/1 day, 2:52:55, loss=0.57938020141216, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.91(1.03), norm=4.309639645507798, lr=0.33269678922837903
2023-12-12 17:45:17   INFO  epoch: 1/24, acc_iter=6562, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:22:17, time_cost(all): 2:00:50/1 day, 3:28:39, loss=0.579193156795071, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.93(1.03), norm=1.6681521907351817, lr=0.33573116261004654
2023-12-12 17:46:12   INFO  epoch: 1/24, acc_iter=6612, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:48, time_cost(all): 2:01:45/1 day, 1:10:11, loss=0.579006112177983, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=3.414351039653859, lr=0.3387655359917141
2023-12-12 17:47:08   INFO  epoch: 1/24, acc_iter=6662, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:11, time_cost(all): 2:02:41/1 day, 1:29:48, loss=0.578819067560894, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.07(1.03), norm=4.2797967962091805, lr=0.3417999093733816
2023-12-12 17:48:03   INFO  epoch: 1/24, acc_iter=6712, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:11, time_cost(all): 2:03:36/1 day, 1:18:29, loss=0.578632022943806, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=2.7829739432767315, lr=0.3448342827550491
2023-12-12 17:48:58   INFO  epoch: 1/24, acc_iter=6762, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:42, time_cost(all): 2:04:31/1 day, 3:03:41, loss=0.578444978326717, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.94(1.03), norm=3.9423912862906665, lr=0.34786865613671664
2023-12-12 17:49:54   INFO  epoch: 1/24, acc_iter=6812, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:02, time_cost(all): 2:05:27/1 day, 1:14:32, loss=0.578257933709629, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=1.6360980627215036, lr=0.3509030295183842
2023-12-12 17:50:49   INFO  epoch: 1/24, acc_iter=6862, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:47, time_cost(all): 2:06:22/1 day, 1:43:43, loss=0.57807088909254, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=2.0182214540212193, lr=0.3539374029000517
2023-12-12 17:51:44   INFO  epoch: 1/24, acc_iter=6912, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:25, time_cost(all): 2:07:17/1 day, 1:32:32, loss=0.577883844475451, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.06(1.03), norm=3.8296164987647816, lr=0.3569717762817192
2023-12-12 17:52:40   INFO  epoch: 1/24, acc_iter=6962, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:07, time_cost(all): 2:08:13/1 day, 2:46:28, loss=0.577696799858363, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.16(1.03), norm=3.3856662812935587, lr=0.36000614966338684
2023-12-12 17:53:35   INFO  epoch: 1/24, acc_iter=7012, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:19, time_cost(all): 2:09:08/1 day, 2:59:49, loss=0.577509755241274, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.21(1.03), norm=2.214493726410845, lr=0.36304052304505435
2023-12-12 17:54:30   INFO  epoch: 1/24, acc_iter=7062, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:17, time_cost(all): 2:10:03/1 day, 3:15:33, loss=0.577322710624186, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=1.1848956235411472, lr=0.36607489642672186
2023-12-12 17:55:26   INFO  epoch: 1/24, acc_iter=7112, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:00, time_cost(all): 2:10:59/1 day, 3:04:46, loss=0.577135666007097, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=2.3054195869301863, lr=0.3691092698083894
2023-12-12 17:56:21   INFO  epoch: 1/24, acc_iter=7162, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:01, time_cost(all): 2:11:54/1 day, 2:23:39, loss=0.576948621390009, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.12(1.03), norm=4.089463281791829, lr=0.37214364319005694
2023-12-12 17:57:17   INFO  epoch: 1/24, acc_iter=7212, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:32, time_cost(all): 2:12:50/1 day, 1:56:27, loss=0.57676157677292, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=2.882403120660642, lr=0.37517801657172445
2023-12-12 17:58:12   INFO  epoch: 1/24, acc_iter=7262, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:26, time_cost(all): 2:13:45/1 day, 1:01:24, loss=0.576574532155832, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=2.446305030307994, lr=0.37821238995339196
2023-12-12 17:59:07   INFO  epoch: 1/24, acc_iter=7312, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:16, time_cost(all): 2:14:40/1 day, 1:44:14, loss=0.576387487538743, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=1.9402557766877406, lr=0.3812467633350595
2023-12-12 18:00:03   INFO  epoch: 1/24, acc_iter=7362, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:28, time_cost(all): 2:15:36/1 day, 2:04:42, loss=0.576200442921655, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=2.7567563890002225, lr=0.38428113671672703
2023-12-12 18:00:58   INFO  epoch: 1/24, acc_iter=7412, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:52, time_cost(all): 2:16:31/1 day, 1:19:39, loss=0.576013398304566, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=2.508181579233167, lr=0.38731551009839454
2023-12-12 18:01:53   INFO  epoch: 1/24, acc_iter=7462, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:46, time_cost(all): 2:17:26/1 day, 1:45:25, loss=0.575826353687478, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=3.9440493611222323, lr=0.3903498834800621
2023-12-12 18:02:49   INFO  epoch: 1/24, acc_iter=7512, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:50, time_cost(all): 2:18:22/1 day, 0:58:50, loss=0.575639309070389, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=4.351319644796114, lr=0.3933842568617296
2023-12-12 18:03:44   INFO  epoch: 1/24, acc_iter=7562, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:59, time_cost(all): 2:19:17/1 day, 1:25:47, loss=0.575452264453301, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.07(1.03), norm=2.255829359982706, lr=0.39641863024339713
2023-12-12 18:04:39   INFO  epoch: 1/24, acc_iter=7612, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:01:59, time_cost(all): 2:20:12/1 day, 1:09:46, loss=0.575265219836212, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.86(1.03), norm=2.6835054113456906, lr=0.39945300362506464
2023-12-12 18:05:35   INFO  epoch: 1/24, acc_iter=7662, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 2:21:08/1 day, 1:32:28, loss=0.575078175219124, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.89(1.03), norm=3.493378124125769, lr=0.4024873770067322
2023-12-12 18:06:30   INFO  epoch: 1/24, acc_iter=7712, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 2:22:03/1 day, 2:16:18, loss=0.574891130602035, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=0.9638589504077306, lr=0.4055217503883997
2023-12-12 18:07:25   INFO  epoch: 2/24, acc_iter=7774, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:10:59, time_cost(all): 2:22:58/1 day, 0:58:22, loss=0.574659195276845, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=3.4438535284245626, lr=0.40928437338166745
2023-12-12 18:08:21   INFO  epoch: 2/24, acc_iter=7824, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:11:08, time_cost(all): 2:23:54/1 day, 0:57:13, loss=0.574472150659757, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=4.45396660619702, lr=0.41231874676333496
2023-12-12 18:09:16   INFO  epoch: 2/24, acc_iter=7874, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:05, time_cost(all): 2:24:49/1 day, 1:46:57, loss=0.574285106042668, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.94(1.03), norm=2.315830852801887, lr=0.41535312014500253
2023-12-12 18:10:11   INFO  epoch: 2/24, acc_iter=7924, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:07:08, time_cost(all): 2:25:44/1 day, 3:03:51, loss=0.57409806142558, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=4.0042665781253355, lr=0.41838749352667004
2023-12-12 18:11:07   INFO  epoch: 2/24, acc_iter=7974, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:04:05, time_cost(all): 2:26:40/1 day, 2:00:54, loss=0.573911016808491, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=3.3460327511215264, lr=0.4214218669083376
2023-12-12 18:12:02   INFO  epoch: 2/24, acc_iter=8024, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:03:52, time_cost(all): 2:27:35/1 day, 0:47:24, loss=0.573723972191402, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.22(1.03), norm=1.4253139409239644, lr=0.4244562402900051
2023-12-12 18:12:57   INFO  epoch: 2/24, acc_iter=8074, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:04:20, time_cost(all): 2:28:30/1 day, 2:26:39, loss=0.573536927574314, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=2.2354782049593886, lr=0.4274906136716727
2023-12-12 18:13:53   INFO  epoch: 2/24, acc_iter=8124, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:04:34, time_cost(all): 2:29:26/1 day, 1:37:43, loss=0.573349882957225, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.16(1.03), norm=2.5251669354450095, lr=0.4305249870533402
2023-12-12 18:14:48   INFO  epoch: 2/24, acc_iter=8174, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:01:47, time_cost(all): 2:30:21/1 day, 2:34:57, loss=0.573162838340137, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.0(1.03), norm=4.95439405632086, lr=0.4335593604350077
2023-12-12 18:15:43   INFO  epoch: 2/24, acc_iter=8224, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:53, time_cost(all): 2:31:16/1 day, 2:55:03, loss=0.572975793723048, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.5384767845554146, lr=0.43659373381667527
2023-12-12 18:16:39   INFO  epoch: 2/24, acc_iter=8274, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:59:22, time_cost(all): 2:32:12/1 day, 2:30:19, loss=0.57278874910596, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=3.650146284575256, lr=0.4396281071983428
2023-12-12 18:17:34   INFO  epoch: 2/24, acc_iter=8324, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:57:33, time_cost(all): 2:33:07/1 day, 2:44:11, loss=0.572601704488871, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=3.575452969601126, lr=0.4426624805800103
2023-12-12 18:18:30   INFO  epoch: 2/24, acc_iter=8374, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:56:23, time_cost(all): 2:34:03/1 day, 2:33:10, loss=0.572414659871783, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.19(1.03), norm=1.1400506522479081, lr=0.44569685396167785
2023-12-12 18:19:25   INFO  epoch: 2/24, acc_iter=8424, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:19, time_cost(all): 2:34:58/1 day, 3:11:01, loss=0.572227615254694, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=3.9454472216241587, lr=0.44873122734334536
2023-12-12 18:20:20   INFO  epoch: 2/24, acc_iter=8474, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:55:30, time_cost(all): 2:35:53/1 day, 1:45:56, loss=0.572040570637606, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=4.496528096663119, lr=0.4517656007250129
2023-12-12 18:21:16   INFO  epoch: 2/24, acc_iter=8524, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:51, time_cost(all): 2:36:49/1 day, 3:06:31, loss=0.571853526020517, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=1.681071292244328, lr=0.4547999741066804
2023-12-12 18:22:11   INFO  epoch: 2/24, acc_iter=8574, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:43, time_cost(all): 2:37:44/1 day, 1:42:19, loss=0.571666481403429, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.6155433170407985, lr=0.45783434748834795
2023-12-12 18:23:06   INFO  epoch: 2/24, acc_iter=8624, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:11, time_cost(all): 2:38:39/1 day, 2:38:48, loss=0.57147943678634, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=1.9499426731448763, lr=0.46086872087001546
2023-12-12 18:24:02   INFO  epoch: 2/24, acc_iter=8674, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:52:35, time_cost(all): 2:39:35/1 day, 2:35:50, loss=0.571292392169252, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=1.476062399857831, lr=0.46390309425168297
2023-12-12 18:24:57   INFO  epoch: 2/24, acc_iter=8724, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:51:36, time_cost(all): 2:40:30/1 day, 1:58:18, loss=0.571105347552163, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=1.4562448166443105, lr=0.46693746763335053
2023-12-12 18:25:52   INFO  epoch: 2/24, acc_iter=8774, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:58, time_cost(all): 2:41:25/1 day, 2:19:02, loss=0.570918302935075, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.06(1.03), norm=2.3663100742721093, lr=0.46997184101501804
2023-12-12 18:26:48   INFO  epoch: 2/24, acc_iter=8824, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:49:55, time_cost(all): 2:42:21/1 day, 2:18:03, loss=0.570731258317986, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=3.2697611938046114, lr=0.47300621439668555
2023-12-12 18:27:43   INFO  epoch: 2/24, acc_iter=8874, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:48:25, time_cost(all): 2:43:16/1 day, 1:16:19, loss=0.570544213700897, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=3.5604330850352204, lr=0.4760405877783531
2023-12-12 18:28:38   INFO  epoch: 2/24, acc_iter=8924, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:18, time_cost(all): 2:44:11/1 day, 1:38:59, loss=0.570357169083809, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=4.752295170814984, lr=0.4790749611600207
2023-12-12 18:29:34   INFO  epoch: 2/24, acc_iter=8974, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:20, time_cost(all): 2:45:07/1 day, 1:43:17, loss=0.57017012446672, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=4.155258646199294, lr=0.4821093345416882
2023-12-12 18:30:29   INFO  epoch: 2/24, acc_iter=9024, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:53, time_cost(all): 2:46:02/1 day, 0:32:17, loss=0.569983079849632, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=2.457701439391165, lr=0.4851437079233557
2023-12-12 18:31:24   INFO  epoch: 2/24, acc_iter=9074, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:31, time_cost(all): 2:46:57/1 day, 0:30:33, loss=0.569796035232543, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=3.779562350515627, lr=0.48817808130502327
2023-12-12 18:32:20   INFO  epoch: 2/24, acc_iter=9124, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:02, time_cost(all): 2:47:53/1 day, 2:45:49, loss=0.569608990615455, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=1.9421332654955077, lr=0.4912124546866908
2023-12-12 18:33:15   INFO  epoch: 2/24, acc_iter=9174, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:42:54, time_cost(all): 2:48:48/1 day, 2:07:39, loss=0.569421945998366, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.9(1.03), norm=4.037762099402794, lr=0.4942468280683583
2023-12-12 18:34:10   INFO  epoch: 2/24, acc_iter=9224, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:27, time_cost(all): 2:49:43/1 day, 0:32:53, loss=0.569234901381278, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.12(1.03), norm=4.534019106376996, lr=0.49728120145002586
2023-12-12 18:35:06   INFO  epoch: 2/24, acc_iter=9274, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:09, time_cost(all): 2:50:39/1 day, 1:44:38, loss=0.569047856764189, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.92(1.03), norm=0.5769903950548032, lr=0.49996444227248527
2023-12-12 18:36:01   INFO  epoch: 2/24, acc_iter=9324, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:39:55, time_cost(all): 2:51:34/1 day, 2:38:31, loss=0.568860812147101, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.384642222568134, lr=0.4996225410463819
2023-12-12 18:36:56   INFO  epoch: 2/24, acc_iter=9374, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:05, time_cost(all): 2:52:29/1 day, 0:25:33, loss=0.568673767530012, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.98(1.03), norm=4.433612471490835, lr=0.4992806398202785
2023-12-12 18:37:52   INFO  epoch: 2/24, acc_iter=9424, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:34, time_cost(all): 2:53:25/1 day, 1:04:25, loss=0.568486722912923, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=4.604711690838067, lr=0.4989387385941751
2023-12-12 18:38:47   INFO  epoch: 2/24, acc_iter=9474, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:40:38, time_cost(all): 2:54:20/1 day, 0:59:10, loss=0.568299678295835, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.4861117519903577, lr=0.4985968373680717
2023-12-12 18:39:43   INFO  epoch: 2/24, acc_iter=9524, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:11, time_cost(all): 2:55:16/1 day, 1:38:04, loss=0.568112633678746, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=2.4596395890862612, lr=0.49825493614196836
2023-12-12 18:40:38   INFO  epoch: 2/24, acc_iter=9574, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:34, time_cost(all): 2:56:11/1 day, 1:27:38, loss=0.567925589061658, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=2.198136616179158, lr=0.49791303491586497
2023-12-12 18:41:33   INFO  epoch: 2/24, acc_iter=9624, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:37:00, time_cost(all): 2:57:06/1 day, 0:49:40, loss=0.567738544444569, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=4.805497991570194, lr=0.4975711336897616
2023-12-12 18:42:29   INFO  epoch: 2/24, acc_iter=9674, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:21, time_cost(all): 2:58:02/1 day, 1:54:00, loss=0.567551499827481, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=4.303563652737386, lr=0.4972292324636582
2023-12-12 18:43:24   INFO  epoch: 2/24, acc_iter=9724, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:44, time_cost(all): 2:58:57/1 day, 0:24:32, loss=0.567364455210392, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=1.727321936567542, lr=0.4968873312375548
2023-12-12 18:44:19   INFO  epoch: 2/24, acc_iter=9774, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:08, time_cost(all): 2:59:52/1 day, 1:29:21, loss=0.567177410593304, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=3.265243274018144, lr=0.4965454300114514
2023-12-12 18:45:15   INFO  epoch: 2/24, acc_iter=9824, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:51, time_cost(all): 3:00:48/1 day, 2:10:46, loss=0.566990365976215, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=3.4188777773959016, lr=0.496203528785348
2023-12-12 18:46:10   INFO  epoch: 2/24, acc_iter=9874, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:28, time_cost(all): 3:01:43/1 day, 0:43:40, loss=0.566803321359127, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.97(1.03), norm=1.201552620344641, lr=0.4958616275592447
2023-12-12 18:47:05   INFO  epoch: 2/24, acc_iter=9924, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:21, time_cost(all): 3:02:38/1 day, 0:58:32, loss=0.566616276742038, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=1.3846383821450696, lr=0.4955197263331413
2023-12-12 18:48:01   INFO  epoch: 2/24, acc_iter=9974, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:36, time_cost(all): 3:03:34/1 day, 1:54:45, loss=0.56642923212495, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=4.608503331513336, lr=0.4951778251070379
2023-12-12 18:48:56   INFO  epoch: 2/24, acc_iter=10024, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:26, time_cost(all): 3:04:29/1 day, 1:08:21, loss=0.566242187507861, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.93(1.03), norm=2.8616566840003306, lr=0.4948359238809345
2023-12-12 18:49:51   INFO  epoch: 2/24, acc_iter=10074, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:17, time_cost(all): 3:05:24/1 day, 2:07:34, loss=0.566055142890773, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=4.994032019626009, lr=0.4944940226548311
2023-12-12 18:50:47   INFO  epoch: 2/24, acc_iter=10124, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:22, time_cost(all): 3:06:20/1 day, 1:08:47, loss=0.565868098273684, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.03(1.03), norm=1.9406337569738379, lr=0.4941521214287277
2023-12-12 18:51:42   INFO  epoch: 2/24, acc_iter=10174, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:24:51, time_cost(all): 3:07:15/1 day, 1:34:16, loss=0.565681053656595, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=2.998517935453935, lr=0.49381022020262433
2023-12-12 18:52:37   INFO  epoch: 2/24, acc_iter=10224, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:44, time_cost(all): 3:08:10/1 day, 2:28:17, loss=0.565494009039507, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=1.586066977603107, lr=0.493468318976521
2023-12-12 18:53:33   INFO  epoch: 2/24, acc_iter=10274, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:49, time_cost(all): 3:09:06/1 day, 0:38:20, loss=0.565306964422418, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.6329303301783522, lr=0.4931264177504176
2023-12-12 18:54:28   INFO  epoch: 2/24, acc_iter=10324, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:22:46, time_cost(all): 3:10:01/1 day, 0:18:52, loss=0.56511991980533, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.19(1.03), norm=3.3599427388798273, lr=0.4927845165243142
2023-12-12 18:55:23   INFO  epoch: 2/24, acc_iter=10374, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:49, time_cost(all): 3:10:56/1 day, 1:50:38, loss=0.564932875188241, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=4.117741084434494, lr=0.4924426152982108
2023-12-12 18:56:19   INFO  epoch: 2/24, acc_iter=10424, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:22:29, time_cost(all): 3:11:52/1 day, 1:37:52, loss=0.564745830571153, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=1.4584638941936472, lr=0.4921007140721074
2023-12-12 18:57:14   INFO  epoch: 2/24, acc_iter=10474, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:26, time_cost(all): 3:12:47/1 day, 0:57:06, loss=0.564558785954064, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.4706228754611281, lr=0.49175881284600403
2023-12-12 18:58:09   INFO  epoch: 2/24, acc_iter=10524, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:21, time_cost(all): 3:13:42/1 day, 2:28:09, loss=0.564371741336976, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=1.5179509305994756, lr=0.49141691161990064
2023-12-12 18:59:05   INFO  epoch: 2/24, acc_iter=10574, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:19, time_cost(all): 3:14:38/1 day, 2:18:44, loss=0.564184696719887, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.18(1.03), norm=2.5183832263868307, lr=0.4910750103937973
2023-12-12 19:00:00   INFO  epoch: 2/24, acc_iter=10624, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:21, time_cost(all): 3:15:33/1 day, 0:43:59, loss=0.563997652102799, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=0.986671269558636, lr=0.4907331091676939
2023-12-12 19:00:56   INFO  epoch: 2/24, acc_iter=10674, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:02, time_cost(all): 3:16:29/1 day, 0:09:35, loss=0.56381060748571, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.86(1.03), norm=4.135591123753338, lr=0.4903912079415905
2023-12-12 19:01:51   INFO  epoch: 2/24, acc_iter=10724, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:55, time_cost(all): 3:17:24/1 day, 0:19:02, loss=0.563623562868622, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.4596011361410923, lr=0.49004930671548713
2023-12-12 19:02:46   INFO  epoch: 2/24, acc_iter=10774, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:38, time_cost(all): 3:18:19/1 day, 0:35:54, loss=0.563436518251533, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=2.0411624608288195, lr=0.48970740548938374
2023-12-12 19:03:42   INFO  epoch: 2/24, acc_iter=10824, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:26, time_cost(all): 3:19:15/1 day, 2:24:31, loss=0.563249473634445, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.88(1.03), norm=2.784655366214064, lr=0.48936550426328035
2023-12-12 19:04:37   INFO  epoch: 2/24, acc_iter=10874, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:45, time_cost(all): 3:20:10/1 day, 2:19:20, loss=0.563062429017356, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=1.9856355407871296, lr=0.48902360303717696
2023-12-12 19:05:32   INFO  epoch: 2/24, acc_iter=10924, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:30, time_cost(all): 3:21:05/1 day, 1:43:03, loss=0.562875384400267, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=2.298766778173333, lr=0.4886817018110736
2023-12-12 19:06:28   INFO  epoch: 2/24, acc_iter=10974, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:34, time_cost(all): 3:22:01/1 day, 0:53:07, loss=0.562688339783179, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.036067421240234, lr=0.48833980058497023
2023-12-12 19:07:23   INFO  epoch: 2/24, acc_iter=11024, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:15, time_cost(all): 3:22:56/1 day, 1:20:20, loss=0.56250129516609, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.879078500489613, lr=0.48799789935886684
2023-12-12 19:08:18   INFO  epoch: 2/24, acc_iter=11074, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:33, time_cost(all): 3:23:51/1 day, 0:31:56, loss=0.562314250549002, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.89(1.03), norm=2.8524813772341107, lr=0.48765599813276345
2023-12-12 19:09:14   INFO  epoch: 2/24, acc_iter=11124, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:48, time_cost(all): 3:24:47/1 day, 1:04:30, loss=0.562127205931913, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=0.8782983969941822, lr=0.48731409690666005
2023-12-12 19:10:09   INFO  epoch: 2/24, acc_iter=11174, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:27, time_cost(all): 3:25:42/1 day, 1:27:10, loss=0.561940161314825, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=1.7449346736934381, lr=0.48697219568055666
2023-12-12 19:11:04   INFO  epoch: 2/24, acc_iter=11224, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:39, time_cost(all): 3:26:37/1 day, 1:22:43, loss=0.561753116697736, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=1.5311282650257434, lr=0.48663029445445327
2023-12-12 19:12:00   INFO  epoch: 2/24, acc_iter=11274, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:40, time_cost(all): 3:27:33/1 day, 1:37:29, loss=0.561566072080648, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.1(1.03), norm=3.8942068372280265, lr=0.48628839322834994
2023-12-12 19:12:55   INFO  epoch: 2/24, acc_iter=11324, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:05:04, time_cost(all): 3:28:28/1 day, 0:18:28, loss=0.561379027463559, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=2.410607306705309, lr=0.48594649200224654
2023-12-12 19:13:50   INFO  epoch: 2/24, acc_iter=11374, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:53, time_cost(all): 3:29:23/1 day, 0:53:21, loss=0.561191982846471, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=4.775020662626698, lr=0.48560459077614315
2023-12-12 19:14:46   INFO  epoch: 2/24, acc_iter=11424, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:00, time_cost(all): 3:30:19/1 day, 0:29:31, loss=0.561004938229382, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=2.6706137453994536, lr=0.48526268955003976
2023-12-12 19:15:41   INFO  epoch: 2/24, acc_iter=11474, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:09, time_cost(all): 3:31:14/1 day, 1:56:25, loss=0.560817893612294, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.16(1.03), norm=2.486028268907263, lr=0.48492078832393637
2023-12-12 19:16:36   INFO  epoch: 2/24, acc_iter=11524, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:10, time_cost(all): 3:32:09/1 day, 1:43:53, loss=0.560630848995205, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.9177805729793476, lr=0.484578887097833
2023-12-12 19:17:32   INFO  epoch: 2/24, acc_iter=11574, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 3:33:05/1 day, 1:40:03, loss=0.560443804378116, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=1.1058368053596763, lr=0.4842369858717296
2023-12-12 19:18:27   INFO  epoch: 3/24, acc_iter=11636, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:08:19, time_cost(all): 3:34:00/1 day, 1:11:31, loss=0.560211869052927, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=0.7406559582349277, lr=0.4838130283513614
2023-12-12 19:19:22   INFO  epoch: 3/24, acc_iter=11686, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:50, time_cost(all): 3:34:55/1 day, 0:43:03, loss=0.560024824435838, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=1.455419338320885, lr=0.483471127125258
2023-12-12 19:20:18   INFO  epoch: 3/24, acc_iter=11736, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:46, time_cost(all): 3:35:51/1 day, 0:16:35, loss=0.55983777981875, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.91(1.03), norm=0.6862785227982944, lr=0.48312922589915464
2023-12-12 19:21:13   INFO  epoch: 3/24, acc_iter=11786, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:04:30, time_cost(all): 3:36:46/1 day, 0:10:20, loss=0.559650735201661, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.03(1.03), norm=2.399360508767621, lr=0.48278732467305124
2023-12-12 19:22:09   INFO  epoch: 3/24, acc_iter=11836, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:04:34, time_cost(all): 3:37:42/1 day, 0:02:39, loss=0.559463690584573, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.8247176619970986, lr=0.48244542344694785
2023-12-12 19:23:04   INFO  epoch: 3/24, acc_iter=11886, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:03:47, time_cost(all): 3:38:37/23:47:09, loss=0.559276645967484, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=4.8070060163383985, lr=0.4821035222208445
2023-12-12 19:23:59   INFO  epoch: 3/24, acc_iter=11936, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:03:38, time_cost(all): 3:39:32/1 day, 0:38:31, loss=0.559089601350396, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=4.37363022104015, lr=0.4817616209947411
2023-12-12 19:24:55   INFO  epoch: 3/24, acc_iter=11986, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:05:21, time_cost(all): 3:40:28/1 day, 1:17:52, loss=0.558902556733307, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=1.566168604006846, lr=0.48141971976863773
2023-12-12 19:25:50   INFO  epoch: 3/24, acc_iter=12036, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:49, time_cost(all): 3:41:23/1 day, 1:46:04, loss=0.558715512116218, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=2.075348836221139, lr=0.48107781854253434
2023-12-12 19:26:45   INFO  epoch: 3/24, acc_iter=12086, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/0:59:44, time_cost(all): 3:42:18/1 day, 0:54:13, loss=0.55852846749913, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=3.837365797652246, lr=0.48073591731643095
2023-12-12 19:27:41   INFO  epoch: 3/24, acc_iter=12136, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:01:21, time_cost(all): 3:43:14/1 day, 0:52:32, loss=0.558341422882041, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=4.098643133284973, lr=0.48039401609032756
2023-12-12 19:28:36   INFO  epoch: 3/24, acc_iter=12186, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:11, time_cost(all): 3:44:09/23:36:09, loss=0.558154378264953, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=2.4773566534575138, lr=0.48005211486422417
2023-12-12 19:29:31   INFO  epoch: 3/24, acc_iter=12236, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:02:09, time_cost(all): 3:45:04/1 day, 1:20:31, loss=0.557967333647864, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.731777354245239, lr=0.47971021363812083
2023-12-12 19:30:27   INFO  epoch: 3/24, acc_iter=12286, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:20, time_cost(all): 3:46:00/1 day, 0:43:42, loss=0.557780289030776, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.07(1.03), norm=1.533038721940795, lr=0.47936831241201744
2023-12-12 19:31:22   INFO  epoch: 3/24, acc_iter=12336, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:54:34, time_cost(all): 3:46:55/1 day, 0:32:09, loss=0.557593244413687, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=1.5747498076184738, lr=0.47902641118591405
2023-12-12 19:32:17   INFO  epoch: 3/24, acc_iter=12386, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:54:28, time_cost(all): 3:47:50/1 day, 1:14:59, loss=0.557406199796599, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=3.7104138867210787, lr=0.47868450995981066
2023-12-12 19:33:13   INFO  epoch: 3/24, acc_iter=12436, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:54:54, time_cost(all): 3:48:46/23:30:47, loss=0.55721915517951, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.21(1.03), norm=3.4063663019611297, lr=0.47834260873370726
2023-12-12 19:34:08   INFO  epoch: 3/24, acc_iter=12486, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:56:19, time_cost(all): 3:49:41/23:47:29, loss=0.557032110562422, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.11(1.03), norm=4.631627128540719, lr=0.4780007075076039
2023-12-12 19:35:03   INFO  epoch: 3/24, acc_iter=12536, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:52:26, time_cost(all): 3:50:36/1 day, 0:24:45, loss=0.556845065945333, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=4.102153003921078, lr=0.4776588062815005
2023-12-12 19:35:59   INFO  epoch: 3/24, acc_iter=12586, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:50:37, time_cost(all): 3:51:32/1 day, 1:01:49, loss=0.556658021328244, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=3.737687408727027, lr=0.47731690505539714
2023-12-12 19:36:54   INFO  epoch: 3/24, acc_iter=12636, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:21, time_cost(all): 3:52:27/23:30:50, loss=0.556470976711156, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=2.6835415250892964, lr=0.47697500382929375
2023-12-12 19:37:49   INFO  epoch: 3/24, acc_iter=12686, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:11, time_cost(all): 3:53:22/1 day, 1:24:37, loss=0.556283932094067, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=3.0651692535661264, lr=0.47663310260319036
2023-12-12 19:38:45   INFO  epoch: 3/24, acc_iter=12736, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:51:57, time_cost(all): 3:54:18/1 day, 1:40:40, loss=0.556096887476979, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=1.071431515396755, lr=0.47629120137708697
2023-12-12 19:39:40   INFO  epoch: 3/24, acc_iter=12786, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:49:27, time_cost(all): 3:55:13/1 day, 1:25:07, loss=0.55590984285989, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.8576380155394756, lr=0.4759493001509836
2023-12-12 19:40:35   INFO  epoch: 3/24, acc_iter=12836, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:48:48, time_cost(all): 3:56:08/23:56:31, loss=0.555722798242802, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.9298540616739777, lr=0.4756073989248802
2023-12-12 19:41:31   INFO  epoch: 3/24, acc_iter=12886, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:48:50, time_cost(all): 3:57:04/1 day, 1:26:24, loss=0.555535753625713, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=4.244834633115124, lr=0.4752654976987768
2023-12-12 19:42:26   INFO  epoch: 3/24, acc_iter=12936, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:26, time_cost(all): 3:57:59/1 day, 0:42:05, loss=0.555348709008625, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.21(1.03), norm=4.044868622340107, lr=0.47492359647267346
2023-12-12 19:43:22   INFO  epoch: 3/24, acc_iter=12986, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:43:23, time_cost(all): 3:58:55/1 day, 1:06:30, loss=0.555161664391536, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=3.255843710345744, lr=0.47458169524657007
2023-12-12 19:44:17   INFO  epoch: 3/24, acc_iter=13036, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:42:55, time_cost(all): 3:59:50/1 day, 1:28:38, loss=0.554974619774448, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=3.94667873492181, lr=0.4742397940204667
2023-12-12 19:45:12   INFO  epoch: 3/24, acc_iter=13086, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:41:52, time_cost(all): 4:00:45/1 day, 0:57:20, loss=0.554787575157359, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.2(1.03), norm=4.238761402949117, lr=0.4738978927943633
2023-12-12 19:46:08   INFO  epoch: 3/24, acc_iter=13136, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:44, time_cost(all): 4:01:41/23:24:12, loss=0.554600530540271, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=1.2360964946313464, lr=0.4735559915682599
2023-12-12 19:47:03   INFO  epoch: 3/24, acc_iter=13186, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:27, time_cost(all): 4:02:36/1 day, 0:55:38, loss=0.554413485923182, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=4.810284546791869, lr=0.4732140903421565
2023-12-12 19:47:58   INFO  epoch: 3/24, acc_iter=13236, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:39, time_cost(all): 4:03:31/1 day, 1:19:58, loss=0.554226441306094, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=4.735451801310229, lr=0.4728721891160531
2023-12-12 19:48:54   INFO  epoch: 3/24, acc_iter=13286, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:43, time_cost(all): 4:04:27/23:32:16, loss=0.554039396689005, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.09421489260061, lr=0.4725302878899498
2023-12-12 19:49:49   INFO  epoch: 3/24, acc_iter=13336, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:06, time_cost(all): 4:05:22/1 day, 0:43:35, loss=0.553852352071917, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=0.7265403743132247, lr=0.4721883866638464
2023-12-12 19:50:44   INFO  epoch: 3/24, acc_iter=13386, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:43, time_cost(all): 4:06:17/1 day, 0:57:44, loss=0.553665307454828, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.07(1.03), norm=2.0571284958598692, lr=0.471846485437743
2023-12-12 19:51:40   INFO  epoch: 3/24, acc_iter=13436, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:35:42, time_cost(all): 4:07:13/1 day, 0:10:47, loss=0.553478262837739, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=3.811312903162936, lr=0.4715045842116396
2023-12-12 19:52:35   INFO  epoch: 3/24, acc_iter=13486, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:50, time_cost(all): 4:08:08/23:45:39, loss=0.553291218220651, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=4.24619934596763, lr=0.4711626829855362
2023-12-12 19:53:30   INFO  epoch: 3/24, acc_iter=13536, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:38, time_cost(all): 4:09:03/23:44:02, loss=0.553104173603562, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.1221659882521717, lr=0.4708207817594328
2023-12-12 19:54:26   INFO  epoch: 3/24, acc_iter=13586, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:10, time_cost(all): 4:09:59/1 day, 1:31:26, loss=0.552917128986474, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=1.5455015324732575, lr=0.4704788805333294
2023-12-12 19:55:21   INFO  epoch: 3/24, acc_iter=13636, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:35:01, time_cost(all): 4:10:54/1 day, 0:15:42, loss=0.552730084369385, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.19(1.03), norm=1.0997609204144834, lr=0.4701369793072261
2023-12-12 19:56:16   INFO  epoch: 3/24, acc_iter=13686, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:27, time_cost(all): 4:11:49/1 day, 1:22:34, loss=0.552543039752297, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=2.6127569062481846, lr=0.4697950780811227
2023-12-12 19:57:12   INFO  epoch: 3/24, acc_iter=13736, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:31:18, time_cost(all): 4:12:45/1 day, 1:28:47, loss=0.552355995135208, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.87(1.03), norm=2.3628402532659285, lr=0.4694531768550193
2023-12-12 19:58:07   INFO  epoch: 3/24, acc_iter=13786, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:05, time_cost(all): 4:13:40/1 day, 0:49:25, loss=0.55216895051812, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=1.3929646312685282, lr=0.4691112756289159
2023-12-12 19:59:02   INFO  epoch: 3/24, acc_iter=13836, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:30, time_cost(all): 4:14:35/23:45:18, loss=0.551981905901031, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=1.7663713222429425, lr=0.4687693744028125
2023-12-12 19:59:58   INFO  epoch: 3/24, acc_iter=13886, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:45, time_cost(all): 4:15:31/1 day, 0:44:45, loss=0.551794861283943, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=4.79303112267034, lr=0.46842747317670913
2023-12-12 20:00:53   INFO  epoch: 3/24, acc_iter=13936, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:57, time_cost(all): 4:16:26/1 day, 0:06:33, loss=0.551607816666854, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.18(1.03), norm=3.982465843047463, lr=0.46808557195060574
2023-12-12 20:01:48   INFO  epoch: 3/24, acc_iter=13986, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:57, time_cost(all): 4:17:21/23:56:01, loss=0.551420772049765, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.95(1.03), norm=4.859284908456984, lr=0.4677436707245024
2023-12-12 20:02:44   INFO  epoch: 3/24, acc_iter=14036, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:00, time_cost(all): 4:18:17/1 day, 0:02:39, loss=0.551233727432677, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.252852219129236, lr=0.467401769498399
2023-12-12 20:03:39   INFO  epoch: 3/24, acc_iter=14086, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:32, time_cost(all): 4:19:12/1 day, 1:06:08, loss=0.551046682815588, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=2.6473373678515335, lr=0.4670598682722956
2023-12-12 20:04:34   INFO  epoch: 3/24, acc_iter=14136, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:09, time_cost(all): 4:20:07/1 day, 0:42:21, loss=0.5508596381985, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=2.427528069691311, lr=0.46671796704619223
2023-12-12 20:05:30   INFO  epoch: 3/24, acc_iter=14186, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:18, time_cost(all): 4:21:03/1 day, 1:21:03, loss=0.550672593581411, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=2.203655102270308, lr=0.46637606582008884
2023-12-12 20:06:25   INFO  epoch: 3/24, acc_iter=14236, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:53, time_cost(all): 4:21:58/1 day, 0:07:24, loss=0.550485548964323, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=4.766714917301817, lr=0.46603416459398544
2023-12-12 20:07:21   INFO  epoch: 3/24, acc_iter=14286, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:43, time_cost(all): 4:22:54/23:20:38, loss=0.550298504347234, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.936862826507176, lr=0.46569226336788205
2023-12-12 20:08:16   INFO  epoch: 3/24, acc_iter=14336, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:31, time_cost(all): 4:23:49/1 day, 0:27:37, loss=0.550111459730146, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.9(1.03), norm=1.1842183799151278, lr=0.4653503621417787
2023-12-12 20:09:11   INFO  epoch: 3/24, acc_iter=14386, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:10, time_cost(all): 4:24:44/1 day, 0:07:16, loss=0.549924415113057, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=4.415795240830561, lr=0.4650084609156753
2023-12-12 20:10:07   INFO  epoch: 3/24, acc_iter=14436, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:28, time_cost(all): 4:25:40/23:20:13, loss=0.549737370495969, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=1.0387833745483657, lr=0.46466655968957193
2023-12-12 20:11:02   INFO  epoch: 3/24, acc_iter=14486, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:27, time_cost(all): 4:26:35/1 day, 1:03:24, loss=0.54955032587888, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.05(1.03), norm=2.6212976909135857, lr=0.46432465846346854
2023-12-12 20:11:57   INFO  epoch: 3/24, acc_iter=14536, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:59, time_cost(all): 4:27:30/23:05:48, loss=0.549363281261792, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=1.9188999680663112, lr=0.46398275723736515
2023-12-12 20:12:53   INFO  epoch: 3/24, acc_iter=14586, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:21, time_cost(all): 4:28:26/23:24:56, loss=0.549176236644703, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=0.7886011838022984, lr=0.46364085601126176
2023-12-12 20:13:48   INFO  epoch: 3/24, acc_iter=14636, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:26, time_cost(all): 4:29:21/1 day, 0:43:05, loss=0.548989192027615, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.7160965843651, lr=0.46329895478515837
2023-12-12 20:14:43   INFO  epoch: 3/24, acc_iter=14686, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:29, time_cost(all): 4:30:16/1 day, 1:09:08, loss=0.548802147410526, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.19(1.03), norm=3.711584603352908, lr=0.46295705355905503
2023-12-12 20:15:39   INFO  epoch: 3/24, acc_iter=14736, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:42, time_cost(all): 4:31:12/23:05:20, loss=0.548615102793438, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.09(1.03), norm=2.3456639139180675, lr=0.46261515233295164
2023-12-12 20:16:34   INFO  epoch: 3/24, acc_iter=14786, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:11, time_cost(all): 4:32:07/1 day, 0:33:29, loss=0.548428058176349, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.1(1.03), norm=1.304513943390318, lr=0.46227325110684825
2023-12-12 20:17:29   INFO  epoch: 3/24, acc_iter=14836, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:13, time_cost(all): 4:33:02/23:49:47, loss=0.54824101355926, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.0(1.03), norm=2.994068235336224, lr=0.46193134988074486
2023-12-12 20:18:25   INFO  epoch: 3/24, acc_iter=14886, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:46, time_cost(all): 4:33:58/23:46:02, loss=0.548053968942172, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.109024643992007, lr=0.46158944865464147
2023-12-12 20:19:20   INFO  epoch: 3/24, acc_iter=14936, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:07, time_cost(all): 4:34:53/23:01:59, loss=0.547866924325083, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.8285747763274083, lr=0.4612475474285381
2023-12-12 20:20:15   INFO  epoch: 3/24, acc_iter=14986, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:18, time_cost(all): 4:35:48/1 day, 0:27:16, loss=0.547679879707995, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.237891930313207, lr=0.4609056462024347
2023-12-12 20:21:11   INFO  epoch: 3/24, acc_iter=15036, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:16, time_cost(all): 4:36:44/1 day, 0:13:34, loss=0.547492835090906, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=3.376868011958979, lr=0.46056374497633135
2023-12-12 20:22:06   INFO  epoch: 3/24, acc_iter=15086, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:54, time_cost(all): 4:37:39/23:21:10, loss=0.547305790473818, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.21(1.03), norm=0.9029296454415976, lr=0.46022184375022795
2023-12-12 20:23:01   INFO  epoch: 3/24, acc_iter=15136, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:30, time_cost(all): 4:38:34/22:57:25, loss=0.547118745856729, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.87(1.03), norm=1.015680528132762, lr=0.45987994252412456
2023-12-12 20:23:57   INFO  epoch: 3/24, acc_iter=15186, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:54, time_cost(all): 4:39:30/23:07:06, loss=0.546931701239641, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.98(1.03), norm=1.5149098098219898, lr=0.45953804129802117
2023-12-12 20:24:52   INFO  epoch: 3/24, acc_iter=15236, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:45, time_cost(all): 4:40:25/1 day, 0:50:45, loss=0.546744656622552, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.724559001048328, lr=0.4591961400719178
2023-12-12 20:25:47   INFO  epoch: 3/24, acc_iter=15286, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:01, time_cost(all): 4:41:20/1 day, 0:34:19, loss=0.546557612005464, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=3.3305351786974366, lr=0.4588542388458144
2023-12-12 20:26:43   INFO  epoch: 3/24, acc_iter=15336, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:04, time_cost(all): 4:42:16/1 day, 0:46:11, loss=0.546370567388375, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=1.975128924264212, lr=0.458512337619711
2023-12-12 20:27:38   INFO  epoch: 3/24, acc_iter=15386, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:11, time_cost(all): 4:43:11/1 day, 0:50:22, loss=0.546183522771287, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=1.9328106473384121, lr=0.45817043639360766
2023-12-12 20:28:34   INFO  epoch: 3/24, acc_iter=15436, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:12, time_cost(all): 4:44:07/1 day, 0:20:17, loss=0.545996478154198, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.5487940243790366, lr=0.45782853516750427
2023-12-12 20:29:29   INFO  epoch: 4/24, acc_iter=15498, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:11:23, time_cost(all): 4:45:02/22:56:11, loss=0.545764542829008, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.976068190558701, lr=0.45740457764713605
2023-12-12 20:30:24   INFO  epoch: 4/24, acc_iter=15548, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:09:34, time_cost(all): 4:45:57/23:12:34, loss=0.54557749821192, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.93(1.03), norm=4.1250348853065635, lr=0.45706267642103265
2023-12-12 20:31:20   INFO  epoch: 4/24, acc_iter=15598, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:10:23, time_cost(all): 4:46:53/23:00:29, loss=0.545390453594831, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.8990901839594745, lr=0.45672077519492926
2023-12-12 20:32:15   INFO  epoch: 4/24, acc_iter=15648, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:06:57, time_cost(all): 4:47:48/23:12:04, loss=0.545203408977743, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=3.709857694477287, lr=0.4563788739688259
2023-12-12 20:33:10   INFO  epoch: 4/24, acc_iter=15698, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:07:17, time_cost(all): 4:48:43/22:55:18, loss=0.545016364360654, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=2.538762933114739, lr=0.45603697274272254
2023-12-12 20:34:06   INFO  epoch: 4/24, acc_iter=15748, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:58, time_cost(all): 4:49:39/22:46:30, loss=0.544829319743566, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=4.1342978738094445, lr=0.45569507151661914
2023-12-12 20:35:01   INFO  epoch: 4/24, acc_iter=15798, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:02:48, time_cost(all): 4:50:34/23:52:44, loss=0.544642275126477, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=2.8651432712666405, lr=0.45535317029051575
2023-12-12 20:35:56   INFO  epoch: 4/24, acc_iter=15848, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:06:08, time_cost(all): 4:51:29/23:22:21, loss=0.544455230509388, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=2.82180179896818, lr=0.45501126906441236
2023-12-12 20:36:52   INFO  epoch: 4/24, acc_iter=15898, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:03:30, time_cost(all): 4:52:25/1 day, 0:46:33, loss=0.5442681858923, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.01(1.03), norm=3.98837260087921, lr=0.45466936783830897
2023-12-12 20:37:47   INFO  epoch: 4/24, acc_iter=15948, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:05:02, time_cost(all): 4:53:20/23:47:12, loss=0.544081141275211, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.93(1.03), norm=4.334786550784736, lr=0.4543274666122056
2023-12-12 20:38:42   INFO  epoch: 4/24, acc_iter=15998, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:06, time_cost(all): 4:54:15/22:38:33, loss=0.543894096658123, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=0.8767738687900741, lr=0.45398556538610224
2023-12-12 20:39:38   INFO  epoch: 4/24, acc_iter=16048, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:00:46, time_cost(all): 4:55:11/1 day, 0:11:07, loss=0.543707052041034, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.2568885371198, lr=0.45364366415999885
2023-12-12 20:40:33   INFO  epoch: 4/24, acc_iter=16098, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:58:47, time_cost(all): 4:56:06/23:44:54, loss=0.543520007423946, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=1.233917136392651, lr=0.45330176293389546
2023-12-12 20:41:28   INFO  epoch: 4/24, acc_iter=16148, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:58:33, time_cost(all): 4:57:01/1 day, 0:39:43, loss=0.543332962806857, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.027789328532121, lr=0.45295986170779207
2023-12-12 20:42:24   INFO  epoch: 4/24, acc_iter=16198, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:37, time_cost(all): 4:57:57/23:44:32, loss=0.543145918189769, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=3.235915528784431, lr=0.4526179604816887
2023-12-12 20:43:19   INFO  epoch: 4/24, acc_iter=16248, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:57:41, time_cost(all): 4:58:52/1 day, 0:15:57, loss=0.54295887357268, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.17(1.03), norm=4.301682463013792, lr=0.4522760592555853
2023-12-12 20:44:14   INFO  epoch: 4/24, acc_iter=16298, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:56:53, time_cost(all): 4:59:47/23:27:31, loss=0.542771828955592, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.03(1.03), norm=2.5578458834701276, lr=0.4519341580294819
2023-12-12 20:45:10   INFO  epoch: 4/24, acc_iter=16348, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:52:33, time_cost(all): 5:00:43/23:48:32, loss=0.542584784338503, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.91(1.03), norm=2.643400728871668, lr=0.45159225680337856
2023-12-12 20:46:05   INFO  epoch: 4/24, acc_iter=16398, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:29, time_cost(all): 5:01:38/1 day, 0:09:36, loss=0.542397739721415, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=3.757144553445072, lr=0.45125035557727516
2023-12-12 20:47:00   INFO  epoch: 4/24, acc_iter=16448, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:53:01, time_cost(all): 5:02:33/23:51:45, loss=0.542210695104326, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=0.5704861638315397, lr=0.4509084543511718
2023-12-12 20:47:56   INFO  epoch: 4/24, acc_iter=16498, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:59, time_cost(all): 5:03:29/22:24:32, loss=0.542023650487238, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.85437250446776, lr=0.4505665531250684
2023-12-12 20:48:51   INFO  epoch: 4/24, acc_iter=16548, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:54, time_cost(all): 5:04:24/1 day, 0:25:21, loss=0.541836605870149, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=1.107245131497119, lr=0.450224651898965
2023-12-12 20:49:47   INFO  epoch: 4/24, acc_iter=16598, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:34, time_cost(all): 5:05:20/22:33:34, loss=0.54164956125306, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=3.918346064323592, lr=0.4498827506728616
2023-12-12 20:50:42   INFO  epoch: 4/24, acc_iter=16648, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:51:27, time_cost(all): 5:06:15/1 day, 0:01:13, loss=0.541462516635972, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=4.817182571785061, lr=0.4495408494467582
2023-12-12 20:51:37   INFO  epoch: 4/24, acc_iter=16698, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:46:02, time_cost(all): 5:07:10/1 day, 0:01:26, loss=0.541275472018883, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=4.315975208479234, lr=0.44919894822065487
2023-12-12 20:52:33   INFO  epoch: 4/24, acc_iter=16748, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:45:05, time_cost(all): 5:08:06/1 day, 0:02:17, loss=0.541088427401795, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=1.0325182318589978, lr=0.4488570469945515
2023-12-12 20:53:28   INFO  epoch: 4/24, acc_iter=16798, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:36, time_cost(all): 5:09:01/23:56:47, loss=0.540901382784706, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=3.8058249887345474, lr=0.4485151457684481
2023-12-12 20:54:23   INFO  epoch: 4/24, acc_iter=16848, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:47, time_cost(all): 5:09:56/23:12:18, loss=0.540714338167618, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=0.8673750077150673, lr=0.4481732445423447
2023-12-12 20:55:19   INFO  epoch: 4/24, acc_iter=16898, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:46:12, time_cost(all): 5:10:52/22:33:31, loss=0.540527293550529, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=4.761532317906218, lr=0.4478313433162413
2023-12-12 20:56:14   INFO  epoch: 4/24, acc_iter=16948, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:18, time_cost(all): 5:11:47/1 day, 0:20:08, loss=0.540340248933441, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.95(1.03), norm=4.839287783228107, lr=0.4474894420901379
2023-12-12 20:57:09   INFO  epoch: 4/24, acc_iter=16998, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:44:34, time_cost(all): 5:12:42/23:33:17, loss=0.540153204316352, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=1.6702867328728337, lr=0.4471475408640345
2023-12-12 20:58:05   INFO  epoch: 4/24, acc_iter=17048, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:47, time_cost(all): 5:13:38/22:58:58, loss=0.539966159699264, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=1.983040896140132, lr=0.4468056396379312
2023-12-12 20:59:00   INFO  epoch: 4/24, acc_iter=17098, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:57, time_cost(all): 5:14:33/23:14:55, loss=0.539779115082175, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.6456477431242136, lr=0.4464637384118278
2023-12-12 20:59:55   INFO  epoch: 4/24, acc_iter=17148, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:48, time_cost(all): 5:15:28/23:12:19, loss=0.539592070465087, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=1.8109133707426848, lr=0.4461218371857244
2023-12-12 21:00:51   INFO  epoch: 4/24, acc_iter=17198, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:39:34, time_cost(all): 5:16:24/22:44:31, loss=0.539405025847998, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=4.577251327899645, lr=0.445779935959621
2023-12-12 21:01:46   INFO  epoch: 4/24, acc_iter=17248, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:38:17, time_cost(all): 5:17:19/23:40:08, loss=0.539217981230909, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.23(1.03), norm=2.7164588953346653, lr=0.4454380347335176
2023-12-12 21:02:41   INFO  epoch: 4/24, acc_iter=17298, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:36, time_cost(all): 5:18:14/23:35:39, loss=0.539030936613821, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.12(1.03), norm=4.433528779188822, lr=0.4450961335074142
2023-12-12 21:03:37   INFO  epoch: 4/24, acc_iter=17348, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:46, time_cost(all): 5:19:10/23:39:29, loss=0.538843891996732, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=1.2902249076679826, lr=0.44475423228131084
2023-12-12 21:04:32   INFO  epoch: 4/24, acc_iter=17398, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:02, time_cost(all): 5:20:05/1 day, 0:03:57, loss=0.538656847379644, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.14(1.03), norm=2.3640743191218565, lr=0.4444123310552075
2023-12-12 21:05:27   INFO  epoch: 4/24, acc_iter=17448, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:14, time_cost(all): 5:21:00/23:37:25, loss=0.538469802762555, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=4.922566210767817, lr=0.4440704298291041
2023-12-12 21:06:23   INFO  epoch: 4/24, acc_iter=17498, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:14, time_cost(all): 5:21:56/23:21:50, loss=0.538282758145467, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=0.9656922071070289, lr=0.4437285286030007
2023-12-12 21:07:18   INFO  epoch: 4/24, acc_iter=17548, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:46, time_cost(all): 5:22:51/22:33:41, loss=0.538095713528378, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.4151725182101034, lr=0.4433866273768973
2023-12-12 21:08:13   INFO  epoch: 4/24, acc_iter=17598, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:57, time_cost(all): 5:23:46/22:39:45, loss=0.53790866891129, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=0.5865982140026622, lr=0.44304472615079393
2023-12-12 21:09:09   INFO  epoch: 4/24, acc_iter=17648, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:06, time_cost(all): 5:24:42/22:44:17, loss=0.537721624294201, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.11(1.03), norm=3.245802319283371, lr=0.44270282492469054
2023-12-12 21:10:04   INFO  epoch: 4/24, acc_iter=17698, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:50, time_cost(all): 5:25:37/21:59:40, loss=0.537534579677113, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=1.0100365501181887, lr=0.44236092369858715
2023-12-12 21:11:00   INFO  epoch: 4/24, acc_iter=17748, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:06, time_cost(all): 5:26:33/22:36:56, loss=0.537347535060024, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.182364448412158, lr=0.4420190224724838
2023-12-12 21:11:55   INFO  epoch: 4/24, acc_iter=17798, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:44, time_cost(all): 5:27:28/23:23:59, loss=0.537160490442936, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.18(1.03), norm=4.614950458809399, lr=0.4416771212463804
2023-12-12 21:12:50   INFO  epoch: 4/24, acc_iter=17848, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:00, time_cost(all): 5:28:23/23:57:38, loss=0.536973445825847, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.726788724439049, lr=0.44133522002027703
2023-12-12 21:13:46   INFO  epoch: 4/24, acc_iter=17898, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:27:15, time_cost(all): 5:29:19/23:32:01, loss=0.536786401208759, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=3.9895661306236834, lr=0.44099331879417364
2023-12-12 21:14:41   INFO  epoch: 4/24, acc_iter=17948, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:00, time_cost(all): 5:30:14/23:00:31, loss=0.53659935659167, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.96(1.03), norm=2.451100954911798, lr=0.44065141756807025
2023-12-12 21:15:36   INFO  epoch: 4/24, acc_iter=17998, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:59, time_cost(all): 5:31:09/22:19:09, loss=0.536412311974581, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.23(1.03), norm=1.3191652587888467, lr=0.44030951634196686
2023-12-12 21:16:32   INFO  epoch: 4/24, acc_iter=18048, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:31, time_cost(all): 5:32:05/23:22:07, loss=0.536225267357493, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=2.4617896446070313, lr=0.43996761511586346
2023-12-12 21:17:27   INFO  epoch: 4/24, acc_iter=18098, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:57, time_cost(all): 5:33:00/23:58:58, loss=0.536038222740404, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.15(1.03), norm=1.1672940725552625, lr=0.43962571388976013
2023-12-12 21:18:22   INFO  epoch: 4/24, acc_iter=18148, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:30, time_cost(all): 5:33:55/1 day, 0:01:11, loss=0.535851178123316, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.21(1.03), norm=3.052885965651858, lr=0.43928381266365674
2023-12-12 21:19:18   INFO  epoch: 4/24, acc_iter=18198, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:31, time_cost(all): 5:34:51/21:54:35, loss=0.535664133506227, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=2.646908572454943, lr=0.43894191143755334
2023-12-12 21:20:13   INFO  epoch: 4/24, acc_iter=18248, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:09, time_cost(all): 5:35:46/22:17:07, loss=0.535477088889139, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.0(1.03), norm=4.29944677942428, lr=0.43860001021144995
2023-12-12 21:21:08   INFO  epoch: 4/24, acc_iter=18298, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:25, time_cost(all): 5:36:41/22:27:20, loss=0.53529004427205, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=1.6322327829895333, lr=0.43825810898534656
2023-12-12 21:22:04   INFO  epoch: 4/24, acc_iter=18348, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:21, time_cost(all): 5:37:37/22:40:22, loss=0.535102999654962, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.9(1.03), norm=3.6177862162168104, lr=0.43791620775924317
2023-12-12 21:22:59   INFO  epoch: 4/24, acc_iter=18398, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:26, time_cost(all): 5:38:32/23:26:31, loss=0.534915955037873, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.23(1.03), norm=3.213982694610674, lr=0.4375743065331398
2023-12-12 21:23:54   INFO  epoch: 4/24, acc_iter=18448, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:41, time_cost(all): 5:39:27/22:07:59, loss=0.534728910420785, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.92(1.03), norm=3.372818795313952, lr=0.43723240530703644
2023-12-12 21:24:50   INFO  epoch: 4/24, acc_iter=18498, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:04, time_cost(all): 5:40:23/23:10:39, loss=0.534541865803696, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.1750220560962124, lr=0.43689050408093305
2023-12-12 21:25:45   INFO  epoch: 4/24, acc_iter=18548, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:42, time_cost(all): 5:41:18/21:52:24, loss=0.534354821186608, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=3.5402748055311153, lr=0.43654860285482966
2023-12-12 21:26:40   INFO  epoch: 4/24, acc_iter=18598, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:52, time_cost(all): 5:42:13/23:21:32, loss=0.534167776569519, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=4.677911376575897, lr=0.43620670162872627
2023-12-12 21:27:36   INFO  epoch: 4/24, acc_iter=18648, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:16, time_cost(all): 5:43:09/22:11:19, loss=0.53398073195243, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.92(1.03), norm=4.585985451173849, lr=0.4358648004026229
2023-12-12 21:28:31   INFO  epoch: 4/24, acc_iter=18698, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:30, time_cost(all): 5:44:04/22:26:41, loss=0.533793687335342, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.12(1.03), norm=4.121468017970216, lr=0.43552289917651954
2023-12-12 21:29:26   INFO  epoch: 4/24, acc_iter=18748, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:39, time_cost(all): 5:44:59/22:40:13, loss=0.533606642718253, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.9701260642379284, lr=0.4351809979504161
2023-12-12 21:30:22   INFO  epoch: 4/24, acc_iter=18798, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:43, time_cost(all): 5:45:55/22:56:30, loss=0.533419598101165, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.94(1.03), norm=3.1168597160250964, lr=0.43483909672431276
2023-12-12 21:31:17   INFO  epoch: 4/24, acc_iter=18848, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:55, time_cost(all): 5:46:50/22:09:21, loss=0.533232553484076, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.1(1.03), norm=2.9508922906636332, lr=0.43449719549820937
2023-12-12 21:32:13   INFO  epoch: 4/24, acc_iter=18898, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:24, time_cost(all): 5:47:46/23:09:40, loss=0.533045508866988, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.99(1.03), norm=1.294863296816813, lr=0.434155294272106
2023-12-12 21:33:08   INFO  epoch: 4/24, acc_iter=18948, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:54, time_cost(all): 5:48:41/23:40:55, loss=0.532858464249899, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.93(1.03), norm=4.843382634591476, lr=0.4338133930460026
2023-12-12 21:34:03   INFO  epoch: 4/24, acc_iter=18998, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:41, time_cost(all): 5:49:36/23:14:16, loss=0.532671419632811, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=1.90581595115725, lr=0.4334714918198992
2023-12-12 21:34:59   INFO  epoch: 4/24, acc_iter=19048, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:37, time_cost(all): 5:50:32/23:05:24, loss=0.532484375015722, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=0.7765761173675654, lr=0.4331295905937958
2023-12-12 21:35:54   INFO  epoch: 4/24, acc_iter=19098, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:59, time_cost(all): 5:51:27/22:25:59, loss=0.532297330398634, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.08(1.03), norm=4.587040656241893, lr=0.4327876893676924
2023-12-12 21:36:49   INFO  epoch: 4/24, acc_iter=19148, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:57, time_cost(all): 5:52:22/23:29:43, loss=0.532110285781545, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.99(1.03), norm=1.1720094369618779, lr=0.43244578814158907
2023-12-12 21:37:45   INFO  epoch: 4/24, acc_iter=19198, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 5:53:18/21:49:36, loss=0.531923241164457, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=2.9465860133103434, lr=0.4321038869154857
2023-12-12 21:38:40   INFO  epoch: 4/24, acc_iter=19248, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:10, time_cost(all): 5:54:13/22:34:57, loss=0.531736196547368, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=3.422227999929924, lr=0.4317619856893823
2023-12-12 21:39:35   INFO  epoch: 4/24, acc_iter=19298, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 5:55:08/23:38:11, loss=0.53154915193028, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.17(1.03), norm=1.9650172391654532, lr=0.4314200844632789
2023-12-12 21:40:31   INFO  epoch: 5/24, acc_iter=19360, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:13:21, time_cost(all): 5:56:04/23:05:41, loss=0.53131721660509, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=0.7186656510556461, lr=0.4309961269429107
2023-12-12 21:41:26   INFO  epoch: 5/24, acc_iter=19410, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:50, time_cost(all): 5:56:59/23:18:28, loss=0.531130171988001, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=1.7612038640379681, lr=0.43065422571680734
2023-12-12 21:42:21   INFO  epoch: 5/24, acc_iter=19460, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:05:40, time_cost(all): 5:57:54/22:59:13, loss=0.530943127370913, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=2.7513149315957026, lr=0.43031232449070395
2023-12-12 21:43:17   INFO  epoch: 5/24, acc_iter=19510, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:10:36, time_cost(all): 5:58:50/21:27:18, loss=0.530756082753824, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=1.73629963398175, lr=0.42997042326460055
2023-12-12 21:44:12   INFO  epoch: 5/24, acc_iter=19560, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:05:38, time_cost(all): 5:59:45/23:12:59, loss=0.530569038136736, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=1.7376066374678596, lr=0.42962852203849716
2023-12-12 21:45:07   INFO  epoch: 5/24, acc_iter=19610, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:03, time_cost(all): 6:00:40/21:47:51, loss=0.530381993519647, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.06(1.03), norm=1.2476225948721225, lr=0.42928662081239377
2023-12-12 21:46:03   INFO  epoch: 5/24, acc_iter=19660, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:06:36, time_cost(all): 6:01:36/21:21:33, loss=0.530194948902559, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=4.849814835037962, lr=0.42894471958629043
2023-12-12 21:46:58   INFO  epoch: 5/24, acc_iter=19710, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:06, time_cost(all): 6:02:31/22:02:24, loss=0.53000790428547, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.14(1.03), norm=0.9780237612535088, lr=0.428602818360187
2023-12-12 21:47:53   INFO  epoch: 5/24, acc_iter=19760, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:16, time_cost(all): 6:03:26/22:43:15, loss=0.529820859668381, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.89(1.03), norm=4.626819280052677, lr=0.42826091713408365
2023-12-12 21:48:49   INFO  epoch: 5/24, acc_iter=19810, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/0:59:07, time_cost(all): 6:04:22/22:34:33, loss=0.529633815051293, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.07(1.03), norm=2.3203052351573943, lr=0.42791901590798026
2023-12-12 21:49:44   INFO  epoch: 5/24, acc_iter=19860, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:43, time_cost(all): 6:05:17/21:57:36, loss=0.529446770434204, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.1(1.03), norm=4.407207405306158, lr=0.42757711468187687
2023-12-12 21:50:39   INFO  epoch: 5/24, acc_iter=19910, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:00:01, time_cost(all): 6:06:12/21:55:55, loss=0.529259725817116, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.87(1.03), norm=3.1370758624957746, lr=0.4272352134557735
2023-12-12 21:51:35   INFO  epoch: 5/24, acc_iter=19960, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:57:19, time_cost(all): 6:07:08/21:33:41, loss=0.529072681200027, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.85(1.03), norm=2.9789446642786284, lr=0.4268933122296701
2023-12-12 21:52:30   INFO  epoch: 5/24, acc_iter=20010, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/1:00:44, time_cost(all): 6:08:03/22:09:09, loss=0.528885636582939, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.2853654228739226, lr=0.4265514110035667
2023-12-12 21:53:26   INFO  epoch: 5/24, acc_iter=20060, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:55:19, time_cost(all): 6:08:59/21:16:26, loss=0.52869859196585, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=1.1887848221327435, lr=0.4262095097774633
2023-12-12 21:54:21   INFO  epoch: 5/24, acc_iter=20110, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:59:15, time_cost(all): 6:09:54/21:27:41, loss=0.528511547348762, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=3.684656750169349, lr=0.42586760855135997
2023-12-12 21:55:16   INFO  epoch: 5/24, acc_iter=20160, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:55:12, time_cost(all): 6:10:49/21:30:58, loss=0.528324502731673, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.01(1.03), norm=3.648658894857483, lr=0.4255257073252566
2023-12-12 21:56:12   INFO  epoch: 5/24, acc_iter=20210, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:55:53, time_cost(all): 6:11:45/22:50:28, loss=0.528137458114585, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=4.945033882117419, lr=0.4251838060991532
2023-12-12 21:57:07   INFO  epoch: 5/24, acc_iter=20260, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:42, time_cost(all): 6:12:40/23:10:26, loss=0.527950413497496, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.0832058391963977, lr=0.4248419048730498
2023-12-12 21:58:02   INFO  epoch: 5/24, acc_iter=20310, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:33, time_cost(all): 6:13:35/21:30:37, loss=0.527763368880408, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=0.8842525300595272, lr=0.4245000036469464
2023-12-12 21:58:58   INFO  epoch: 5/24, acc_iter=20360, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:51:29, time_cost(all): 6:14:31/22:02:12, loss=0.527576324263319, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=3.135721301228913, lr=0.424158102420843
2023-12-12 21:59:53   INFO  epoch: 5/24, acc_iter=20410, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:48:55, time_cost(all): 6:15:26/21:13:56, loss=0.527389279646231, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=1.4456157678470687, lr=0.4238162011947396
2023-12-12 22:00:48   INFO  epoch: 5/24, acc_iter=20460, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:47:39, time_cost(all): 6:16:21/21:41:09, loss=0.527202235029142, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=1.3769187238921077, lr=0.4234742999686363
2023-12-12 22:01:44   INFO  epoch: 5/24, acc_iter=20510, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:48:56, time_cost(all): 6:17:17/22:19:56, loss=0.527015190412053, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=3.493768616821318, lr=0.4231323987425329
2023-12-12 22:02:39   INFO  epoch: 5/24, acc_iter=20560, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:58, time_cost(all): 6:18:12/21:59:44, loss=0.526828145794965, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.11(1.03), norm=3.380801669407491, lr=0.4227904975164295
2023-12-12 22:03:34   INFO  epoch: 5/24, acc_iter=20610, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:46:26, time_cost(all): 6:19:07/21:58:47, loss=0.526641101177876, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.89(1.03), norm=1.6070059805514552, lr=0.4224485962903261
2023-12-12 22:04:30   INFO  epoch: 5/24, acc_iter=20660, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:43, time_cost(all): 6:20:03/22:06:47, loss=0.526454056560788, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.96(1.03), norm=1.1399142315359627, lr=0.4221066950642227
2023-12-12 22:05:25   INFO  epoch: 5/24, acc_iter=20710, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:13, time_cost(all): 6:20:58/22:18:29, loss=0.526267011943699, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=2.5561378384972375, lr=0.4217647938381193
2023-12-12 22:06:20   INFO  epoch: 5/24, acc_iter=20760, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:42:55, time_cost(all): 6:21:53/21:23:18, loss=0.526079967326611, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.01(1.03), norm=4.3224399886617215, lr=0.42142289261201593
2023-12-12 22:07:16   INFO  epoch: 5/24, acc_iter=20810, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:09, time_cost(all): 6:22:49/21:48:56, loss=0.525892922709522, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=1.3548654314936077, lr=0.4210809913859126
2023-12-12 22:08:11   INFO  epoch: 5/24, acc_iter=20860, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:53, time_cost(all): 6:23:44/21:49:48, loss=0.525705878092434, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=3.9110283781624577, lr=0.4207390901598092
2023-12-12 22:09:06   INFO  epoch: 5/24, acc_iter=20910, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:40:56, time_cost(all): 6:24:39/22:02:32, loss=0.525518833475345, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=1.715092319980179, lr=0.4203971889337058
2023-12-12 22:10:02   INFO  epoch: 5/24, acc_iter=20960, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:39:33, time_cost(all): 6:25:35/22:55:08, loss=0.525331788858257, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=2.884053142280435, lr=0.4200552877076024
2023-12-12 22:10:57   INFO  epoch: 5/24, acc_iter=21010, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:43, time_cost(all): 6:26:30/22:52:18, loss=0.525144744241168, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=4.041776964544978, lr=0.41971338648149903
2023-12-12 22:11:52   INFO  epoch: 5/24, acc_iter=21060, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:51, time_cost(all): 6:27:25/21:58:49, loss=0.52495769962408, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.312676971835819, lr=0.4193714852553957
2023-12-12 22:12:48   INFO  epoch: 5/24, acc_iter=21110, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:41, time_cost(all): 6:28:21/21:35:38, loss=0.524770655006991, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=1.8988117153954764, lr=0.41902958402929225
2023-12-12 22:13:43   INFO  epoch: 5/24, acc_iter=21160, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:59, time_cost(all): 6:29:16/22:14:51, loss=0.524583610389902, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=4.472717424259122, lr=0.4186876828031889
2023-12-12 22:14:38   INFO  epoch: 5/24, acc_iter=21210, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:50, time_cost(all): 6:30:11/21:19:47, loss=0.524396565772814, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.7469144926460087, lr=0.4183457815770855
2023-12-12 22:15:34   INFO  epoch: 5/24, acc_iter=21260, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:33:56, time_cost(all): 6:31:07/22:38:00, loss=0.524209521155725, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=4.342215090879749, lr=0.4180038803509821
2023-12-12 22:16:29   INFO  epoch: 5/24, acc_iter=21310, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:32:58, time_cost(all): 6:32:02/21:54:50, loss=0.524022476538637, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.03(1.03), norm=4.167489157218085, lr=0.41766197912487874
2023-12-12 22:17:25   INFO  epoch: 5/24, acc_iter=21360, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:29, time_cost(all): 6:32:58/21:58:21, loss=0.523835431921548, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=3.2987316505499935, lr=0.41732007789877534
2023-12-12 22:18:20   INFO  epoch: 5/24, acc_iter=21410, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:49, time_cost(all): 6:33:53/22:11:11, loss=0.52364838730446, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=2.817254807449971, lr=0.416978176672672
2023-12-12 22:19:15   INFO  epoch: 5/24, acc_iter=21460, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:16, time_cost(all): 6:34:48/22:26:09, loss=0.523461342687371, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=0.504354465247298, lr=0.41663627544656856
2023-12-12 22:20:11   INFO  epoch: 5/24, acc_iter=21510, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:14, time_cost(all): 6:35:44/21:04:58, loss=0.523274298070283, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=1.371787073019441, lr=0.4162943742204652
2023-12-12 22:21:06   INFO  epoch: 5/24, acc_iter=21560, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:50, time_cost(all): 6:36:39/22:23:52, loss=0.523087253453194, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.0(1.03), norm=0.6843157628267502, lr=0.41595247299436183
2023-12-12 22:22:01   INFO  epoch: 5/24, acc_iter=21610, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:30:05, time_cost(all): 6:37:34/20:47:52, loss=0.522900208836106, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=3.5711804191720025, lr=0.41561057176825844
2023-12-12 22:22:57   INFO  epoch: 5/24, acc_iter=21660, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:59, time_cost(all): 6:38:30/22:15:33, loss=0.522713164219017, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=4.230662473467192, lr=0.41526867054215505
2023-12-12 22:23:52   INFO  epoch: 5/24, acc_iter=21710, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:12, time_cost(all): 6:39:25/22:13:04, loss=0.522526119601929, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=3.8542039087207174, lr=0.41492676931605166
2023-12-12 22:24:47   INFO  epoch: 5/24, acc_iter=21760, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:31, time_cost(all): 6:40:20/21:52:37, loss=0.52233907498484, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=4.428022089474394, lr=0.41458486808994827
2023-12-12 22:25:43   INFO  epoch: 5/24, acc_iter=21810, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:45, time_cost(all): 6:41:16/22:27:04, loss=0.522152030367751, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=1.739998305951503, lr=0.4142429668638449
2023-12-12 22:26:38   INFO  epoch: 5/24, acc_iter=21860, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:00, time_cost(all): 6:42:11/21:37:43, loss=0.521964985750663, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=4.885007886693661, lr=0.41390106563774154
2023-12-12 22:27:33   INFO  epoch: 5/24, acc_iter=21910, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:34, time_cost(all): 6:43:06/20:47:04, loss=0.521777941133574, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.6437711666986905, lr=0.41355916441163815
2023-12-12 22:28:29   INFO  epoch: 5/24, acc_iter=21960, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:25, time_cost(all): 6:44:02/22:06:40, loss=0.521590896516486, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.13(1.03), norm=4.876117200688346, lr=0.41321726318553476
2023-12-12 22:29:24   INFO  epoch: 5/24, acc_iter=22010, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:22:00, time_cost(all): 6:44:57/22:34:11, loss=0.521403851899397, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.17(1.03), norm=4.815395798275248, lr=0.41287536195943136
2023-12-12 22:30:19   INFO  epoch: 5/24, acc_iter=22060, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:37, time_cost(all): 6:45:52/20:56:19, loss=0.521216807282309, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.84(1.03), norm=3.387779108387735, lr=0.412533460733328
2023-12-12 22:31:15   INFO  epoch: 5/24, acc_iter=22110, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:24, time_cost(all): 6:46:48/21:08:01, loss=0.52102976266522, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.13(1.03), norm=4.484833648114705, lr=0.4121915595072246
2023-12-12 22:32:10   INFO  epoch: 5/24, acc_iter=22160, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:14, time_cost(all): 6:47:43/21:55:25, loss=0.520842718048132, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=4.414107620301166, lr=0.4118496582811212
2023-12-12 22:33:05   INFO  epoch: 5/24, acc_iter=22210, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:30, time_cost(all): 6:48:38/20:46:04, loss=0.520655673431043, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=0.8719946069456515, lr=0.41150775705501785
2023-12-12 22:34:01   INFO  epoch: 5/24, acc_iter=22260, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:39, time_cost(all): 6:49:34/22:31:14, loss=0.520468628813955, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=2.319941166753959, lr=0.41116585582891446
2023-12-12 22:34:56   INFO  epoch: 5/24, acc_iter=22310, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:19, time_cost(all): 6:50:29/20:50:37, loss=0.520281584196866, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.97(1.03), norm=2.5091598460349287, lr=0.41082395460281107
2023-12-12 22:35:51   INFO  epoch: 5/24, acc_iter=22360, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:35, time_cost(all): 6:51:24/22:40:55, loss=0.520094539579778, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.03(1.03), norm=1.7076149345455123, lr=0.4104820533767077
2023-12-12 22:36:47   INFO  epoch: 5/24, acc_iter=22410, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:44, time_cost(all): 6:52:20/22:17:50, loss=0.519907494962689, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=2.7152787277584016, lr=0.4101401521506043
2023-12-12 22:37:42   INFO  epoch: 5/24, acc_iter=22460, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:00, time_cost(all): 6:53:15/21:56:49, loss=0.519720450345601, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=1.9483735477998037, lr=0.40979825092450095
2023-12-12 22:38:38   INFO  epoch: 5/24, acc_iter=22510, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:42, time_cost(all): 6:54:11/21:16:39, loss=0.519533405728512, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.0549397661272035, lr=0.4094563496983975
2023-12-12 22:39:33   INFO  epoch: 5/24, acc_iter=22560, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:45, time_cost(all): 6:55:06/21:41:57, loss=0.519346361111424, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.05(1.03), norm=0.8928844919134621, lr=0.40911444847229417
2023-12-12 22:40:28   INFO  epoch: 5/24, acc_iter=22610, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:18, time_cost(all): 6:56:01/21:06:51, loss=0.519159316494335, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=1.368873600389066, lr=0.4087725472461908
2023-12-12 22:41:24   INFO  epoch: 5/24, acc_iter=22660, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:11, time_cost(all): 6:56:57/20:44:31, loss=0.518972271877246, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=0.8351883830211573, lr=0.4084306460200874
2023-12-12 22:42:19   INFO  epoch: 5/24, acc_iter=22710, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:06, time_cost(all): 6:57:52/21:15:24, loss=0.518785227260158, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=0.7245026969527866, lr=0.408088744793984
2023-12-12 22:43:14   INFO  epoch: 5/24, acc_iter=22760, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:57, time_cost(all): 6:58:47/20:29:30, loss=0.518598182643069, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=1.5346750098664343, lr=0.4077468435678806
2023-12-12 22:44:10   INFO  epoch: 5/24, acc_iter=22810, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:59, time_cost(all): 6:59:43/21:46:56, loss=0.518411138025981, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.9(1.03), norm=1.8225370786326387, lr=0.40740494234177727
2023-12-12 22:45:05   INFO  epoch: 5/24, acc_iter=22860, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:58, time_cost(all): 7:00:38/22:28:38, loss=0.518224093408892, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=4.439293406888319, lr=0.4070630411156738
2023-12-12 22:46:00   INFO  epoch: 5/24, acc_iter=22910, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:51, time_cost(all): 7:01:33/20:26:09, loss=0.518037048791804, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=3.044547824174237, lr=0.4067211398895705
2023-12-12 22:46:56   INFO  epoch: 5/24, acc_iter=22960, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:04:02, time_cost(all): 7:02:29/21:39:09, loss=0.517850004174715, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=1.8293307103048886, lr=0.4063792386634671
2023-12-12 22:47:51   INFO  epoch: 5/24, acc_iter=23010, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:04, time_cost(all): 7:03:24/20:47:29, loss=0.517662959557627, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.1(1.03), norm=4.6605676157032425, lr=0.4060373374373637
2023-12-12 22:48:46   INFO  epoch: 5/24, acc_iter=23060, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:02, time_cost(all): 7:04:19/21:20:06, loss=0.517475914940538, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=3.2743411781345007, lr=0.4056954362112603
2023-12-12 22:49:42   INFO  epoch: 5/24, acc_iter=23110, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:05, time_cost(all): 7:05:15/20:34:37, loss=0.51728887032345, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.04(1.03), norm=3.546755876019399, lr=0.4053535349851569
2023-12-12 22:50:37   INFO  epoch: 5/24, acc_iter=23160, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 7:06:10/20:39:37, loss=0.517101825706361, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.83(1.03), norm=1.4590696445087596, lr=0.4050116337590535
2023-12-12 22:51:32   INFO  epoch: 6/24, acc_iter=23222, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:11:18, time_cost(all): 7:07:05/21:13:11, loss=0.516869890381171, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=1.1514464513744427, lr=0.40458767623868536
2023-12-12 22:52:28   INFO  epoch: 6/24, acc_iter=23272, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:22, time_cost(all): 7:08:01/20:53:25, loss=0.516682845764083, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=2.010991655791477, lr=0.40424577501258196
2023-12-12 22:53:23   INFO  epoch: 6/24, acc_iter=23322, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:06:23, time_cost(all): 7:08:56/21:50:43, loss=0.516495801146994, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.0282761269855945, lr=0.4039038737864786
2023-12-12 22:54:18   INFO  epoch: 6/24, acc_iter=23372, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:04:11, time_cost(all): 7:09:51/20:20:26, loss=0.516308756529906, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=2.5223783493102836, lr=0.4035619725603752
2023-12-12 22:55:14   INFO  epoch: 6/24, acc_iter=23422, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:07:41, time_cost(all): 7:10:47/20:38:14, loss=0.516121711912817, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=1.2417701401290833, lr=0.40322007133427185
2023-12-12 22:56:09   INFO  epoch: 6/24, acc_iter=23472, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:48, time_cost(all): 7:11:42/21:12:44, loss=0.515934667295729, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.23(1.03), norm=4.770324201142483, lr=0.4028781701081684
2023-12-12 22:57:04   INFO  epoch: 6/24, acc_iter=23522, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:04:50, time_cost(all): 7:12:37/21:10:04, loss=0.51574762267864, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.177417913561401, lr=0.40253626888206506
2023-12-12 22:58:00   INFO  epoch: 6/24, acc_iter=23572, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:05:28, time_cost(all): 7:13:33/21:44:14, loss=0.515560578061552, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.03(1.03), norm=3.2850001298502716, lr=0.40219436765596167
2023-12-12 22:58:55   INFO  epoch: 6/24, acc_iter=23622, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:55, time_cost(all): 7:14:28/20:51:23, loss=0.515373533444463, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=1.9449233877762155, lr=0.4018524664298583
2023-12-12 22:59:51   INFO  epoch: 6/24, acc_iter=23672, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/0:59:20, time_cost(all): 7:15:24/21:16:53, loss=0.515186488827374, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=4.103575369287395, lr=0.4015105652037549
2023-12-12 23:00:46   INFO  epoch: 6/24, acc_iter=23722, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:03:49, time_cost(all): 7:16:19/21:10:18, loss=0.514999444210286, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.15(1.03), norm=4.009742138082963, lr=0.4011686639776515
2023-12-12 23:01:41   INFO  epoch: 6/24, acc_iter=23772, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:57:16, time_cost(all): 7:17:14/20:44:27, loss=0.514812399593197, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.02(1.03), norm=2.0525875705027827, lr=0.40082676275154816
2023-12-12 23:02:37   INFO  epoch: 6/24, acc_iter=23822, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:56:47, time_cost(all): 7:18:10/21:17:28, loss=0.514625354976109, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=0.8084086940119908, lr=0.40048486152544477
2023-12-12 23:03:32   INFO  epoch: 6/24, acc_iter=23872, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/1:01:11, time_cost(all): 7:19:05/20:58:16, loss=0.51443831035902, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.08(1.03), norm=1.2658592479172752, lr=0.4001429602993414
2023-12-12 23:04:27   INFO  epoch: 6/24, acc_iter=23922, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:01, time_cost(all): 7:20:00/20:56:19, loss=0.514251265741932, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=2.288778007704639, lr=0.399801059073238
2023-12-12 23:05:23   INFO  epoch: 6/24, acc_iter=23972, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:58:44, time_cost(all): 7:20:56/21:37:35, loss=0.514064221124843, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.95(1.03), norm=3.4769039195692137, lr=0.3994591578471346
2023-12-12 23:06:18   INFO  epoch: 6/24, acc_iter=24022, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:08, time_cost(all): 7:21:51/21:51:51, loss=0.513877176507755, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=2.084105539960813, lr=0.3991172566210312
2023-12-12 23:07:13   INFO  epoch: 6/24, acc_iter=24072, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:38, time_cost(all): 7:22:46/20:26:25, loss=0.513690131890666, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=3.952320415071455, lr=0.3987753553949278
2023-12-12 23:08:09   INFO  epoch: 6/24, acc_iter=24122, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:54:11, time_cost(all): 7:23:42/20:54:36, loss=0.513503087273578, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.86(1.03), norm=1.70261065851481, lr=0.3984334541688245
2023-12-12 23:09:04   INFO  epoch: 6/24, acc_iter=24172, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:50:30, time_cost(all): 7:24:37/20:24:09, loss=0.513316042656489, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.95(1.03), norm=1.9778645424167138, lr=0.398091552942721
2023-12-12 23:09:59   INFO  epoch: 6/24, acc_iter=24222, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:43, time_cost(all): 7:25:32/21:23:11, loss=0.5131289980394, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.11(1.03), norm=4.539903819024235, lr=0.3977496517166177
2023-12-12 23:10:55   INFO  epoch: 6/24, acc_iter=24272, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:53:11, time_cost(all): 7:26:28/20:47:36, loss=0.512941953422312, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=3.7094312441874946, lr=0.3974077504905143
2023-12-12 23:11:50   INFO  epoch: 6/24, acc_iter=24322, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:51:36, time_cost(all): 7:27:23/20:27:57, loss=0.512754908805223, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.2197439545108555, lr=0.3970658492644109
2023-12-12 23:12:45   INFO  epoch: 6/24, acc_iter=24372, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:29, time_cost(all): 7:28:18/20:52:31, loss=0.512567864188135, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=1.6513147241962067, lr=0.3967239480383075
2023-12-12 23:13:41   INFO  epoch: 6/24, acc_iter=24422, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:08, time_cost(all): 7:29:14/21:56:23, loss=0.512380819571046, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.85(1.03), norm=4.712334925239459, lr=0.3963820468122041
2023-12-12 23:14:36   INFO  epoch: 6/24, acc_iter=24472, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:22, time_cost(all): 7:30:09/20:10:14, loss=0.512193774953958, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=4.396558984526868, lr=0.39604014558610073
2023-12-12 23:15:31   INFO  epoch: 6/24, acc_iter=24522, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:48:11, time_cost(all): 7:31:04/21:49:08, loss=0.512006730336869, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.8249549591190393, lr=0.39569824435999734
2023-12-12 23:16:27   INFO  epoch: 6/24, acc_iter=24572, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:10, time_cost(all): 7:32:00/20:15:28, loss=0.511819685719781, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=1.0264823546439772, lr=0.395356343133894
2023-12-12 23:17:22   INFO  epoch: 6/24, acc_iter=24622, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:44:48, time_cost(all): 7:32:55/21:49:11, loss=0.511632641102692, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=4.411383542172709, lr=0.3950144419077906
2023-12-12 23:18:17   INFO  epoch: 6/24, acc_iter=24672, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:43:45, time_cost(all): 7:33:50/21:39:03, loss=0.511445596485604, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=1.752626613477391, lr=0.3946725406816872
2023-12-12 23:19:13   INFO  epoch: 6/24, acc_iter=24722, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:41, time_cost(all): 7:34:46/20:00:37, loss=0.511258551868515, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=1.2944901268134488, lr=0.39433063945558383
2023-12-12 23:20:08   INFO  epoch: 6/24, acc_iter=24772, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:39:54, time_cost(all): 7:35:41/21:11:52, loss=0.511071507251427, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=0.7240514864093062, lr=0.39398873822948044
2023-12-12 23:21:04   INFO  epoch: 6/24, acc_iter=24822, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:29, time_cost(all): 7:36:37/19:55:56, loss=0.510884462634338, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=4.181344728337798, lr=0.3936468370033771
2023-12-12 23:21:59   INFO  epoch: 6/24, acc_iter=24872, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:02, time_cost(all): 7:37:32/20:28:51, loss=0.51069741801725, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=2.999839715323879, lr=0.39330493577727366
2023-12-12 23:22:54   INFO  epoch: 6/24, acc_iter=24922, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:40:42, time_cost(all): 7:38:27/20:43:21, loss=0.510510373400161, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=2.121086626494777, lr=0.3929630345511703
2023-12-12 23:23:50   INFO  epoch: 6/24, acc_iter=24972, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:17, time_cost(all): 7:39:23/21:29:19, loss=0.510323328783073, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=0.670392344559656, lr=0.39262113332506693
2023-12-12 23:24:45   INFO  epoch: 6/24, acc_iter=25022, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:36:47, time_cost(all): 7:40:18/21:51:41, loss=0.510136284165984, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.09(1.03), norm=3.2129833394565757, lr=0.39227923209896354
2023-12-12 23:25:40   INFO  epoch: 6/24, acc_iter=25072, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:15, time_cost(all): 7:41:13/20:23:17, loss=0.509949239548895, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=4.423515172674064, lr=0.39193733087286015
2023-12-12 23:26:36   INFO  epoch: 6/24, acc_iter=25122, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:19, time_cost(all): 7:42:09/20:47:04, loss=0.509762194931807, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=4.61560223023225, lr=0.39159542964675675
2023-12-12 23:27:31   INFO  epoch: 6/24, acc_iter=25172, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:32:57, time_cost(all): 7:43:04/21:05:37, loss=0.509575150314718, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=3.8521530884270754, lr=0.3912535284206534
2023-12-12 23:28:26   INFO  epoch: 6/24, acc_iter=25222, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:21, time_cost(all): 7:43:59/21:23:56, loss=0.50938810569763, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.08(1.03), norm=1.9023821397846297, lr=0.39091162719455
2023-12-12 23:29:22   INFO  epoch: 6/24, acc_iter=25272, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:56, time_cost(all): 7:44:55/19:43:08, loss=0.509201061080541, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=2.0295787972541928, lr=0.39056972596844663
2023-12-12 23:30:17   INFO  epoch: 6/24, acc_iter=25322, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:56, time_cost(all): 7:45:50/20:29:58, loss=0.509014016463453, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.2297295779601034, lr=0.39022782474234324
2023-12-12 23:31:12   INFO  epoch: 6/24, acc_iter=25372, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:51, time_cost(all): 7:46:45/20:32:48, loss=0.508826971846364, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.96(1.03), norm=1.7067355445062076, lr=0.38988592351623985
2023-12-12 23:32:08   INFO  epoch: 6/24, acc_iter=25422, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:20, time_cost(all): 7:47:41/20:18:25, loss=0.508639927229276, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=2.271695647376001, lr=0.38954402229013646
2023-12-12 23:33:03   INFO  epoch: 6/24, acc_iter=25472, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:43, time_cost(all): 7:48:36/20:45:06, loss=0.508452882612187, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.88(1.03), norm=4.6534784249996965, lr=0.38920212106403307
2023-12-12 23:33:58   INFO  epoch: 6/24, acc_iter=25522, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:29:10, time_cost(all): 7:49:31/21:41:10, loss=0.508265837995099, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.87(1.03), norm=2.737075903799901, lr=0.38886021983792973
2023-12-12 23:34:54   INFO  epoch: 6/24, acc_iter=25572, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:56, time_cost(all): 7:50:27/20:47:30, loss=0.50807879337801, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.18(1.03), norm=4.660571694080763, lr=0.3885183186118263
2023-12-12 23:35:49   INFO  epoch: 6/24, acc_iter=25622, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:49, time_cost(all): 7:51:22/20:54:32, loss=0.507891748760922, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=3.111566837760415, lr=0.38817641738572295
2023-12-12 23:36:44   INFO  epoch: 6/24, acc_iter=25672, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:28, time_cost(all): 7:52:17/19:39:17, loss=0.507704704143833, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=0.8144764406922623, lr=0.38783451615961956
2023-12-12 23:37:40   INFO  epoch: 6/24, acc_iter=25722, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:10, time_cost(all): 7:53:13/19:44:13, loss=0.507517659526744, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=0.6066449376505676, lr=0.38749261493351617
2023-12-12 23:38:35   INFO  epoch: 6/24, acc_iter=25772, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:17, time_cost(all): 7:54:08/20:41:39, loss=0.507330614909656, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=2.583284100236464, lr=0.38715071370741283
2023-12-12 23:39:30   INFO  epoch: 6/24, acc_iter=25822, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:37, time_cost(all): 7:55:03/21:09:51, loss=0.507143570292567, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=0.8259794751906123, lr=0.3868088124813094
2023-12-12 23:40:26   INFO  epoch: 6/24, acc_iter=25872, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:22:05, time_cost(all): 7:55:59/19:36:30, loss=0.506956525675479, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=4.400991618683136, lr=0.38646691125520605
2023-12-12 23:41:21   INFO  epoch: 6/24, acc_iter=25922, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:06, time_cost(all): 7:56:54/19:57:24, loss=0.50676948105839, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=2.4857594871494415, lr=0.3861250100291026
2023-12-12 23:42:17   INFO  epoch: 6/24, acc_iter=25972, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:24, time_cost(all): 7:57:50/20:16:42, loss=0.506582436441302, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.04(1.03), norm=2.282698240234825, lr=0.38578310880299926
2023-12-12 23:43:12   INFO  epoch: 6/24, acc_iter=26022, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:17:48, time_cost(all): 7:58:45/20:36:30, loss=0.506395391824213, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.06(1.03), norm=1.4527555551749702, lr=0.3854412075768959
2023-12-12 23:44:07   INFO  epoch: 6/24, acc_iter=26072, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:16:54, time_cost(all): 7:59:40/20:22:05, loss=0.506208347207125, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.9(1.03), norm=4.888139940442748, lr=0.3850993063507925
2023-12-12 23:45:03   INFO  epoch: 6/24, acc_iter=26122, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:49, time_cost(all): 8:00:36/20:43:00, loss=0.506021302590036, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=4.226501272339577, lr=0.3847574051246891
2023-12-12 23:45:58   INFO  epoch: 6/24, acc_iter=26172, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:20, time_cost(all): 8:01:31/20:08:32, loss=0.505834257972948, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=3.0995524823533542, lr=0.3844155038985857
2023-12-12 23:46:53   INFO  epoch: 6/24, acc_iter=26222, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:25, time_cost(all): 8:02:26/21:08:35, loss=0.505647213355859, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=1.8764561635891137, lr=0.38407360267248236
2023-12-12 23:47:49   INFO  epoch: 6/24, acc_iter=26272, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:52, time_cost(all): 8:03:22/20:10:00, loss=0.505460168738771, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.87(1.03), norm=4.012415436002421, lr=0.38373170144637897
2023-12-12 23:48:44   INFO  epoch: 6/24, acc_iter=26322, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:44, time_cost(all): 8:04:17/21:05:00, loss=0.505273124121682, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.2(1.03), norm=4.363793625408309, lr=0.3833898002202756
2023-12-12 23:49:39   INFO  epoch: 6/24, acc_iter=26372, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:45, time_cost(all): 8:05:12/19:32:57, loss=0.505086079504593, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=0.5654803255675362, lr=0.3830478989941722
2023-12-12 23:50:35   INFO  epoch: 6/24, acc_iter=26422, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:20, time_cost(all): 8:06:08/20:26:27, loss=0.504899034887505, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.92(1.03), norm=3.1349057480070046, lr=0.3827059977680688
2023-12-12 23:51:30   INFO  epoch: 6/24, acc_iter=26472, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:25, time_cost(all): 8:07:03/19:50:03, loss=0.504711990270416, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=3.4158328955571253, lr=0.3823640965419654
2023-12-12 23:52:25   INFO  epoch: 6/24, acc_iter=26522, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:02, time_cost(all): 8:07:58/20:33:46, loss=0.504524945653328, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.568495627632645, lr=0.382022195315862
2023-12-12 23:53:21   INFO  epoch: 6/24, acc_iter=26572, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:38, time_cost(all): 8:08:54/20:25:00, loss=0.504337901036239, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=2.833512761539873, lr=0.3816802940897587
2023-12-12 23:54:16   INFO  epoch: 6/24, acc_iter=26622, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:31, time_cost(all): 8:09:49/20:27:18, loss=0.504150856419151, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.23(1.03), norm=0.6053584804813055, lr=0.3813383928636553
2023-12-12 23:55:11   INFO  epoch: 6/24, acc_iter=26672, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:41, time_cost(all): 8:10:44/20:32:30, loss=0.503963811802062, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.84(1.03), norm=2.473391528216945, lr=0.3809964916375519
2023-12-12 23:56:07   INFO  epoch: 6/24, acc_iter=26722, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:49, time_cost(all): 8:11:40/20:52:37, loss=0.503776767184974, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.15(1.03), norm=1.675136787402766, lr=0.3806545904114485
2023-12-12 23:57:02   INFO  epoch: 6/24, acc_iter=26772, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:57, time_cost(all): 8:12:35/19:23:03, loss=0.503589722567885, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.05(1.03), norm=1.1623143655043169, lr=0.3803126891853451
2023-12-12 23:57:57   INFO  epoch: 6/24, acc_iter=26822, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:59, time_cost(all): 8:13:30/20:20:30, loss=0.503402677950797, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=3.362988095632145, lr=0.3799707879592417
2023-12-12 23:58:53   INFO  epoch: 6/24, acc_iter=26872, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:54, time_cost(all): 8:14:26/21:16:16, loss=0.503215633333708, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=2.1060167798299463, lr=0.3796288867331383
2023-12-12 23:59:48   INFO  epoch: 6/24, acc_iter=26922, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 8:15:21/19:24:47, loss=0.50302858871662, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.06(1.03), norm=2.8967365166805563, lr=0.379286985507035
2023-12-13 00:00:43   INFO  epoch: 6/24, acc_iter=26972, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:05, time_cost(all): 8:16:16/19:59:55, loss=0.502841544099531, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.431205730544292, lr=0.3789450842809316
2023-12-13 00:01:39   INFO  epoch: 6/24, acc_iter=27022, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:12, time_cost(all): 8:17:12/19:23:36, loss=0.502654499482443, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=2.9983656398148795, lr=0.3786031830548282
2023-12-13 00:02:34   INFO  epoch: 7/24, acc_iter=27084, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:11:52, time_cost(all): 8:18:07/20:53:31, loss=0.502422564157253, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.05(1.03), norm=4.191269962932777, lr=0.37817922553446004
2023-12-13 00:03:30   INFO  epoch: 7/24, acc_iter=27134, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:09:15, time_cost(all): 8:19:03/20:34:57, loss=0.502235519540164, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.06(1.03), norm=3.4253632648115744, lr=0.3778373243083566
2023-12-13 00:04:25   INFO  epoch: 7/24, acc_iter=27184, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:35, time_cost(all): 8:19:58/20:20:14, loss=0.502048474923076, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=1.8315698921217585, lr=0.37749542308225326
2023-12-13 00:05:20   INFO  epoch: 7/24, acc_iter=27234, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:07:22, time_cost(all): 8:20:53/21:01:06, loss=0.501861430305987, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=0.6521668103136057, lr=0.37715352185614986
2023-12-13 00:06:16   INFO  epoch: 7/24, acc_iter=27284, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:07:54, time_cost(all): 8:21:49/20:30:38, loss=0.501674385688899, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.92(1.03), norm=3.4072120377877666, lr=0.3768116206300465
2023-12-13 00:07:11   INFO  epoch: 7/24, acc_iter=27334, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:02:50, time_cost(all): 8:22:44/19:34:27, loss=0.50148734107181, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.18(1.03), norm=2.589629334966777, lr=0.3764697194039431
2023-12-13 00:08:06   INFO  epoch: 7/24, acc_iter=27384, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:01:51, time_cost(all): 8:23:39/20:01:08, loss=0.501300296454722, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=4.739032529899583, lr=0.3761278181778397
2023-12-13 00:09:02   INFO  epoch: 7/24, acc_iter=27434, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:02:06, time_cost(all): 8:24:35/20:56:12, loss=0.501113251837633, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.6822828329588493, lr=0.37578591695173635
2023-12-13 00:09:57   INFO  epoch: 7/24, acc_iter=27484, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:06:02, time_cost(all): 8:25:30/20:47:46, loss=0.500926207220544, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.9(1.03), norm=2.47391865462706, lr=0.37544401572563296
2023-12-13 00:10:52   INFO  epoch: 7/24, acc_iter=27534, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:04:50, time_cost(all): 8:26:25/19:37:06, loss=0.500739162603456, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.9278717437214015, lr=0.37510211449952957
2023-12-13 00:11:48   INFO  epoch: 7/24, acc_iter=27584, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:59:10, time_cost(all): 8:27:21/20:04:02, loss=0.500552117986367, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.97(1.03), norm=1.796104357294395, lr=0.3747602132734261
2023-12-13 00:12:43   INFO  epoch: 7/24, acc_iter=27634, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:01:30, time_cost(all): 8:28:16/20:08:46, loss=0.500365073369279, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.14(1.03), norm=0.993250311484735, lr=0.3744183120473228
2023-12-13 00:13:38   INFO  epoch: 7/24, acc_iter=27684, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:02:05, time_cost(all): 8:29:11/19:13:19, loss=0.50017802875219, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.99(1.03), norm=4.575875963003671, lr=0.3740764108212194
2023-12-13 00:14:34   INFO  epoch: 7/24, acc_iter=27734, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:32, time_cost(all): 8:30:07/20:05:03, loss=0.499990984135102, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.08(1.03), norm=2.7149448910933005, lr=0.373734509595116
2023-12-13 00:15:29   INFO  epoch: 7/24, acc_iter=27784, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:07, time_cost(all): 8:31:02/19:53:08, loss=0.499803939518013, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=1.7104221963275719, lr=0.37339260836901267
2023-12-13 00:16:24   INFO  epoch: 7/24, acc_iter=27834, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:53:41, time_cost(all): 8:31:57/20:47:15, loss=0.499616894900925, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=3.8838276817366033, lr=0.3730507071429092
2023-12-13 00:17:20   INFO  epoch: 7/24, acc_iter=27884, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:55:46, time_cost(all): 8:32:53/19:44:42, loss=0.499429850283836, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=4.378977252333555, lr=0.3727088059168059
2023-12-13 00:18:15   INFO  epoch: 7/24, acc_iter=27934, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:32, time_cost(all): 8:33:48/20:01:33, loss=0.499242805666748, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=1.5588023154461452, lr=0.37236690469070244
2023-12-13 00:19:10   INFO  epoch: 7/24, acc_iter=27984, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:51:36, time_cost(all): 8:34:43/20:42:56, loss=0.499055761049659, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.000436356613175, lr=0.3720250034645991
2023-12-13 00:20:06   INFO  epoch: 7/24, acc_iter=28034, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:34, time_cost(all): 8:35:39/20:32:53, loss=0.498868716432571, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.0(1.03), norm=1.1675604144561706, lr=0.3716831022384957
2023-12-13 00:21:01   INFO  epoch: 7/24, acc_iter=28084, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:51, time_cost(all): 8:36:34/20:06:12, loss=0.498681671815482, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=0.9747635323820216, lr=0.3713412010123923
2023-12-13 00:21:56   INFO  epoch: 7/24, acc_iter=28134, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:48:48, time_cost(all): 8:37:29/19:41:45, loss=0.498494627198394, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=4.181893346488534, lr=0.370999299786289
2023-12-13 00:22:52   INFO  epoch: 7/24, acc_iter=28184, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:52:29, time_cost(all): 8:38:25/20:40:24, loss=0.498307582581305, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=1.4149123719305041, lr=0.37065739856018554
2023-12-13 00:23:47   INFO  epoch: 7/24, acc_iter=28234, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:36, time_cost(all): 8:39:20/18:52:37, loss=0.498120537964216, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=3.3775255718195893, lr=0.3703154973340822
2023-12-13 00:24:42   INFO  epoch: 7/24, acc_iter=28284, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:44, time_cost(all): 8:40:15/20:07:10, loss=0.497933493347128, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=4.056920488048341, lr=0.36997359610797875
2023-12-13 00:25:38   INFO  epoch: 7/24, acc_iter=28334, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:49:19, time_cost(all): 8:41:11/20:23:53, loss=0.497746448730039, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.22(1.03), norm=0.6468535249432565, lr=0.3696316948818754
2023-12-13 00:26:33   INFO  epoch: 7/24, acc_iter=28384, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:50, time_cost(all): 8:42:06/20:01:15, loss=0.497559404112951, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.15(1.03), norm=2.865267440084111, lr=0.369289793655772
2023-12-13 00:27:29   INFO  epoch: 7/24, acc_iter=28434, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:44:00, time_cost(all): 8:43:02/19:30:54, loss=0.497372359495862, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.93(1.03), norm=4.813324737020404, lr=0.36894789242966863
2023-12-13 00:28:24   INFO  epoch: 7/24, acc_iter=28484, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:33, time_cost(all): 8:43:57/19:23:18, loss=0.497185314878774, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=3.4299191295660765, lr=0.3686059912035653
2023-12-13 00:29:19   INFO  epoch: 7/24, acc_iter=28534, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:22, time_cost(all): 8:44:52/19:45:34, loss=0.496998270261685, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=1.0743704023163776, lr=0.36826408997746185
2023-12-13 00:30:15   INFO  epoch: 7/24, acc_iter=28584, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:28, time_cost(all): 8:45:48/19:39:28, loss=0.496811225644597, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.13(1.03), norm=0.6453289116680133, lr=0.3679221887513585
2023-12-13 00:31:10   INFO  epoch: 7/24, acc_iter=28634, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:29, time_cost(all): 8:46:43/20:22:28, loss=0.496624181027508, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=3.0091102191495733, lr=0.3675802875252551
2023-12-13 00:32:05   INFO  epoch: 7/24, acc_iter=28684, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:18, time_cost(all): 8:47:38/20:37:06, loss=0.49643713641042, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.88(1.03), norm=3.684373576068648, lr=0.36723838629915173
2023-12-13 00:33:01   INFO  epoch: 7/24, acc_iter=28734, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:48, time_cost(all): 8:48:34/18:44:55, loss=0.496250091793331, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=4.3458334986922456, lr=0.36689648507304834
2023-12-13 00:33:56   INFO  epoch: 7/24, acc_iter=28784, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:39:19, time_cost(all): 8:49:29/20:15:32, loss=0.496063047176243, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.97(1.03), norm=1.8006228969172395, lr=0.36655458384694495
2023-12-13 00:34:51   INFO  epoch: 7/24, acc_iter=28834, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:55, time_cost(all): 8:50:24/20:08:50, loss=0.495876002559154, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=4.805621357113596, lr=0.3662126826208416
2023-12-13 00:35:47   INFO  epoch: 7/24, acc_iter=28884, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:20, time_cost(all): 8:51:20/19:34:48, loss=0.495688957942065, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.03(1.03), norm=4.248860163016101, lr=0.36587078139473816
2023-12-13 00:36:42   INFO  epoch: 7/24, acc_iter=28934, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:07, time_cost(all): 8:52:15/20:30:29, loss=0.495501913324977, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.01(1.03), norm=2.5177635878589077, lr=0.36552888016863483
2023-12-13 00:37:37   INFO  epoch: 7/24, acc_iter=28984, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:33:37, time_cost(all): 8:53:10/20:28:09, loss=0.495314868707888, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.4430744807331757, lr=0.36518697894253144
2023-12-13 00:38:33   INFO  epoch: 7/24, acc_iter=29034, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:45, time_cost(all): 8:54:06/19:36:32, loss=0.4951278240908, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=2.4823937456859184, lr=0.36484507771642805
2023-12-13 00:39:28   INFO  epoch: 7/24, acc_iter=29084, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:53, time_cost(all): 8:55:01/19:02:27, loss=0.494940779473711, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=3.2471237646879323, lr=0.36450317649032465
2023-12-13 00:40:23   INFO  epoch: 7/24, acc_iter=29134, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:28, time_cost(all): 8:55:56/19:15:28, loss=0.494753734856623, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.874614026959032, lr=0.36416127526422126
2023-12-13 00:41:19   INFO  epoch: 7/24, acc_iter=29184, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:22, time_cost(all): 8:56:52/19:39:49, loss=0.494566690239534, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=3.3438754435945253, lr=0.3638193740381179
2023-12-13 00:42:14   INFO  epoch: 7/24, acc_iter=29234, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:48, time_cost(all): 8:57:47/19:30:09, loss=0.494379645622446, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=4.903974046532584, lr=0.3634774728120145
2023-12-13 00:43:09   INFO  epoch: 7/24, acc_iter=29284, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:34, time_cost(all): 8:58:42/19:17:07, loss=0.494192601005357, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=0.5702445952484073, lr=0.36313557158591114
2023-12-13 00:44:05   INFO  epoch: 7/24, acc_iter=29334, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:18, time_cost(all): 8:59:38/19:42:55, loss=0.494005556388269, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.17(1.03), norm=0.6569435344813536, lr=0.36279367035980775
2023-12-13 00:45:00   INFO  epoch: 7/24, acc_iter=29384, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:33, time_cost(all): 9:00:33/18:32:15, loss=0.49381851177118, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=4.367705141830287, lr=0.36245176913370436
2023-12-13 00:45:55   INFO  epoch: 7/24, acc_iter=29434, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:09, time_cost(all): 9:01:28/19:04:35, loss=0.493631467154092, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=4.470992958056298, lr=0.362109867907601
2023-12-13 00:46:51   INFO  epoch: 7/24, acc_iter=29484, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:16, time_cost(all): 9:02:24/19:25:06, loss=0.493444422537003, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.6453044335905056, lr=0.3617679666814976
2023-12-13 00:47:46   INFO  epoch: 7/24, acc_iter=29534, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:00, time_cost(all): 9:03:19/20:16:27, loss=0.493257377919915, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=2.6232491761603014, lr=0.3614260654553942
2023-12-13 00:48:42   INFO  epoch: 7/24, acc_iter=29584, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:14, time_cost(all): 9:04:15/19:49:17, loss=0.493070333302826, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.02(1.03), norm=1.7405709943243006, lr=0.3610841642292908
2023-12-13 00:49:37   INFO  epoch: 7/24, acc_iter=29634, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:18, time_cost(all): 9:05:10/18:50:51, loss=0.492883288685737, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=2.646202048766424, lr=0.36074226300318746
2023-12-13 00:50:32   INFO  epoch: 7/24, acc_iter=29684, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:43, time_cost(all): 9:06:05/18:50:29, loss=0.492696244068649, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.0(1.03), norm=0.9799056762624949, lr=0.36040036177708407
2023-12-13 00:51:28   INFO  epoch: 7/24, acc_iter=29734, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:36, time_cost(all): 9:07:01/19:44:39, loss=0.49250919945156, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=2.174378982086309, lr=0.3600584605509807
2023-12-13 00:52:23   INFO  epoch: 7/24, acc_iter=29784, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:54, time_cost(all): 9:07:56/19:00:32, loss=0.492322154834472, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.02(1.03), norm=4.50014817052565, lr=0.3597165593248773
2023-12-13 00:53:18   INFO  epoch: 7/24, acc_iter=29834, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:18:49, time_cost(all): 9:08:51/20:15:52, loss=0.492135110217383, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.19(1.03), norm=1.9220890399562598, lr=0.3593746580987739
2023-12-13 00:54:14   INFO  epoch: 7/24, acc_iter=29884, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:04, time_cost(all): 9:09:47/19:57:51, loss=0.491948065600295, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.17(1.03), norm=3.872383022472935, lr=0.3590327568726705
2023-12-13 00:55:09   INFO  epoch: 7/24, acc_iter=29934, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:17, time_cost(all): 9:10:42/20:03:40, loss=0.491761020983206, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=3.5404169982343583, lr=0.3586908556465671
2023-12-13 00:56:04   INFO  epoch: 7/24, acc_iter=29984, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:25, time_cost(all): 9:11:37/19:32:11, loss=0.491573976366118, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.87(1.03), norm=3.958923280537015, lr=0.35834895442046377
2023-12-13 00:57:00   INFO  epoch: 7/24, acc_iter=30034, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:40, time_cost(all): 9:12:33/18:26:25, loss=0.491386931749029, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.23(1.03), norm=3.0857593828518723, lr=0.3580070531943604
2023-12-13 00:57:55   INFO  epoch: 7/24, acc_iter=30084, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:33, time_cost(all): 9:13:28/19:33:25, loss=0.491199887131941, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.83(1.03), norm=0.8303071351558762, lr=0.357665151968257
2023-12-13 00:58:50   INFO  epoch: 7/24, acc_iter=30134, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:45, time_cost(all): 9:14:23/19:50:47, loss=0.491012842514852, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.99(1.03), norm=0.5975283063186982, lr=0.3573232507421536
2023-12-13 00:59:46   INFO  epoch: 7/24, acc_iter=30184, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:50, time_cost(all): 9:15:19/18:30:42, loss=0.490825797897764, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=2.9666584998124144, lr=0.3569813495160502
2023-12-13 01:00:41   INFO  epoch: 7/24, acc_iter=30234, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:38, time_cost(all): 9:16:14/18:48:18, loss=0.490638753280675, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.83(1.03), norm=4.7710254834149595, lr=0.35663944828994687
2023-12-13 01:01:36   INFO  epoch: 7/24, acc_iter=30284, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:04, time_cost(all): 9:17:09/20:09:04, loss=0.490451708663586, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=0.933652197585226, lr=0.3562975470638434
2023-12-13 01:02:32   INFO  epoch: 7/24, acc_iter=30334, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:09:55, time_cost(all): 9:18:05/20:02:53, loss=0.490264664046498, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.2584621925836217, lr=0.3559556458377401
2023-12-13 01:03:27   INFO  epoch: 7/24, acc_iter=30384, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:16, time_cost(all): 9:19:00/18:24:39, loss=0.490077619429409, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.393431312465562, lr=0.35561374461163664
2023-12-13 01:04:22   INFO  epoch: 7/24, acc_iter=30434, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:20, time_cost(all): 9:19:55/18:31:06, loss=0.489890574812321, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.85(1.03), norm=1.1445936829954553, lr=0.3552718433855333
2023-12-13 01:05:18   INFO  epoch: 7/24, acc_iter=30484, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:43, time_cost(all): 9:20:51/18:43:18, loss=0.489703530195232, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.11(1.03), norm=2.818892727694885, lr=0.3549299421594299
2023-12-13 01:06:13   INFO  epoch: 7/24, acc_iter=30534, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:56, time_cost(all): 9:21:46/20:02:14, loss=0.489516485578144, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.14(1.03), norm=1.2544523346759981, lr=0.3545880409333265
2023-12-13 01:07:08   INFO  epoch: 7/24, acc_iter=30584, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:33, time_cost(all): 9:22:41/18:30:36, loss=0.489329440961055, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=2.010077771704342, lr=0.3542461397072232
2023-12-13 01:08:04   INFO  epoch: 7/24, acc_iter=30634, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:59, time_cost(all): 9:23:37/19:28:30, loss=0.489142396343967, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.1(1.03), norm=3.987001423595609, lr=0.35390423848111974
2023-12-13 01:08:59   INFO  epoch: 7/24, acc_iter=30684, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:43, time_cost(all): 9:24:32/19:38:23, loss=0.488955351726878, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.08(1.03), norm=1.6168087155434696, lr=0.3535623372550164
2023-12-13 01:09:55   INFO  epoch: 7/24, acc_iter=30734, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:59, time_cost(all): 9:25:28/19:47:11, loss=0.48876830710979, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=3.330801610808561, lr=0.35322043602891295
2023-12-13 01:10:50   INFO  epoch: 7/24, acc_iter=30784, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 9:26:23/18:11:31, loss=0.488581262492701, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=4.8486971726158385, lr=0.3528785348028096
2023-12-13 01:11:45   INFO  epoch: 7/24, acc_iter=30834, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:09, time_cost(all): 9:27:18/18:35:32, loss=0.488394217875613, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=1.2386719349480633, lr=0.3525366335767062
2023-12-13 01:12:41   INFO  epoch: 7/24, acc_iter=30884, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 9:28:14/18:23:47, loss=0.488207173258524, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=1.5390707808137192, lr=0.35219473235060283
2023-12-13 01:13:36   INFO  epoch: 8/24, acc_iter=30946, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:11:10, time_cost(all): 9:29:09/19:08:08, loss=0.487975237933334, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=3.832121943514302, lr=0.35177077483023467
2023-12-13 01:14:31   INFO  epoch: 8/24, acc_iter=30996, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:07:09, time_cost(all): 9:30:04/19:05:07, loss=0.487788193316246, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.28639114961781, lr=0.3514288736041313
2023-12-13 01:15:27   INFO  epoch: 8/24, acc_iter=31046, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:08:50, time_cost(all): 9:31:00/19:42:11, loss=0.487601148699157, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.12(1.03), norm=0.7428667961847543, lr=0.3510869723780279
2023-12-13 01:16:22   INFO  epoch: 8/24, acc_iter=31096, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:10:38, time_cost(all): 9:31:55/18:23:25, loss=0.487414104082069, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=4.664044552383951, lr=0.3507450711519245
2023-12-13 01:17:17   INFO  epoch: 8/24, acc_iter=31146, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:03:54, time_cost(all): 9:32:50/18:22:48, loss=0.48722705946498, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.95(1.03), norm=1.8405575242864196, lr=0.3504031699258211
2023-12-13 01:18:13   INFO  epoch: 8/24, acc_iter=31196, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:05:30, time_cost(all): 9:33:46/18:14:48, loss=0.487040014847892, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=0.7086757100812755, lr=0.35006126869971776
2023-12-13 01:19:08   INFO  epoch: 8/24, acc_iter=31246, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:02:51, time_cost(all): 9:34:41/18:34:56, loss=0.486852970230803, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.83(1.03), norm=2.9223664243139837, lr=0.3497193674736143
2023-12-13 01:20:03   INFO  epoch: 8/24, acc_iter=31296, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:05:27, time_cost(all): 9:35:36/19:05:22, loss=0.486665925613715, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=3.4312655955483597, lr=0.349377466247511
2023-12-13 01:20:59   INFO  epoch: 8/24, acc_iter=31346, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:56, time_cost(all): 9:36:32/18:16:15, loss=0.486478880996626, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=4.047774907466684, lr=0.3490355650214076
2023-12-13 01:21:54   INFO  epoch: 8/24, acc_iter=31396, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:04:01, time_cost(all): 9:37:27/19:24:48, loss=0.486291836379537, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.89(1.03), norm=4.374870206130618, lr=0.3486936637953042
2023-12-13 01:22:49   INFO  epoch: 8/24, acc_iter=31446, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:58:58, time_cost(all): 9:38:22/18:09:44, loss=0.486104791762449, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.7405541498285768, lr=0.3483517625692008
2023-12-13 01:23:45   INFO  epoch: 8/24, acc_iter=31496, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:16, time_cost(all): 9:39:18/19:30:11, loss=0.48591774714536, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.855108192331408, lr=0.3480098613430974
2023-12-13 01:24:40   INFO  epoch: 8/24, acc_iter=31546, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:02:11, time_cost(all): 9:40:13/19:40:06, loss=0.485730702528272, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.11(1.03), norm=4.37650281421003, lr=0.3476679601169941
2023-12-13 01:25:35   INFO  epoch: 8/24, acc_iter=31596, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/1:00:22, time_cost(all): 9:41:08/17:58:58, loss=0.485543657911183, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=3.857321332893814, lr=0.34732605889089063
2023-12-13 01:26:31   INFO  epoch: 8/24, acc_iter=31646, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:54:59, time_cost(all): 9:42:04/18:07:10, loss=0.485356613294095, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=1.8214917399727741, lr=0.3469841576647873
2023-12-13 01:27:26   INFO  epoch: 8/24, acc_iter=31696, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:58:27, time_cost(all): 9:42:59/19:02:16, loss=0.485169568677006, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=0.6928743834222867, lr=0.3466422564386839
2023-12-13 01:28:21   INFO  epoch: 8/24, acc_iter=31746, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:57:16, time_cost(all): 9:43:54/19:35:22, loss=0.484982524059918, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.11(1.03), norm=2.1307978245676185, lr=0.3463003552125805
2023-12-13 01:29:17   INFO  epoch: 8/24, acc_iter=31796, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:57:05, time_cost(all): 9:44:50/19:16:45, loss=0.484795479442829, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=3.819513099064766, lr=0.3459584539864772
2023-12-13 01:30:12   INFO  epoch: 8/24, acc_iter=31846, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:54:57, time_cost(all): 9:45:45/18:42:17, loss=0.484608434825741, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=4.480194861032082, lr=0.34561655276037373
2023-12-13 01:31:08   INFO  epoch: 8/24, acc_iter=31896, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:53, time_cost(all): 9:46:41/18:59:10, loss=0.484421390208652, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=0.5400500486304852, lr=0.3452746515342704
2023-12-13 01:32:03   INFO  epoch: 8/24, acc_iter=31946, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:54:05, time_cost(all): 9:47:36/18:27:22, loss=0.484234345591564, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=2.8877739783777776, lr=0.34493275030816695
2023-12-13 01:32:58   INFO  epoch: 8/24, acc_iter=31996, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:44, time_cost(all): 9:48:31/18:23:09, loss=0.484047300974475, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=2.5753815493510563, lr=0.3445908490820636
2023-12-13 01:33:54   INFO  epoch: 8/24, acc_iter=32046, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:50:38, time_cost(all): 9:49:27/18:12:21, loss=0.483860256357386, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=1.3682980375827207, lr=0.3442489478559602
2023-12-13 01:34:49   INFO  epoch: 8/24, acc_iter=32096, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:49, time_cost(all): 9:50:22/18:26:34, loss=0.483673211740298, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=1.6798538706313915, lr=0.3439070466298568
2023-12-13 01:35:44   INFO  epoch: 8/24, acc_iter=32146, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:36, time_cost(all): 9:51:17/19:32:35, loss=0.483486167123209, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=4.549724283217915, lr=0.3435651454037535
2023-12-13 01:36:40   INFO  epoch: 8/24, acc_iter=32196, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:01, time_cost(all): 9:52:13/19:32:30, loss=0.483299122506121, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.86(1.03), norm=2.2597333463912666, lr=0.34322324417765004
2023-12-13 01:37:35   INFO  epoch: 8/24, acc_iter=32246, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:44:30, time_cost(all): 9:53:08/19:28:06, loss=0.483112077889032, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=2.777041642859798, lr=0.34288134295154665
2023-12-13 01:38:30   INFO  epoch: 8/24, acc_iter=32296, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:13, time_cost(all): 9:54:03/17:49:08, loss=0.482925033271944, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.07(1.03), norm=1.7472208573265116, lr=0.34253944172544326
2023-12-13 01:39:26   INFO  epoch: 8/24, acc_iter=32346, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:58, time_cost(all): 9:54:59/17:53:19, loss=0.482737988654855, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=4.61234268226862, lr=0.3421975404993399
2023-12-13 01:40:21   INFO  epoch: 8/24, acc_iter=32396, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:58, time_cost(all): 9:55:54/19:21:25, loss=0.482550944037767, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=2.5152536420817007, lr=0.34185563927323653
2023-12-13 01:41:16   INFO  epoch: 8/24, acc_iter=32446, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:40:32, time_cost(all): 9:56:49/19:19:11, loss=0.482363899420678, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=0.5023564003337129, lr=0.34151373804713314
2023-12-13 01:42:12   INFO  epoch: 8/24, acc_iter=32496, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:40:32, time_cost(all): 9:57:45/17:54:41, loss=0.48217685480359, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.21(1.03), norm=3.1873727100612026, lr=0.34117183682102975
2023-12-13 01:43:07   INFO  epoch: 8/24, acc_iter=32546, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:06, time_cost(all): 9:58:40/18:48:39, loss=0.481989810186501, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=2.0562243730734586, lr=0.34082993559492636
2023-12-13 01:44:02   INFO  epoch: 8/24, acc_iter=32596, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:30, time_cost(all): 9:59:35/18:15:44, loss=0.481802765569413, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=1.8481658811094195, lr=0.340488034368823
2023-12-13 01:44:58   INFO  epoch: 8/24, acc_iter=32646, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:40:35, time_cost(all): 10:00:31/17:58:02, loss=0.481615720952324, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=2.133753705236114, lr=0.3401461331427196
2023-12-13 01:45:53   INFO  epoch: 8/24, acc_iter=32696, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:38:13, time_cost(all): 10:01:26/17:45:52, loss=0.481428676335236, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=1.7925141417606942, lr=0.33980423191661624
2023-12-13 01:46:48   INFO  epoch: 8/24, acc_iter=32746, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:21, time_cost(all): 10:02:21/18:55:33, loss=0.481241631718147, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.779858471881785, lr=0.33946233069051285
2023-12-13 01:47:44   INFO  epoch: 8/24, acc_iter=32796, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:26, time_cost(all): 10:03:17/18:32:18, loss=0.481054587101058, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.22(1.03), norm=3.3749112281043843, lr=0.33912042946440946
2023-12-13 01:48:39   INFO  epoch: 8/24, acc_iter=32846, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:36:17, time_cost(all): 10:04:12/18:57:47, loss=0.48086754248397, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.9081751784064522, lr=0.33877852823830606
2023-12-13 01:49:34   INFO  epoch: 8/24, acc_iter=32896, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:49, time_cost(all): 10:05:07/18:29:17, loss=0.480680497866881, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.14(1.03), norm=2.06524249050871, lr=0.3384366270122027
2023-12-13 01:50:30   INFO  epoch: 8/24, acc_iter=32946, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:25, time_cost(all): 10:06:03/17:33:34, loss=0.480493453249793, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.91(1.03), norm=3.0151057457293646, lr=0.33809472578609934
2023-12-13 01:51:25   INFO  epoch: 8/24, acc_iter=32996, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:56, time_cost(all): 10:06:58/18:44:45, loss=0.480306408632704, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.92(1.03), norm=3.982328778629463, lr=0.33775282455999595
2023-12-13 01:52:21   INFO  epoch: 8/24, acc_iter=33046, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:10, time_cost(all): 10:07:54/19:01:10, loss=0.480119364015616, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=2.8972951693932556, lr=0.33741092333389255
2023-12-13 01:53:16   INFO  epoch: 8/24, acc_iter=33096, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:59, time_cost(all): 10:08:49/17:36:56, loss=0.479932319398527, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=3.267297386355371, lr=0.3370690221077891
2023-12-13 01:54:11   INFO  epoch: 8/24, acc_iter=33146, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:19, time_cost(all): 10:09:44/17:44:59, loss=0.479745274781439, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=1.1379695133762715, lr=0.33672712088168577
2023-12-13 01:55:07   INFO  epoch: 8/24, acc_iter=33196, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:30:06, time_cost(all): 10:10:40/18:20:44, loss=0.47955823016435, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.13(1.03), norm=1.7601137455804006, lr=0.3363852196555824
2023-12-13 01:56:02   INFO  epoch: 8/24, acc_iter=33246, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:56, time_cost(all): 10:11:35/18:28:11, loss=0.479371185547262, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=2.9397594953309167, lr=0.336043318429479
2023-12-13 01:56:57   INFO  epoch: 8/24, acc_iter=33296, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:40, time_cost(all): 10:12:30/17:42:44, loss=0.479184140930173, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=1.9167243371637332, lr=0.33570141720337565
2023-12-13 01:57:53   INFO  epoch: 8/24, acc_iter=33346, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:53, time_cost(all): 10:13:26/17:48:29, loss=0.478997096313085, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=1.4649531839233867, lr=0.3353595159772722
2023-12-13 01:58:48   INFO  epoch: 8/24, acc_iter=33396, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:29, time_cost(all): 10:14:21/18:43:18, loss=0.478810051695996, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.04(1.03), norm=3.2517266554460704, lr=0.33501761475116887
2023-12-13 01:59:43   INFO  epoch: 8/24, acc_iter=33446, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:54, time_cost(all): 10:15:16/18:02:29, loss=0.478623007078907, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=2.698172469993458, lr=0.3346757135250654
2023-12-13 02:00:39   INFO  epoch: 8/24, acc_iter=33496, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:11, time_cost(all): 10:16:12/18:44:14, loss=0.478435962461819, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.1419900003768584, lr=0.3343338122989621
2023-12-13 02:01:34   INFO  epoch: 8/24, acc_iter=33546, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:47, time_cost(all): 10:17:07/18:36:11, loss=0.47824891784473, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.08(1.03), norm=4.266508767109035, lr=0.3339919110728587
2023-12-13 02:02:29   INFO  epoch: 8/24, acc_iter=33596, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:59, time_cost(all): 10:18:02/17:35:52, loss=0.478061873227642, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.06(1.03), norm=3.892281652044883, lr=0.3336500098467553
2023-12-13 02:03:25   INFO  epoch: 8/24, acc_iter=33646, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:00, time_cost(all): 10:18:58/17:59:54, loss=0.477874828610553, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.21(1.03), norm=2.3149779130924015, lr=0.33330810862065197
2023-12-13 02:04:20   INFO  epoch: 8/24, acc_iter=33696, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:29, time_cost(all): 10:19:53/17:34:04, loss=0.477687783993465, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.06(1.03), norm=4.1939815337614945, lr=0.3329662073945485
2023-12-13 02:05:15   INFO  epoch: 8/24, acc_iter=33746, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:08, time_cost(all): 10:20:48/17:32:40, loss=0.477500739376376, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.294519702881012, lr=0.3326243061684452
2023-12-13 02:06:11   INFO  epoch: 8/24, acc_iter=33796, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:37, time_cost(all): 10:21:44/18:41:57, loss=0.477313694759288, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=3.2652755071808723, lr=0.33228240494234174
2023-12-13 02:07:06   INFO  epoch: 8/24, acc_iter=33846, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:07, time_cost(all): 10:22:39/18:53:25, loss=0.477126650142199, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.15(1.03), norm=3.2711430836140254, lr=0.3319405037162384
2023-12-13 02:08:01   INFO  epoch: 8/24, acc_iter=33896, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:27, time_cost(all): 10:23:34/18:00:51, loss=0.476939605525111, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.9344234647335226, lr=0.331598602490135
2023-12-13 02:08:57   INFO  epoch: 8/24, acc_iter=33946, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:20, time_cost(all): 10:24:30/18:25:02, loss=0.476752560908022, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.07(1.03), norm=1.8043905225650825, lr=0.3312567012640316
2023-12-13 02:09:52   INFO  epoch: 8/24, acc_iter=33996, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:35, time_cost(all): 10:25:25/17:28:53, loss=0.476565516290934, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=0.7061415252220586, lr=0.3309148000379283
2023-12-13 02:10:47   INFO  epoch: 8/24, acc_iter=34046, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:43, time_cost(all): 10:26:20/17:24:04, loss=0.476378471673845, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=2.650755737603677, lr=0.33057289881182483
2023-12-13 02:11:43   INFO  epoch: 8/24, acc_iter=34096, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:47, time_cost(all): 10:27:16/17:53:38, loss=0.476191427056757, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.15(1.03), norm=3.0756958098021796, lr=0.3302309975857215
2023-12-13 02:12:38   INFO  epoch: 8/24, acc_iter=34146, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:14, time_cost(all): 10:28:11/17:11:51, loss=0.476004382439668, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.08(1.03), norm=4.468549872524054, lr=0.3298890963596181
2023-12-13 02:13:34   INFO  epoch: 8/24, acc_iter=34196, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:03, time_cost(all): 10:29:07/17:53:02, loss=0.475817337822579, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=1.5873991944015144, lr=0.3295471951335147
2023-12-13 02:14:29   INFO  epoch: 8/24, acc_iter=34246, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:14, time_cost(all): 10:30:02/17:21:18, loss=0.475630293205491, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=4.175905652950664, lr=0.3292052939074113
2023-12-13 02:15:24   INFO  epoch: 8/24, acc_iter=34296, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:07, time_cost(all): 10:30:57/18:10:16, loss=0.475443248588402, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.04(1.03), norm=4.853971603914335, lr=0.32886339268130793
2023-12-13 02:16:20   INFO  epoch: 8/24, acc_iter=34346, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:53, time_cost(all): 10:31:53/17:21:22, loss=0.475256203971314, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=4.227261882578476, lr=0.3285214914552046
2023-12-13 02:17:15   INFO  epoch: 8/24, acc_iter=34396, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:35, time_cost(all): 10:32:48/17:20:11, loss=0.475069159354225, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=0.7225623042754912, lr=0.32817959022910115
2023-12-13 02:18:10   INFO  epoch: 8/24, acc_iter=34446, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:34, time_cost(all): 10:33:43/17:41:01, loss=0.474882114737137, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=3.184067496040245, lr=0.3278376890029978
2023-12-13 02:19:06   INFO  epoch: 8/24, acc_iter=34496, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:45, time_cost(all): 10:34:39/18:02:40, loss=0.474695070120048, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=4.066918782166885, lr=0.3274957877768944
2023-12-13 02:20:01   INFO  epoch: 8/24, acc_iter=34546, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:53, time_cost(all): 10:35:34/17:14:24, loss=0.47450802550296, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=2.1486539649398773, lr=0.32715388655079103
2023-12-13 02:20:56   INFO  epoch: 8/24, acc_iter=34596, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:07, time_cost(all): 10:36:29/17:22:56, loss=0.474320980885871, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=1.1936442049296847, lr=0.32681198532468764
2023-12-13 02:21:52   INFO  epoch: 8/24, acc_iter=34646, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:05, time_cost(all): 10:37:25/17:46:06, loss=0.474133936268783, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.22(1.03), norm=1.9234237333192072, lr=0.32647008409858425
2023-12-13 02:22:47   INFO  epoch: 8/24, acc_iter=34696, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:05, time_cost(all): 10:38:20/17:40:05, loss=0.473946891651694, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=1.1258840158186323, lr=0.3261281828724809
2023-12-13 02:23:42   INFO  epoch: 8/24, acc_iter=34746, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 10:39:15/18:29:06, loss=0.473759847034606, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=3.3928910189459702, lr=0.32578628164637746
2023-12-13 02:24:38   INFO  epoch: 9/24, acc_iter=34808, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:10:32, time_cost(all): 10:40:11/17:43:27, loss=0.473527911709416, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=4.447012485458828, lr=0.3253623241260093
2023-12-13 02:25:33   INFO  epoch: 9/24, acc_iter=34858, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:11:28, time_cost(all): 10:41:06/17:47:09, loss=0.473340867092327, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.99(1.03), norm=4.157832456584702, lr=0.32502042289990596
2023-12-13 02:26:28   INFO  epoch: 9/24, acc_iter=34908, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:01, time_cost(all): 10:42:01/18:04:28, loss=0.473153822475239, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=2.104384598846607, lr=0.3246785216738025
2023-12-13 02:27:24   INFO  epoch: 9/24, acc_iter=34958, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:05:56, time_cost(all): 10:42:57/17:49:34, loss=0.47296677785815, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=3.5092326783996377, lr=0.3243366204476992
2023-12-13 02:28:19   INFO  epoch: 9/24, acc_iter=35008, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:08:36, time_cost(all): 10:43:52/17:14:48, loss=0.472779733241062, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.5988762216629575, lr=0.32399471922159573
2023-12-13 02:29:14   INFO  epoch: 9/24, acc_iter=35058, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:03:56, time_cost(all): 10:44:47/18:05:33, loss=0.472592688623973, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=4.345765493956145, lr=0.3236528179954924
2023-12-13 02:30:10   INFO  epoch: 9/24, acc_iter=35108, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:03:09, time_cost(all): 10:45:43/18:02:27, loss=0.472405644006885, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.103459793207343, lr=0.323310916769389
2023-12-13 02:31:05   INFO  epoch: 9/24, acc_iter=35158, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:02:30, time_cost(all): 10:46:38/17:05:15, loss=0.472218599389796, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=4.431583285726795, lr=0.3229690155432856
2023-12-13 02:32:00   INFO  epoch: 9/24, acc_iter=35208, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:52, time_cost(all): 10:47:33/17:53:18, loss=0.472031554772708, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=3.864336618181679, lr=0.3226271143171822
2023-12-13 02:32:56   INFO  epoch: 9/24, acc_iter=35258, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:00:37, time_cost(all): 10:48:29/18:11:37, loss=0.471844510155619, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=1.5290805604421254, lr=0.3222852130910788
2023-12-13 02:33:51   INFO  epoch: 9/24, acc_iter=35308, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:59:12, time_cost(all): 10:49:24/18:12:10, loss=0.47165746553853, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=3.2873188065108403, lr=0.3219433118649755
2023-12-13 02:34:46   INFO  epoch: 9/24, acc_iter=35358, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:57:48, time_cost(all): 10:50:19/18:02:32, loss=0.471470420921442, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.08(1.03), norm=3.268741056962855, lr=0.3216014106388721
2023-12-13 02:35:42   INFO  epoch: 9/24, acc_iter=35408, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:00:27, time_cost(all): 10:51:15/18:30:43, loss=0.471283376304353, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=4.133647722654699, lr=0.3212595094127687
2023-12-13 02:36:37   INFO  epoch: 9/24, acc_iter=35458, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:47, time_cost(all): 10:52:10/17:02:56, loss=0.471096331687265, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=0.8504709901551027, lr=0.3209176081866653
2023-12-13 02:37:33   INFO  epoch: 9/24, acc_iter=35508, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:55:59, time_cost(all): 10:53:06/17:54:10, loss=0.470909287070176, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=3.692742867839335, lr=0.3205757069605619
2023-12-13 02:38:28   INFO  epoch: 9/24, acc_iter=35558, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:57:21, time_cost(all): 10:54:01/17:52:19, loss=0.470722242453088, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.13(1.03), norm=4.5801626563914795, lr=0.32023380573445853
2023-12-13 02:39:23   INFO  epoch: 9/24, acc_iter=35608, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:56:49, time_cost(all): 10:54:56/17:07:44, loss=0.470535197835999, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=3.1013775498975504, lr=0.31989190450835514
2023-12-13 02:40:19   INFO  epoch: 9/24, acc_iter=35658, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:53, time_cost(all): 10:55:52/17:07:32, loss=0.470348153218911, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=4.368549346351352, lr=0.3195500032822518
2023-12-13 02:41:14   INFO  epoch: 9/24, acc_iter=35708, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:38, time_cost(all): 10:56:47/18:00:07, loss=0.470161108601822, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.1(1.03), norm=4.722018250514988, lr=0.3192081020561484
2023-12-13 02:42:09   INFO  epoch: 9/24, acc_iter=35758, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:50:45, time_cost(all): 10:57:42/16:48:16, loss=0.469974063984734, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=4.907730626697457, lr=0.318866200830045
2023-12-13 02:43:05   INFO  epoch: 9/24, acc_iter=35808, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:50:39, time_cost(all): 10:58:38/16:51:25, loss=0.469787019367645, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=3.0420097029906996, lr=0.3185242996039416
2023-12-13 02:44:00   INFO  epoch: 9/24, acc_iter=35858, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:50:46, time_cost(all): 10:59:33/18:04:47, loss=0.469599974750557, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=2.3464475241093106, lr=0.31818239837783824
2023-12-13 02:44:55   INFO  epoch: 9/24, acc_iter=35908, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:48:29, time_cost(all): 11:00:28/17:19:39, loss=0.469412930133468, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.11(1.03), norm=0.8347253491385347, lr=0.31784049715173485
2023-12-13 02:45:51   INFO  epoch: 9/24, acc_iter=35958, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:14, time_cost(all): 11:01:24/17:26:15, loss=0.469225885516379, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.1(1.03), norm=4.951288741806722, lr=0.31749859592563145
2023-12-13 02:46:46   INFO  epoch: 9/24, acc_iter=36008, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:46:50, time_cost(all): 11:02:19/17:57:15, loss=0.469038840899291, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=2.8563029557561816, lr=0.3171566946995281
2023-12-13 02:47:41   INFO  epoch: 9/24, acc_iter=36058, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:58, time_cost(all): 11:03:14/17:38:20, loss=0.468851796282202, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=3.7083632981381474, lr=0.31681479347342467
2023-12-13 02:48:37   INFO  epoch: 9/24, acc_iter=36108, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:44:24, time_cost(all): 11:04:10/16:41:19, loss=0.468664751665114, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.19(1.03), norm=2.205074819324143, lr=0.31647289224732134
2023-12-13 02:49:32   INFO  epoch: 9/24, acc_iter=36158, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:22, time_cost(all): 11:05:05/17:15:55, loss=0.468477707048025, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=2.8683594243326143, lr=0.3161309910212179
2023-12-13 02:50:27   INFO  epoch: 9/24, acc_iter=36208, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:51, time_cost(all): 11:06:00/17:49:03, loss=0.468290662430937, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=3.891711604769451, lr=0.31578908979511455
2023-12-13 02:51:23   INFO  epoch: 9/24, acc_iter=36258, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:38, time_cost(all): 11:06:56/17:33:02, loss=0.468103617813848, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=1.0869631661078207, lr=0.31544718856901116
2023-12-13 02:52:18   INFO  epoch: 9/24, acc_iter=36308, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:06, time_cost(all): 11:07:51/18:07:24, loss=0.46791657319676, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=1.9388705451558583, lr=0.31510528734290777
2023-12-13 02:53:13   INFO  epoch: 9/24, acc_iter=36358, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:54, time_cost(all): 11:08:46/17:24:49, loss=0.467729528579671, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=0.9197704580043746, lr=0.31476338611680443
2023-12-13 02:54:09   INFO  epoch: 9/24, acc_iter=36408, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:17, time_cost(all): 11:09:42/18:03:42, loss=0.467542483962583, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=3.667167370412542, lr=0.314421484890701
2023-12-13 02:55:04   INFO  epoch: 9/24, acc_iter=36458, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:39, time_cost(all): 11:10:37/17:09:53, loss=0.467355439345494, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=0.7294613292180141, lr=0.31407958366459765
2023-12-13 02:55:59   INFO  epoch: 9/24, acc_iter=36508, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:39:41, time_cost(all): 11:11:32/17:11:13, loss=0.467168394728406, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.16(1.03), norm=2.322940048109615, lr=0.31373768243849426
2023-12-13 02:56:55   INFO  epoch: 9/24, acc_iter=36558, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:05, time_cost(all): 11:12:28/17:55:10, loss=0.466981350111317, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=2.6309971560169862, lr=0.31339578121239087
2023-12-13 02:57:50   INFO  epoch: 9/24, acc_iter=36608, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:44, time_cost(all): 11:13:23/17:04:56, loss=0.466794305494229, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=2.7766340740494835, lr=0.3130538799862875
2023-12-13 02:58:46   INFO  epoch: 9/24, acc_iter=36658, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:37:35, time_cost(all): 11:14:19/17:52:42, loss=0.46660726087714, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=3.253020181334653, lr=0.3127119787601841
2023-12-13 02:59:41   INFO  epoch: 9/24, acc_iter=36708, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:36:00, time_cost(all): 11:15:14/16:56:54, loss=0.466420216260051, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.0(1.03), norm=3.3228710511250235, lr=0.31237007753408075
2023-12-13 03:00:36   INFO  epoch: 9/24, acc_iter=36758, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:32:50, time_cost(all): 11:16:09/16:33:24, loss=0.466233171642963, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=4.104648349551843, lr=0.3120281763079773
2023-12-13 03:01:32   INFO  epoch: 9/24, acc_iter=36808, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:25, time_cost(all): 11:17:05/18:01:30, loss=0.466046127025874, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=0.5826250002212332, lr=0.31168627508187396
2023-12-13 03:02:27   INFO  epoch: 9/24, acc_iter=36858, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:24, time_cost(all): 11:18:00/17:47:44, loss=0.465859082408786, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.83(1.03), norm=3.8764165699834923, lr=0.3113443738557706
2023-12-13 03:03:22   INFO  epoch: 9/24, acc_iter=36908, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:08, time_cost(all): 11:18:55/16:50:16, loss=0.465672037791697, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.06(1.03), norm=1.6011048131669903, lr=0.3110024726296672
2023-12-13 03:04:18   INFO  epoch: 9/24, acc_iter=36958, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:59, time_cost(all): 11:19:51/16:35:42, loss=0.465484993174609, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.88(1.03), norm=4.494255738290115, lr=0.3106605714035638
2023-12-13 03:05:13   INFO  epoch: 9/24, acc_iter=37008, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:28, time_cost(all): 11:20:46/16:37:02, loss=0.46529794855752, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=4.798784448711572, lr=0.3103186701774604
2023-12-13 03:06:08   INFO  epoch: 9/24, acc_iter=37058, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:28, time_cost(all): 11:21:41/17:23:12, loss=0.465110903940432, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.95(1.03), norm=2.2974551518157313, lr=0.30997676895135706
2023-12-13 03:07:04   INFO  epoch: 9/24, acc_iter=37108, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:47, time_cost(all): 11:22:37/16:38:54, loss=0.464923859323343, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=2.8633687325977406, lr=0.3096348677252536
2023-12-13 03:07:59   INFO  epoch: 9/24, acc_iter=37158, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:29, time_cost(all): 11:23:32/16:48:59, loss=0.464736814706255, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=1.4167641478938555, lr=0.3092929664991503
2023-12-13 03:08:54   INFO  epoch: 9/24, acc_iter=37208, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:15, time_cost(all): 11:24:27/16:34:30, loss=0.464549770089166, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=3.4074455500885663, lr=0.3089510652730469
2023-12-13 03:09:50   INFO  epoch: 9/24, acc_iter=37258, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:08, time_cost(all): 11:25:23/16:36:28, loss=0.464362725472078, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=0.9038732352415411, lr=0.3086091640469435
2023-12-13 03:10:45   INFO  epoch: 9/24, acc_iter=37308, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:28, time_cost(all): 11:26:18/16:48:41, loss=0.464175680854989, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=3.6234019558389345, lr=0.30826726282084016
2023-12-13 03:11:40   INFO  epoch: 9/24, acc_iter=37358, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:22:14, time_cost(all): 11:27:13/16:52:37, loss=0.4639886362379, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=0.5501515163852532, lr=0.3079253615947367
2023-12-13 03:12:36   INFO  epoch: 9/24, acc_iter=37408, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:54, time_cost(all): 11:28:09/17:35:22, loss=0.463801591620812, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=1.069136947309596, lr=0.3075834603686334
2023-12-13 03:13:31   INFO  epoch: 9/24, acc_iter=37458, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:51, time_cost(all): 11:29:04/17:17:46, loss=0.463614547003723, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=4.213387437516255, lr=0.30724155914252993
2023-12-13 03:14:26   INFO  epoch: 9/24, acc_iter=37508, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:33, time_cost(all): 11:29:59/16:21:32, loss=0.463427502386635, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.97(1.03), norm=4.508068999817444, lr=0.3068996579164266
2023-12-13 03:15:22   INFO  epoch: 9/24, acc_iter=37558, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:18:59, time_cost(all): 11:30:55/16:33:24, loss=0.463240457769546, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.99(1.03), norm=4.791919227309852, lr=0.3065577566903232
2023-12-13 03:16:17   INFO  epoch: 9/24, acc_iter=37608, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:59, time_cost(all): 11:31:50/17:26:22, loss=0.463053413152458, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=2.229065111796256, lr=0.3062158554642198
2023-12-13 03:17:12   INFO  epoch: 9/24, acc_iter=37658, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:13, time_cost(all): 11:32:45/17:15:44, loss=0.462866368535369, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.07(1.03), norm=1.839479094372896, lr=0.3058739542381165
2023-12-13 03:18:08   INFO  epoch: 9/24, acc_iter=37708, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:09, time_cost(all): 11:33:41/16:17:03, loss=0.462679323918281, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=4.466538604410564, lr=0.305532053012013
2023-12-13 03:19:03   INFO  epoch: 9/24, acc_iter=37758, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:37, time_cost(all): 11:34:36/16:23:50, loss=0.462492279301192, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.21(1.03), norm=2.825717524560641, lr=0.3051901517859097
2023-12-13 03:19:59   INFO  epoch: 9/24, acc_iter=37808, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:26, time_cost(all): 11:35:32/17:44:41, loss=0.462305234684104, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.7400050214697718, lr=0.30484825055980624
2023-12-13 03:20:54   INFO  epoch: 9/24, acc_iter=37858, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:56, time_cost(all): 11:36:27/16:55:42, loss=0.462118190067015, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.03(1.03), norm=2.3631723074975572, lr=0.3045063493337029
2023-12-13 03:21:49   INFO  epoch: 9/24, acc_iter=37908, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:20, time_cost(all): 11:37:22/17:17:15, loss=0.461931145449927, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.84(1.03), norm=2.179474836466609, lr=0.3041644481075995
2023-12-13 03:22:45   INFO  epoch: 9/24, acc_iter=37958, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:58, time_cost(all): 11:38:18/16:20:12, loss=0.461744100832838, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.0379860919839765, lr=0.3038225468814961
2023-12-13 03:23:40   INFO  epoch: 9/24, acc_iter=38008, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:10:48, time_cost(all): 11:39:13/16:13:26, loss=0.461557056215749, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=1.4698690189725871, lr=0.3034806456553928
2023-12-13 03:24:35   INFO  epoch: 9/24, acc_iter=38058, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:00, time_cost(all): 11:40:08/16:42:03, loss=0.461370011598661, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=2.335645042018057, lr=0.30313874442928934
2023-12-13 03:25:31   INFO  epoch: 9/24, acc_iter=38108, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:53, time_cost(all): 11:41:04/16:42:31, loss=0.461182966981572, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.1(1.03), norm=2.101171262644334, lr=0.302796843203186
2023-12-13 03:26:26   INFO  epoch: 9/24, acc_iter=38158, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:17, time_cost(all): 11:41:59/17:11:25, loss=0.460995922364484, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.2388946226631927, lr=0.30245494197708256
2023-12-13 03:27:21   INFO  epoch: 9/24, acc_iter=38208, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:27, time_cost(all): 11:42:54/17:10:00, loss=0.460808877747395, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=1.3472933284818502, lr=0.3021130407509792
2023-12-13 03:28:17   INFO  epoch: 9/24, acc_iter=38258, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:50, time_cost(all): 11:43:50/17:03:54, loss=0.460621833130307, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=3.874940924483053, lr=0.30177113952487583
2023-12-13 03:29:12   INFO  epoch: 9/24, acc_iter=38308, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:50, time_cost(all): 11:44:45/16:39:56, loss=0.460434788513218, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=0.9647617849728367, lr=0.30142923829877244
2023-12-13 03:30:07   INFO  epoch: 9/24, acc_iter=38358, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:47, time_cost(all): 11:45:40/17:03:01, loss=0.46024774389613, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=1.4237733105789112, lr=0.30108733707266905
2023-12-13 03:31:03   INFO  epoch: 9/24, acc_iter=38408, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:04:01, time_cost(all): 11:46:36/16:15:01, loss=0.460060699279041, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.21(1.03), norm=1.0938455321586917, lr=0.30074543584656566
2023-12-13 03:31:58   INFO  epoch: 9/24, acc_iter=38458, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:07, time_cost(all): 11:47:31/17:17:42, loss=0.459873654661953, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.01(1.03), norm=4.280774122305175, lr=0.3004035346204623
2023-12-13 03:32:53   INFO  epoch: 9/24, acc_iter=38508, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 11:48:26/16:51:10, loss=0.459686610044864, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=4.111998326710214, lr=0.30006163339435893
2023-12-13 03:33:49   INFO  epoch: 9/24, acc_iter=38558, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:10, time_cost(all): 11:49:22/17:16:51, loss=0.459499565427776, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.96(1.03), norm=4.116599478975448, lr=0.29971973216825554
2023-12-13 03:34:44   INFO  epoch: 9/24, acc_iter=38608, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 11:50:17/17:28:27, loss=0.459312520810687, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.547054805414418, lr=0.29937783094215215
2023-12-13 03:35:39   INFO  epoch: 10/24, acc_iter=38670, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:09:47, time_cost(all): 11:51:12/15:58:41, loss=0.459080585485497, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.15(1.03), norm=3.566447087308701, lr=0.2989538734217839
2023-12-13 03:36:35   INFO  epoch: 10/24, acc_iter=38720, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:43, time_cost(all): 11:52:08/16:26:46, loss=0.458893540868409, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=4.280445137917064, lr=0.2986119721956806
2023-12-13 03:37:30   INFO  epoch: 10/24, acc_iter=38770, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:08:04, time_cost(all): 11:53:03/15:54:42, loss=0.45870649625132, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.94(1.03), norm=4.256712683588731, lr=0.29827007096957714
2023-12-13 03:38:25   INFO  epoch: 10/24, acc_iter=38820, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:08:02, time_cost(all): 11:53:58/17:24:03, loss=0.458519451634232, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.1805451909623494, lr=0.2979281697434738
2023-12-13 03:39:21   INFO  epoch: 10/24, acc_iter=38870, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:09:37, time_cost(all): 11:54:54/16:07:06, loss=0.458332407017143, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.09(1.03), norm=4.511845429935316, lr=0.2975862685173704
2023-12-13 03:40:16   INFO  epoch: 10/24, acc_iter=38920, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:06:11, time_cost(all): 11:55:49/16:19:04, loss=0.458145362400055, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.0(1.03), norm=1.2693604379484154, lr=0.297244367291267
2023-12-13 03:41:12   INFO  epoch: 10/24, acc_iter=38970, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:07:00, time_cost(all): 11:56:45/16:37:38, loss=0.457958317782966, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.9(1.03), norm=4.780103561257477, lr=0.29690246606516363
2023-12-13 03:42:07   INFO  epoch: 10/24, acc_iter=39020, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:15, time_cost(all): 11:57:40/15:57:47, loss=0.457771273165878, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.92(1.03), norm=1.6010367943843105, lr=0.29656056483906024
2023-12-13 03:43:02   INFO  epoch: 10/24, acc_iter=39070, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:23, time_cost(all): 11:58:35/17:08:36, loss=0.457584228548789, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.09(1.03), norm=4.89009808929469, lr=0.2962186636129569
2023-12-13 03:43:58   INFO  epoch: 10/24, acc_iter=39120, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:01:45, time_cost(all): 11:59:31/17:05:59, loss=0.4573971839317, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=1.4269978958496063, lr=0.29587676238685345
2023-12-13 03:44:53   INFO  epoch: 10/24, acc_iter=39170, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:58:38, time_cost(all): 12:00:26/16:11:47, loss=0.457210139314612, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.1(1.03), norm=3.4452001837615627, lr=0.2955348611607501
2023-12-13 03:45:48   INFO  epoch: 10/24, acc_iter=39220, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:02:14, time_cost(all): 12:01:21/16:38:29, loss=0.457023094697523, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.06(1.03), norm=1.1823714858765828, lr=0.2951929599346467
2023-12-13 03:46:44   INFO  epoch: 10/24, acc_iter=39270, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:57:23, time_cost(all): 12:02:17/16:11:01, loss=0.456836050080435, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=2.4273599215522372, lr=0.29485105870854333
2023-12-13 03:47:39   INFO  epoch: 10/24, acc_iter=39320, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/1:01:15, time_cost(all): 12:03:12/17:07:12, loss=0.456649005463346, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.02(1.03), norm=1.4749728209012514, lr=0.29450915748243994
2023-12-13 03:48:34   INFO  epoch: 10/24, acc_iter=39370, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:28, time_cost(all): 12:04:07/16:28:23, loss=0.456461960846258, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=3.4640309881719915, lr=0.29416725625633655
2023-12-13 03:49:30   INFO  epoch: 10/24, acc_iter=39420, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:53:42, time_cost(all): 12:05:03/16:39:25, loss=0.456274916229169, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.86(1.03), norm=1.875934263102203, lr=0.2938253550302332
2023-12-13 03:50:25   INFO  epoch: 10/24, acc_iter=39470, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:57:07, time_cost(all): 12:05:58/16:39:59, loss=0.456087871612081, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.93(1.03), norm=2.553787456687677, lr=0.29348345380412977
2023-12-13 03:51:20   INFO  epoch: 10/24, acc_iter=39520, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:56:59, time_cost(all): 12:06:53/16:51:21, loss=0.455900826994992, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.21(1.03), norm=4.9564408958179165, lr=0.29314155257802643
2023-12-13 03:52:16   INFO  epoch: 10/24, acc_iter=39570, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:48, time_cost(all): 12:07:49/15:43:06, loss=0.455713782377904, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.23(1.03), norm=4.827496923332808, lr=0.29279965135192304
2023-12-13 03:53:11   INFO  epoch: 10/24, acc_iter=39620, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:57, time_cost(all): 12:08:44/16:50:43, loss=0.455526737760815, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.92(1.03), norm=1.2508987752171297, lr=0.29245775012581965
2023-12-13 03:54:06   INFO  epoch: 10/24, acc_iter=39670, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:14, time_cost(all): 12:09:39/16:26:14, loss=0.455339693143727, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.18(1.03), norm=2.4411890350607965, lr=0.2921158488997163
2023-12-13 03:55:02   INFO  epoch: 10/24, acc_iter=39720, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:49:56, time_cost(all): 12:10:35/16:10:12, loss=0.455152648526638, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=0.7773742245300359, lr=0.29177394767361287
2023-12-13 03:55:57   INFO  epoch: 10/24, acc_iter=39770, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:00, time_cost(all): 12:11:30/17:01:55, loss=0.45496560390955, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=2.0312324240976283, lr=0.29143204644750953
2023-12-13 03:56:52   INFO  epoch: 10/24, acc_iter=39820, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:48:36, time_cost(all): 12:12:25/15:57:25, loss=0.454778559292461, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.86(1.03), norm=3.780379989977949, lr=0.2910901452214061
2023-12-13 03:57:48   INFO  epoch: 10/24, acc_iter=39870, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:50:19, time_cost(all): 12:13:21/17:03:44, loss=0.454591514675372, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=0.7069094335984034, lr=0.29074824399530275
2023-12-13 03:58:43   INFO  epoch: 10/24, acc_iter=39920, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:48:27, time_cost(all): 12:14:16/15:53:00, loss=0.454404470058284, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=4.806839077548172, lr=0.29040634276919935
2023-12-13 03:59:38   INFO  epoch: 10/24, acc_iter=39970, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:45:34, time_cost(all): 12:15:11/16:55:46, loss=0.454217425441195, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.91(1.03), norm=3.180339989777533, lr=0.29006444154309596
2023-12-13 04:00:34   INFO  epoch: 10/24, acc_iter=40020, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:44:30, time_cost(all): 12:16:07/16:28:38, loss=0.454030380824107, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=1.2958439886380608, lr=0.2897225403169926
2023-12-13 04:01:29   INFO  epoch: 10/24, acc_iter=40070, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:44:44, time_cost(all): 12:17:02/15:29:09, loss=0.453843336207018, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=3.2162601040336805, lr=0.2893806390908892
2023-12-13 04:02:25   INFO  epoch: 10/24, acc_iter=40120, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:39, time_cost(all): 12:17:58/15:40:22, loss=0.45365629158993, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.102770403799651, lr=0.28903873786478584
2023-12-13 04:03:20   INFO  epoch: 10/24, acc_iter=40170, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:44, time_cost(all): 12:18:53/16:09:03, loss=0.453469246972841, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=0.869375699445327, lr=0.2886968366386824
2023-12-13 04:04:15   INFO  epoch: 10/24, acc_iter=40220, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:44, time_cost(all): 12:19:48/15:36:52, loss=0.453282202355753, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.85(1.03), norm=0.502259016784199, lr=0.28835493541257906
2023-12-13 04:05:11   INFO  epoch: 10/24, acc_iter=40270, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:25, time_cost(all): 12:20:44/16:01:46, loss=0.453095157738664, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=0.9142360466276105, lr=0.28801303418647567
2023-12-13 04:06:06   INFO  epoch: 10/24, acc_iter=40320, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:48, time_cost(all): 12:21:39/16:09:17, loss=0.452908113121576, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=4.264026208110849, lr=0.2876711329603723
2023-12-13 04:07:01   INFO  epoch: 10/24, acc_iter=40370, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:04, time_cost(all): 12:22:34/16:48:19, loss=0.452721068504487, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.576477685911575, lr=0.28732923173426894
2023-12-13 04:07:57   INFO  epoch: 10/24, acc_iter=40420, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:50, time_cost(all): 12:23:30/16:34:52, loss=0.452534023887399, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=1.1668180962509231, lr=0.2869873305081655
2023-12-13 04:08:52   INFO  epoch: 10/24, acc_iter=40470, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:21, time_cost(all): 12:24:25/15:41:37, loss=0.45234697927031, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=4.562605451628756, lr=0.28664542928206216
2023-12-13 04:09:47   INFO  epoch: 10/24, acc_iter=40520, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:37:53, time_cost(all): 12:25:20/15:38:41, loss=0.452159934653222, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=3.2855812509544133, lr=0.2863035280559587
2023-12-13 04:10:43   INFO  epoch: 10/24, acc_iter=40570, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:22, time_cost(all): 12:26:16/16:50:33, loss=0.451972890036133, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.89(1.03), norm=0.8455898671439717, lr=0.2859616268298554
2023-12-13 04:11:38   INFO  epoch: 10/24, acc_iter=40620, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:54, time_cost(all): 12:27:11/16:10:31, loss=0.451785845419044, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=4.498857465727227, lr=0.285619725603752
2023-12-13 04:12:33   INFO  epoch: 10/24, acc_iter=40670, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:03, time_cost(all): 12:28:06/15:17:26, loss=0.451598800801956, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=4.808644620677236, lr=0.2852778243776486
2023-12-13 04:13:29   INFO  epoch: 10/24, acc_iter=40720, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:58, time_cost(all): 12:29:02/15:42:06, loss=0.451411756184867, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.87(1.03), norm=3.698529103545397, lr=0.28493592315154526
2023-12-13 04:14:24   INFO  epoch: 10/24, acc_iter=40770, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:31:09, time_cost(all): 12:29:57/15:16:02, loss=0.451224711567779, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=4.239242245181709, lr=0.2845940219254418
2023-12-13 04:15:19   INFO  epoch: 10/24, acc_iter=40820, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:52, time_cost(all): 12:30:52/15:46:31, loss=0.45103766695069, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.22(1.03), norm=4.698077466878625, lr=0.2842521206993385
2023-12-13 04:16:15   INFO  epoch: 10/24, acc_iter=40870, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:06, time_cost(all): 12:31:48/15:21:12, loss=0.450850622333602, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.91(1.03), norm=1.2128207581588952, lr=0.2839102194732351
2023-12-13 04:17:10   INFO  epoch: 10/24, acc_iter=40920, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:45, time_cost(all): 12:32:43/16:07:06, loss=0.450663577716513, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.09(1.03), norm=1.0016629679310836, lr=0.2835683182471317
2023-12-13 04:18:05   INFO  epoch: 10/24, acc_iter=40970, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:48, time_cost(all): 12:33:38/16:32:39, loss=0.450476533099425, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=4.162884194210219, lr=0.2832264170210283
2023-12-13 04:19:01   INFO  epoch: 10/24, acc_iter=41020, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:25:52, time_cost(all): 12:34:34/15:34:16, loss=0.450289488482336, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=0.9640433128504754, lr=0.2828845157949249
2023-12-13 04:19:56   INFO  epoch: 10/24, acc_iter=41070, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:34, time_cost(all): 12:35:29/16:42:10, loss=0.450102443865248, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=4.68868063259371, lr=0.2825426145688215
2023-12-13 04:20:51   INFO  epoch: 10/24, acc_iter=41120, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:23:52, time_cost(all): 12:36:24/15:39:15, loss=0.449915399248159, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=1.270994733621897, lr=0.2822007133427181
2023-12-13 04:21:47   INFO  epoch: 10/24, acc_iter=41170, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:02, time_cost(all): 12:37:20/16:08:59, loss=0.449728354631071, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.91(1.03), norm=0.9763655266161704, lr=0.2818588121166148
2023-12-13 04:22:42   INFO  epoch: 10/24, acc_iter=41220, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:22:46, time_cost(all): 12:38:15/15:48:26, loss=0.449541310013982, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.536724593096819, lr=0.2815169108905114
2023-12-13 04:23:38   INFO  epoch: 10/24, acc_iter=41270, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:54, time_cost(all): 12:39:11/16:38:09, loss=0.449354265396893, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=4.82096901667908, lr=0.281175009664408
2023-12-13 04:24:33   INFO  epoch: 10/24, acc_iter=41320, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:18, time_cost(all): 12:40:06/15:13:34, loss=0.449167220779805, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=3.872066923534226, lr=0.2808331084383046
2023-12-13 04:25:28   INFO  epoch: 10/24, acc_iter=41370, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:43, time_cost(all): 12:41:01/16:06:49, loss=0.448980176162716, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=2.440901696321677, lr=0.2804912072122012
2023-12-13 04:26:24   INFO  epoch: 10/24, acc_iter=41420, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:00, time_cost(all): 12:41:57/15:42:30, loss=0.448793131545628, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=3.8585006006958054, lr=0.28014930598609783
2023-12-13 04:27:19   INFO  epoch: 10/24, acc_iter=41470, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:00, time_cost(all): 12:42:52/16:29:31, loss=0.448606086928539, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=4.223982299391134, lr=0.27980740475999444
2023-12-13 04:28:14   INFO  epoch: 10/24, acc_iter=41520, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:55, time_cost(all): 12:43:47/15:30:53, loss=0.448419042311451, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=3.882301147974055, lr=0.2794655035338911
2023-12-13 04:29:10   INFO  epoch: 10/24, acc_iter=41570, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:05, time_cost(all): 12:44:43/15:31:36, loss=0.448231997694362, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=2.5026705080747416, lr=0.2791236023077877
2023-12-13 04:30:05   INFO  epoch: 10/24, acc_iter=41620, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:12, time_cost(all): 12:45:38/16:26:00, loss=0.448044953077274, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=3.230342602862555, lr=0.2787817010816843
2023-12-13 04:31:00   INFO  epoch: 10/24, acc_iter=41670, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:54, time_cost(all): 12:46:33/15:43:28, loss=0.447857908460185, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.15(1.03), norm=4.007599209497611, lr=0.27843979985558087
2023-12-13 04:31:56   INFO  epoch: 10/24, acc_iter=41720, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:08, time_cost(all): 12:47:29/15:14:54, loss=0.447670863843097, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=4.843064028039291, lr=0.27809789862947754
2023-12-13 04:32:51   INFO  epoch: 10/24, acc_iter=41770, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:45, time_cost(all): 12:48:24/15:27:01, loss=0.447483819226008, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=4.684273371738811, lr=0.27775599740337414
2023-12-13 04:33:46   INFO  epoch: 10/24, acc_iter=41820, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:55, time_cost(all): 12:49:19/15:10:00, loss=0.44729677460892, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.7692117773433695, lr=0.27741409617727075
2023-12-13 04:34:42   INFO  epoch: 10/24, acc_iter=41870, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:29, time_cost(all): 12:50:15/16:21:55, loss=0.447109729991831, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=0.7908399276334726, lr=0.2770721949511674
2023-12-13 04:35:37   INFO  epoch: 10/24, acc_iter=41920, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:09:58, time_cost(all): 12:51:10/14:57:29, loss=0.446922685374743, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=3.6772044807506346, lr=0.27673029372506397
2023-12-13 04:36:32   INFO  epoch: 10/24, acc_iter=41970, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:15, time_cost(all): 12:52:05/15:24:34, loss=0.446735640757654, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=2.059769615599075, lr=0.27638839249896063
2023-12-13 04:37:28   INFO  epoch: 10/24, acc_iter=42020, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:37, time_cost(all): 12:53:01/14:53:22, loss=0.446548596140565, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=3.867391320213023, lr=0.27604649127285724
2023-12-13 04:38:23   INFO  epoch: 10/24, acc_iter=42070, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:18, time_cost(all): 12:53:56/15:40:04, loss=0.446361551523477, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.98(1.03), norm=1.5151132044091455, lr=0.27570459004675385
2023-12-13 04:39:18   INFO  epoch: 10/24, acc_iter=42120, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:38, time_cost(all): 12:54:51/14:53:17, loss=0.446174506906388, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.07(1.03), norm=4.289849151390182, lr=0.27536268882065046
2023-12-13 04:40:14   INFO  epoch: 10/24, acc_iter=42170, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:42, time_cost(all): 12:55:47/15:12:05, loss=0.4459874622893, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=0.6658524534140549, lr=0.27502078759454707
2023-12-13 04:41:09   INFO  epoch: 10/24, acc_iter=42220, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:46, time_cost(all): 12:56:42/15:46:03, loss=0.445800417672211, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=3.3464496724785144, lr=0.27467888636844373
2023-12-13 04:42:04   INFO  epoch: 10/24, acc_iter=42270, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:57, time_cost(all): 12:57:37/14:47:06, loss=0.445613373055123, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=4.1134629936519795, lr=0.2743369851423403
2023-12-13 04:43:00   INFO  epoch: 10/24, acc_iter=42320, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:51, time_cost(all): 12:58:33/14:56:28, loss=0.445426328438034, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.1(1.03), norm=2.737770795664099, lr=0.27399508391623695
2023-12-13 04:43:55   INFO  epoch: 10/24, acc_iter=42370, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:07, time_cost(all): 12:59:28/15:06:16, loss=0.445239283820946, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.94(1.03), norm=4.101912232778668, lr=0.2736531826901335
2023-12-13 04:44:50   INFO  epoch: 10/24, acc_iter=42420, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 13:00:23/15:46:56, loss=0.445052239203857, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.05(1.03), norm=2.3463074087606373, lr=0.27331128146403016
2023-12-13 04:45:46   INFO  epoch: 10/24, acc_iter=42470, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 13:01:19/14:59:33, loss=0.444865194586769, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.88(1.03), norm=4.087137006830012, lr=0.2729693802379268
2023-12-13 04:46:41   INFO  epoch: 11/24, acc_iter=42532, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:12:30, time_cost(all): 13:02:14/15:14:07, loss=0.444633259261579, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.21(1.03), norm=1.4636204345655754, lr=0.2725454227175586
2023-12-13 04:47:37   INFO  epoch: 11/24, acc_iter=42582, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:12:51, time_cost(all): 13:03:10/14:50:32, loss=0.44444621464449, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.04(1.03), norm=4.352167795703195, lr=0.2722035214914552
2023-12-13 04:48:32   INFO  epoch: 11/24, acc_iter=42632, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:06:28, time_cost(all): 13:04:05/15:58:03, loss=0.444259170027402, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.18(1.03), norm=3.905331286147514, lr=0.2718616202653519
2023-12-13 04:49:27   INFO  epoch: 11/24, acc_iter=42682, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:05:12, time_cost(all): 13:05:00/15:34:05, loss=0.444072125410313, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.12(1.03), norm=1.0763062278274118, lr=0.27151971903924843
2023-12-13 04:50:23   INFO  epoch: 11/24, acc_iter=42732, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:08:40, time_cost(all): 13:05:56/16:04:27, loss=0.443885080793225, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=1.8562091631280837, lr=0.27117781781314504
2023-12-13 04:51:18   INFO  epoch: 11/24, acc_iter=42782, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:53, time_cost(all): 13:06:51/15:51:48, loss=0.443698036176136, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.744374994594594, lr=0.27083591658704165
2023-12-13 04:52:13   INFO  epoch: 11/24, acc_iter=42832, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:03:38, time_cost(all): 13:07:46/15:17:53, loss=0.443510991559048, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=4.387427078384528, lr=0.2704940153609383
2023-12-13 04:53:09   INFO  epoch: 11/24, acc_iter=42882, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:25, time_cost(all): 13:08:42/15:03:18, loss=0.443323946941959, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.16(1.03), norm=4.075162635732741, lr=0.27015211413483486
2023-12-13 04:54:04   INFO  epoch: 11/24, acc_iter=42932, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:02:54, time_cost(all): 13:09:37/15:58:59, loss=0.443136902324871, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=1.4553759548756064, lr=0.26981021290873153
2023-12-13 04:54:59   INFO  epoch: 11/24, acc_iter=42982, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:00:04, time_cost(all): 13:10:32/15:33:43, loss=0.442949857707782, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=4.9893415772987435, lr=0.26946831168262814
2023-12-13 04:55:55   INFO  epoch: 11/24, acc_iter=43032, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:59:23, time_cost(all): 13:11:28/14:36:51, loss=0.442762813090693, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.1956645593557484, lr=0.26912641045652475
2023-12-13 04:56:50   INFO  epoch: 11/24, acc_iter=43082, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:00:02, time_cost(all): 13:12:23/14:34:29, loss=0.442575768473605, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.88(1.03), norm=3.932419831489117, lr=0.2687845092304214
2023-12-13 04:57:45   INFO  epoch: 11/24, acc_iter=43132, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:57:27, time_cost(all): 13:13:18/14:56:35, loss=0.442388723856516, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=2.723571376796041, lr=0.26844260800431796
2023-12-13 04:58:41   INFO  epoch: 11/24, acc_iter=43182, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:58:23, time_cost(all): 13:14:14/15:17:43, loss=0.442201679239428, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=0.5116122741041167, lr=0.2681007067782146
2023-12-13 04:59:36   INFO  epoch: 11/24, acc_iter=43232, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:55, time_cost(all): 13:15:09/15:29:55, loss=0.442014634622339, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=3.8512299317971377, lr=0.26775880555211123
2023-12-13 05:00:31   INFO  epoch: 11/24, acc_iter=43282, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:58:42, time_cost(all): 13:16:04/15:27:46, loss=0.441827590005251, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=2.3065351827595153, lr=0.26741690432600784
2023-12-13 05:01:27   INFO  epoch: 11/24, acc_iter=43332, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:54:08, time_cost(all): 13:17:00/15:36:32, loss=0.441640545388162, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=3.026709996530038, lr=0.2670750030999045
2023-12-13 05:02:22   INFO  epoch: 11/24, acc_iter=43382, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:57:01, time_cost(all): 13:17:55/15:02:38, loss=0.441453500771074, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=1.3944455487209855, lr=0.26673310187380106
2023-12-13 05:03:17   INFO  epoch: 11/24, acc_iter=43432, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:52:24, time_cost(all): 13:18:50/15:41:37, loss=0.441266456153985, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=0.9001900835132377, lr=0.2663912006476977
2023-12-13 05:04:13   INFO  epoch: 11/24, acc_iter=43482, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:55:21, time_cost(all): 13:19:46/14:55:10, loss=0.441079411536897, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.23(1.03), norm=4.925249971392399, lr=0.2660492994215943
2023-12-13 05:05:08   INFO  epoch: 11/24, acc_iter=43532, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:27, time_cost(all): 13:20:41/15:30:31, loss=0.440892366919808, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=1.5271553151193527, lr=0.2657073981954909
2023-12-13 05:06:03   INFO  epoch: 11/24, acc_iter=43582, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:45, time_cost(all): 13:21:36/14:42:36, loss=0.44070532230272, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=4.558209614261512, lr=0.2653654969693875
2023-12-13 05:06:59   INFO  epoch: 11/24, acc_iter=43632, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:47:36, time_cost(all): 13:22:32/15:26:38, loss=0.440518277685631, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.01(1.03), norm=3.7427981368982493, lr=0.26502359574328416
2023-12-13 05:07:54   INFO  epoch: 11/24, acc_iter=43682, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:37, time_cost(all): 13:23:27/15:24:36, loss=0.440331233068542, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.6239544158340875, lr=0.26468169451718077
2023-12-13 05:08:50   INFO  epoch: 11/24, acc_iter=43732, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:50:35, time_cost(all): 13:24:23/15:03:09, loss=0.440144188451454, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=0.5213167308607765, lr=0.2643397932910774
2023-12-13 05:09:45   INFO  epoch: 11/24, acc_iter=43782, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:48, time_cost(all): 13:25:18/15:15:39, loss=0.439957143834365, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.83(1.03), norm=3.1111005652552564, lr=0.263997892064974
2023-12-13 05:10:40   INFO  epoch: 11/24, acc_iter=43832, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:19, time_cost(all): 13:26:13/15:36:10, loss=0.439770099217277, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.12(1.03), norm=0.6823508116161958, lr=0.2636559908388706
2023-12-13 05:11:36   INFO  epoch: 11/24, acc_iter=43882, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:43:10, time_cost(all): 13:27:09/15:18:18, loss=0.439583054600188, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=0.6577613306585057, lr=0.26331408961276725
2023-12-13 05:12:31   INFO  epoch: 11/24, acc_iter=43932, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:43:05, time_cost(all): 13:28:04/14:19:37, loss=0.4393960099831, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=4.838086227862776, lr=0.26297218838666386
2023-12-13 05:13:26   INFO  epoch: 11/24, acc_iter=43982, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:37, time_cost(all): 13:28:59/15:25:30, loss=0.439208965366011, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.1(1.03), norm=4.095496044087838, lr=0.26263028716056047
2023-12-13 05:14:22   INFO  epoch: 11/24, acc_iter=44032, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:10, time_cost(all): 13:29:55/14:15:06, loss=0.439021920748923, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.02(1.03), norm=2.130045977149301, lr=0.2622883859344571
2023-12-13 05:15:17   INFO  epoch: 11/24, acc_iter=44082, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:13, time_cost(all): 13:30:50/15:32:25, loss=0.438834876131834, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.18(1.03), norm=4.082934640240365, lr=0.2619464847083537
2023-12-13 05:16:12   INFO  epoch: 11/24, acc_iter=44132, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:15, time_cost(all): 13:31:45/15:16:18, loss=0.438647831514746, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=4.7877422345570855, lr=0.26160458348225035
2023-12-13 05:17:08   INFO  epoch: 11/24, acc_iter=44182, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:06, time_cost(all): 13:32:41/14:34:12, loss=0.438460786897657, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.13(1.03), norm=1.006251727161489, lr=0.26126268225614696
2023-12-13 05:18:03   INFO  epoch: 11/24, acc_iter=44232, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:51, time_cost(all): 13:33:36/14:31:27, loss=0.438273742280569, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=0.5685472052616607, lr=0.2609207810300435
2023-12-13 05:18:58   INFO  epoch: 11/24, acc_iter=44282, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:14, time_cost(all): 13:34:31/15:05:46, loss=0.43808669766348, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=3.0488295910509082, lr=0.2605788798039401
2023-12-13 05:19:54   INFO  epoch: 11/24, acc_iter=44332, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:43, time_cost(all): 13:35:27/14:54:05, loss=0.437899653046392, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.13(1.03), norm=1.4374280059802993, lr=0.2602369785778368
2023-12-13 05:20:49   INFO  epoch: 11/24, acc_iter=44382, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:46, time_cost(all): 13:36:22/15:32:28, loss=0.437712608429303, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.84(1.03), norm=4.395099998931026, lr=0.25989507735173334
2023-12-13 05:21:44   INFO  epoch: 11/24, acc_iter=44432, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:00, time_cost(all): 13:37:17/15:19:03, loss=0.437525563812214, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.84(1.03), norm=0.6891699662269444, lr=0.25955317612563
2023-12-13 05:22:40   INFO  epoch: 11/24, acc_iter=44482, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:33:42, time_cost(all): 13:38:13/14:54:02, loss=0.437338519195126, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.12(1.03), norm=4.619242075104153, lr=0.2592112748995266
2023-12-13 05:23:35   INFO  epoch: 11/24, acc_iter=44532, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:19, time_cost(all): 13:39:08/14:39:13, loss=0.437151474578037, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.01(1.03), norm=1.1984153518563867, lr=0.2588693736734232
2023-12-13 05:24:30   INFO  epoch: 11/24, acc_iter=44582, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:08, time_cost(all): 13:40:03/14:53:16, loss=0.436964429960949, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.84(1.03), norm=1.4718628837565504, lr=0.2585274724473199
2023-12-13 05:25:26   INFO  epoch: 11/24, acc_iter=44632, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:42, time_cost(all): 13:40:59/14:28:39, loss=0.43677738534386, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=3.207303584874096, lr=0.25818557122121644
2023-12-13 05:26:21   INFO  epoch: 11/24, acc_iter=44682, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:43, time_cost(all): 13:41:54/14:11:49, loss=0.436590340726772, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.23(1.03), norm=0.686755690442701, lr=0.2578436699951131
2023-12-13 05:27:16   INFO  epoch: 11/24, acc_iter=44732, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:23, time_cost(all): 13:42:49/15:24:14, loss=0.436403296109683, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.06(1.03), norm=3.8477024291562953, lr=0.2575017687690097
2023-12-13 05:28:12   INFO  epoch: 11/24, acc_iter=44782, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:30:05, time_cost(all): 13:43:45/14:44:20, loss=0.436216251492595, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.23(1.03), norm=2.999334775809502, lr=0.2571598675429063
2023-12-13 05:29:07   INFO  epoch: 11/24, acc_iter=44832, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:22, time_cost(all): 13:44:40/14:38:26, loss=0.436029206875506, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.273727078954226, lr=0.256817966316803
2023-12-13 05:30:03   INFO  epoch: 11/24, acc_iter=44882, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:08, time_cost(all): 13:45:36/15:10:43, loss=0.435842162258418, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.22(1.03), norm=2.634431627728382, lr=0.25647606509069953
2023-12-13 05:30:58   INFO  epoch: 11/24, acc_iter=44932, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:03, time_cost(all): 13:46:31/14:09:20, loss=0.435655117641329, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=4.989427818196937, lr=0.2561341638645962
2023-12-13 05:31:53   INFO  epoch: 11/24, acc_iter=44982, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:26, time_cost(all): 13:47:26/15:12:53, loss=0.435468073024241, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.18(1.03), norm=4.441085736573275, lr=0.2557922626384928
2023-12-13 05:32:49   INFO  epoch: 11/24, acc_iter=45032, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:44, time_cost(all): 13:48:22/14:29:26, loss=0.435281028407152, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=1.5484837874642625, lr=0.2554503614123894
2023-12-13 05:33:44   INFO  epoch: 11/24, acc_iter=45082, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:20, time_cost(all): 13:49:17/14:03:21, loss=0.435093983790063, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.83(1.03), norm=0.6231563533223912, lr=0.25510846018628597
2023-12-13 05:34:39   INFO  epoch: 11/24, acc_iter=45132, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:10, time_cost(all): 13:50:12/15:10:00, loss=0.434906939172975, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=4.045576774697276, lr=0.25476655896018263
2023-12-13 05:35:35   INFO  epoch: 11/24, acc_iter=45182, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:31, time_cost(all): 13:51:08/14:44:39, loss=0.434719894555886, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=4.194837838617088, lr=0.25442465773407924
2023-12-13 05:36:30   INFO  epoch: 11/24, acc_iter=45232, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:58, time_cost(all): 13:52:03/15:18:20, loss=0.434532849938798, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=3.9920451735391076, lr=0.25408275650797585
2023-12-13 05:37:25   INFO  epoch: 11/24, acc_iter=45282, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:22, time_cost(all): 13:52:58/14:32:36, loss=0.434345805321709, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.2(1.03), norm=4.164376058647182, lr=0.2537408552818725
2023-12-13 05:38:21   INFO  epoch: 11/24, acc_iter=45332, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:48, time_cost(all): 13:53:54/14:54:47, loss=0.434158760704621, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=4.495331394005791, lr=0.25339895405576907
2023-12-13 05:39:16   INFO  epoch: 11/24, acc_iter=45382, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:03, time_cost(all): 13:54:49/14:52:20, loss=0.433971716087532, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=2.3457182088878588, lr=0.25305705282966573
2023-12-13 05:40:11   INFO  epoch: 11/24, acc_iter=45432, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:49, time_cost(all): 13:55:44/14:55:42, loss=0.433784671470444, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=2.1421956167640084, lr=0.25271515160356234
2023-12-13 05:41:07   INFO  epoch: 11/24, acc_iter=45482, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:14, time_cost(all): 13:56:40/14:48:16, loss=0.433597626853355, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=2.535168640511302, lr=0.25237325037745895
2023-12-13 05:42:02   INFO  epoch: 11/24, acc_iter=45532, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:33, time_cost(all): 13:57:35/13:54:47, loss=0.433410582236267, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=2.0131446115834057, lr=0.2520313491513556
2023-12-13 05:42:57   INFO  epoch: 11/24, acc_iter=45582, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:09, time_cost(all): 13:58:30/14:30:08, loss=0.433223537619178, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.02(1.03), norm=2.441827422500827, lr=0.25168944792525216
2023-12-13 05:43:53   INFO  epoch: 11/24, acc_iter=45632, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:51, time_cost(all): 13:59:26/15:01:59, loss=0.43303649300209, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.99(1.03), norm=3.143299811051166, lr=0.2513475466991488
2023-12-13 05:44:48   INFO  epoch: 11/24, acc_iter=45682, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:43, time_cost(all): 14:00:21/14:59:23, loss=0.432849448385001, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=1.6476065229166021, lr=0.25100564547304544
2023-12-13 05:45:43   INFO  epoch: 11/24, acc_iter=45732, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:18, time_cost(all): 14:01:16/14:30:12, loss=0.432662403767913, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.16(1.03), norm=1.8698060964088827, lr=0.25066374424694204
2023-12-13 05:46:39   INFO  epoch: 11/24, acc_iter=45782, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:51, time_cost(all): 14:02:12/14:29:41, loss=0.432475359150824, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.23(1.03), norm=4.216030679320291, lr=0.2503218430208387
2023-12-13 05:47:34   INFO  epoch: 11/24, acc_iter=45832, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:01, time_cost(all): 14:03:07/14:47:47, loss=0.432288314533736, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=2.363278486739901, lr=0.24997994179473526
2023-12-13 05:48:29   INFO  epoch: 11/24, acc_iter=45882, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:10, time_cost(all): 14:04:02/13:48:46, loss=0.432101269916647, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=1.9412864438381872, lr=0.24963804056863187
2023-12-13 05:49:25   INFO  epoch: 11/24, acc_iter=45932, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:26, time_cost(all): 14:04:58/15:06:54, loss=0.431914225299558, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.278934990410658, lr=0.24929613934252848
2023-12-13 05:50:20   INFO  epoch: 11/24, acc_iter=45982, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:27, time_cost(all): 14:05:53/14:50:59, loss=0.43172718068247, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.15(1.03), norm=0.810321743233102, lr=0.2489542381164251
2023-12-13 05:51:16   INFO  epoch: 11/24, acc_iter=46032, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:56, time_cost(all): 14:06:49/13:41:10, loss=0.431540136065381, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=2.9203151773003504, lr=0.2486123368903217
2023-12-13 05:52:11   INFO  epoch: 11/24, acc_iter=46082, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:05:01, time_cost(all): 14:07:44/14:48:32, loss=0.431353091448293, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.15(1.03), norm=4.066172347834127, lr=0.24827043566421836
2023-12-13 05:53:06   INFO  epoch: 11/24, acc_iter=46132, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:50, time_cost(all): 14:08:39/14:48:17, loss=0.431166046831204, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=1.9723776309157337, lr=0.24792853443811497
2023-12-13 05:54:02   INFO  epoch: 11/24, acc_iter=46182, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:50, time_cost(all): 14:09:35/14:28:38, loss=0.430979002214116, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.15(1.03), norm=4.051834775831789, lr=0.24758663321201158
2023-12-13 05:54:57   INFO  epoch: 11/24, acc_iter=46232, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 14:10:30/14:41:30, loss=0.430791957597027, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.3410471081436595, lr=0.24724473198590818
2023-12-13 05:55:52   INFO  epoch: 11/24, acc_iter=46282, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:05, time_cost(all): 14:11:25/14:01:41, loss=0.430604912979939, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.98(1.03), norm=3.8582602573464437, lr=0.2469028307598048
2023-12-13 05:56:48   INFO  epoch: 11/24, acc_iter=46332, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 14:12:21/14:48:45, loss=0.43041786836285, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.2(1.03), norm=1.7988740196986803, lr=0.24656092953370146
2023-12-13 05:57:43   INFO  epoch: 12/24, acc_iter=46394, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:09:04, time_cost(all): 14:13:16/14:33:11, loss=0.43018593303766, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=3.303164674256133, lr=0.24613697201333323
2023-12-13 05:58:38   INFO  epoch: 12/24, acc_iter=46444, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:42, time_cost(all): 14:14:11/13:34:53, loss=0.429998888420572, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=2.1227263887920915, lr=0.24579507078722984
2023-12-13 05:59:34   INFO  epoch: 12/24, acc_iter=46494, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:28, time_cost(all): 14:15:07/14:36:33, loss=0.429811843803483, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.93(1.03), norm=2.9105419144539386, lr=0.24545316956112645
2023-12-13 06:00:29   INFO  epoch: 12/24, acc_iter=46544, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:05:56, time_cost(all): 14:16:02/14:47:15, loss=0.429624799186395, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.04(1.03), norm=2.5318900317893513, lr=0.24511126833502306
2023-12-13 06:01:24   INFO  epoch: 12/24, acc_iter=46594, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:04:34, time_cost(all): 14:16:57/14:36:00, loss=0.429437754569306, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.08(1.03), norm=2.312844790881318, lr=0.24476936710891972
2023-12-13 06:02:20   INFO  epoch: 12/24, acc_iter=46644, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:07:50, time_cost(all): 14:17:53/14:54:24, loss=0.429250709952218, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.97(1.03), norm=2.1765695827547265, lr=0.24442746588281633
2023-12-13 06:03:15   INFO  epoch: 12/24, acc_iter=46694, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:05:39, time_cost(all): 14:18:48/13:34:04, loss=0.429063665335129, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=2.609888001249121, lr=0.24408556465671294
2023-12-13 06:04:10   INFO  epoch: 12/24, acc_iter=46744, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:01:31, time_cost(all): 14:19:43/14:32:30, loss=0.428876620718041, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=3.5790960500744635, lr=0.24374366343060955
2023-12-13 06:05:06   INFO  epoch: 12/24, acc_iter=46794, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:03:52, time_cost(all): 14:20:39/14:24:46, loss=0.428689576100952, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=0.7465644825438211, lr=0.24340176220450616
2023-12-13 06:06:01   INFO  epoch: 12/24, acc_iter=46844, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:01:37, time_cost(all): 14:21:34/14:12:39, loss=0.428502531483864, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=1.6780388655009695, lr=0.24305986097840282
2023-12-13 06:06:56   INFO  epoch: 12/24, acc_iter=46894, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/0:59:11, time_cost(all): 14:22:29/13:58:24, loss=0.428315486866775, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.18(1.03), norm=2.8360096935893875, lr=0.24271795975229943
2023-12-13 06:07:52   INFO  epoch: 12/24, acc_iter=46944, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:12, time_cost(all): 14:23:25/14:45:23, loss=0.428128442249686, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=1.8757514624936198, lr=0.24237605852619604
2023-12-13 06:08:47   INFO  epoch: 12/24, acc_iter=46994, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:56:39, time_cost(all): 14:24:20/13:36:11, loss=0.427941397632598, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.23(1.03), norm=1.0435844050287337, lr=0.2420341573000926
2023-12-13 06:09:42   INFO  epoch: 12/24, acc_iter=47044, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:58:43, time_cost(all): 14:25:15/13:31:55, loss=0.427754353015509, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.0(1.03), norm=2.5408945487663037, lr=0.2416922560739892
2023-12-13 06:10:38   INFO  epoch: 12/24, acc_iter=47094, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:55:11, time_cost(all): 14:26:11/14:36:16, loss=0.427567308398421, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.04(1.03), norm=2.371039764082273, lr=0.24135035484788586
2023-12-13 06:11:33   INFO  epoch: 12/24, acc_iter=47144, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:57:12, time_cost(all): 14:27:06/13:51:48, loss=0.427380263781332, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.1594292385668654, lr=0.24100845362178247
2023-12-13 06:12:29   INFO  epoch: 12/24, acc_iter=47194, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:39, time_cost(all): 14:28:02/14:25:08, loss=0.427193219164244, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.01(1.03), norm=4.299004550985765, lr=0.24066655239567908
2023-12-13 06:13:24   INFO  epoch: 12/24, acc_iter=47244, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:56:14, time_cost(all): 14:28:57/13:19:12, loss=0.427006174547155, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=0.5107317209676792, lr=0.2403246511695757
2023-12-13 06:14:19   INFO  epoch: 12/24, acc_iter=47294, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:32, time_cost(all): 14:29:52/13:49:08, loss=0.426819129930067, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=1.1433929983944495, lr=0.2399827499434723
2023-12-13 06:15:15   INFO  epoch: 12/24, acc_iter=47344, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:53:45, time_cost(all): 14:30:48/13:18:06, loss=0.426632085312978, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.84(1.03), norm=2.3078732445117183, lr=0.23964084871736896
2023-12-13 06:16:10   INFO  epoch: 12/24, acc_iter=47394, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:50:34, time_cost(all): 14:31:43/14:27:13, loss=0.42644504069589, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.14(1.03), norm=0.6571348408959758, lr=0.23929894749126557
2023-12-13 06:17:05   INFO  epoch: 12/24, acc_iter=47444, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:49:55, time_cost(all): 14:32:38/13:41:50, loss=0.426257996078801, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=3.2476592722566906, lr=0.23895704626516218
2023-12-13 06:18:01   INFO  epoch: 12/24, acc_iter=47494, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:12, time_cost(all): 14:33:34/13:31:16, loss=0.426070951461713, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.01(1.03), norm=3.803745285733193, lr=0.23861514503905878
2023-12-13 06:18:56   INFO  epoch: 12/24, acc_iter=47544, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:08, time_cost(all): 14:34:29/13:53:22, loss=0.425883906844624, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.11(1.03), norm=0.7516980687107877, lr=0.2382732438129554
2023-12-13 06:19:51   INFO  epoch: 12/24, acc_iter=47594, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:18, time_cost(all): 14:35:24/13:51:18, loss=0.425696862227535, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=3.4958922008640494, lr=0.23793134258685206
2023-12-13 06:20:47   INFO  epoch: 12/24, acc_iter=47644, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:46:15, time_cost(all): 14:36:20/13:14:38, loss=0.425509817610447, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.92(1.03), norm=0.7745170460483182, lr=0.23758944136074867
2023-12-13 06:21:42   INFO  epoch: 12/24, acc_iter=47694, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:45:15, time_cost(all): 14:37:15/13:41:31, loss=0.425322772993358, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.14(1.03), norm=4.155621256742064, lr=0.23724754013464527
2023-12-13 06:22:37   INFO  epoch: 12/24, acc_iter=47744, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:44:19, time_cost(all): 14:38:10/13:11:05, loss=0.42513572837627, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=1.023250410651447, lr=0.23690563890854188
2023-12-13 06:23:33   INFO  epoch: 12/24, acc_iter=47794, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:46:26, time_cost(all): 14:39:06/13:46:01, loss=0.424948683759181, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=4.227715968864536, lr=0.2365637376824385
2023-12-13 06:24:28   INFO  epoch: 12/24, acc_iter=47844, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:53, time_cost(all): 14:40:01/14:23:04, loss=0.424761639142093, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.84(1.03), norm=0.5184570919587652, lr=0.2362218364563351
2023-12-13 06:25:23   INFO  epoch: 12/24, acc_iter=47894, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:45, time_cost(all): 14:40:56/13:48:46, loss=0.424574594525004, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.12(1.03), norm=3.355212257071871, lr=0.2358799352302317
2023-12-13 06:26:19   INFO  epoch: 12/24, acc_iter=47944, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:59, time_cost(all): 14:41:52/13:13:27, loss=0.424387549907916, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.18(1.03), norm=3.3666183690290126, lr=0.23553803400412832
2023-12-13 06:27:14   INFO  epoch: 12/24, acc_iter=47994, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:31, time_cost(all): 14:42:47/13:51:29, loss=0.424200505290827, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.23(1.03), norm=2.5896631631518643, lr=0.23519613277802492
2023-12-13 06:28:09   INFO  epoch: 12/24, acc_iter=48044, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:07, time_cost(all): 14:43:42/14:03:06, loss=0.424013460673739, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=4.259081040979696, lr=0.2348542315519216
2023-12-13 06:29:05   INFO  epoch: 12/24, acc_iter=48094, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:09, time_cost(all): 14:44:38/14:05:55, loss=0.42382641605665, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.13(1.03), norm=2.222505541840349, lr=0.2345123303258182
2023-12-13 06:30:00   INFO  epoch: 12/24, acc_iter=48144, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:39:33, time_cost(all): 14:45:33/13:43:15, loss=0.423639371439562, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.4219395766109872, lr=0.2341704290997148
2023-12-13 06:30:55   INFO  epoch: 12/24, acc_iter=48194, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:36:23, time_cost(all): 14:46:28/14:07:43, loss=0.423452326822473, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=0.5339418553190819, lr=0.2338285278736114
2023-12-13 06:31:51   INFO  epoch: 12/24, acc_iter=48244, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:35:21, time_cost(all): 14:47:24/14:22:35, loss=0.423265282205385, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=4.409050073795582, lr=0.23348662664750802
2023-12-13 06:32:46   INFO  epoch: 12/24, acc_iter=48294, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:16, time_cost(all): 14:48:19/13:22:47, loss=0.423078237588296, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.02(1.03), norm=0.5030259767298908, lr=0.23314472542140469
2023-12-13 06:33:41   INFO  epoch: 12/24, acc_iter=48344, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:33:48, time_cost(all): 14:49:14/13:57:58, loss=0.422891192971207, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.7588435372129076, lr=0.2328028241953013
2023-12-13 06:34:37   INFO  epoch: 12/24, acc_iter=48394, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:25, time_cost(all): 14:50:10/13:05:13, loss=0.422704148354119, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.89(1.03), norm=1.8060353348053184, lr=0.2324609229691979
2023-12-13 06:35:32   INFO  epoch: 12/24, acc_iter=48444, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:30:53, time_cost(all): 14:51:05/13:29:15, loss=0.42251710373703, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.1875975897170274, lr=0.2321190217430945
2023-12-13 06:36:28   INFO  epoch: 12/24, acc_iter=48494, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:20, time_cost(all): 14:52:01/13:30:43, loss=0.422330059119942, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=2.713280235652066, lr=0.23177712051699112
2023-12-13 06:37:23   INFO  epoch: 12/24, acc_iter=48544, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:50, time_cost(all): 14:52:56/13:00:30, loss=0.422143014502853, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=0.8553111572303547, lr=0.23143521929088778
2023-12-13 06:38:18   INFO  epoch: 12/24, acc_iter=48594, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:16, time_cost(all): 14:53:51/13:54:16, loss=0.421955969885765, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=1.5114793050837663, lr=0.23109331806478434
2023-12-13 06:39:14   INFO  epoch: 12/24, acc_iter=48644, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:27:59, time_cost(all): 14:54:47/13:57:53, loss=0.421768925268676, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=1.502321987442262, lr=0.23075141683868094
2023-12-13 06:40:09   INFO  epoch: 12/24, acc_iter=48694, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:52, time_cost(all): 14:55:42/13:28:39, loss=0.421581880651588, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.97(1.03), norm=2.1272349598132934, lr=0.23040951561257755
2023-12-13 06:41:04   INFO  epoch: 12/24, acc_iter=48744, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:44, time_cost(all): 14:56:37/13:10:44, loss=0.421394836034499, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.84(1.03), norm=4.224493994749299, lr=0.23006761438647416
2023-12-13 06:42:00   INFO  epoch: 12/24, acc_iter=48794, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:56, time_cost(all): 14:57:33/12:57:14, loss=0.421207791417411, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.06(1.03), norm=1.6071389610796771, lr=0.22972571316037083
2023-12-13 06:42:55   INFO  epoch: 12/24, acc_iter=48844, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:50, time_cost(all): 14:58:28/14:02:53, loss=0.421020746800322, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.91(1.03), norm=3.8941425258183218, lr=0.22938381193426743
2023-12-13 06:43:50   INFO  epoch: 12/24, acc_iter=48894, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:40, time_cost(all): 14:59:23/13:34:26, loss=0.420833702183234, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=4.5900765218509685, lr=0.22904191070816404
2023-12-13 06:44:46   INFO  epoch: 12/24, acc_iter=48944, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:22:16, time_cost(all): 15:00:19/13:15:39, loss=0.420646657566145, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=3.2212246003163068, lr=0.22870000948206065
2023-12-13 06:45:41   INFO  epoch: 12/24, acc_iter=48994, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:23:05, time_cost(all): 15:01:14/13:44:53, loss=0.420459612949056, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=1.8158748231185111, lr=0.22835810825595726
2023-12-13 06:46:36   INFO  epoch: 12/24, acc_iter=49044, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:42, time_cost(all): 15:02:09/14:00:52, loss=0.420272568331968, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=3.5799601096765405, lr=0.22801620702985392
2023-12-13 06:47:32   INFO  epoch: 12/24, acc_iter=49094, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:15, time_cost(all): 15:03:05/13:22:29, loss=0.420085523714879, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=0.5705075423634236, lr=0.22767430580375053
2023-12-13 06:48:27   INFO  epoch: 12/24, acc_iter=49144, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:18:45, time_cost(all): 15:04:00/13:50:18, loss=0.419898479097791, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.86(1.03), norm=4.978734147261122, lr=0.22733240457764714
2023-12-13 06:49:22   INFO  epoch: 12/24, acc_iter=49194, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:33, time_cost(all): 15:04:55/13:43:45, loss=0.419711434480702, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.06(1.03), norm=1.8652844738201932, lr=0.22699050335154375
2023-12-13 06:50:18   INFO  epoch: 12/24, acc_iter=49244, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:04, time_cost(all): 15:05:51/13:34:30, loss=0.419524389863614, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.5386214015330624, lr=0.22664860212544036
2023-12-13 06:51:13   INFO  epoch: 12/24, acc_iter=49294, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:53, time_cost(all): 15:06:46/13:58:54, loss=0.419337345246525, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.09(1.03), norm=0.9425941614668699, lr=0.22630670089933702
2023-12-13 06:52:08   INFO  epoch: 12/24, acc_iter=49344, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:11, time_cost(all): 15:07:41/13:10:11, loss=0.419150300629437, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=2.4826245587358637, lr=0.22596479967323357
2023-12-13 06:53:04   INFO  epoch: 12/24, acc_iter=49394, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:43, time_cost(all): 15:08:37/13:18:39, loss=0.418963256012348, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=4.096203811488964, lr=0.22562289844713018
2023-12-13 06:53:59   INFO  epoch: 12/24, acc_iter=49444, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:36, time_cost(all): 15:09:32/13:20:26, loss=0.41877621139526, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.7095490350955316, lr=0.2252809972210268
2023-12-13 06:54:54   INFO  epoch: 12/24, acc_iter=49494, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:38, time_cost(all): 15:10:27/13:23:34, loss=0.418589166778171, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=3.238070379440776, lr=0.22493909599492345
2023-12-13 06:55:50   INFO  epoch: 12/24, acc_iter=49544, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:05, time_cost(all): 15:11:23/13:53:28, loss=0.418402122161083, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=3.459166707254149, lr=0.22459719476882006
2023-12-13 06:56:45   INFO  epoch: 12/24, acc_iter=49594, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:33, time_cost(all): 15:12:18/13:06:22, loss=0.418215077543994, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.03(1.03), norm=1.8517987395923488, lr=0.22425529354271667
2023-12-13 06:57:41   INFO  epoch: 12/24, acc_iter=49644, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:44, time_cost(all): 15:13:14/12:53:33, loss=0.418028032926906, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=3.420147867176425, lr=0.22391339231661328
2023-12-13 06:58:36   INFO  epoch: 12/24, acc_iter=49694, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:27, time_cost(all): 15:14:09/13:39:56, loss=0.417840988309817, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.742824565127767, lr=0.2235714910905099
2023-12-13 06:59:31   INFO  epoch: 12/24, acc_iter=49744, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:07, time_cost(all): 15:15:04/13:14:27, loss=0.417653943692728, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=4.902544586342906, lr=0.22322958986440655
2023-12-13 07:00:27   INFO  epoch: 12/24, acc_iter=49794, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:48, time_cost(all): 15:16:00/13:27:24, loss=0.41746689907564, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.83(1.03), norm=3.7087213483546666, lr=0.22288768863830316
2023-12-13 07:01:22   INFO  epoch: 12/24, acc_iter=49844, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:35, time_cost(all): 15:16:55/12:39:37, loss=0.417279854458551, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=2.5281267340318845, lr=0.22254578741219977
2023-12-13 07:02:17   INFO  epoch: 12/24, acc_iter=49894, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:55, time_cost(all): 15:17:50/12:36:56, loss=0.417092809841463, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=4.379113122702421, lr=0.22220388618609638
2023-12-13 07:03:13   INFO  epoch: 12/24, acc_iter=49944, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:38, time_cost(all): 15:18:46/12:52:45, loss=0.416905765224374, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=4.521328794794572, lr=0.22186198495999299
2023-12-13 07:04:08   INFO  epoch: 12/24, acc_iter=49994, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:04:04, time_cost(all): 15:19:41/13:45:37, loss=0.416718720607286, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=3.165312980934281, lr=0.22152008373388965
2023-12-13 07:05:03   INFO  epoch: 12/24, acc_iter=50044, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:53, time_cost(all): 15:20:36/13:07:47, loss=0.416531675990197, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=4.631329444218624, lr=0.22117818250778626
2023-12-13 07:05:59   INFO  epoch: 12/24, acc_iter=50094, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:07, time_cost(all): 15:21:32/12:49:20, loss=0.416344631373109, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.91(1.03), norm=2.697361227138437, lr=0.22083628128168287
2023-12-13 07:06:54   INFO  epoch: 12/24, acc_iter=50144, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 15:22:27/13:42:12, loss=0.41615758675602, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.97(1.03), norm=4.524506650028883, lr=0.22049438005557942
2023-12-13 07:07:49   INFO  epoch: 12/24, acc_iter=50194, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 15:23:22/12:40:50, loss=0.415970542138932, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=3.879950174883403, lr=0.22015247882947603
2023-12-13 07:08:45   INFO  epoch: 13/24, acc_iter=50256, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:07:28, time_cost(all): 15:24:18/13:16:27, loss=0.415738606813742, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=3.8966888289852046, lr=0.21972852130910786
2023-12-13 07:09:40   INFO  epoch: 13/24, acc_iter=50306, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:12:19, time_cost(all): 15:25:13/12:49:18, loss=0.415551562196653, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.7065794260443488, lr=0.21938662008300452
2023-12-13 07:10:35   INFO  epoch: 13/24, acc_iter=50356, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:35, time_cost(all): 15:26:08/12:41:16, loss=0.415364517579565, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.83(1.03), norm=4.687365979999177, lr=0.21904471885690113
2023-12-13 07:11:31   INFO  epoch: 13/24, acc_iter=50406, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:08:38, time_cost(all): 15:27:04/13:01:19, loss=0.415177472962476, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=0.7489159605853053, lr=0.21870281763079774
2023-12-13 07:12:26   INFO  epoch: 13/24, acc_iter=50456, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:03:23, time_cost(all): 15:27:59/12:24:23, loss=0.414990428345388, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=1.232952402961491, lr=0.21836091640469435
2023-12-13 07:13:21   INFO  epoch: 13/24, acc_iter=50506, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:02:56, time_cost(all): 15:28:54/13:35:46, loss=0.414803383728299, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.96(1.03), norm=4.589941837638712, lr=0.21801901517859096
2023-12-13 07:14:17   INFO  epoch: 13/24, acc_iter=50556, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:06:12, time_cost(all): 15:29:50/13:34:35, loss=0.414616339111211, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.05(1.03), norm=1.4457227137323998, lr=0.21767711395248757
2023-12-13 07:15:12   INFO  epoch: 13/24, acc_iter=50606, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:06:32, time_cost(all): 15:30:45/12:58:45, loss=0.414429294494122, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.07(1.03), norm=1.526915330969246, lr=0.21733521272638417
2023-12-13 07:16:07   INFO  epoch: 13/24, acc_iter=50656, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:01:10, time_cost(all): 15:31:40/13:31:08, loss=0.414242249877034, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=1.215669523422242, lr=0.21699331150028078
2023-12-13 07:17:03   INFO  epoch: 13/24, acc_iter=50706, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:03:30, time_cost(all): 15:32:36/12:31:15, loss=0.414055205259945, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.94(1.03), norm=3.1717986115482533, lr=0.2166514102741774
2023-12-13 07:17:58   INFO  epoch: 13/24, acc_iter=50756, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:01:40, time_cost(all): 15:33:31/12:19:32, loss=0.413868160642856, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.2(1.03), norm=1.9279092280647239, lr=0.21630950904807406
2023-12-13 07:18:54   INFO  epoch: 13/24, acc_iter=50806, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:01:01, time_cost(all): 15:34:27/12:35:33, loss=0.413681116025768, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=3.6009556680895356, lr=0.21596760782197066
2023-12-13 07:19:49   INFO  epoch: 13/24, acc_iter=50856, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:00:28, time_cost(all): 15:35:22/12:55:12, loss=0.413494071408679, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.22(1.03), norm=3.9476326312042893, lr=0.21562570659586727
2023-12-13 07:20:44   INFO  epoch: 13/24, acc_iter=50906, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:59:33, time_cost(all): 15:36:17/13:27:40, loss=0.413307026791591, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.85(1.03), norm=1.8707666265140004, lr=0.21528380536976388
2023-12-13 07:21:40   INFO  epoch: 13/24, acc_iter=50956, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:49, time_cost(all): 15:37:13/12:43:30, loss=0.413119982174502, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.87(1.03), norm=4.430238884213322, lr=0.2149419041436605
2023-12-13 07:22:35   INFO  epoch: 13/24, acc_iter=51006, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:14, time_cost(all): 15:38:08/12:22:34, loss=0.412932937557414, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.97(1.03), norm=4.168142358985484, lr=0.21460000291755715
2023-12-13 07:23:30   INFO  epoch: 13/24, acc_iter=51056, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:57:15, time_cost(all): 15:39:03/12:31:49, loss=0.412745892940325, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.98(1.03), norm=1.4859064682467986, lr=0.21425810169145376
2023-12-13 07:24:26   INFO  epoch: 13/24, acc_iter=51106, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:53:49, time_cost(all): 15:39:59/12:25:00, loss=0.412558848323237, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=2.111392287924839, lr=0.21391620046535037
2023-12-13 07:25:21   INFO  epoch: 13/24, acc_iter=51156, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:16, time_cost(all): 15:40:54/13:11:53, loss=0.412371803706148, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.19(1.03), norm=4.139680306348934, lr=0.21357429923924698
2023-12-13 07:26:16   INFO  epoch: 13/24, acc_iter=51206, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:54:03, time_cost(all): 15:41:49/12:41:24, loss=0.41218475908906, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=1.4650392806183195, lr=0.2132323980131436
2023-12-13 07:27:12   INFO  epoch: 13/24, acc_iter=51256, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:51:45, time_cost(all): 15:42:45/13:20:04, loss=0.411997714471971, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.17(1.03), norm=1.0734113421262381, lr=0.21289049678704025
2023-12-13 07:28:07   INFO  epoch: 13/24, acc_iter=51306, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:09, time_cost(all): 15:43:40/12:52:58, loss=0.411810669854883, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=2.8926375426875786, lr=0.21254859556093686
2023-12-13 07:29:02   INFO  epoch: 13/24, acc_iter=51356, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:51:36, time_cost(all): 15:44:35/12:32:49, loss=0.411623625237794, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.9(1.03), norm=1.2108487755212747, lr=0.2122066943348334
2023-12-13 07:29:58   INFO  epoch: 13/24, acc_iter=51406, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:27, time_cost(all): 15:45:31/12:35:48, loss=0.411436580620706, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=4.8859153895859695, lr=0.21186479310873002
2023-12-13 07:30:53   INFO  epoch: 13/24, acc_iter=51456, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:46:39, time_cost(all): 15:46:26/12:15:04, loss=0.411249536003617, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.0(1.03), norm=0.8118184726265127, lr=0.21152289188262663
2023-12-13 07:31:48   INFO  epoch: 13/24, acc_iter=51506, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:06, time_cost(all): 15:47:21/12:22:34, loss=0.411062491386528, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.96(1.03), norm=4.939154457375944, lr=0.2111809906565233
2023-12-13 07:32:44   INFO  epoch: 13/24, acc_iter=51556, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:48:30, time_cost(all): 15:48:17/12:47:05, loss=0.41087544676944, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=0.9765921914382836, lr=0.2108390894304199
2023-12-13 07:33:39   INFO  epoch: 13/24, acc_iter=51606, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:54, time_cost(all): 15:49:12/13:01:36, loss=0.410688402152351, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.05(1.03), norm=2.80245522364429, lr=0.2104971882043165
2023-12-13 07:34:34   INFO  epoch: 13/24, acc_iter=51656, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:44:47, time_cost(all): 15:50:07/13:15:13, loss=0.410501357535263, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.07(1.03), norm=3.0721494479750397, lr=0.21015528697821312
2023-12-13 07:35:30   INFO  epoch: 13/24, acc_iter=51706, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:43:45, time_cost(all): 15:51:03/13:02:17, loss=0.410314312918174, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.05(1.03), norm=2.0099127380418937, lr=0.20981338575210973
2023-12-13 07:36:25   INFO  epoch: 13/24, acc_iter=51756, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:44:40, time_cost(all): 15:51:58/13:03:42, loss=0.410127268301086, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.95(1.03), norm=1.6325389652748303, lr=0.2094714845260064
2023-12-13 07:37:20   INFO  epoch: 13/24, acc_iter=51806, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:51, time_cost(all): 15:52:53/13:10:25, loss=0.409940223683997, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.9(1.03), norm=1.1527096958169079, lr=0.209129583299903
2023-12-13 07:38:16   INFO  epoch: 13/24, acc_iter=51856, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:13, time_cost(all): 15:53:49/12:23:03, loss=0.409753179066909, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.0(1.03), norm=4.302650923462259, lr=0.2087876820737996
2023-12-13 07:39:11   INFO  epoch: 13/24, acc_iter=51906, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:18, time_cost(all): 15:54:44/12:24:55, loss=0.40956613444982, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=3.954059304713593, lr=0.20844578084769622
2023-12-13 07:40:07   INFO  epoch: 13/24, acc_iter=51956, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:40:03, time_cost(all): 15:55:40/12:21:12, loss=0.409379089832732, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=3.5624491711689106, lr=0.20810387962159282
2023-12-13 07:41:02   INFO  epoch: 13/24, acc_iter=52006, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:38:47, time_cost(all): 15:56:35/12:35:45, loss=0.409192045215643, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.2443824052104753, lr=0.2077619783954895
2023-12-13 07:41:57   INFO  epoch: 13/24, acc_iter=52056, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:17, time_cost(all): 15:57:30/12:17:36, loss=0.409005000598555, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.87(1.03), norm=4.0901153556541905, lr=0.2074200771693861
2023-12-13 07:42:53   INFO  epoch: 13/24, acc_iter=52106, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:53, time_cost(all): 15:58:26/12:45:57, loss=0.408817955981466, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.89(1.03), norm=3.477941444640424, lr=0.20707817594328265
2023-12-13 07:43:48   INFO  epoch: 13/24, acc_iter=52156, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:56, time_cost(all): 15:59:21/12:14:02, loss=0.408630911364378, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=2.2737405231474375, lr=0.20673627471717926
2023-12-13 07:44:43   INFO  epoch: 13/24, acc_iter=52206, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:36, time_cost(all): 16:00:16/13:00:04, loss=0.408443866747289, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.99(1.03), norm=1.0953840753651989, lr=0.20639437349107592
2023-12-13 07:45:39   INFO  epoch: 13/24, acc_iter=52256, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:20, time_cost(all): 16:01:12/13:06:10, loss=0.4082568221302, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.14(1.03), norm=2.887545001511582, lr=0.20605247226497253
2023-12-13 07:46:34   INFO  epoch: 13/24, acc_iter=52306, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:30, time_cost(all): 16:02:07/12:35:35, loss=0.408069777513112, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.91(1.03), norm=2.856884915720469, lr=0.20571057103886914
2023-12-13 07:47:29   INFO  epoch: 13/24, acc_iter=52356, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:18, time_cost(all): 16:03:02/12:16:43, loss=0.407882732896023, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=2.9132041025853908, lr=0.20536866981276575
2023-12-13 07:48:25   INFO  epoch: 13/24, acc_iter=52406, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:18, time_cost(all): 16:03:58/12:03:17, loss=0.407695688278935, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=1.2363720667838685, lr=0.20502676858666236
2023-12-13 07:49:20   INFO  epoch: 13/24, acc_iter=52456, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:27, time_cost(all): 16:04:53/11:52:58, loss=0.407508643661846, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.14(1.03), norm=0.9195492603096158, lr=0.20468486736055902
2023-12-13 07:50:15   INFO  epoch: 13/24, acc_iter=52506, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:45, time_cost(all): 16:05:48/12:44:31, loss=0.407321599044758, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.93(1.03), norm=2.134723174290445, lr=0.20434296613445563
2023-12-13 07:51:11   INFO  epoch: 13/24, acc_iter=52556, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:38, time_cost(all): 16:06:44/12:23:42, loss=0.407134554427669, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.84(1.03), norm=4.448163152295386, lr=0.20400106490835224
2023-12-13 07:52:06   INFO  epoch: 13/24, acc_iter=52606, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:01, time_cost(all): 16:07:39/12:34:43, loss=0.406947509810581, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=4.053734141960648, lr=0.20365916368224884
2023-12-13 07:53:01   INFO  epoch: 13/24, acc_iter=52656, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:27:14, time_cost(all): 16:08:34/12:01:29, loss=0.406760465193492, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.18(1.03), norm=4.63686911872959, lr=0.20331726245614545
2023-12-13 07:53:57   INFO  epoch: 13/24, acc_iter=52706, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:41, time_cost(all): 16:09:30/12:03:24, loss=0.406573420576404, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.94(1.03), norm=2.5910441164156133, lr=0.20297536123004212
2023-12-13 07:54:52   INFO  epoch: 13/24, acc_iter=52756, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:58, time_cost(all): 16:10:25/12:41:43, loss=0.406386375959315, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=3.6880315023134216, lr=0.20263346000393873
2023-12-13 07:55:47   INFO  epoch: 13/24, acc_iter=52806, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:38, time_cost(all): 16:11:20/12:51:46, loss=0.406199331342227, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.88(1.03), norm=2.5890544122195704, lr=0.20229155877783533
2023-12-13 07:56:43   INFO  epoch: 13/24, acc_iter=52856, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:44, time_cost(all): 16:12:16/12:06:41, loss=0.406012286725138, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.02(1.03), norm=2.967194440106931, lr=0.20194965755173194
2023-12-13 07:57:38   INFO  epoch: 13/24, acc_iter=52906, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:51, time_cost(all): 16:13:11/12:52:28, loss=0.405825242108049, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=3.5570250853558347, lr=0.2016077563256285
2023-12-13 07:58:33   INFO  epoch: 13/24, acc_iter=52956, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:23, time_cost(all): 16:14:06/12:03:35, loss=0.405638197490961, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=2.36958639558777, lr=0.20126585509952516
2023-12-13 07:59:29   INFO  epoch: 13/24, acc_iter=53006, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:01, time_cost(all): 16:15:02/11:41:41, loss=0.405451152873872, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=4.071523115039927, lr=0.20092395387342177
2023-12-13 08:00:24   INFO  epoch: 13/24, acc_iter=53056, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:57, time_cost(all): 16:15:57/12:09:59, loss=0.405264108256784, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=2.1434355911443843, lr=0.20058205264731838
2023-12-13 08:01:20   INFO  epoch: 13/24, acc_iter=53106, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:29, time_cost(all): 16:16:53/12:35:10, loss=0.405077063639695, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=2.428347007987186, lr=0.20024015142121498
2023-12-13 08:02:15   INFO  epoch: 13/24, acc_iter=53156, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:51, time_cost(all): 16:17:48/12:15:37, loss=0.404890019022607, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.92(1.03), norm=3.6977811293251244, lr=0.1998982501951116
2023-12-13 08:03:10   INFO  epoch: 13/24, acc_iter=53206, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:21, time_cost(all): 16:18:43/12:08:53, loss=0.404702974405518, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=3.1978526909784417, lr=0.19955634896900826
2023-12-13 08:04:06   INFO  epoch: 13/24, acc_iter=53256, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:30, time_cost(all): 16:19:39/12:00:27, loss=0.40451592978843, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.18(1.03), norm=2.91579429785523, lr=0.19921444774290487
2023-12-13 08:05:01   INFO  epoch: 13/24, acc_iter=53306, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:27, time_cost(all): 16:20:34/12:17:59, loss=0.404328885171341, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.03(1.03), norm=1.0942313604226341, lr=0.19887254651680147
2023-12-13 08:05:56   INFO  epoch: 13/24, acc_iter=53356, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:02, time_cost(all): 16:21:29/12:37:55, loss=0.404141840554253, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=2.8147464736273404, lr=0.19853064529069808
2023-12-13 08:06:52   INFO  epoch: 13/24, acc_iter=53406, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:04, time_cost(all): 16:22:25/12:00:37, loss=0.403954795937164, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=1.2719351013329472, lr=0.1981887440645947
2023-12-13 08:07:47   INFO  epoch: 13/24, acc_iter=53456, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:45, time_cost(all): 16:23:20/12:13:48, loss=0.403767751320076, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=1.5470156741232624, lr=0.19784684283849135
2023-12-13 08:08:42   INFO  epoch: 13/24, acc_iter=53506, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:20, time_cost(all): 16:24:15/12:03:27, loss=0.403580706702987, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.2(1.03), norm=4.905799272753722, lr=0.19750494161238796
2023-12-13 08:09:38   INFO  epoch: 13/24, acc_iter=53556, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:54, time_cost(all): 16:25:11/11:43:41, loss=0.403393662085899, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=0.9776763383803908, lr=0.19716304038628457
2023-12-13 08:10:33   INFO  epoch: 13/24, acc_iter=53606, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:34, time_cost(all): 16:26:06/12:04:05, loss=0.40320661746881, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=4.505803720532435, lr=0.19682113916018118
2023-12-13 08:11:28   INFO  epoch: 13/24, acc_iter=53656, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:35, time_cost(all): 16:27:01/11:52:01, loss=0.403019572851721, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.9(1.03), norm=2.537176741330434, lr=0.1964792379340778
2023-12-13 08:12:24   INFO  epoch: 13/24, acc_iter=53706, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:50, time_cost(all): 16:27:57/12:11:15, loss=0.402832528234633, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=1.6607630205946173, lr=0.1961373367079744
2023-12-13 08:13:19   INFO  epoch: 13/24, acc_iter=53756, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:06:01, time_cost(all): 16:28:52/11:47:47, loss=0.402645483617544, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.85(1.03), norm=2.3493622710665027, lr=0.195795435481871
2023-12-13 08:14:14   INFO  epoch: 13/24, acc_iter=53806, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:51, time_cost(all): 16:29:47/11:39:48, loss=0.402458439000456, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=2.6428946633597055, lr=0.1954535342557676
2023-12-13 08:15:10   INFO  epoch: 13/24, acc_iter=53856, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:54, time_cost(all): 16:30:43/11:41:12, loss=0.402271394383367, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.18(1.03), norm=1.9567952937387807, lr=0.19511163302966422
2023-12-13 08:16:05   INFO  epoch: 13/24, acc_iter=53906, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:00, time_cost(all): 16:31:38/11:38:21, loss=0.402084349766279, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.984170653731775, lr=0.19476973180356089
2023-12-13 08:17:00   INFO  epoch: 13/24, acc_iter=53956, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:04, time_cost(all): 16:32:33/11:48:03, loss=0.40189730514919, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=1.469424685105484, lr=0.1944278305774575
2023-12-13 08:17:56   INFO  epoch: 13/24, acc_iter=54006, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 16:33:29/12:05:36, loss=0.401710260532102, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.07(1.03), norm=3.6574426078150166, lr=0.1940859293513541
2023-12-13 08:18:51   INFO  epoch: 13/24, acc_iter=54056, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 16:34:24/11:20:15, loss=0.401523215915013, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=3.1324822116166566, lr=0.1937440281252507
2023-12-13 08:19:46   INFO  epoch: 14/24, acc_iter=54118, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:10:06, time_cost(all): 16:35:19/12:26:18, loss=0.401291280589823, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.23(1.03), norm=3.1884320882223767, lr=0.1933200706048825
2023-12-13 08:20:42   INFO  epoch: 14/24, acc_iter=54168, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:47, time_cost(all): 16:36:15/11:55:54, loss=0.401104235972735, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=4.941059999464316, lr=0.1929781693787791
2023-12-13 08:21:37   INFO  epoch: 14/24, acc_iter=54218, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:12, time_cost(all): 16:37:10/11:26:06, loss=0.400917191355646, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.91(1.03), norm=3.243838406278388, lr=0.19263626815267576
2023-12-13 08:22:33   INFO  epoch: 14/24, acc_iter=54268, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:04:15, time_cost(all): 16:38:06/12:03:20, loss=0.400730146738558, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=3.9307208315436206, lr=0.19229436692657237
2023-12-13 08:23:28   INFO  epoch: 14/24, acc_iter=54318, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:04:32, time_cost(all): 16:39:01/11:42:18, loss=0.400543102121469, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.97(1.03), norm=2.7584241582801394, lr=0.19195246570046898
2023-12-13 08:24:23   INFO  epoch: 14/24, acc_iter=54368, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:05:26, time_cost(all): 16:39:56/11:42:07, loss=0.400356057504381, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=3.7720845378391012, lr=0.19161056447436559
2023-12-13 08:25:19   INFO  epoch: 14/24, acc_iter=54418, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:01:53, time_cost(all): 16:40:52/11:25:07, loss=0.400169012887292, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=1.8881469624644367, lr=0.1912686632482622
2023-12-13 08:26:14   INFO  epoch: 14/24, acc_iter=54468, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:07:01, time_cost(all): 16:41:47/12:17:22, loss=0.399981968270204, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=2.1545586797043814, lr=0.19092676202215886
2023-12-13 08:27:09   INFO  epoch: 14/24, acc_iter=54518, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:03:09, time_cost(all): 16:42:42/12:21:12, loss=0.399794923653115, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=3.0939640652072145, lr=0.19058486079605547
2023-12-13 08:28:05   INFO  epoch: 14/24, acc_iter=54568, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:00:05, time_cost(all): 16:43:38/11:15:18, loss=0.399607879036027, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.0566414849813284, lr=0.19024295956995207
2023-12-13 08:29:00   INFO  epoch: 14/24, acc_iter=54618, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:00:46, time_cost(all): 16:44:33/12:10:26, loss=0.399420834418938, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.2(1.03), norm=2.450140585889136, lr=0.18990105834384868
2023-12-13 08:29:55   INFO  epoch: 14/24, acc_iter=54668, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:45, time_cost(all): 16:45:28/11:34:36, loss=0.399233789801849, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=2.9734649942028337, lr=0.1895591571177453
2023-12-13 08:30:51   INFO  epoch: 14/24, acc_iter=54718, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:57:26, time_cost(all): 16:46:24/11:14:26, loss=0.399046745184761, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.95(1.03), norm=3.467216098493994, lr=0.18921725589164196
2023-12-13 08:31:46   INFO  epoch: 14/24, acc_iter=54768, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:58:05, time_cost(all): 16:47:19/11:25:27, loss=0.398859700567672, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=0.6120031775783158, lr=0.18887535466553856
2023-12-13 08:32:41   INFO  epoch: 14/24, acc_iter=54818, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:56:56, time_cost(all): 16:48:14/11:21:30, loss=0.398672655950584, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=3.2281376767421928, lr=0.18853345343943517
2023-12-13 08:33:37   INFO  epoch: 14/24, acc_iter=54868, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:56:08, time_cost(all): 16:49:10/11:17:49, loss=0.398485611333495, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=2.662389851849454, lr=0.18819155221333173
2023-12-13 08:34:32   INFO  epoch: 14/24, acc_iter=54918, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:58:18, time_cost(all): 16:50:05/11:29:24, loss=0.398298566716407, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=4.88400215182294, lr=0.1878496509872284
2023-12-13 08:35:27   INFO  epoch: 14/24, acc_iter=54968, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:37, time_cost(all): 16:51:00/12:09:52, loss=0.398111522099318, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.16(1.03), norm=1.1513151952640617, lr=0.187507749761125
2023-12-13 08:36:23   INFO  epoch: 14/24, acc_iter=55018, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:39, time_cost(all): 16:51:56/12:00:38, loss=0.39792447748223, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.2(1.03), norm=3.4588696799996868, lr=0.1871658485350216
2023-12-13 08:37:18   INFO  epoch: 14/24, acc_iter=55068, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:51:27, time_cost(all): 16:52:51/11:55:36, loss=0.397737432865141, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.08(1.03), norm=0.8064577952986534, lr=0.18682394730891821
2023-12-13 08:38:13   INFO  epoch: 14/24, acc_iter=55118, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:49:39, time_cost(all): 16:53:46/11:07:32, loss=0.397550388248053, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.17(1.03), norm=4.677011064257716, lr=0.18648204608281482
2023-12-13 08:39:09   INFO  epoch: 14/24, acc_iter=55168, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:52:22, time_cost(all): 16:54:42/11:14:21, loss=0.397363343630964, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.9(1.03), norm=2.411324852002884, lr=0.1861401448567115
2023-12-13 08:40:04   INFO  epoch: 14/24, acc_iter=55218, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:14, time_cost(all): 16:55:37/11:02:03, loss=0.397176299013876, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.84(1.03), norm=4.852372082225884, lr=0.1857982436306081
2023-12-13 08:40:59   INFO  epoch: 14/24, acc_iter=55268, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:49, time_cost(all): 16:56:32/11:19:36, loss=0.396989254396787, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.18(1.03), norm=3.8713357255635343, lr=0.1854563424045047
2023-12-13 08:41:55   INFO  epoch: 14/24, acc_iter=55318, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:49:56, time_cost(all): 16:57:28/12:05:27, loss=0.396802209779698, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.02(1.03), norm=0.9042002539257981, lr=0.1851144411784013
2023-12-13 08:42:50   INFO  epoch: 14/24, acc_iter=55368, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:47:58, time_cost(all): 16:58:23/11:28:02, loss=0.39661516516261, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=4.617988741757323, lr=0.18477253995229792
2023-12-13 08:43:45   INFO  epoch: 14/24, acc_iter=55418, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:34, time_cost(all): 16:59:18/11:48:27, loss=0.396428120545521, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=3.4775497976736633, lr=0.18443063872619458
2023-12-13 08:44:41   INFO  epoch: 14/24, acc_iter=55468, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:48, time_cost(all): 17:00:14/11:55:11, loss=0.396241075928433, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.84(1.03), norm=0.9632062566178432, lr=0.1840887375000912
2023-12-13 08:45:36   INFO  epoch: 14/24, acc_iter=55518, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:42:26, time_cost(all): 17:01:09/12:02:56, loss=0.396054031311344, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.22(1.03), norm=4.050786543820745, lr=0.1837468362739878
2023-12-13 08:46:32   INFO  epoch: 14/24, acc_iter=55568, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:25, time_cost(all): 17:02:05/11:09:31, loss=0.395866986694256, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.88(1.03), norm=0.5041135071671606, lr=0.1834049350478844
2023-12-13 08:47:27   INFO  epoch: 14/24, acc_iter=55618, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:02, time_cost(all): 17:03:00/11:24:12, loss=0.395679942077167, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=4.699288976897695, lr=0.18306303382178102
2023-12-13 08:48:22   INFO  epoch: 14/24, acc_iter=55668, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:28, time_cost(all): 17:03:55/11:28:49, loss=0.395492897460079, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.04(1.03), norm=0.5840006327079692, lr=0.18272113259567763
2023-12-13 08:49:18   INFO  epoch: 14/24, acc_iter=55718, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:42:03, time_cost(all): 17:04:51/11:13:34, loss=0.39530585284299, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=2.8117472738963265, lr=0.18237923136957424
2023-12-13 08:50:13   INFO  epoch: 14/24, acc_iter=55768, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:52, time_cost(all): 17:05:46/11:11:37, loss=0.395118808225902, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=4.076689525211732, lr=0.18203733014347084
2023-12-13 08:51:08   INFO  epoch: 14/24, acc_iter=55818, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:39:29, time_cost(all): 17:06:41/11:28:57, loss=0.394931763608813, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.18(1.03), norm=4.082344319440961, lr=0.18169542891736745
2023-12-13 08:52:04   INFO  epoch: 14/24, acc_iter=55868, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:39:39, time_cost(all): 17:07:37/11:49:34, loss=0.394744718991725, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.84(1.03), norm=2.6896244611237448, lr=0.18135352769126406
2023-12-13 08:52:59   INFO  epoch: 14/24, acc_iter=55918, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:35:28, time_cost(all): 17:08:32/11:17:06, loss=0.394557674374636, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.15(1.03), norm=2.7047669533899334, lr=0.18101162646516072
2023-12-13 08:53:54   INFO  epoch: 14/24, acc_iter=55968, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:35:41, time_cost(all): 17:09:27/11:20:10, loss=0.394370629757548, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.17(1.03), norm=1.5953945179660491, lr=0.18066972523905733
2023-12-13 08:54:50   INFO  epoch: 14/24, acc_iter=56018, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:41, time_cost(all): 17:10:23/10:56:45, loss=0.394183585140459, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.99(1.03), norm=0.9131287267640225, lr=0.18032782401295394
2023-12-13 08:55:45   INFO  epoch: 14/24, acc_iter=56068, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:26, time_cost(all): 17:11:18/11:19:15, loss=0.393996540523371, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.0734248910669622, lr=0.17998592278685055
2023-12-13 08:56:40   INFO  epoch: 14/24, acc_iter=56118, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:16, time_cost(all): 17:12:13/11:01:52, loss=0.393809495906282, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.87(1.03), norm=0.7566295237479788, lr=0.17964402156074716
2023-12-13 08:57:36   INFO  epoch: 14/24, acc_iter=56168, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:19, time_cost(all): 17:13:09/10:49:24, loss=0.393622451289193, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.92(1.03), norm=2.6606865266903816, lr=0.17930212033464382
2023-12-13 08:58:31   INFO  epoch: 14/24, acc_iter=56218, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:21, time_cost(all): 17:14:04/10:49:33, loss=0.393435406672105, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.8278940957635617, lr=0.17896021910854043
2023-12-13 08:59:26   INFO  epoch: 14/24, acc_iter=56268, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:57, time_cost(all): 17:14:59/11:31:17, loss=0.393248362055016, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=2.7068075427037286, lr=0.17861831788243704
2023-12-13 09:00:22   INFO  epoch: 14/24, acc_iter=56318, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:42, time_cost(all): 17:15:55/11:20:43, loss=0.393061317437928, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.86740784566321, lr=0.17827641665633365
2023-12-13 09:01:17   INFO  epoch: 14/24, acc_iter=56368, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:55, time_cost(all): 17:16:50/11:41:50, loss=0.392874272820839, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.12(1.03), norm=3.688859777975713, lr=0.17793451543023026
2023-12-13 09:02:12   INFO  epoch: 14/24, acc_iter=56418, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:01, time_cost(all): 17:17:45/11:41:12, loss=0.392687228203751, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.11(1.03), norm=2.6253972625086646, lr=0.17759261420412692
2023-12-13 09:03:08   INFO  epoch: 14/24, acc_iter=56468, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:12, time_cost(all): 17:18:41/10:46:53, loss=0.392500183586662, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=4.084432581218605, lr=0.17725071297802347
2023-12-13 09:04:03   INFO  epoch: 14/24, acc_iter=56518, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:52, time_cost(all): 17:19:36/11:02:45, loss=0.392313138969574, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.19(1.03), norm=2.13075143419508, lr=0.17690881175192008
2023-12-13 09:04:58   INFO  epoch: 14/24, acc_iter=56568, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:48, time_cost(all): 17:20:31/11:09:25, loss=0.392126094352485, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.18(1.03), norm=0.9389367910928315, lr=0.1765669105258167
2023-12-13 09:05:54   INFO  epoch: 14/24, acc_iter=56618, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:04, time_cost(all): 17:21:27/11:23:54, loss=0.391939049735397, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.89(1.03), norm=1.1104590938905434, lr=0.17622500929971335
2023-12-13 09:06:49   INFO  epoch: 14/24, acc_iter=56668, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:00, time_cost(all): 17:22:22/11:18:27, loss=0.391752005118308, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.94(1.03), norm=3.3301919764764905, lr=0.17588310807360996
2023-12-13 09:07:45   INFO  epoch: 14/24, acc_iter=56718, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:23:07, time_cost(all): 17:23:18/11:04:01, loss=0.39156496050122, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=4.876606602814207, lr=0.17554120684750657
2023-12-13 09:08:40   INFO  epoch: 14/24, acc_iter=56768, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:37, time_cost(all): 17:24:13/10:54:18, loss=0.391377915884131, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.15(1.03), norm=0.8264346511417455, lr=0.17519930562140318
2023-12-13 09:09:35   INFO  epoch: 14/24, acc_iter=56818, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:27, time_cost(all): 17:25:08/11:28:11, loss=0.391190871267042, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=2.4589238373680984, lr=0.1748574043952998
2023-12-13 09:10:31   INFO  epoch: 14/24, acc_iter=56868, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:18:55, time_cost(all): 17:26:04/11:17:26, loss=0.391003826649954, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.94(1.03), norm=2.665010413842666, lr=0.17451550316919645
2023-12-13 09:11:26   INFO  epoch: 14/24, acc_iter=56918, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:18, time_cost(all): 17:26:59/11:08:37, loss=0.390816782032865, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=4.758941925632406, lr=0.17417360194309306
2023-12-13 09:12:21   INFO  epoch: 14/24, acc_iter=56968, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:59, time_cost(all): 17:27:54/10:50:15, loss=0.390629737415777, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=0.5935470641330214, lr=0.17383170071698967
2023-12-13 09:13:17   INFO  epoch: 14/24, acc_iter=57018, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:22, time_cost(all): 17:28:50/10:59:05, loss=0.390442692798688, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=1.0339287444206637, lr=0.17348979949088628
2023-12-13 09:14:12   INFO  epoch: 14/24, acc_iter=57068, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:41, time_cost(all): 17:29:45/11:28:45, loss=0.3902556481816, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.85(1.03), norm=1.645600560049195, lr=0.17314789826478288
2023-12-13 09:15:07   INFO  epoch: 14/24, acc_iter=57118, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:13, time_cost(all): 17:30:40/11:07:12, loss=0.390068603564511, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=0.959527490104515, lr=0.17280599703867955
2023-12-13 09:16:03   INFO  epoch: 14/24, acc_iter=57168, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:25, time_cost(all): 17:31:36/11:03:33, loss=0.389881558947423, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.11(1.03), norm=0.5615845515943412, lr=0.17246409581257616
2023-12-13 09:16:58   INFO  epoch: 14/24, acc_iter=57218, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:13:13, time_cost(all): 17:32:31/10:50:59, loss=0.389694514330334, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.88(1.03), norm=3.067184484227279, lr=0.17212219458647277
2023-12-13 09:17:53   INFO  epoch: 14/24, acc_iter=57268, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:49, time_cost(all): 17:33:26/11:09:08, loss=0.389507469713246, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.17(1.03), norm=1.7306439031523668, lr=0.17178029336036932
2023-12-13 09:18:49   INFO  epoch: 14/24, acc_iter=57318, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:07, time_cost(all): 17:34:22/10:30:30, loss=0.389320425096157, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=0.6241915327895171, lr=0.17143839213426593
2023-12-13 09:19:44   INFO  epoch: 14/24, acc_iter=57368, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:01, time_cost(all): 17:35:17/10:22:49, loss=0.389133380479069, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=4.1517862825487954, lr=0.1710964909081626
2023-12-13 09:20:39   INFO  epoch: 14/24, acc_iter=57418, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:46, time_cost(all): 17:36:12/11:08:52, loss=0.38894633586198, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=3.2628005275767493, lr=0.1707545896820592
2023-12-13 09:21:35   INFO  epoch: 14/24, acc_iter=57468, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:34, time_cost(all): 17:37:08/10:30:44, loss=0.388759291244892, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=4.0829420163494685, lr=0.1704126884559558
2023-12-13 09:22:30   INFO  epoch: 14/24, acc_iter=57518, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:48, time_cost(all): 17:38:03/10:45:25, loss=0.388572246627803, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.98(1.03), norm=0.7573036036651797, lr=0.17007078722985242
2023-12-13 09:23:25   INFO  epoch: 14/24, acc_iter=57568, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:30, time_cost(all): 17:38:58/10:38:13, loss=0.388385202010714, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=1.246051851229583, lr=0.16972888600374902
2023-12-13 09:24:21   INFO  epoch: 14/24, acc_iter=57618, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:45, time_cost(all): 17:39:54/11:19:19, loss=0.388198157393626, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=4.262148521873529, lr=0.1693869847776457
2023-12-13 09:25:16   INFO  epoch: 14/24, acc_iter=57668, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:39, time_cost(all): 17:40:49/11:05:40, loss=0.388011112776537, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.21(1.03), norm=2.8030948021682542, lr=0.1690450835515423
2023-12-13 09:26:11   INFO  epoch: 14/24, acc_iter=57718, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:58, time_cost(all): 17:41:44/11:05:40, loss=0.387824068159449, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.85(1.03), norm=2.37661032391264, lr=0.1687031823254389
2023-12-13 09:27:07   INFO  epoch: 14/24, acc_iter=57768, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:59, time_cost(all): 17:42:40/11:08:05, loss=0.38763702354236, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=4.749058410933488, lr=0.1683612810993355
2023-12-13 09:28:02   INFO  epoch: 14/24, acc_iter=57818, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:08, time_cost(all): 17:43:35/10:38:50, loss=0.387449978925272, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=4.550572593955921, lr=0.16801937987323212
2023-12-13 09:28:58   INFO  epoch: 14/24, acc_iter=57868, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:10, time_cost(all): 17:44:31/10:59:06, loss=0.387262934308183, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=1.9624582736445384, lr=0.16767747864712879
2023-12-13 09:29:53   INFO  epoch: 14/24, acc_iter=57918, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 17:45:26/10:21:19, loss=0.387075889691095, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.5459484366250242, lr=0.1673355774210254
2023-12-13 09:30:48   INFO  epoch: 15/24, acc_iter=57980, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:07:08, time_cost(all): 17:46:21/10:19:39, loss=0.386843954365905, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=2.184112846937684, lr=0.16691161990065717
2023-12-13 09:31:44   INFO  epoch: 15/24, acc_iter=58030, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:07:49, time_cost(all): 17:47:17/11:01:04, loss=0.386656909748816, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.85(1.03), norm=2.240597116663787, lr=0.16656971867455378
2023-12-13 09:32:39   INFO  epoch: 15/24, acc_iter=58080, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:35, time_cost(all): 17:48:12/10:30:56, loss=0.386469865131728, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.88(1.03), norm=2.5716774670985014, lr=0.1662278174484504
2023-12-13 09:33:34   INFO  epoch: 15/24, acc_iter=58130, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:06:07, time_cost(all): 17:49:07/10:23:03, loss=0.386282820514639, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.98(1.03), norm=3.542589245495322, lr=0.16588591622234705
2023-12-13 09:34:30   INFO  epoch: 15/24, acc_iter=58180, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:05:02, time_cost(all): 17:50:03/10:58:09, loss=0.386095775897551, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.05(1.03), norm=3.1200461922132865, lr=0.16554401499624366
2023-12-13 09:35:25   INFO  epoch: 15/24, acc_iter=58230, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:05:16, time_cost(all): 17:50:58/10:11:16, loss=0.385908731280462, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.12(1.03), norm=3.6586690600657032, lr=0.16520211377014027
2023-12-13 09:36:20   INFO  epoch: 15/24, acc_iter=58280, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:05:55, time_cost(all): 17:51:53/10:24:02, loss=0.385721686663374, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.08(1.03), norm=2.1353811197508863, lr=0.16486021254403688
2023-12-13 09:37:16   INFO  epoch: 15/24, acc_iter=58330, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:25, time_cost(all): 17:52:49/10:52:13, loss=0.385534642046285, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=0.7707400299226095, lr=0.16451831131793349
2023-12-13 09:38:11   INFO  epoch: 15/24, acc_iter=58380, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:55, time_cost(all): 17:53:44/10:29:54, loss=0.385347597429197, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.22(1.03), norm=2.500281812717007, lr=0.16417641009183015
2023-12-13 09:39:06   INFO  epoch: 15/24, acc_iter=58430, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:46, time_cost(all): 17:54:39/10:26:58, loss=0.385160552812108, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.97(1.03), norm=2.775094176663812, lr=0.1638345088657267
2023-12-13 09:40:02   INFO  epoch: 15/24, acc_iter=58480, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:09, time_cost(all): 17:55:35/10:50:55, loss=0.38497350819502, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=2.6596107536286366, lr=0.1634926076396233
2023-12-13 09:40:57   INFO  epoch: 15/24, acc_iter=58530, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:01:53, time_cost(all): 17:56:30/10:24:39, loss=0.384786463577931, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=4.370100821874997, lr=0.16315070641351992
2023-12-13 09:41:52   INFO  epoch: 15/24, acc_iter=58580, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:57:28, time_cost(all): 17:57:25/10:43:50, loss=0.384599418960842, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=0.8806588195740033, lr=0.16280880518741653
2023-12-13 09:42:48   INFO  epoch: 15/24, acc_iter=58630, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:58:41, time_cost(all): 17:58:21/10:28:01, loss=0.384412374343754, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.13(1.03), norm=4.669352849122655, lr=0.1624669039613132
2023-12-13 09:43:43   INFO  epoch: 15/24, acc_iter=58680, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:57:29, time_cost(all): 17:59:16/10:33:38, loss=0.384225329726665, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=0.6291102686260723, lr=0.1621250027352098
2023-12-13 09:44:38   INFO  epoch: 15/24, acc_iter=58730, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:59:14, time_cost(all): 18:00:11/10:38:55, loss=0.384038285109577, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.489640324046224, lr=0.1617831015091064
2023-12-13 09:45:34   INFO  epoch: 15/24, acc_iter=58780, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:20, time_cost(all): 18:01:07/10:35:12, loss=0.383851240492488, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=1.9424372717922012, lr=0.16144120028300302
2023-12-13 09:46:29   INFO  epoch: 15/24, acc_iter=58830, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:53:47, time_cost(all): 18:02:02/10:56:30, loss=0.3836641958754, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.03(1.03), norm=0.9726447086008181, lr=0.16109929905689963
2023-12-13 09:47:24   INFO  epoch: 15/24, acc_iter=58880, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:52:05, time_cost(all): 18:02:57/10:54:38, loss=0.383477151258311, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.15(1.03), norm=4.110147722435624, lr=0.1607573978307963
2023-12-13 09:48:20   INFO  epoch: 15/24, acc_iter=58930, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:51:57, time_cost(all): 18:03:53/10:12:11, loss=0.383290106641223, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.18(1.03), norm=0.8963391284859372, lr=0.1604154966046929
2023-12-13 09:49:15   INFO  epoch: 15/24, acc_iter=58980, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:22, time_cost(all): 18:04:48/10:25:51, loss=0.383103062024134, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=3.9368838286363173, lr=0.1600735953785895
2023-12-13 09:50:11   INFO  epoch: 15/24, acc_iter=59030, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:52:47, time_cost(all): 18:05:44/10:08:10, loss=0.382916017407046, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=4.176234979776376, lr=0.15973169415248611
2023-12-13 09:51:06   INFO  epoch: 15/24, acc_iter=59080, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:50:31, time_cost(all): 18:06:39/10:08:11, loss=0.382728972789957, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.16(1.03), norm=2.9288946944392427, lr=0.15938979292638272
2023-12-13 09:52:01   INFO  epoch: 15/24, acc_iter=59130, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:02, time_cost(all): 18:07:34/10:41:55, loss=0.382541928172869, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.89(1.03), norm=3.78968639410106, lr=0.1590478917002794
2023-12-13 09:52:57   INFO  epoch: 15/24, acc_iter=59180, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:34, time_cost(all): 18:08:30/10:17:17, loss=0.38235488355578, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=1.5866670899209554, lr=0.158705990474176
2023-12-13 09:53:52   INFO  epoch: 15/24, acc_iter=59230, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:46:07, time_cost(all): 18:09:25/9:51:36, loss=0.382167838938691, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.9(1.03), norm=1.39262253054295, lr=0.15836408924807255
2023-12-13 09:54:47   INFO  epoch: 15/24, acc_iter=59280, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:45:59, time_cost(all): 18:10:20/10:03:32, loss=0.381980794321603, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=3.1064301159857344, lr=0.15802218802196916
2023-12-13 09:55:43   INFO  epoch: 15/24, acc_iter=59330, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:44:20, time_cost(all): 18:11:16/10:42:27, loss=0.381793749704514, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.23(1.03), norm=2.8100724903228924, lr=0.15768028679586582
2023-12-13 09:56:38   INFO  epoch: 15/24, acc_iter=59380, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:07, time_cost(all): 18:12:11/10:08:00, loss=0.381606705087426, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.96(1.03), norm=1.0331149643075013, lr=0.15733838556976243
2023-12-13 09:57:33   INFO  epoch: 15/24, acc_iter=59430, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:49, time_cost(all): 18:13:06/10:27:43, loss=0.381419660470337, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.87(1.03), norm=3.132735488561059, lr=0.15699648434365904
2023-12-13 09:58:29   INFO  epoch: 15/24, acc_iter=59480, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:35, time_cost(all): 18:14:02/10:07:21, loss=0.381232615853249, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=4.107543129103655, lr=0.15665458311755565
2023-12-13 09:59:24   INFO  epoch: 15/24, acc_iter=59530, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:40:24, time_cost(all): 18:14:57/10:17:30, loss=0.38104557123616, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.21(1.03), norm=2.8915603429443486, lr=0.15631268189145225
2023-12-13 10:00:19   INFO  epoch: 15/24, acc_iter=59580, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:42:46, time_cost(all): 18:15:52/10:34:24, loss=0.380858526619072, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.12(1.03), norm=4.812732331608348, lr=0.15597078066534892
2023-12-13 10:01:15   INFO  epoch: 15/24, acc_iter=59630, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:21, time_cost(all): 18:16:48/10:12:31, loss=0.380671482001983, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=3.3966300977147266, lr=0.15562887943924553
2023-12-13 10:02:10   INFO  epoch: 15/24, acc_iter=59680, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:11, time_cost(all): 18:17:43/10:36:09, loss=0.380484437384895, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.15(1.03), norm=3.7471803025628287, lr=0.15528697821314214
2023-12-13 10:03:05   INFO  epoch: 15/24, acc_iter=59730, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:11, time_cost(all): 18:18:38/10:02:26, loss=0.380297392767806, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.29939359628907, lr=0.15494507698703874
2023-12-13 10:04:01   INFO  epoch: 15/24, acc_iter=59780, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:12, time_cost(all): 18:19:34/10:19:39, loss=0.380110348150718, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=2.4209333448629056, lr=0.15460317576093535
2023-12-13 10:04:56   INFO  epoch: 15/24, acc_iter=59830, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:27, time_cost(all): 18:20:29/10:11:19, loss=0.379923303533629, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.14(1.03), norm=3.5191691730486405, lr=0.15426127453483202
2023-12-13 10:05:51   INFO  epoch: 15/24, acc_iter=59880, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:36:09, time_cost(all): 18:21:24/9:49:28, loss=0.379736258916541, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=4.844595070993325, lr=0.15391937330872862
2023-12-13 10:06:47   INFO  epoch: 15/24, acc_iter=59930, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:33:43, time_cost(all): 18:22:20/9:49:19, loss=0.379549214299452, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=3.858648174124473, lr=0.15357747208262523
2023-12-13 10:07:42   INFO  epoch: 15/24, acc_iter=59980, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:44, time_cost(all): 18:23:15/10:28:51, loss=0.379362169682363, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.86(1.03), norm=2.6290434023709066, lr=0.15323557085652179
2023-12-13 10:08:37   INFO  epoch: 15/24, acc_iter=60030, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:59, time_cost(all): 18:24:10/10:26:55, loss=0.379175125065275, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.15(1.03), norm=3.3162868683338917, lr=0.1528936696304184
2023-12-13 10:09:33   INFO  epoch: 15/24, acc_iter=60080, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:31:43, time_cost(all): 18:25:06/10:24:33, loss=0.378988080448186, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.18(1.03), norm=0.8353428778574765, lr=0.15255176840431506
2023-12-13 10:10:28   INFO  epoch: 15/24, acc_iter=60130, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:58, time_cost(all): 18:26:01/9:41:59, loss=0.378801035831098, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.97(1.03), norm=1.999091364245295, lr=0.15220986717821167
2023-12-13 10:11:24   INFO  epoch: 15/24, acc_iter=60180, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:31:01, time_cost(all): 18:26:57/10:28:40, loss=0.378613991214009, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.19(1.03), norm=0.7108729053601255, lr=0.15186796595210827
2023-12-13 10:12:19   INFO  epoch: 15/24, acc_iter=60230, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:35, time_cost(all): 18:27:52/10:12:55, loss=0.378426946596921, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.22(1.03), norm=3.845214986536007, lr=0.15152606472600488
2023-12-13 10:13:14   INFO  epoch: 15/24, acc_iter=60280, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:57, time_cost(all): 18:28:47/10:22:52, loss=0.378239901979832, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.594883631905807, lr=0.1511841634999015
2023-12-13 10:14:10   INFO  epoch: 15/24, acc_iter=60330, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:32, time_cost(all): 18:29:43/10:28:05, loss=0.378052857362744, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.14(1.03), norm=2.6021907992635844, lr=0.15084226227379816
2023-12-13 10:15:05   INFO  epoch: 15/24, acc_iter=60380, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:02, time_cost(all): 18:30:38/9:40:31, loss=0.377865812745655, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=1.2921053348349942, lr=0.15050036104769476
2023-12-13 10:16:00   INFO  epoch: 15/24, acc_iter=60430, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:33, time_cost(all): 18:31:33/9:53:39, loss=0.377678768128567, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.1(1.03), norm=2.8151796309297974, lr=0.15015845982159137
2023-12-13 10:16:56   INFO  epoch: 15/24, acc_iter=60480, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:56, time_cost(all): 18:32:29/9:36:00, loss=0.377491723511478, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.03(1.03), norm=0.6841113383350683, lr=0.14981655859548798
2023-12-13 10:17:51   INFO  epoch: 15/24, acc_iter=60530, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:05, time_cost(all): 18:33:24/10:06:31, loss=0.37730467889439, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.03(1.03), norm=2.0214783083552934, lr=0.1494746573693846
2023-12-13 10:18:46   INFO  epoch: 15/24, acc_iter=60580, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:12, time_cost(all): 18:34:19/10:05:39, loss=0.377117634277301, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=2.2669684082241903, lr=0.14913275614328125
2023-12-13 10:19:42   INFO  epoch: 15/24, acc_iter=60630, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:30, time_cost(all): 18:35:15/10:12:12, loss=0.376930589660213, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.86(1.03), norm=2.2241589259879566, lr=0.14879085491717786
2023-12-13 10:20:37   INFO  epoch: 15/24, acc_iter=60680, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:57, time_cost(all): 18:36:10/9:50:55, loss=0.376743545043124, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=3.6433312868210757, lr=0.14844895369107447
2023-12-13 10:21:32   INFO  epoch: 15/24, acc_iter=60730, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:39, time_cost(all): 18:37:05/9:53:18, loss=0.376556500426035, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.92(1.03), norm=3.1946508127480198, lr=0.14810705246497108
2023-12-13 10:22:28   INFO  epoch: 15/24, acc_iter=60780, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:56, time_cost(all): 18:38:01/10:05:14, loss=0.376369455808947, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=2.3801735334675644, lr=0.1477651512388677
2023-12-13 10:23:23   INFO  epoch: 15/24, acc_iter=60830, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:08, time_cost(all): 18:38:56/10:08:02, loss=0.376182411191858, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.85(1.03), norm=2.49125959690518, lr=0.1474232500127643
2023-12-13 10:24:18   INFO  epoch: 15/24, acc_iter=60880, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:55, time_cost(all): 18:39:51/9:32:09, loss=0.37599536657477, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.19(1.03), norm=4.534446176881056, lr=0.1470813487866609
2023-12-13 10:25:14   INFO  epoch: 15/24, acc_iter=60930, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:26, time_cost(all): 18:40:47/9:39:23, loss=0.375808321957681, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=2.1507865496295535, lr=0.1467394475605575
2023-12-13 10:26:09   INFO  epoch: 15/24, acc_iter=60980, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:00, time_cost(all): 18:41:42/9:44:02, loss=0.375621277340593, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.84(1.03), norm=4.269431526119909, lr=0.14639754633445412
2023-12-13 10:27:04   INFO  epoch: 15/24, acc_iter=61030, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:26, time_cost(all): 18:42:37/9:24:09, loss=0.375434232723504, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=4.021435323516537, lr=0.14605564510835078
2023-12-13 10:28:00   INFO  epoch: 15/24, acc_iter=61080, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:29, time_cost(all): 18:43:33/9:37:39, loss=0.375247188106416, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=0.7327141926515587, lr=0.1457137438822474
2023-12-13 10:28:55   INFO  epoch: 15/24, acc_iter=61130, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:47, time_cost(all): 18:44:28/9:43:19, loss=0.375060143489327, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=0.8408025877511902, lr=0.145371842656144
2023-12-13 10:29:50   INFO  epoch: 15/24, acc_iter=61180, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:38, time_cost(all): 18:45:23/9:47:06, loss=0.374873098872239, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.08(1.03), norm=2.785803091576665, lr=0.1450299414300406
2023-12-13 10:30:46   INFO  epoch: 15/24, acc_iter=61230, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:26, time_cost(all): 18:46:19/9:21:53, loss=0.37468605425515, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.86(1.03), norm=4.598654553600944, lr=0.14468804020393722
2023-12-13 10:31:41   INFO  epoch: 15/24, acc_iter=61280, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:12, time_cost(all): 18:47:14/9:17:24, loss=0.374499009638062, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=3.5517849756571156, lr=0.14434613897783388
2023-12-13 10:32:37   INFO  epoch: 15/24, acc_iter=61330, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:13, time_cost(all): 18:48:10/9:57:28, loss=0.374311965020973, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.87(1.03), norm=3.0910056937796155, lr=0.1440042377517305
2023-12-13 10:33:32   INFO  epoch: 15/24, acc_iter=61380, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:23, time_cost(all): 18:49:05/9:35:23, loss=0.374124920403884, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=2.7890913160236224, lr=0.1436623365256271
2023-12-13 10:34:27   INFO  epoch: 15/24, acc_iter=61430, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:35, time_cost(all): 18:50:00/9:30:55, loss=0.373937875786796, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=3.6850421892674325, lr=0.1433204352995237
2023-12-13 10:35:23   INFO  epoch: 15/24, acc_iter=61480, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:33, time_cost(all): 18:50:56/9:30:55, loss=0.373750831169707, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.97(1.03), norm=4.196958258766666, lr=0.14297853407342032
2023-12-13 10:36:18   INFO  epoch: 15/24, acc_iter=61530, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:54, time_cost(all): 18:51:51/10:00:41, loss=0.373563786552619, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.05(1.03), norm=3.878414382468733, lr=0.14263663284731698
2023-12-13 10:37:13   INFO  epoch: 15/24, acc_iter=61580, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:52, time_cost(all): 18:52:46/9:39:24, loss=0.37337674193553, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=1.026094392539029, lr=0.14229473162121353
2023-12-13 10:38:09   INFO  epoch: 15/24, acc_iter=61630, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:51, time_cost(all): 18:53:42/9:45:14, loss=0.373189697318442, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=1.820426603805073, lr=0.14195283039511014
2023-12-13 10:39:04   INFO  epoch: 15/24, acc_iter=61680, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:04, time_cost(all): 18:54:37/9:25:17, loss=0.373002652701353, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.1(1.03), norm=1.3006769904179984, lr=0.14161092916900675
2023-12-13 10:39:59   INFO  epoch: 15/24, acc_iter=61730, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 18:55:32/9:12:16, loss=0.372815608084265, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.9(1.03), norm=0.6500248956543011, lr=0.14126902794290336
2023-12-13 10:40:55   INFO  epoch: 15/24, acc_iter=61780, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 18:56:28/9:19:17, loss=0.372628563467176, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.99(1.03), norm=4.97798464598879, lr=0.14092712671680002
2023-12-13 10:41:50   INFO  epoch: 16/24, acc_iter=61842, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:10:18, time_cost(all): 18:57:23/9:49:42, loss=0.372396628141986, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.96(1.03), norm=0.6547555408746997, lr=0.14050316919643185
2023-12-13 10:42:45   INFO  epoch: 16/24, acc_iter=61892, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:06:15, time_cost(all): 18:58:18/9:22:26, loss=0.372209583524898, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.04(1.03), norm=0.6144203141484019, lr=0.14016126797032846
2023-12-13 10:43:41   INFO  epoch: 16/24, acc_iter=61942, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:34, time_cost(all): 18:59:14/9:09:05, loss=0.372022538907809, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.17(1.03), norm=3.8462589872144908, lr=0.13981936674422507
2023-12-13 10:44:36   INFO  epoch: 16/24, acc_iter=61992, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:04:37, time_cost(all): 19:00:09/9:47:06, loss=0.371835494290721, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.66327845885099, lr=0.13947746551812162
2023-12-13 10:45:31   INFO  epoch: 16/24, acc_iter=62042, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:07:36, time_cost(all): 19:01:04/9:27:54, loss=0.371648449673632, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.95(1.03), norm=1.3335761403892392, lr=0.1391355642920183
2023-12-13 10:46:27   INFO  epoch: 16/24, acc_iter=62092, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:16, time_cost(all): 19:02:00/9:38:27, loss=0.371461405056544, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.1(1.03), norm=3.496053556877098, lr=0.1387936630659149
2023-12-13 10:47:22   INFO  epoch: 16/24, acc_iter=62142, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:07:37, time_cost(all): 19:02:55/9:40:35, loss=0.371274360439455, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=2.105548295579502, lr=0.1384517618398115
2023-12-13 10:48:17   INFO  epoch: 16/24, acc_iter=62192, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:02:05, time_cost(all): 19:03:50/9:34:46, loss=0.371087315822367, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.87(1.03), norm=1.226594184867373, lr=0.1381098606137081
2023-12-13 10:49:13   INFO  epoch: 16/24, acc_iter=62242, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:01:23, time_cost(all): 19:04:46/9:20:41, loss=0.370900271205278, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.95(1.03), norm=1.0390570604018845, lr=0.13776795938760472
2023-12-13 10:50:08   INFO  epoch: 16/24, acc_iter=62292, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:03:12, time_cost(all): 19:05:41/9:24:24, loss=0.37071322658819, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.86(1.03), norm=4.614183232656826, lr=0.13742605816150139
2023-12-13 10:51:03   INFO  epoch: 16/24, acc_iter=62342, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:41, time_cost(all): 19:06:36/8:57:14, loss=0.370526181971101, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.09(1.03), norm=1.2255707050349736, lr=0.137084156935398
2023-12-13 10:51:59   INFO  epoch: 16/24, acc_iter=62392, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:31, time_cost(all): 19:07:32/8:55:52, loss=0.370339137354012, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.95(1.03), norm=4.866282209557729, lr=0.1367422557092946
2023-12-13 10:52:54   INFO  epoch: 16/24, acc_iter=62442, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:00:00, time_cost(all): 19:08:27/9:41:42, loss=0.370152092736924, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.08(1.03), norm=1.7357240436370136, lr=0.1364003544831912
2023-12-13 10:53:49   INFO  epoch: 16/24, acc_iter=62492, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:55:27, time_cost(all): 19:09:22/9:06:36, loss=0.369965048119835, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.11(1.03), norm=3.6923681757223052, lr=0.13605845325708782
2023-12-13 10:54:45   INFO  epoch: 16/24, acc_iter=62542, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:57:31, time_cost(all): 19:10:18/9:00:19, loss=0.369778003502747, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=0.567139956282563, lr=0.13571655203098448
2023-12-13 10:55:40   INFO  epoch: 16/24, acc_iter=62592, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:37, time_cost(all): 19:11:13/9:14:25, loss=0.369590958885658, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.03(1.03), norm=2.8605088001387102, lr=0.1353746508048811
2023-12-13 10:56:36   INFO  epoch: 16/24, acc_iter=62642, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:58:03, time_cost(all): 19:12:09/8:53:12, loss=0.36940391426857, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.09(1.03), norm=1.0179602457815342, lr=0.1350327495787777
2023-12-13 10:57:31   INFO  epoch: 16/24, acc_iter=62692, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:55:40, time_cost(all): 19:13:04/9:16:04, loss=0.369216869651481, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.17(1.03), norm=0.5821085552597753, lr=0.1346908483526743
2023-12-13 10:58:26   INFO  epoch: 16/24, acc_iter=62742, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:54:59, time_cost(all): 19:13:59/8:54:29, loss=0.369029825034393, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.98(1.03), norm=1.393504812397044, lr=0.13434894712657086
2023-12-13 10:59:22   INFO  epoch: 16/24, acc_iter=62792, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:53:41, time_cost(all): 19:14:55/9:39:56, loss=0.368842780417304, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=1.1165829289329139, lr=0.13400704590046753
2023-12-13 11:00:17   INFO  epoch: 16/24, acc_iter=62842, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:50:04, time_cost(all): 19:15:50/9:32:59, loss=0.368655735800216, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.86(1.03), norm=3.5815939683300284, lr=0.13366514467436413
2023-12-13 11:01:12   INFO  epoch: 16/24, acc_iter=62892, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:50:29, time_cost(all): 19:16:45/9:21:36, loss=0.368468691183127, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=1.73009790841633, lr=0.13332324344826074
2023-12-13 11:02:08   INFO  epoch: 16/24, acc_iter=62942, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:47:34, time_cost(all): 19:17:41/9:19:51, loss=0.368281646566039, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.83(1.03), norm=0.902752941622747, lr=0.13298134222215735
2023-12-13 11:03:03   INFO  epoch: 16/24, acc_iter=62992, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:47:38, time_cost(all): 19:18:36/9:13:06, loss=0.36809460194895, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.17(1.03), norm=3.485172966967736, lr=0.13263944099605396
2023-12-13 11:03:58   INFO  epoch: 16/24, acc_iter=63042, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:08, time_cost(all): 19:19:31/8:50:41, loss=0.367907557331862, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.11(1.03), norm=2.012341922913018, lr=0.13229753976995062
2023-12-13 11:04:54   INFO  epoch: 16/24, acc_iter=63092, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:46:44, time_cost(all): 19:20:27/8:50:01, loss=0.367720512714773, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.93(1.03), norm=3.2782979444836897, lr=0.13195563854384723
2023-12-13 11:05:49   INFO  epoch: 16/24, acc_iter=63142, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:06, time_cost(all): 19:21:22/8:51:24, loss=0.367533468097685, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.61192718825663, lr=0.13161373731774384
2023-12-13 11:06:44   INFO  epoch: 16/24, acc_iter=63192, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:43:39, time_cost(all): 19:22:17/9:23:24, loss=0.367346423480596, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.96(1.03), norm=3.409387161530391, lr=0.13127183609164045
2023-12-13 11:07:40   INFO  epoch: 16/24, acc_iter=63242, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:43:01, time_cost(all): 19:23:13/8:50:48, loss=0.367159378863507, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.04(1.03), norm=1.8579308621233774, lr=0.13092993486553706
2023-12-13 11:08:35   INFO  epoch: 16/24, acc_iter=63292, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:02, time_cost(all): 19:24:08/9:19:57, loss=0.366972334246419, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.15(1.03), norm=4.632664582312012, lr=0.13058803363943372
2023-12-13 11:09:30   INFO  epoch: 16/24, acc_iter=63342, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:50, time_cost(all): 19:25:03/9:02:55, loss=0.36678528962933, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.252819382518591, lr=0.13024613241333033
2023-12-13 11:10:26   INFO  epoch: 16/24, acc_iter=63392, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:50, time_cost(all): 19:25:59/8:46:35, loss=0.366598245012242, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=2.544314585303626, lr=0.12990423118722694
2023-12-13 11:11:21   INFO  epoch: 16/24, acc_iter=63442, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:47, time_cost(all): 19:26:54/9:11:53, loss=0.366411200395153, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.06(1.03), norm=1.492167763585, lr=0.12956232996112355
2023-12-13 11:12:16   INFO  epoch: 16/24, acc_iter=63492, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:30, time_cost(all): 19:27:49/9:25:58, loss=0.366224155778065, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.86(1.03), norm=4.871722716913943, lr=0.12922042873502015
2023-12-13 11:13:12   INFO  epoch: 16/24, acc_iter=63542, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:40:05, time_cost(all): 19:28:45/9:09:03, loss=0.366037111160976, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.06(1.03), norm=2.3255774454196523, lr=0.12887852750891676
2023-12-13 11:14:07   INFO  epoch: 16/24, acc_iter=63592, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:38:31, time_cost(all): 19:29:40/9:09:44, loss=0.365850066543888, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.15(1.03), norm=2.8971031754569045, lr=0.12853662628281337
2023-12-13 11:15:02   INFO  epoch: 16/24, acc_iter=63642, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:35:56, time_cost(all): 19:30:35/9:24:14, loss=0.365663021926799, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.99(1.03), norm=1.5274945955744932, lr=0.12819472505670998
2023-12-13 11:15:58   INFO  epoch: 16/24, acc_iter=63692, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:21, time_cost(all): 19:31:31/8:46:02, loss=0.365475977309711, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.02(1.03), norm=1.1775973788589296, lr=0.1278528238306066
2023-12-13 11:16:53   INFO  epoch: 16/24, acc_iter=63742, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:32, time_cost(all): 19:32:26/9:07:15, loss=0.365288932692622, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.13(1.03), norm=2.0741520449789075, lr=0.12751092260450325
2023-12-13 11:17:49   INFO  epoch: 16/24, acc_iter=63792, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:25, time_cost(all): 19:33:22/8:34:01, loss=0.365101888075534, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=3.7779997250911497, lr=0.12716902137839986
2023-12-13 11:18:44   INFO  epoch: 16/24, acc_iter=63842, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:44, time_cost(all): 19:34:17/9:09:06, loss=0.364914843458445, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=1.7456760037207213, lr=0.12682712015229647
2023-12-13 11:19:39   INFO  epoch: 16/24, acc_iter=63892, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:56, time_cost(all): 19:35:12/9:16:52, loss=0.364727798841356, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.86(1.03), norm=1.3341760933116669, lr=0.12648521892619308
2023-12-13 11:20:35   INFO  epoch: 16/24, acc_iter=63942, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:12, time_cost(all): 19:36:08/8:45:25, loss=0.364540754224268, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.98(1.03), norm=3.6177575927559404, lr=0.12614331770008969
2023-12-13 11:21:30   INFO  epoch: 16/24, acc_iter=63992, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:31, time_cost(all): 19:37:03/9:01:41, loss=0.364353709607179, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=4.277380794171133, lr=0.12580141647398635
2023-12-13 11:22:25   INFO  epoch: 16/24, acc_iter=64042, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:59, time_cost(all): 19:37:58/8:47:17, loss=0.364166664990091, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.2(1.03), norm=2.9606770316018824, lr=0.12545951524788296
2023-12-13 11:23:21   INFO  epoch: 16/24, acc_iter=64092, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:27:32, time_cost(all): 19:38:54/8:43:31, loss=0.363979620373002, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.5135639899346964, lr=0.12511761402177957
2023-12-13 11:24:16   INFO  epoch: 16/24, acc_iter=64142, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:07, time_cost(all): 19:39:49/8:35:33, loss=0.363792575755914, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.95(1.03), norm=1.7716259834950718, lr=0.12477571279567617
2023-12-13 11:25:11   INFO  epoch: 16/24, acc_iter=64192, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:28:11, time_cost(all): 19:40:44/9:11:51, loss=0.363605531138825, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=3.9305950436521764, lr=0.12443381156957278
2023-12-13 11:26:07   INFO  epoch: 16/24, acc_iter=64242, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:09, time_cost(all): 19:41:40/8:45:33, loss=0.363418486521737, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=1.2566106330477829, lr=0.12409191034346945
2023-12-13 11:27:02   INFO  epoch: 16/24, acc_iter=64292, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:17, time_cost(all): 19:42:35/9:07:27, loss=0.363231441904648, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.15(1.03), norm=2.7441233119732886, lr=0.12375000911736606
2023-12-13 11:27:57   INFO  epoch: 16/24, acc_iter=64342, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:26, time_cost(all): 19:43:30/8:33:33, loss=0.36304439728756, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.16335816896306, lr=0.12340810789126261
2023-12-13 11:28:53   INFO  epoch: 16/24, acc_iter=64392, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:22:19, time_cost(all): 19:44:26/9:10:08, loss=0.362857352670471, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=4.351607547058918, lr=0.12306620666515922
2023-12-13 11:29:48   INFO  epoch: 16/24, acc_iter=64442, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:17, time_cost(all): 19:45:21/8:24:33, loss=0.362670308053383, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.06(1.03), norm=3.8141192141800335, lr=0.12272430543905583
2023-12-13 11:30:43   INFO  epoch: 16/24, acc_iter=64492, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:34, time_cost(all): 19:46:16/9:00:25, loss=0.362483263436294, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.15(1.03), norm=0.6956823339186158, lr=0.12238240421295249
2023-12-13 11:31:39   INFO  epoch: 16/24, acc_iter=64542, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:20:35, time_cost(all): 19:47:12/8:38:04, loss=0.362296218819205, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.85(1.03), norm=2.6384110311990305, lr=0.1220405029868491
2023-12-13 11:32:34   INFO  epoch: 16/24, acc_iter=64592, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:19, time_cost(all): 19:48:07/8:21:55, loss=0.362109174202117, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.02(1.03), norm=4.418942909276242, lr=0.1216986017607457
2023-12-13 11:33:29   INFO  epoch: 16/24, acc_iter=64642, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:25, time_cost(all): 19:49:02/8:35:23, loss=0.361922129585028, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.2877542869510086, lr=0.12135670053464231
2023-12-13 11:34:25   INFO  epoch: 16/24, acc_iter=64692, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:37, time_cost(all): 19:49:58/8:34:35, loss=0.36173508496794, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.93(1.03), norm=4.394969840744851, lr=0.12101479930853892
2023-12-13 11:35:20   INFO  epoch: 16/24, acc_iter=64742, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:06, time_cost(all): 19:50:53/8:26:38, loss=0.361548040350851, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.13(1.03), norm=2.8770469059788297, lr=0.12067289808243559
2023-12-13 11:36:15   INFO  epoch: 16/24, acc_iter=64792, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:25, time_cost(all): 19:51:48/8:21:25, loss=0.361360995733763, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=2.3700569960585853, lr=0.1203309968563322
2023-12-13 11:37:11   INFO  epoch: 16/24, acc_iter=64842, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:40, time_cost(all): 19:52:44/8:59:29, loss=0.361173951116674, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=2.0656790609199747, lr=0.1199890956302288
2023-12-13 11:38:06   INFO  epoch: 16/24, acc_iter=64892, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:32, time_cost(all): 19:53:39/8:35:41, loss=0.360986906499586, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.23(1.03), norm=0.825526115504627, lr=0.11964719440412541
2023-12-13 11:39:02   INFO  epoch: 16/24, acc_iter=64942, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:39, time_cost(all): 19:54:35/8:55:57, loss=0.360799861882497, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=2.4536406859809126, lr=0.11930529317802202
2023-12-13 11:39:57   INFO  epoch: 16/24, acc_iter=64992, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:33, time_cost(all): 19:55:30/8:48:16, loss=0.360612817265409, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=0.840404316993502, lr=0.11896339195191868
2023-12-13 11:40:52   INFO  epoch: 16/24, acc_iter=65042, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:08, time_cost(all): 19:56:25/8:46:55, loss=0.36042577264832, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=3.4783634479649566, lr=0.11862149072581529
2023-12-13 11:41:48   INFO  epoch: 16/24, acc_iter=65092, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:00, time_cost(all): 19:57:21/8:32:10, loss=0.360238728031232, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.13(1.03), norm=1.9283837598052262, lr=0.1182795894997119
2023-12-13 11:42:43   INFO  epoch: 16/24, acc_iter=65142, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:40, time_cost(all): 19:58:16/8:46:55, loss=0.360051683414143, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.19(1.03), norm=2.2766724505949494, lr=0.11793768827360845
2023-12-13 11:43:38   INFO  epoch: 16/24, acc_iter=65192, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:39, time_cost(all): 19:59:11/8:41:31, loss=0.359864638797055, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.88(1.03), norm=2.523333453050555, lr=0.11759578704750512
2023-12-13 11:44:34   INFO  epoch: 16/24, acc_iter=65242, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:46, time_cost(all): 20:00:07/8:41:25, loss=0.359677594179966, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=4.312201479958414, lr=0.11725388582140173
2023-12-13 11:45:29   INFO  epoch: 16/24, acc_iter=65292, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:55, time_cost(all): 20:01:02/8:38:32, loss=0.359490549562877, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.07(1.03), norm=3.6275514395579798, lr=0.11691198459529833
2023-12-13 11:46:24   INFO  epoch: 16/24, acc_iter=65342, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:47, time_cost(all): 20:01:57/8:20:16, loss=0.359303504945789, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.0(1.03), norm=1.513926639704878, lr=0.11657008336919494
2023-12-13 11:47:20   INFO  epoch: 16/24, acc_iter=65392, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:36, time_cost(all): 20:02:53/8:28:06, loss=0.3591164603287, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.93(1.03), norm=0.871005874917353, lr=0.11622818214309155
2023-12-13 11:48:15   INFO  epoch: 16/24, acc_iter=65442, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:47, time_cost(all): 20:03:48/8:37:36, loss=0.358929415711612, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.9618562924310288, lr=0.11588628091698822
2023-12-13 11:49:10   INFO  epoch: 16/24, acc_iter=65492, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:53, time_cost(all): 20:04:43/8:18:03, loss=0.358742371094523, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.84(1.03), norm=4.594808146856004, lr=0.11554437969088482
2023-12-13 11:50:06   INFO  epoch: 16/24, acc_iter=65542, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:00, time_cost(all): 20:05:39/8:42:18, loss=0.358555326477435, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=1.8570797095987246, lr=0.11520247846478143
2023-12-13 11:51:01   INFO  epoch: 16/24, acc_iter=65592, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:11, time_cost(all): 20:06:34/8:14:39, loss=0.358368281860346, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=3.2436342015250914, lr=0.11486057723867804
2023-12-13 11:51:56   INFO  epoch: 16/24, acc_iter=65642, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 20:07:29/8:44:40, loss=0.358181237243258, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=0.7027226066813949, lr=0.11451867601257465
2023-12-13 11:52:52   INFO  epoch: 17/24, acc_iter=65704, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:13:28, time_cost(all): 20:08:25/8:19:50, loss=0.357949301918068, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.2(1.03), norm=3.17367294361098, lr=0.11409471849220643
2023-12-13 11:53:47   INFO  epoch: 17/24, acc_iter=65754, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:10:19, time_cost(all): 20:09:20/8:15:45, loss=0.357762257300979, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.17(1.03), norm=0.637457190150759, lr=0.11375281726610309
2023-12-13 11:54:42   INFO  epoch: 17/24, acc_iter=65804, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:08:17, time_cost(all): 20:10:15/8:07:45, loss=0.357575212683891, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.92(1.03), norm=4.2147416744071995, lr=0.1134109160399997
2023-12-13 11:55:38   INFO  epoch: 17/24, acc_iter=65854, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:07:07, time_cost(all): 20:11:11/8:26:22, loss=0.357388168066802, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.09(1.03), norm=1.7995122148860356, lr=0.1130690148138963
2023-12-13 11:56:33   INFO  epoch: 17/24, acc_iter=65904, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:06:21, time_cost(all): 20:12:06/7:55:06, loss=0.357201123449714, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.13(1.03), norm=1.4809321668537374, lr=0.11272711358779292
2023-12-13 11:57:28   INFO  epoch: 17/24, acc_iter=65954, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:04:18, time_cost(all): 20:13:01/8:17:04, loss=0.357014078832625, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.05(1.03), norm=4.146857870812132, lr=0.11238521236168952
2023-12-13 11:58:24   INFO  epoch: 17/24, acc_iter=66004, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:02:04, time_cost(all): 20:13:57/7:59:09, loss=0.356827034215537, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=0.8888516605881355, lr=0.11204331113558619
2023-12-13 11:59:19   INFO  epoch: 17/24, acc_iter=66054, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:04:39, time_cost(all): 20:14:52/8:00:17, loss=0.356639989598448, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=0.6805745042393114, lr=0.1117014099094828
2023-12-13 12:00:15   INFO  epoch: 17/24, acc_iter=66104, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:01:17, time_cost(all): 20:15:48/8:32:36, loss=0.35645294498136, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.88(1.03), norm=3.260732561832338, lr=0.1113595086833794
2023-12-13 12:01:10   INFO  epoch: 17/24, acc_iter=66154, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:45, time_cost(all): 20:16:43/8:07:34, loss=0.356265900364271, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.91(1.03), norm=4.931661890823333, lr=0.11101760745727601
2023-12-13 12:02:05   INFO  epoch: 17/24, acc_iter=66204, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:04:03, time_cost(all): 20:17:38/8:28:42, loss=0.356078855747183, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=2.938449219123353, lr=0.11067570623117262
2023-12-13 12:03:01   INFO  epoch: 17/24, acc_iter=66254, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:57:33, time_cost(all): 20:18:34/8:14:26, loss=0.355891811130094, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.91(1.03), norm=3.6831945214014765, lr=0.11033380500506929
2023-12-13 12:03:56   INFO  epoch: 17/24, acc_iter=66304, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:01:01, time_cost(all): 20:19:29/8:12:18, loss=0.355704766513005, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.91(1.03), norm=0.6079828920129926, lr=0.10999190377896584
2023-12-13 12:04:51   INFO  epoch: 17/24, acc_iter=66354, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:30, time_cost(all): 20:20:24/8:10:24, loss=0.355517721895917, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.23(1.03), norm=4.561577368647919, lr=0.10965000255286245
2023-12-13 12:05:47   INFO  epoch: 17/24, acc_iter=66404, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:56:23, time_cost(all): 20:21:20/8:08:18, loss=0.355330677278828, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.1(1.03), norm=2.4836617699655688, lr=0.10930810132675906
2023-12-13 12:06:42   INFO  epoch: 17/24, acc_iter=66454, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:56, time_cost(all): 20:22:15/8:15:51, loss=0.35514363266174, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.02(1.03), norm=4.193297813849618, lr=0.10896620010065572
2023-12-13 12:07:37   INFO  epoch: 17/24, acc_iter=66504, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:03, time_cost(all): 20:23:10/7:52:07, loss=0.354956588044651, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=1.9644346212616168, lr=0.10862429887455233
2023-12-13 12:08:33   INFO  epoch: 17/24, acc_iter=66554, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:56:32, time_cost(all): 20:24:06/7:42:51, loss=0.354769543427563, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.88(1.03), norm=1.5622733463400442, lr=0.10828239764844894
2023-12-13 12:09:28   INFO  epoch: 17/24, acc_iter=66604, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:46, time_cost(all): 20:25:01/7:58:00, loss=0.354582498810474, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.16(1.03), norm=4.669074882507877, lr=0.10794049642234554
2023-12-13 12:10:23   INFO  epoch: 17/24, acc_iter=66654, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:50:11, time_cost(all): 20:25:56/7:53:36, loss=0.354395454193386, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.99(1.03), norm=4.884277186383018, lr=0.10759859519624215
2023-12-13 12:11:19   INFO  epoch: 17/24, acc_iter=66704, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:50:02, time_cost(all): 20:26:52/8:00:20, loss=0.354208409576297, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=4.249922454585658, lr=0.10725669397013882
2023-12-13 12:12:14   INFO  epoch: 17/24, acc_iter=66754, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:53:09, time_cost(all): 20:27:47/7:49:50, loss=0.354021364959209, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.07(1.03), norm=1.2115650315576658, lr=0.10691479274403543
2023-12-13 12:13:09   INFO  epoch: 17/24, acc_iter=66804, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:47:49, time_cost(all): 20:28:42/8:22:55, loss=0.35383432034212, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.88(1.03), norm=0.7618434914284187, lr=0.10657289151793203
2023-12-13 12:14:05   INFO  epoch: 17/24, acc_iter=66854, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:57, time_cost(all): 20:29:38/7:52:51, loss=0.353647275725032, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.19(1.03), norm=2.5977840799064627, lr=0.10623099029182864
2023-12-13 12:15:00   INFO  epoch: 17/24, acc_iter=66904, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:48:27, time_cost(all): 20:30:33/7:43:34, loss=0.353460231107943, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=4.553243394278411, lr=0.10588908906572525
2023-12-13 12:15:55   INFO  epoch: 17/24, acc_iter=66954, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:49:16, time_cost(all): 20:31:28/8:18:58, loss=0.353273186490855, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.22(1.03), norm=3.2607440834540045, lr=0.10554718783962191
2023-12-13 12:16:51   INFO  epoch: 17/24, acc_iter=67004, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:44:54, time_cost(all): 20:32:24/7:57:02, loss=0.353086141873766, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=1.2632953960395013, lr=0.10520528661351852
2023-12-13 12:17:46   INFO  epoch: 17/24, acc_iter=67054, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:47:14, time_cost(all): 20:33:19/7:55:46, loss=0.352899097256677, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.03(1.03), norm=0.869568906932491, lr=0.10486338538741513
2023-12-13 12:18:41   INFO  epoch: 17/24, acc_iter=67104, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:44:16, time_cost(all): 20:34:14/7:56:36, loss=0.352712052639589, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.04(1.03), norm=4.377620239403294, lr=0.10452148416131168
2023-12-13 12:19:37   INFO  epoch: 17/24, acc_iter=67154, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:44:09, time_cost(all): 20:35:10/7:36:38, loss=0.3525250080225, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.99(1.03), norm=2.8201625882330625, lr=0.10417958293520829
2023-12-13 12:20:32   INFO  epoch: 17/24, acc_iter=67204, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:44:42, time_cost(all): 20:36:05/7:51:02, loss=0.352337963405412, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.88(1.03), norm=2.4655445518119175, lr=0.10383768170910496
2023-12-13 12:21:28   INFO  epoch: 17/24, acc_iter=67254, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:43:16, time_cost(all): 20:37:01/8:13:25, loss=0.352150918788323, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.12(1.03), norm=1.2140660987518812, lr=0.10349578048300156
2023-12-13 12:22:23   INFO  epoch: 17/24, acc_iter=67304, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:39:31, time_cost(all): 20:37:56/8:10:22, loss=0.351963874171235, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=1.6841582634614958, lr=0.10315387925689817
2023-12-13 12:23:18   INFO  epoch: 17/24, acc_iter=67354, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:40:39, time_cost(all): 20:38:51/7:39:00, loss=0.351776829554146, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=4.028882298798796, lr=0.10281197803079478
2023-12-13 12:24:14   INFO  epoch: 17/24, acc_iter=67404, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:37, time_cost(all): 20:39:47/8:10:50, loss=0.351589784937058, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.0(1.03), norm=2.6263099318623024, lr=0.10247007680469139
2023-12-13 12:25:09   INFO  epoch: 17/24, acc_iter=67454, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:56, time_cost(all): 20:40:42/7:33:08, loss=0.351402740319969, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.85(1.03), norm=1.138759432698248, lr=0.10212817557858805
2023-12-13 12:26:04   INFO  epoch: 17/24, acc_iter=67504, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:35:47, time_cost(all): 20:41:37/7:40:22, loss=0.351215695702881, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.88(1.03), norm=1.854209250506024, lr=0.10178627435248466
2023-12-13 12:27:00   INFO  epoch: 17/24, acc_iter=67554, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:59, time_cost(all): 20:42:33/7:59:20, loss=0.351028651085792, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.17(1.03), norm=4.597875984717182, lr=0.10144437312638127
2023-12-13 12:27:55   INFO  epoch: 17/24, acc_iter=67604, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:36:55, time_cost(all): 20:43:28/7:45:31, loss=0.350841606468704, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=2.3462179651381243, lr=0.10110247190027788
2023-12-13 12:28:50   INFO  epoch: 17/24, acc_iter=67654, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:19, time_cost(all): 20:44:23/7:37:00, loss=0.350654561851615, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.94(1.03), norm=3.0955332958723094, lr=0.10076057067417449
2023-12-13 12:29:46   INFO  epoch: 17/24, acc_iter=67704, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:04, time_cost(all): 20:45:19/8:02:38, loss=0.350467517234527, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=3.5738573534578917, lr=0.10041866944807115
2023-12-13 12:30:41   INFO  epoch: 17/24, acc_iter=67754, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:33, time_cost(all): 20:46:14/7:39:19, loss=0.350280472617438, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.7053006324607205, lr=0.10007676822196776
2023-12-13 12:31:36   INFO  epoch: 17/24, acc_iter=67804, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:54, time_cost(all): 20:47:09/7:45:46, loss=0.350093428000349, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.92(1.03), norm=2.2382356862896655, lr=0.09973486699586437
2023-12-13 12:32:32   INFO  epoch: 17/24, acc_iter=67854, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:12, time_cost(all): 20:48:05/7:25:59, loss=0.349906383383261, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.09(1.03), norm=3.6391955809153846, lr=0.09939296576976098
2023-12-13 12:33:27   INFO  epoch: 17/24, acc_iter=67904, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:38, time_cost(all): 20:49:00/7:44:52, loss=0.349719338766172, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.0(1.03), norm=2.459009565145491, lr=0.09905106454365759
2023-12-13 12:34:22   INFO  epoch: 17/24, acc_iter=67954, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:31, time_cost(all): 20:49:55/7:49:48, loss=0.349532294149084, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.86(1.03), norm=3.9035030137450613, lr=0.0987091633175542
2023-12-13 12:35:18   INFO  epoch: 17/24, acc_iter=68004, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:12, time_cost(all): 20:50:51/7:40:21, loss=0.349345249531995, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.88(1.03), norm=3.6866191321558, lr=0.0983672620914508
2023-12-13 12:36:13   INFO  epoch: 17/24, acc_iter=68054, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:19, time_cost(all): 20:51:46/7:15:48, loss=0.349158204914907, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=1.7746140797392553, lr=0.09802536086534741
2023-12-13 12:37:08   INFO  epoch: 17/24, acc_iter=68104, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:58, time_cost(all): 20:52:41/7:24:11, loss=0.348971160297818, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.01(1.03), norm=3.1136710325018404, lr=0.09768345963924402
2023-12-13 12:38:04   INFO  epoch: 17/24, acc_iter=68154, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:26:20, time_cost(all): 20:53:37/7:25:47, loss=0.34878411568073, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.02(1.03), norm=0.8723342625999746, lr=0.09734155841314068
2023-12-13 12:38:59   INFO  epoch: 17/24, acc_iter=68204, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:05, time_cost(all): 20:54:32/7:33:11, loss=0.348597071063641, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.98(1.03), norm=3.6564680445308455, lr=0.09699965718703729
2023-12-13 12:39:54   INFO  epoch: 17/24, acc_iter=68254, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:08, time_cost(all): 20:55:27/7:31:24, loss=0.348410026446553, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.0(1.03), norm=1.5600783826472082, lr=0.0966577559609339
2023-12-13 12:40:50   INFO  epoch: 17/24, acc_iter=68304, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:21:58, time_cost(all): 20:56:23/7:40:54, loss=0.348222981829464, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.96(1.03), norm=4.010555879196078, lr=0.09631585473483051
2023-12-13 12:41:45   INFO  epoch: 17/24, acc_iter=68354, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:37, time_cost(all): 20:57:18/7:51:19, loss=0.348035937212376, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=3.4924144200041862, lr=0.09597395350872712
2023-12-13 12:42:41   INFO  epoch: 17/24, acc_iter=68404, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:27, time_cost(all): 20:58:14/7:44:06, loss=0.347848892595287, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.2(1.03), norm=4.006724697909428, lr=0.09563205228262378
2023-12-13 12:43:36   INFO  epoch: 17/24, acc_iter=68454, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:31, time_cost(all): 20:59:09/7:44:25, loss=0.347661847978198, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.05(1.03), norm=3.917641948661617, lr=0.09529015105652039
2023-12-13 12:44:31   INFO  epoch: 17/24, acc_iter=68504, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:26, time_cost(all): 21:00:04/7:14:32, loss=0.34747480336111, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=0.584796288765177, lr=0.094948249830417
2023-12-13 12:45:27   INFO  epoch: 17/24, acc_iter=68554, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:20, time_cost(all): 21:01:00/7:08:01, loss=0.347287758744021, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=2.6012202877110346, lr=0.0946063486043136
2023-12-13 12:46:22   INFO  epoch: 17/24, acc_iter=68604, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:37, time_cost(all): 21:01:55/7:44:48, loss=0.347100714126933, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.16(1.03), norm=4.904367661072339, lr=0.09426444737821021
2023-12-13 12:47:17   INFO  epoch: 17/24, acc_iter=68654, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:40, time_cost(all): 21:02:50/7:21:24, loss=0.346913669509844, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=3.879784974451494, lr=0.09392254615210682
2023-12-13 12:48:13   INFO  epoch: 17/24, acc_iter=68704, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:11, time_cost(all): 21:03:46/7:19:39, loss=0.346726624892756, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=2.106398996547921, lr=0.09358064492600343
2023-12-13 12:49:08   INFO  epoch: 17/24, acc_iter=68754, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:33, time_cost(all): 21:04:41/7:07:39, loss=0.346539580275667, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=3.067143735594749, lr=0.09323874369990004
2023-12-13 12:50:03   INFO  epoch: 17/24, acc_iter=68804, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:39, time_cost(all): 21:05:36/7:36:01, loss=0.346352535658579, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.05(1.03), norm=1.3661057487219255, lr=0.09289684247379665
2023-12-13 12:50:59   INFO  epoch: 17/24, acc_iter=68854, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:42, time_cost(all): 21:06:32/7:04:43, loss=0.34616549104149, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.04(1.03), norm=4.195559446403663, lr=0.09255494124769326
2023-12-13 12:51:54   INFO  epoch: 17/24, acc_iter=68904, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:22, time_cost(all): 21:07:27/7:24:38, loss=0.345978446424402, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=2.889473577609066, lr=0.09221304002158992
2023-12-13 12:52:49   INFO  epoch: 17/24, acc_iter=68954, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:09:51, time_cost(all): 21:08:22/7:09:37, loss=0.345791401807313, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=4.820342661801161, lr=0.09187113879548653
2023-12-13 12:53:45   INFO  epoch: 17/24, acc_iter=69004, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:07, time_cost(all): 21:09:18/7:23:44, loss=0.345604357190225, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.95(1.03), norm=2.6536693516099943, lr=0.09152923756938314
2023-12-13 12:54:40   INFO  epoch: 17/24, acc_iter=69054, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:51, time_cost(all): 21:10:13/7:04:54, loss=0.345417312573136, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.17(1.03), norm=3.2819738473853617, lr=0.09118733634327975
2023-12-13 12:55:35   INFO  epoch: 17/24, acc_iter=69104, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:48, time_cost(all): 21:11:08/7:14:17, loss=0.345230267956047, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.98(1.03), norm=4.029756251511948, lr=0.09084543511717635
2023-12-13 12:56:31   INFO  epoch: 17/24, acc_iter=69154, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:39, time_cost(all): 21:12:04/7:05:06, loss=0.345043223338959, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.19(1.03), norm=0.7901298768300995, lr=0.09050353389107302
2023-12-13 12:57:26   INFO  epoch: 17/24, acc_iter=69204, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:36, time_cost(all): 21:12:59/7:28:55, loss=0.34485617872187, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.23(1.03), norm=1.4063152569374056, lr=0.09016163266496963
2023-12-13 12:58:21   INFO  epoch: 17/24, acc_iter=69254, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:57, time_cost(all): 21:13:54/7:11:10, loss=0.344669134104782, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.87(1.03), norm=3.4878922229735405, lr=0.08981973143886623
2023-12-13 12:59:17   INFO  epoch: 17/24, acc_iter=69304, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:45, time_cost(all): 21:14:50/7:09:11, loss=0.344482089487693, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.85(1.03), norm=1.3029108027297398, lr=0.08947783021276284
2023-12-13 13:00:12   INFO  epoch: 17/24, acc_iter=69354, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:07, time_cost(all): 21:15:45/7:28:07, loss=0.344295044870605, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.0(1.03), norm=1.7541262735140997, lr=0.08913592898665945
2023-12-13 13:01:07   INFO  epoch: 17/24, acc_iter=69404, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:05, time_cost(all): 21:16:40/7:00:00, loss=0.344108000253516, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.19(1.03), norm=0.9150552378600619, lr=0.08879402776055612
2023-12-13 13:02:03   INFO  epoch: 17/24, acc_iter=69454, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 21:17:36/7:03:12, loss=0.343920955636428, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=3.467426898342416, lr=0.08845212653445267
2023-12-13 13:02:58   INFO  epoch: 17/24, acc_iter=69504, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 21:18:31/7:26:36, loss=0.343733911019339, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.95(1.03), norm=4.5147092698055635, lr=0.08811022530834928
2023-12-13 13:03:53   INFO  epoch: 18/24, acc_iter=69566, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:09:27, time_cost(all): 21:19:26/7:00:38, loss=0.343501975694149, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.96(1.03), norm=1.5119748693539965, lr=0.08768626778798111
2023-12-13 13:04:49   INFO  epoch: 18/24, acc_iter=69616, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:06:44, time_cost(all): 21:20:22/6:49:19, loss=0.343314931077061, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.21(1.03), norm=3.496417820069365, lr=0.08734436656187772
2023-12-13 13:05:44   INFO  epoch: 18/24, acc_iter=69666, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:36, time_cost(all): 21:21:17/6:49:39, loss=0.343127886459972, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.93(1.03), norm=2.758853854727875, lr=0.08700246533577438
2023-12-13 13:06:40   INFO  epoch: 18/24, acc_iter=69716, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:09:06, time_cost(all): 21:22:13/7:04:11, loss=0.342940841842884, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.87(1.03), norm=4.386611704763924, lr=0.08666056410967099
2023-12-13 13:07:35   INFO  epoch: 18/24, acc_iter=69766, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:06:26, time_cost(all): 21:23:08/6:56:19, loss=0.342753797225795, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.11(1.03), norm=1.262087272402277, lr=0.0863186628835676
2023-12-13 13:08:30   INFO  epoch: 18/24, acc_iter=69816, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:07:49, time_cost(all): 21:24:03/7:15:53, loss=0.342566752608707, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.2(1.03), norm=3.852588411045432, lr=0.0859767616574642
2023-12-13 13:09:26   INFO  epoch: 18/24, acc_iter=69866, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:05:59, time_cost(all): 21:24:59/7:09:43, loss=0.342379707991618, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=2.9897229013908557, lr=0.08563486043136076
2023-12-13 13:10:21   INFO  epoch: 18/24, acc_iter=69916, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:52, time_cost(all): 21:25:54/7:11:46, loss=0.34219266337453, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.23(1.03), norm=4.227424514168807, lr=0.08529295920525742
2023-12-13 13:11:16   INFO  epoch: 18/24, acc_iter=69966, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/0:59:52, time_cost(all): 21:26:49/7:13:13, loss=0.342005618757441, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.04(1.03), norm=3.0246739902687025, lr=0.08495105797915403
2023-12-13 13:12:12   INFO  epoch: 18/24, acc_iter=70016, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:03:40, time_cost(all): 21:27:45/7:03:03, loss=0.341818574140353, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=1.3953760995712146, lr=0.08460915675305064
2023-12-13 13:13:07   INFO  epoch: 18/24, acc_iter=70066, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:00:20, time_cost(all): 21:28:40/6:53:33, loss=0.341631529523264, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.84(1.03), norm=2.9850146118942025, lr=0.08426725552694725
2023-12-13 13:14:02   INFO  epoch: 18/24, acc_iter=70116, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:59:33, time_cost(all): 21:29:35/6:59:13, loss=0.341444484906176, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.02(1.03), norm=1.0414588016222506, lr=0.08392535430084386
2023-12-13 13:14:58   INFO  epoch: 18/24, acc_iter=70166, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:58:38, time_cost(all): 21:30:31/6:52:52, loss=0.341257440289087, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=1.4270592292532818, lr=0.08358345307474052
2023-12-13 13:15:53   INFO  epoch: 18/24, acc_iter=70216, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:23, time_cost(all): 21:31:26/6:47:52, loss=0.341070395671998, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.97(1.03), norm=1.334609591007529, lr=0.08324155184863713
2023-12-13 13:16:48   INFO  epoch: 18/24, acc_iter=70266, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:59:07, time_cost(all): 21:32:21/6:56:28, loss=0.34088335105491, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.12(1.03), norm=1.6378817569925035, lr=0.08289965062253374
2023-12-13 13:17:44   INFO  epoch: 18/24, acc_iter=70316, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:58:06, time_cost(all): 21:33:17/7:17:33, loss=0.340696306437821, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.7942862835132651, lr=0.08255774939643035
2023-12-13 13:18:39   INFO  epoch: 18/24, acc_iter=70366, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:58:09, time_cost(all): 21:34:12/7:05:56, loss=0.340509261820733, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.19(1.03), norm=2.431936834681613, lr=0.08221584817032696
2023-12-13 13:19:34   INFO  epoch: 18/24, acc_iter=70416, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:52:28, time_cost(all): 21:35:07/6:39:06, loss=0.340322217203644, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=3.0758866380060494, lr=0.08187394694422362
2023-12-13 13:20:30   INFO  epoch: 18/24, acc_iter=70466, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:00, time_cost(all): 21:36:03/7:12:03, loss=0.340135172586556, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=0.5805878095519954, lr=0.08153204571812023
2023-12-13 13:21:25   INFO  epoch: 18/24, acc_iter=70516, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:53:57, time_cost(all): 21:36:58/6:32:55, loss=0.339948127969467, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.39299522692491, lr=0.08119014449201684
2023-12-13 13:22:20   INFO  epoch: 18/24, acc_iter=70566, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:49:47, time_cost(all): 21:37:53/6:39:40, loss=0.339761083352379, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=2.399223630458446, lr=0.08084824326591344
2023-12-13 13:23:16   INFO  epoch: 18/24, acc_iter=70616, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:48:55, time_cost(all): 21:38:49/6:51:19, loss=0.33957403873529, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=2.4891641768546076, lr=0.08050634203981005
2023-12-13 13:24:11   INFO  epoch: 18/24, acc_iter=70666, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:39, time_cost(all): 21:39:44/6:56:15, loss=0.339386994118202, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.14(1.03), norm=1.3324053595046061, lr=0.08016444081370666
2023-12-13 13:25:06   INFO  epoch: 18/24, acc_iter=70716, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:39, time_cost(all): 21:40:39/6:37:46, loss=0.339199949501113, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.21(1.03), norm=1.103663548257121, lr=0.07982253958760327
2023-12-13 13:26:02   INFO  epoch: 18/24, acc_iter=70766, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:48:52, time_cost(all): 21:41:35/6:43:03, loss=0.339012904884025, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.995646699709414, lr=0.07948063836149988
2023-12-13 13:26:57   INFO  epoch: 18/24, acc_iter=70816, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:48:12, time_cost(all): 21:42:30/6:45:13, loss=0.338825860266936, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.18(1.03), norm=1.613873215197935, lr=0.07913873713539649
2023-12-13 13:27:53   INFO  epoch: 18/24, acc_iter=70866, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:47:52, time_cost(all): 21:43:26/7:01:03, loss=0.338638815649847, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=0.84897275822924, lr=0.07879683590929315
2023-12-13 13:28:48   INFO  epoch: 18/24, acc_iter=70916, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:23, time_cost(all): 21:44:21/6:59:34, loss=0.338451771032759, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.14(1.03), norm=1.4698125295219069, lr=0.07845493468318976
2023-12-13 13:29:43   INFO  epoch: 18/24, acc_iter=70966, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:45:25, time_cost(all): 21:45:16/6:25:02, loss=0.33826472641567, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.02(1.03), norm=1.778643940973839, lr=0.07811303345708637
2023-12-13 13:30:39   INFO  epoch: 18/24, acc_iter=71016, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:36, time_cost(all): 21:46:12/6:43:20, loss=0.338077681798582, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.95(1.03), norm=4.617634145543168, lr=0.07777113223098298
2023-12-13 13:31:34   INFO  epoch: 18/24, acc_iter=71066, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:57, time_cost(all): 21:47:07/6:57:26, loss=0.337890637181493, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.17(1.03), norm=4.6881347189312725, lr=0.07742923100487958
2023-12-13 13:32:29   INFO  epoch: 18/24, acc_iter=71116, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:03, time_cost(all): 21:48:02/6:56:27, loss=0.337703592564405, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.0(1.03), norm=2.8522031995725334, lr=0.07708732977877625
2023-12-13 13:33:25   INFO  epoch: 18/24, acc_iter=71166, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:30, time_cost(all): 21:48:58/6:55:15, loss=0.337516547947316, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.91(1.03), norm=1.6241498102681593, lr=0.07674542855267286
2023-12-13 13:34:20   INFO  epoch: 18/24, acc_iter=71216, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:14, time_cost(all): 21:49:53/6:25:56, loss=0.337329503330228, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.22(1.03), norm=4.98190519596691, lr=0.07640352732656946
2023-12-13 13:35:15   INFO  epoch: 18/24, acc_iter=71266, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:13, time_cost(all): 21:50:48/6:20:43, loss=0.337142458713139, d_time=0.00(0.00), f_time=1.0(1.01), b_time=0.85(1.03), norm=1.4554598359064297, lr=0.07606162610046607
2023-12-13 13:36:11   INFO  epoch: 18/24, acc_iter=71316, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:31, time_cost(all): 21:51:44/6:36:23, loss=0.336955414096051, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.01(1.03), norm=1.8486199297244312, lr=0.07571972487436268
2023-12-13 13:37:06   INFO  epoch: 18/24, acc_iter=71366, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:02, time_cost(all): 21:52:39/6:36:23, loss=0.336768369478962, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=4.574260421766112, lr=0.07537782364825935
2023-12-13 13:38:01   INFO  epoch: 18/24, acc_iter=71416, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:46, time_cost(all): 21:53:34/6:38:40, loss=0.336581324861874, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.85(1.03), norm=2.339020369387372, lr=0.0750359224221559
2023-12-13 13:38:57   INFO  epoch: 18/24, acc_iter=71466, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:54, time_cost(all): 21:54:30/6:24:14, loss=0.336394280244785, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=1.033817259303286, lr=0.0746940211960525
2023-12-13 13:39:52   INFO  epoch: 18/24, acc_iter=71516, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:35:16, time_cost(all): 21:55:25/6:51:04, loss=0.336207235627697, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.89(1.03), norm=4.3536141820378464, lr=0.07435211996994912
2023-12-13 13:40:47   INFO  epoch: 18/24, acc_iter=71566, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:34:50, time_cost(all): 21:56:20/6:14:18, loss=0.336020191010608, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.12(1.03), norm=4.659214285947334, lr=0.07401021874384572
2023-12-13 13:41:43   INFO  epoch: 18/24, acc_iter=71616, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:23, time_cost(all): 21:57:16/6:47:22, loss=0.335833146393519, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=4.181435780599683, lr=0.07366831751774239
2023-12-13 13:42:38   INFO  epoch: 18/24, acc_iter=71666, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:30:28, time_cost(all): 21:58:11/6:21:38, loss=0.335646101776431, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=2.5591739040975896, lr=0.073326416291639
2023-12-13 13:43:33   INFO  epoch: 18/24, acc_iter=71716, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:31:34, time_cost(all): 21:59:06/6:50:00, loss=0.335459057159342, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.05(1.03), norm=1.3276956952933072, lr=0.0729845150655356
2023-12-13 13:44:29   INFO  epoch: 18/24, acc_iter=71766, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:17, time_cost(all): 22:00:02/6:36:38, loss=0.335272012542254, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.1(1.03), norm=0.609362051237386, lr=0.07264261383943221
2023-12-13 13:45:24   INFO  epoch: 18/24, acc_iter=71816, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:30, time_cost(all): 22:00:57/6:17:33, loss=0.335084967925165, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=2.9311830528747764, lr=0.07230071261332882
2023-12-13 13:46:19   INFO  epoch: 18/24, acc_iter=71866, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:46, time_cost(all): 22:01:52/6:20:50, loss=0.334897923308077, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.98(1.03), norm=2.9341951545378975, lr=0.07195881138722549
2023-12-13 13:47:15   INFO  epoch: 18/24, acc_iter=71916, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:02, time_cost(all): 22:02:48/6:27:04, loss=0.334710878690988, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.02(1.03), norm=1.6718079582840315, lr=0.0716169101611221
2023-12-13 13:48:10   INFO  epoch: 18/24, acc_iter=71966, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:27:16, time_cost(all): 22:03:43/6:22:37, loss=0.3345238340739, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=4.723364679837301, lr=0.0712750089350187
2023-12-13 13:49:06   INFO  epoch: 18/24, acc_iter=72016, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:43, time_cost(all): 22:04:39/6:28:31, loss=0.334336789456811, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.05(1.03), norm=3.0086403053418804, lr=0.07093310770891531
2023-12-13 13:50:01   INFO  epoch: 18/24, acc_iter=72066, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:25:21, time_cost(all): 22:05:34/6:35:17, loss=0.334149744839723, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.08(1.03), norm=2.609710578123649, lr=0.07059120648281192
2023-12-13 13:50:56   INFO  epoch: 18/24, acc_iter=72116, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:54, time_cost(all): 22:06:29/6:33:44, loss=0.333962700222634, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.2(1.03), norm=1.610380165450874, lr=0.07024930525670858
2023-12-13 13:51:52   INFO  epoch: 18/24, acc_iter=72166, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:26, time_cost(all): 22:07:25/6:41:16, loss=0.333775655605546, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.01(1.03), norm=3.856976031384444, lr=0.06990740403060519
2023-12-13 13:52:47   INFO  epoch: 18/24, acc_iter=72216, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:31, time_cost(all): 22:08:20/6:11:21, loss=0.333588610988457, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=1.6319825149796974, lr=0.06956550280450174
2023-12-13 13:53:42   INFO  epoch: 18/24, acc_iter=72266, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:36, time_cost(all): 22:09:15/6:05:49, loss=0.333401566371369, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.9(1.03), norm=1.0604544460518222, lr=0.06922360157839835
2023-12-13 13:54:38   INFO  epoch: 18/24, acc_iter=72316, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:18:45, time_cost(all): 22:10:11/6:29:21, loss=0.33321452175428, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.97(1.03), norm=4.102677309692191, lr=0.06888170035229502
2023-12-13 13:55:33   INFO  epoch: 18/24, acc_iter=72366, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:17:44, time_cost(all): 22:11:06/6:29:08, loss=0.333027477137191, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=4.679413708773086, lr=0.06853979912619163
2023-12-13 13:56:28   INFO  epoch: 18/24, acc_iter=72416, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:24, time_cost(all): 22:12:01/6:10:44, loss=0.332840432520103, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.84(1.03), norm=1.9341256793399935, lr=0.06819789790008823
2023-12-13 13:57:24   INFO  epoch: 18/24, acc_iter=72466, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:37, time_cost(all): 22:12:57/6:08:39, loss=0.332653387903014, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.01(1.03), norm=3.1481030018033627, lr=0.06785599667398484
2023-12-13 13:58:19   INFO  epoch: 18/24, acc_iter=72516, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:15, time_cost(all): 22:13:52/6:03:34, loss=0.332466343285926, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.18(1.03), norm=4.876670659957979, lr=0.06751409544788145
2023-12-13 13:59:14   INFO  epoch: 18/24, acc_iter=72566, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:15, time_cost(all): 22:14:47/6:27:55, loss=0.332279298668837, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.95(1.03), norm=4.82315115974228, lr=0.06717219422177811
2023-12-13 14:00:10   INFO  epoch: 18/24, acc_iter=72616, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:39, time_cost(all): 22:15:43/6:19:39, loss=0.332092254051749, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=1.168530687803433, lr=0.06683029299567472
2023-12-13 14:01:05   INFO  epoch: 18/24, acc_iter=72666, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:34, time_cost(all): 22:16:38/6:11:13, loss=0.33190520943466, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.9(1.03), norm=2.7805877108940855, lr=0.06648839176957133
2023-12-13 14:02:00   INFO  epoch: 18/24, acc_iter=72716, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:20, time_cost(all): 22:17:33/6:14:37, loss=0.331718164817572, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.89(1.03), norm=3.3101209756750327, lr=0.06614649054346794
2023-12-13 14:02:56   INFO  epoch: 18/24, acc_iter=72766, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:02, time_cost(all): 22:18:29/6:05:49, loss=0.331531120200483, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=4.873366020016093, lr=0.06580458931736455
2023-12-13 14:03:51   INFO  epoch: 18/24, acc_iter=72816, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:04, time_cost(all): 22:19:24/6:22:30, loss=0.331344075583395, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.05(1.03), norm=0.9066221828105088, lr=0.06546268809126121
2023-12-13 14:04:46   INFO  epoch: 18/24, acc_iter=72866, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:41, time_cost(all): 22:20:19/6:05:53, loss=0.331157030966306, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.01(1.03), norm=2.3639528350805676, lr=0.06512078686515782
2023-12-13 14:05:42   INFO  epoch: 18/24, acc_iter=72916, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:13, time_cost(all): 22:21:15/6:07:48, loss=0.330969986349218, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.95(1.03), norm=3.613501563953734, lr=0.06477888563905443
2023-12-13 14:06:37   INFO  epoch: 18/24, acc_iter=72966, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:48, time_cost(all): 22:22:10/6:24:19, loss=0.330782941732129, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=4.655620465396023, lr=0.06443698441295104
2023-12-13 14:07:32   INFO  epoch: 18/24, acc_iter=73016, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:30, time_cost(all): 22:23:05/6:21:03, loss=0.330595897115041, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.87(1.03), norm=0.8278905645858974, lr=0.06409508318684759
2023-12-13 14:08:28   INFO  epoch: 18/24, acc_iter=73066, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:52, time_cost(all): 22:24:01/6:15:49, loss=0.330408852497952, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=2.4784027095857564, lr=0.06375318196074425
2023-12-13 14:09:23   INFO  epoch: 18/24, acc_iter=73116, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:52, time_cost(all): 22:24:56/6:06:31, loss=0.330221807880863, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.02(1.03), norm=4.779678839416345, lr=0.06341128073464086
2023-12-13 14:10:19   INFO  epoch: 18/24, acc_iter=73166, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:04:05, time_cost(all): 22:25:52/5:47:48, loss=0.330034763263775, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.163630324620591, lr=0.06306937950853747
2023-12-13 14:11:14   INFO  epoch: 18/24, acc_iter=73216, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:02, time_cost(all): 22:26:47/5:59:13, loss=0.329847718646686, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.70181236052919, lr=0.06272747828243408
2023-12-13 14:12:09   INFO  epoch: 18/24, acc_iter=73266, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:03, time_cost(all): 22:27:42/6:18:05, loss=0.329660674029598, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.13(1.03), norm=3.6867272962388373, lr=0.06238557705633069
2023-12-13 14:13:05   INFO  epoch: 18/24, acc_iter=73316, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:06, time_cost(all): 22:28:38/6:18:05, loss=0.329473629412509, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.2(1.03), norm=1.3633474778705583, lr=0.06204367583022735
2023-12-13 14:14:00   INFO  epoch: 18/24, acc_iter=73366, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 22:29:33/6:01:41, loss=0.329286584795421, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.16(1.03), norm=1.7250598387950822, lr=0.06170177460412396
2023-12-13 14:14:55   INFO  epoch: 19/24, acc_iter=73428, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:12:03, time_cost(all): 22:30:28/5:53:05, loss=0.329054649470231, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.98(1.03), norm=0.5014255400679247, lr=0.06127781708375574
2023-12-13 14:15:51   INFO  epoch: 19/24, acc_iter=73478, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:42, time_cost(all): 22:31:24/5:52:46, loss=0.328867604853142, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.89(1.03), norm=4.8061590427996865, lr=0.060935915857652345
2023-12-13 14:16:46   INFO  epoch: 19/24, acc_iter=73528, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:08:26, time_cost(all): 22:32:19/5:43:19, loss=0.328680560236054, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.13(1.03), norm=0.6754603353033525, lr=0.060594014631548954
2023-12-13 14:17:41   INFO  epoch: 19/24, acc_iter=73578, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:10:16, time_cost(all): 22:33:14/5:50:46, loss=0.328493515618965, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.0(1.03), norm=3.980715165305869, lr=0.06025211340544562
2023-12-13 14:18:37   INFO  epoch: 19/24, acc_iter=73628, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:08:19, time_cost(all): 22:34:10/5:52:20, loss=0.328306471001877, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.12(1.03), norm=4.314658547887496, lr=0.059910212179342226
2023-12-13 14:19:32   INFO  epoch: 19/24, acc_iter=73678, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:02:35, time_cost(all): 22:35:05/6:11:09, loss=0.328119426384788, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=3.759703335924653, lr=0.059568310953238834
2023-12-13 14:20:27   INFO  epoch: 19/24, acc_iter=73728, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:02:44, time_cost(all): 22:36:00/5:37:30, loss=0.3279323817677, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.88(1.03), norm=3.385186486646977, lr=0.05922640972713544
2023-12-13 14:21:23   INFO  epoch: 19/24, acc_iter=73778, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:06:50, time_cost(all): 22:36:56/5:36:15, loss=0.327745337150611, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.08(1.03), norm=3.837863427078313, lr=0.05888450850103205
2023-12-13 14:22:18   INFO  epoch: 19/24, acc_iter=73828, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:49, time_cost(all): 22:37:51/5:44:51, loss=0.327558292533523, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=4.2362714014802965, lr=0.058542607274928715
2023-12-13 14:23:13   INFO  epoch: 19/24, acc_iter=73878, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:29, time_cost(all): 22:38:46/5:54:02, loss=0.327371247916434, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.23(1.03), norm=1.316505909605095, lr=0.058200706048825324
2023-12-13 14:24:09   INFO  epoch: 19/24, acc_iter=73928, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:00:00, time_cost(all): 22:39:42/5:49:57, loss=0.327184203299346, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=4.051003688293747, lr=0.05785880482272193
2023-12-13 14:25:04   INFO  epoch: 19/24, acc_iter=73978, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:59:27, time_cost(all): 22:40:37/6:03:16, loss=0.326997158682257, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.85(1.03), norm=4.520221358276207, lr=0.05751690359661854
2023-12-13 14:25:59   INFO  epoch: 19/24, acc_iter=74028, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:02:12, time_cost(all): 22:41:32/5:54:11, loss=0.326810114065169, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.16(1.03), norm=2.662347025560391, lr=0.05717500237051515
2023-12-13 14:26:55   INFO  epoch: 19/24, acc_iter=74078, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:56:21, time_cost(all): 22:42:28/5:56:10, loss=0.32662306944808, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.93(1.03), norm=1.991825761703339, lr=0.05683310114441181
2023-12-13 14:27:50   INFO  epoch: 19/24, acc_iter=74128, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:55:20, time_cost(all): 22:43:23/6:02:35, loss=0.326436024830991, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.22(1.03), norm=2.158324913427742, lr=0.05649119991830842
2023-12-13 14:28:45   INFO  epoch: 19/24, acc_iter=74178, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:15, time_cost(all): 22:44:18/5:46:17, loss=0.326248980213903, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.87(1.03), norm=3.3577043797794714, lr=0.056149298692204974
2023-12-13 14:29:41   INFO  epoch: 19/24, acc_iter=74228, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:55:02, time_cost(all): 22:45:14/5:42:46, loss=0.326061935596814, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.07(1.03), norm=4.879645776608991, lr=0.05580739746610158
2023-12-13 14:30:36   INFO  epoch: 19/24, acc_iter=74278, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:54:43, time_cost(all): 22:46:09/5:44:37, loss=0.325874890979726, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.84(1.03), norm=2.547153828885689, lr=0.05546549623999819
2023-12-13 14:31:32   INFO  epoch: 19/24, acc_iter=74328, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:54:54, time_cost(all): 22:47:05/5:42:35, loss=0.325687846362637, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.94(1.03), norm=3.8589057921277807, lr=0.055123595013894855
2023-12-13 14:32:27   INFO  epoch: 19/24, acc_iter=74378, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:55:20, time_cost(all): 22:48:00/5:58:19, loss=0.325500801745549, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.87(1.03), norm=4.878493604002546, lr=0.05478169378779146
2023-12-13 14:33:22   INFO  epoch: 19/24, acc_iter=74428, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:14, time_cost(all): 22:48:55/5:24:49, loss=0.32531375712846, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.99(1.03), norm=1.2796453463690516, lr=0.05443979256168807
2023-12-13 14:34:18   INFO  epoch: 19/24, acc_iter=74478, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:00, time_cost(all): 22:49:51/5:54:42, loss=0.325126712511372, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.1(1.03), norm=3.7709192415534507, lr=0.05409789133558468
2023-12-13 14:35:13   INFO  epoch: 19/24, acc_iter=74528, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:49:05, time_cost(all): 22:50:46/5:47:27, loss=0.324939667894283, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.13(1.03), norm=1.3933954699304345, lr=0.05375599010948129
2023-12-13 14:36:08   INFO  epoch: 19/24, acc_iter=74578, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:51, time_cost(all): 22:51:41/5:41:39, loss=0.324752623277195, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.09(1.03), norm=1.0699680216293281, lr=0.05341408888337795
2023-12-13 14:37:04   INFO  epoch: 19/24, acc_iter=74628, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:50:25, time_cost(all): 22:52:37/5:39:35, loss=0.324565578660106, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.17(1.03), norm=0.9059535290393947, lr=0.05307218765727456
2023-12-13 14:37:59   INFO  epoch: 19/24, acc_iter=74678, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:49:29, time_cost(all): 22:53:32/5:45:49, loss=0.324378534043018, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.91(1.03), norm=0.6114846007853606, lr=0.05273028643117117
2023-12-13 14:38:54   INFO  epoch: 19/24, acc_iter=74728, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:44:09, time_cost(all): 22:54:27/5:23:15, loss=0.324191489425929, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.89(1.03), norm=3.1409112521672, lr=0.05238838520506778
2023-12-13 14:39:50   INFO  epoch: 19/24, acc_iter=74778, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:16, time_cost(all): 22:55:23/5:23:06, loss=0.32400444480884, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.92(1.03), norm=1.394705949935985, lr=0.052046483978964386
2023-12-13 14:40:45   INFO  epoch: 19/24, acc_iter=74828, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:44:02, time_cost(all): 22:56:18/5:27:24, loss=0.323817400191752, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.03(1.03), norm=4.04087299228676, lr=0.05170458275286105
2023-12-13 14:41:40   INFO  epoch: 19/24, acc_iter=74878, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:45:06, time_cost(all): 22:57:13/5:44:27, loss=0.323630355574663, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.92(1.03), norm=0.5858875395266736, lr=0.05136268152675766
2023-12-13 14:42:36   INFO  epoch: 19/24, acc_iter=74928, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:57, time_cost(all): 22:58:09/5:15:40, loss=0.323443310957575, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=1.8458604581864508, lr=0.05102078030065427
2023-12-13 14:43:31   INFO  epoch: 19/24, acc_iter=74978, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:43:03, time_cost(all): 22:59:04/5:37:50, loss=0.323256266340486, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.19(1.03), norm=4.649210289566989, lr=0.05067887907455082
2023-12-13 14:44:26   INFO  epoch: 19/24, acc_iter=75028, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:57, time_cost(all): 22:59:59/5:24:54, loss=0.323069221723398, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.92(1.03), norm=1.5514246742169795, lr=0.050336977848447484
2023-12-13 14:45:22   INFO  epoch: 19/24, acc_iter=75078, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:41:08, time_cost(all): 23:00:55/5:38:23, loss=0.322882177106309, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=3.3234511208810384, lr=0.049997429707253176
2023-12-13 14:46:17   INFO  epoch: 19/24, acc_iter=75128, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:42, time_cost(all): 23:01:50/5:19:09, loss=0.322695132489221, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=2.0801583954810443, lr=0.04981893715539038
2023-12-13 14:47:12   INFO  epoch: 19/24, acc_iter=75178, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:29, time_cost(all): 23:02:45/5:25:41, loss=0.322508087872132, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.14(1.03), norm=3.9111864403241006, lr=0.04964044460352758
2023-12-13 14:48:08   INFO  epoch: 19/24, acc_iter=75228, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:37:21, time_cost(all): 23:03:41/5:31:50, loss=0.322321043255044, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.96(1.03), norm=0.5447409115397606, lr=0.04946195205166479
2023-12-13 14:49:03   INFO  epoch: 19/24, acc_iter=75278, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:34:44, time_cost(all): 23:04:36/5:31:12, loss=0.322133998637955, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.87(1.03), norm=1.964115914621456, lr=0.04928345949980199
2023-12-13 14:49:58   INFO  epoch: 19/24, acc_iter=75328, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:35:08, time_cost(all): 23:05:31/5:08:36, loss=0.321946954020867, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=0.6365722498002905, lr=0.04910496694793919
2023-12-13 14:50:54   INFO  epoch: 19/24, acc_iter=75378, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:21, time_cost(all): 23:06:27/5:14:08, loss=0.321759909403778, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.18(1.03), norm=3.853783125985516, lr=0.0489264743960764
2023-12-13 14:51:49   INFO  epoch: 19/24, acc_iter=75428, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:31, time_cost(all): 23:07:22/5:12:01, loss=0.32157286478669, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=4.4575962880922315, lr=0.048747981844213605
2023-12-13 14:52:45   INFO  epoch: 19/24, acc_iter=75478, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:31, time_cost(all): 23:08:18/5:21:36, loss=0.321385820169601, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.13(1.03), norm=3.69549029904596, lr=0.048569489292350804
2023-12-13 14:53:40   INFO  epoch: 19/24, acc_iter=75528, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:40, time_cost(all): 23:09:13/5:13:47, loss=0.321198775552512, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.85(1.03), norm=1.5575950512849928, lr=0.04839099674048801
2023-12-13 14:54:35   INFO  epoch: 19/24, acc_iter=75578, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:32:10, time_cost(all): 23:10:08/5:11:53, loss=0.321011730935424, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.01(1.03), norm=1.5203988584228634, lr=0.048212504188625216
2023-12-13 14:55:31   INFO  epoch: 19/24, acc_iter=75628, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:10, time_cost(all): 23:11:04/5:09:42, loss=0.320824686318335, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=0.8638752398208229, lr=0.04803401163676242
2023-12-13 14:56:26   INFO  epoch: 19/24, acc_iter=75678, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:30:11, time_cost(all): 23:11:59/5:10:03, loss=0.320637641701247, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=3.7192816333846515, lr=0.04785551908489962
2023-12-13 14:57:21   INFO  epoch: 19/24, acc_iter=75728, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:28:22, time_cost(all): 23:12:54/5:23:45, loss=0.320450597084158, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.11(1.03), norm=3.2626522462181566, lr=0.04767702653303683
2023-12-13 14:58:17   INFO  epoch: 19/24, acc_iter=75778, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:48, time_cost(all): 23:13:50/5:03:48, loss=0.32026355246707, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=3.277002444929213, lr=0.04749853398117403
2023-12-13 14:59:12   INFO  epoch: 19/24, acc_iter=75828, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:26:06, time_cost(all): 23:14:45/5:00:15, loss=0.320076507849981, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=2.7601013988668313, lr=0.04732004142931123
2023-12-13 15:00:07   INFO  epoch: 19/24, acc_iter=75878, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:27, time_cost(all): 23:15:40/5:23:22, loss=0.319889463232893, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=1.8866762434644135, lr=0.04714154887744844
2023-12-13 15:01:03   INFO  epoch: 19/24, acc_iter=75928, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:59, time_cost(all): 23:16:36/5:10:43, loss=0.319702418615804, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.9(1.03), norm=0.6177598051211626, lr=0.046963056325585645
2023-12-13 15:01:58   INFO  epoch: 19/24, acc_iter=75978, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:25, time_cost(all): 23:17:31/5:13:23, loss=0.319515373998716, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=2.7072490233997435, lr=0.04678456377372285
2023-12-13 15:02:53   INFO  epoch: 19/24, acc_iter=76028, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:23:23, time_cost(all): 23:18:26/5:19:27, loss=0.319328329381627, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=0.9813255141893305, lr=0.04660607122186005
2023-12-13 15:03:49   INFO  epoch: 19/24, acc_iter=76078, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:05, time_cost(all): 23:19:22/5:12:22, loss=0.319141284764539, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=4.893219765428436, lr=0.046427578669997256
2023-12-13 15:04:44   INFO  epoch: 19/24, acc_iter=76128, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:21:14, time_cost(all): 23:20:17/4:55:48, loss=0.31895424014745, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.868602347020069, lr=0.04624908611813446
2023-12-13 15:05:39   INFO  epoch: 19/24, acc_iter=76178, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:44, time_cost(all): 23:21:12/5:08:01, loss=0.318767195530361, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.15(1.03), norm=3.7249045585622507, lr=0.04607059356627166
2023-12-13 15:06:35   INFO  epoch: 19/24, acc_iter=76228, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:26, time_cost(all): 23:22:08/5:01:59, loss=0.318580150913273, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.96(1.03), norm=2.483534688238922, lr=0.04589210101440887
2023-12-13 15:07:30   INFO  epoch: 19/24, acc_iter=76278, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:58, time_cost(all): 23:23:03/4:56:00, loss=0.318393106296184, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.98(1.03), norm=4.047249118980325, lr=0.045713608462546074
2023-12-13 15:08:25   INFO  epoch: 19/24, acc_iter=76328, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:12, time_cost(all): 23:23:58/5:17:24, loss=0.318206061679096, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=2.3331118785293103, lr=0.04553511591068328
2023-12-13 15:09:21   INFO  epoch: 19/24, acc_iter=76378, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:35, time_cost(all): 23:24:54/5:04:09, loss=0.318019017062007, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.91(1.03), norm=4.275925736345062, lr=0.04535662335882048
2023-12-13 15:10:16   INFO  epoch: 19/24, acc_iter=76428, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:07, time_cost(all): 23:25:49/5:16:12, loss=0.317831972444919, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.98(1.03), norm=1.0319463089927923, lr=0.045178130806957685
2023-12-13 15:11:11   INFO  epoch: 19/24, acc_iter=76478, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:55, time_cost(all): 23:26:44/5:16:05, loss=0.31764492782783, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.85(1.03), norm=1.8488803544771635, lr=0.04499963825509489
2023-12-13 15:12:07   INFO  epoch: 19/24, acc_iter=76528, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:44, time_cost(all): 23:27:40/4:53:38, loss=0.317457883210742, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=2.076613582630956, lr=0.04482114570323209
2023-12-13 15:13:02   INFO  epoch: 19/24, acc_iter=76578, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:45, time_cost(all): 23:28:35/4:54:22, loss=0.317270838593653, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=1.574817829447357, lr=0.044642653151369296
2023-12-13 15:13:57   INFO  epoch: 19/24, acc_iter=76628, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:13, time_cost(all): 23:29:30/4:54:46, loss=0.317083793976565, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.97(1.03), norm=1.150322599468943, lr=0.0444641605995065
2023-12-13 15:14:53   INFO  epoch: 19/24, acc_iter=76678, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:09:57, time_cost(all): 23:30:26/4:52:55, loss=0.316896749359476, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.85(1.03), norm=3.740544467862425, lr=0.04428566804764371
2023-12-13 15:15:48   INFO  epoch: 19/24, acc_iter=76728, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:25, time_cost(all): 23:31:21/4:59:45, loss=0.316709704742388, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.06(1.03), norm=3.869986898681348, lr=0.04410717549578091
2023-12-13 15:16:44   INFO  epoch: 19/24, acc_iter=76778, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:41, time_cost(all): 23:32:17/4:45:48, loss=0.316522660125299, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.12(1.03), norm=4.683165871563, lr=0.043928682943918114
2023-12-13 15:17:39   INFO  epoch: 19/24, acc_iter=76828, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:35, time_cost(all): 23:33:12/5:09:55, loss=0.316335615508211, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.13(1.03), norm=1.1927796858832211, lr=0.04375019039205532
2023-12-13 15:18:34   INFO  epoch: 19/24, acc_iter=76878, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:34, time_cost(all): 23:34:07/4:51:53, loss=0.316148570891122, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.9(1.03), norm=2.19978024960466, lr=0.04357169784019252
2023-12-13 15:19:30   INFO  epoch: 19/24, acc_iter=76928, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:28, time_cost(all): 23:35:03/4:56:51, loss=0.315961526274033, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.91(1.03), norm=4.8414955975962535, lr=0.043393205288329725
2023-12-13 15:20:25   INFO  epoch: 19/24, acc_iter=76978, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:57, time_cost(all): 23:35:58/4:56:36, loss=0.315774481656945, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.13(1.03), norm=4.502797848372258, lr=0.04321471273646693
2023-12-13 15:21:20   INFO  epoch: 19/24, acc_iter=77028, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:52, time_cost(all): 23:36:53/4:46:24, loss=0.315587437039856, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.91(1.03), norm=2.2828634761661624, lr=0.04303622018460414
2023-12-13 15:22:16   INFO  epoch: 19/24, acc_iter=77078, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:05, time_cost(all): 23:37:49/5:00:20, loss=0.315400392422768, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.07(1.03), norm=0.5513980191313659, lr=0.04285772763274134
2023-12-13 15:23:11   INFO  epoch: 19/24, acc_iter=77128, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:07, time_cost(all): 23:38:44/4:43:10, loss=0.315213347805679, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.84(1.03), norm=1.3068199486179857, lr=0.04267923508087854
2023-12-13 15:24:06   INFO  epoch: 19/24, acc_iter=77178, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 23:39:39/4:54:38, loss=0.315026303188591, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.07(1.03), norm=4.05094754196924, lr=0.04250074252901575
2023-12-13 15:25:02   INFO  epoch: 19/24, acc_iter=77228, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 23:40:35/4:56:23, loss=0.314839258571502, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.19(1.03), norm=3.5955120698825174, lr=0.04232224997715295
2023-12-13 15:25:57   INFO  epoch: 20/24, acc_iter=77290, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:08:05, time_cost(all): 23:41:30/4:48:36, loss=0.314607323246312, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.92(1.03), norm=1.629308219687185, lr=0.042100919212843084
2023-12-13 15:26:52   INFO  epoch: 20/24, acc_iter=77340, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:31, time_cost(all): 23:42:25/4:36:33, loss=0.314420278629224, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.16(1.03), norm=4.449402914039432, lr=0.04192242666098028
2023-12-13 15:27:48   INFO  epoch: 20/24, acc_iter=77390, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:07:22, time_cost(all): 23:43:21/4:36:50, loss=0.314233234012135, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=4.842299487897347, lr=0.04174393410911749
2023-12-13 15:28:43   INFO  epoch: 20/24, acc_iter=77440, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:09:46, time_cost(all): 23:44:16/4:43:37, loss=0.314046189395047, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=2.836534702846736, lr=0.041565441557254695
2023-12-13 15:29:38   INFO  epoch: 20/24, acc_iter=77490, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:06:01, time_cost(all): 23:45:11/4:50:03, loss=0.313859144777958, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.01(1.03), norm=2.579396334342511, lr=0.0413869490053919
2023-12-13 15:30:34   INFO  epoch: 20/24, acc_iter=77540, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:08:29, time_cost(all): 23:46:07/4:32:44, loss=0.31367210016087, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.92(1.03), norm=4.311422303824449, lr=0.04120845645352911
2023-12-13 15:31:29   INFO  epoch: 20/24, acc_iter=77590, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:05:33, time_cost(all): 23:47:02/4:40:19, loss=0.313485055543781, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.98(1.03), norm=4.300431414225738, lr=0.04102996390166631
2023-12-13 15:32:24   INFO  epoch: 20/24, acc_iter=77640, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:03:43, time_cost(all): 23:47:57/4:48:20, loss=0.313298010926693, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=3.3355296263335457, lr=0.04085147134980351
2023-12-13 15:33:20   INFO  epoch: 20/24, acc_iter=77690, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:00:00, time_cost(all): 23:48:53/4:45:59, loss=0.313110966309604, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.93(1.03), norm=3.4078580377238312, lr=0.04067297879794072
2023-12-13 15:34:15   INFO  epoch: 20/24, acc_iter=77740, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:02:29, time_cost(all): 23:49:48/4:48:37, loss=0.312923921692516, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.08(1.03), norm=0.692313662135197, lr=0.04049448624607792
2023-12-13 15:35:10   INFO  epoch: 20/24, acc_iter=77790, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:33, time_cost(all): 23:50:43/4:27:51, loss=0.312736877075427, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.19(1.03), norm=0.5903284186205553, lr=0.040315993694215124
2023-12-13 15:36:06   INFO  epoch: 20/24, acc_iter=77840, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:58:38, time_cost(all): 23:51:39/4:35:34, loss=0.312549832458339, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.98(1.03), norm=3.057125949589068, lr=0.04013750114235233
2023-12-13 15:37:01   INFO  epoch: 20/24, acc_iter=77890, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:59:21, time_cost(all): 23:52:34/4:50:22, loss=0.31236278784125, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=4.3383496303573015, lr=0.039959008590489536
2023-12-13 15:37:57   INFO  epoch: 20/24, acc_iter=77940, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:55:44, time_cost(all): 23:53:30/4:26:09, loss=0.312175743224162, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=4.59042971442774, lr=0.039780516038626736
2023-12-13 15:38:52   INFO  epoch: 20/24, acc_iter=77990, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:58:18, time_cost(all): 23:54:25/4:44:46, loss=0.311988698607073, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=0.6326345669778984, lr=0.03960202348676394
2023-12-13 15:39:47   INFO  epoch: 20/24, acc_iter=78040, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:58:44, time_cost(all): 23:55:20/4:26:35, loss=0.311801653989984, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=1.0158645489512914, lr=0.03942353093490115
2023-12-13 15:40:43   INFO  epoch: 20/24, acc_iter=78090, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:52:49, time_cost(all): 23:56:16/4:34:31, loss=0.311614609372896, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.16(1.03), norm=3.9697151277173317, lr=0.03924503838303835
2023-12-13 15:41:38   INFO  epoch: 20/24, acc_iter=78140, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:53:41, time_cost(all): 23:57:11/4:27:30, loss=0.311427564755807, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.89(1.03), norm=3.0422318765813046, lr=0.03906654583117555
2023-12-13 15:42:33   INFO  epoch: 20/24, acc_iter=78190, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:02, time_cost(all): 23:58:06/4:39:54, loss=0.311240520138719, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.09(1.03), norm=3.8138467315371574, lr=0.03888805327931276
2023-12-13 15:43:29   INFO  epoch: 20/24, acc_iter=78240, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:14, time_cost(all): 23:59:02/4:42:54, loss=0.31105347552163, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.9(1.03), norm=1.0278008210437894, lr=0.038709560727449965
2023-12-13 15:44:24   INFO  epoch: 20/24, acc_iter=78290, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:38, time_cost(all): 23:59:57/4:42:57, loss=0.310866430904542, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.06(1.03), norm=3.2684938036653066, lr=0.038531068175587165
2023-12-13 15:45:19   INFO  epoch: 20/24, acc_iter=78340, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:50:15, time_cost(all): 1 day, 0:00:52/4:41:04, loss=0.310679386287453, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.0(1.03), norm=4.2111000752309895, lr=0.03835257562372437
2023-12-13 15:46:15   INFO  epoch: 20/24, acc_iter=78390, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:48:23, time_cost(all): 1 day, 0:01:48/4:26:37, loss=0.310492341670365, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.19(1.03), norm=4.50361685484783, lr=0.03817408307186158
2023-12-13 15:47:10   INFO  epoch: 20/24, acc_iter=78440, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:29, time_cost(all): 1 day, 0:02:43/4:28:54, loss=0.310305297053276, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.13(1.03), norm=2.7892521306315556, lr=0.037995590519998776
2023-12-13 15:48:05   INFO  epoch: 20/24, acc_iter=78490, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:50:20, time_cost(all): 1 day, 0:03:38/4:14:40, loss=0.310118252436188, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.807285178987018, lr=0.03781709796813598
2023-12-13 15:49:01   INFO  epoch: 20/24, acc_iter=78540, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:46:27, time_cost(all): 1 day, 0:04:34/4:33:37, loss=0.309931207819099, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.17(1.03), norm=1.3782087596067147, lr=0.03763860541627319
2023-12-13 15:49:56   INFO  epoch: 20/24, acc_iter=78590, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:48:14, time_cost(all): 1 day, 0:05:29/4:20:07, loss=0.309744163202011, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.9(1.03), norm=4.275832861157827, lr=0.037460112864410394
2023-12-13 15:50:51   INFO  epoch: 20/24, acc_iter=78640, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:43:22, time_cost(all): 1 day, 0:06:24/4:35:37, loss=0.309557118584922, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.21(1.03), norm=3.664529590614372, lr=0.03728162031254759
2023-12-13 15:51:47   INFO  epoch: 20/24, acc_iter=78690, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:46:25, time_cost(all): 1 day, 0:07:20/4:17:44, loss=0.309370073967833, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.0(1.03), norm=1.4371391547863708, lr=0.0371031277606848
2023-12-13 15:52:42   INFO  epoch: 20/24, acc_iter=78740, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:43, time_cost(all): 1 day, 0:08:15/4:18:34, loss=0.309183029350745, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=0.5618246975979719, lr=0.036924635208822006
2023-12-13 15:53:37   INFO  epoch: 20/24, acc_iter=78790, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:41:23, time_cost(all): 1 day, 0:09:10/4:31:03, loss=0.308995984733656, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.98(1.03), norm=3.452954535427128, lr=0.036746142656959205
2023-12-13 15:54:33   INFO  epoch: 20/24, acc_iter=78840, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:52, time_cost(all): 1 day, 0:10:06/4:30:33, loss=0.308808940116568, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.22(1.03), norm=3.9724308896389067, lr=0.03656765010509641
2023-12-13 15:55:28   INFO  epoch: 20/24, acc_iter=78890, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:12, time_cost(all): 1 day, 0:11:01/4:12:22, loss=0.308621895499479, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.09(1.03), norm=3.8596485436678054, lr=0.03638915755323362
2023-12-13 15:56:23   INFO  epoch: 20/24, acc_iter=78940, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:17, time_cost(all): 1 day, 0:11:56/4:21:15, loss=0.308434850882391, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.22(1.03), norm=1.9501273658299212, lr=0.03621066500137082
2023-12-13 15:57:19   INFO  epoch: 20/24, acc_iter=78990, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:58, time_cost(all): 1 day, 0:12:52/4:04:46, loss=0.308247806265302, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.05(1.03), norm=1.493724245601771, lr=0.03603217244950802
2023-12-13 15:58:14   INFO  epoch: 20/24, acc_iter=79040, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:38:20, time_cost(all): 1 day, 0:13:47/4:25:20, loss=0.308060761648214, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.02(1.03), norm=3.2842265019533055, lr=0.03585367989764523
2023-12-13 15:59:10   INFO  epoch: 20/24, acc_iter=79090, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:36:08, time_cost(all): 1 day, 0:14:43/4:21:00, loss=0.307873717031125, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.04(1.03), norm=1.6602831069566428, lr=0.035675187345782435
2023-12-13 16:00:05   INFO  epoch: 20/24, acc_iter=79140, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:37:46, time_cost(all): 1 day, 0:15:38/4:11:40, loss=0.307686672414037, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.2(1.03), norm=3.778303807259126, lr=0.035496694793919634
2023-12-13 16:01:00   INFO  epoch: 20/24, acc_iter=79190, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:29, time_cost(all): 1 day, 0:16:33/4:03:09, loss=0.307499627796948, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=2.6528977572310133, lr=0.03531820224205684
2023-12-13 16:01:56   INFO  epoch: 20/24, acc_iter=79240, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:19, time_cost(all): 1 day, 0:17:29/4:08:55, loss=0.30731258317986, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.91(1.03), norm=0.5220096519957859, lr=0.035139709690194046
2023-12-13 16:02:51   INFO  epoch: 20/24, acc_iter=79290, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:23, time_cost(all): 1 day, 0:18:24/4:02:56, loss=0.307125538562771, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=2.6758522511735388, lr=0.034961217138331245
2023-12-13 16:03:46   INFO  epoch: 20/24, acc_iter=79340, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:04, time_cost(all): 1 day, 0:19:19/4:18:37, loss=0.306938493945683, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.96(1.03), norm=1.2901816165757864, lr=0.03478272458646845
2023-12-13 16:04:42   INFO  epoch: 20/24, acc_iter=79390, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:31:56, time_cost(all): 1 day, 0:20:15/4:13:10, loss=0.306751449328594, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.13(1.03), norm=3.7281057088966336, lr=0.03460423203460566
2023-12-13 16:05:37   INFO  epoch: 20/24, acc_iter=79440, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:50, time_cost(all): 1 day, 0:21:10/4:01:08, loss=0.306564404711505, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.03(1.03), norm=4.431233806329225, lr=0.034425739482742856
2023-12-13 16:06:32   INFO  epoch: 20/24, acc_iter=79490, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:29:54, time_cost(all): 1 day, 0:22:05/3:55:51, loss=0.306377360094417, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.21(1.03), norm=4.04794952757801, lr=0.03424724693088006
2023-12-13 16:07:28   INFO  epoch: 20/24, acc_iter=79540, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:29:22, time_cost(all): 1 day, 0:23:01/4:11:27, loss=0.306190315477328, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.19(1.03), norm=2.9435796183321337, lr=0.03406875437901727
2023-12-13 16:08:23   INFO  epoch: 20/24, acc_iter=79590, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:26:37, time_cost(all): 1 day, 0:23:56/4:08:51, loss=0.30600327086024, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.12(1.03), norm=3.808833411693084, lr=0.033890261827154475
2023-12-13 16:09:18   INFO  epoch: 20/24, acc_iter=79640, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:28:00, time_cost(all): 1 day, 0:24:51/3:57:46, loss=0.305816226243151, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.19(1.03), norm=1.740467495198933, lr=0.033711769275291674
2023-12-13 16:10:14   INFO  epoch: 20/24, acc_iter=79690, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:49, time_cost(all): 1 day, 0:25:47/3:57:51, loss=0.305629181626063, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.87(1.03), norm=0.5727819506175427, lr=0.03353327672342888
2023-12-13 16:11:09   INFO  epoch: 20/24, acc_iter=79740, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:25:26, time_cost(all): 1 day, 0:26:42/4:01:52, loss=0.305442137008974, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.91(1.03), norm=1.2426716574675745, lr=0.03335478417156608
2023-12-13 16:12:04   INFO  epoch: 20/24, acc_iter=79790, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:28, time_cost(all): 1 day, 0:27:37/4:06:30, loss=0.305255092391886, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.23(1.03), norm=3.227493010431423, lr=0.033176291619703285
2023-12-13 16:13:00   INFO  epoch: 20/24, acc_iter=79840, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:23:48, time_cost(all): 1 day, 0:28:33/4:00:28, loss=0.305068047774797, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.99(1.03), norm=1.5809418337968875, lr=0.03299779906784049
2023-12-13 16:13:55   INFO  epoch: 20/24, acc_iter=79890, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:37, time_cost(all): 1 day, 0:29:28/3:59:08, loss=0.304881003157709, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.88(1.03), norm=2.732653297479301, lr=0.0328193065159777
2023-12-13 16:14:50   INFO  epoch: 20/24, acc_iter=79940, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:23, time_cost(all): 1 day, 0:30:23/3:58:39, loss=0.30469395854062, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.14(1.03), norm=2.7975962957062, lr=0.032640813964114904
2023-12-13 16:15:46   INFO  epoch: 20/24, acc_iter=79990, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:34, time_cost(all): 1 day, 0:31:19/3:59:00, loss=0.304506913923532, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.0415017948306735, lr=0.03246232141225211
2023-12-13 16:16:41   INFO  epoch: 20/24, acc_iter=80040, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:19, time_cost(all): 1 day, 0:32:14/3:49:14, loss=0.304319869306443, d_time=0.00(0.00), f_time=0.91(1.01), b_time=0.97(1.03), norm=4.450026070412171, lr=0.03228382886038931
2023-12-13 16:17:36   INFO  epoch: 20/24, acc_iter=80090, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:21, time_cost(all): 1 day, 0:33:09/3:49:26, loss=0.304132824689354, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=2.956916385621705, lr=0.03210533630852651
2023-12-13 16:18:32   INFO  epoch: 20/24, acc_iter=80140, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:17, time_cost(all): 1 day, 0:34:05/3:59:14, loss=0.303945780072266, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.07(1.03), norm=3.083166459211104, lr=0.031926843756663714
2023-12-13 16:19:27   INFO  epoch: 20/24, acc_iter=80190, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:10, time_cost(all): 1 day, 0:35:00/3:46:12, loss=0.303758735455177, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=0.7367329163063925, lr=0.03174835120480092
2023-12-13 16:20:23   INFO  epoch: 20/24, acc_iter=80240, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:15:09, time_cost(all): 1 day, 0:35:56/3:53:55, loss=0.303571690838089, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.19(1.03), norm=1.8034710204693303, lr=0.031569858652938126
2023-12-13 16:21:18   INFO  epoch: 20/24, acc_iter=80290, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:30, time_cost(all): 1 day, 0:36:51/4:03:16, loss=0.303384646221, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=2.120478257924339, lr=0.03139136610107533
2023-12-13 16:22:13   INFO  epoch: 20/24, acc_iter=80340, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:59, time_cost(all): 1 day, 0:37:46/3:45:26, loss=0.303197601603912, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.06(1.03), norm=0.7623133779621862, lr=0.031212873549212535
2023-12-13 16:23:09   INFO  epoch: 20/24, acc_iter=80390, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:45, time_cost(all): 1 day, 0:38:42/4:01:47, loss=0.303010556986823, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=0.7092173933093154, lr=0.031034380997349738
2023-12-13 16:24:04   INFO  epoch: 20/24, acc_iter=80440, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:29, time_cost(all): 1 day, 0:39:37/3:53:22, loss=0.302823512369735, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.2(1.03), norm=4.515613247558661, lr=0.03085588844548694
2023-12-13 16:24:59   INFO  epoch: 20/24, acc_iter=80490, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:10:57, time_cost(all): 1 day, 0:40:32/3:42:56, loss=0.302636467752646, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.97(1.03), norm=1.7193836366155555, lr=0.030677395893624147
2023-12-13 16:25:55   INFO  epoch: 20/24, acc_iter=80540, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:50, time_cost(all): 1 day, 0:41:28/3:37:52, loss=0.302449423135558, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.1513843205680836, lr=0.03049890334176135
2023-12-13 16:26:50   INFO  epoch: 20/24, acc_iter=80590, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:08:59, time_cost(all): 1 day, 0:42:23/3:44:04, loss=0.302262378518469, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.2(1.03), norm=1.515717675842832, lr=0.030320410789898555
2023-12-13 16:27:45   INFO  epoch: 20/24, acc_iter=80640, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:20, time_cost(all): 1 day, 0:43:18/3:43:57, loss=0.302075333901381, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.11(1.03), norm=2.2765450470690007, lr=0.030141918238035758
2023-12-13 16:28:41   INFO  epoch: 20/24, acc_iter=80690, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:25, time_cost(all): 1 day, 0:44:14/3:48:22, loss=0.301888289284292, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=2.6581693522665457, lr=0.029963425686172964
2023-12-13 16:29:36   INFO  epoch: 20/24, acc_iter=80740, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:45, time_cost(all): 1 day, 0:45:09/3:53:37, loss=0.301701244667203, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.14(1.03), norm=1.9306101092256123, lr=0.029784933134310167
2023-12-13 16:30:31   INFO  epoch: 20/24, acc_iter=80790, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:50, time_cost(all): 1 day, 0:46:04/3:44:36, loss=0.301514200050115, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.0(1.03), norm=2.8488285766184727, lr=0.02960644058244737
2023-12-13 16:31:27   INFO  epoch: 20/24, acc_iter=80840, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:48, time_cost(all): 1 day, 0:47:00/3:37:32, loss=0.301327155433026, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.94(1.03), norm=3.412518008048405, lr=0.029427948030584575
2023-12-13 16:32:22   INFO  epoch: 20/24, acc_iter=80890, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:55, time_cost(all): 1 day, 0:47:55/3:49:07, loss=0.301140110815938, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.22(1.03), norm=2.131908432194926, lr=0.029249455478721778
2023-12-13 16:33:17   INFO  epoch: 20/24, acc_iter=80940, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:02, time_cost(all): 1 day, 0:48:50/3:42:13, loss=0.300953066198849, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.83(1.03), norm=3.0317145517207833, lr=0.029070962926858984
2023-12-13 16:34:13   INFO  epoch: 20/24, acc_iter=80990, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:07, time_cost(all): 1 day, 0:49:46/3:30:51, loss=0.300766021581761, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.83(1.03), norm=2.851095898163033, lr=0.028892470374996187
2023-12-13 16:35:08   INFO  epoch: 20/24, acc_iter=81040, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:06, time_cost(all): 1 day, 0:50:41/3:39:02, loss=0.300578976964672, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=0.7770748737381472, lr=0.028713977823133393
2023-12-13 16:36:03   INFO  epoch: 20/24, acc_iter=81090, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 1 day, 0:51:36/3:43:36, loss=0.300391932347584, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=3.1199565673336456, lr=0.028535485271270596
2023-12-13 16:36:59   INFO  epoch: 21/24, acc_iter=81152, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:13:47, time_cost(all): 1 day, 0:52:32/3:33:24, loss=0.300159997022394, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.06(1.03), norm=3.1416499802448707, lr=0.028314154506960728
2023-12-13 16:37:54   INFO  epoch: 21/24, acc_iter=81202, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:18, time_cost(all): 1 day, 0:53:27/3:28:26, loss=0.299972952405305, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.02(1.03), norm=3.904127688598683, lr=0.02813566195509793
2023-12-13 16:38:49   INFO  epoch: 21/24, acc_iter=81252, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:11:11, time_cost(all): 1 day, 0:54:22/3:41:19, loss=0.299785907788217, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.85(1.03), norm=2.8387725677592295, lr=0.027957169403235137
2023-12-13 16:39:45   INFO  epoch: 21/24, acc_iter=81302, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:09:55, time_cost(all): 1 day, 0:55:18/3:43:39, loss=0.299598863171128, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.09(1.03), norm=3.0550844159157915, lr=0.02777867685137234
2023-12-13 16:40:40   INFO  epoch: 21/24, acc_iter=81352, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:08:31, time_cost(all): 1 day, 0:56:13/3:24:36, loss=0.29941181855404, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.1(1.03), norm=2.7342041069428644, lr=0.027600184299509545
2023-12-13 16:41:36   INFO  epoch: 21/24, acc_iter=81402, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:06:46, time_cost(all): 1 day, 0:57:09/3:27:30, loss=0.299224773936951, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=3.0967619751059288, lr=0.02742169174764675
2023-12-13 16:42:31   INFO  epoch: 21/24, acc_iter=81452, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:06:15, time_cost(all): 1 day, 0:58:04/3:36:52, loss=0.299037729319863, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.07(1.03), norm=1.1983118843711265, lr=0.027243199195783954
2023-12-13 16:43:26   INFO  epoch: 21/24, acc_iter=81502, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:06:49, time_cost(all): 1 day, 0:58:59/3:37:52, loss=0.298850684702774, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.88(1.03), norm=3.777599821127573, lr=0.02706470664392116
2023-12-13 16:44:22   INFO  epoch: 21/24, acc_iter=81552, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:58, time_cost(all): 1 day, 0:59:55/3:29:56, loss=0.298663640085686, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.17(1.03), norm=3.921311848979444, lr=0.02688621409205836
2023-12-13 16:45:17   INFO  epoch: 21/24, acc_iter=81602, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:00:53, time_cost(all): 1 day, 1:00:50/3:39:01, loss=0.298476595468597, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.21(1.03), norm=1.14382818901223, lr=0.02670772154019557
2023-12-13 16:46:12   INFO  epoch: 21/24, acc_iter=81652, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:03:17, time_cost(all): 1 day, 1:01:45/3:26:25, loss=0.298289550851509, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.89(1.03), norm=2.47117277975435, lr=0.026529228988332768
2023-12-13 16:47:08   INFO  epoch: 21/24, acc_iter=81702, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:59:15, time_cost(all): 1 day, 1:02:41/3:32:07, loss=0.29810250623442, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.83(1.03), norm=2.1579983605724453, lr=0.026350736436469974
2023-12-13 16:48:03   INFO  epoch: 21/24, acc_iter=81752, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:00:02, time_cost(all): 1 day, 1:03:36/3:20:45, loss=0.297915461617332, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.84(1.03), norm=2.498683339039604, lr=0.026172243884607177
2023-12-13 16:48:58   INFO  epoch: 21/24, acc_iter=81802, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:56:53, time_cost(all): 1 day, 1:04:31/3:27:16, loss=0.297728417000243, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.93(1.03), norm=4.842237144294335, lr=0.025993751332744383
2023-12-13 16:49:54   INFO  epoch: 21/24, acc_iter=81852, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:56:02, time_cost(all): 1 day, 1:05:27/3:16:44, loss=0.297541372383154, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.96(1.03), norm=2.456578399757486, lr=0.025815258780881586
2023-12-13 16:50:49   INFO  epoch: 21/24, acc_iter=81902, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:56:12, time_cost(all): 1 day, 1:06:22/3:32:17, loss=0.297354327766066, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.89(1.03), norm=3.2616619887001654, lr=0.02563676622901879
2023-12-13 16:51:44   INFO  epoch: 21/24, acc_iter=81952, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:57:57, time_cost(all): 1 day, 1:07:17/3:14:07, loss=0.297167283148977, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.13(1.03), norm=1.8574046412109049, lr=0.025458273677155994
2023-12-13 16:52:40   INFO  epoch: 21/24, acc_iter=82002, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:53:31, time_cost(all): 1 day, 1:08:13/3:23:57, loss=0.296980238531889, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.0(1.03), norm=2.504262526155291, lr=0.025279781125293197
2023-12-13 16:53:35   INFO  epoch: 21/24, acc_iter=82052, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:55:49, time_cost(all): 1 day, 1:09:08/3:25:50, loss=0.2967931939148, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.08(1.03), norm=4.482226196966984, lr=0.025101288573430403
2023-12-13 16:54:30   INFO  epoch: 21/24, acc_iter=82102, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:52:23, time_cost(all): 1 day, 1:10:03/3:21:56, loss=0.296606149297712, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.22(1.03), norm=1.094229620830225, lr=0.024922796021567602
2023-12-13 16:55:26   INFO  epoch: 21/24, acc_iter=82152, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:53:32, time_cost(all): 1 day, 1:10:59/3:15:23, loss=0.296419104680623, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.99(1.03), norm=3.3339283084796465, lr=0.024744303469704812
2023-12-13 16:56:21   INFO  epoch: 21/24, acc_iter=82202, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:51:04, time_cost(all): 1 day, 1:11:54/3:21:02, loss=0.296232060063535, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.15(1.03), norm=0.5233240403314099, lr=0.02456581091784201
2023-12-13 16:57:16   INFO  epoch: 21/24, acc_iter=82252, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:51:06, time_cost(all): 1 day, 1:12:49/3:08:01, loss=0.296045015446446, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.88(1.03), norm=3.2353313299703643, lr=0.024387318365979217
2023-12-13 16:58:12   INFO  epoch: 21/24, acc_iter=82302, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:48:18, time_cost(all): 1 day, 1:13:45/3:19:29, loss=0.295857970829358, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.92(1.03), norm=3.6414950902322416, lr=0.02420882581411642
2023-12-13 16:59:07   INFO  epoch: 21/24, acc_iter=82352, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:46:32, time_cost(all): 1 day, 1:14:40/3:09:37, loss=0.295670926212269, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.96(1.03), norm=1.6595357366352852, lr=0.024030333262253626
2023-12-13 17:00:02   INFO  epoch: 21/24, acc_iter=82402, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:45:25, time_cost(all): 1 day, 1:15:35/3:19:07, loss=0.295483881595181, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.0(1.03), norm=1.6189742957019162, lr=0.02385184071039083
2023-12-13 17:00:58   INFO  epoch: 21/24, acc_iter=82452, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:47, time_cost(all): 1 day, 1:16:31/3:06:51, loss=0.295296836978092, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.08(1.03), norm=3.2937623856815517, lr=0.023673348158528035
2023-12-13 17:01:53   INFO  epoch: 21/24, acc_iter=82502, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:46:17, time_cost(all): 1 day, 1:17:26/3:21:20, loss=0.295109792361004, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.14(1.03), norm=4.303326849034438, lr=0.023494855606665237
2023-12-13 17:02:49   INFO  epoch: 21/24, acc_iter=82552, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:46:04, time_cost(all): 1 day, 1:18:22/3:03:17, loss=0.294922747743915, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.88(1.03), norm=0.9701112649279371, lr=0.023316363054802444
2023-12-13 17:03:44   INFO  epoch: 21/24, acc_iter=82602, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:43:18, time_cost(all): 1 day, 1:19:17/3:13:49, loss=0.294735703126826, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.85(1.03), norm=4.702453924542957, lr=0.023137870502939646
2023-12-13 17:04:39   INFO  epoch: 21/24, acc_iter=82652, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:42:25, time_cost(all): 1 day, 1:20:12/3:16:25, loss=0.294548658509738, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.16(1.03), norm=2.2124844740719274, lr=0.022959377951076852
2023-12-13 17:05:35   INFO  epoch: 21/24, acc_iter=82702, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:41:24, time_cost(all): 1 day, 1:21:08/3:10:43, loss=0.294361613892649, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.17(1.03), norm=2.0426364149904144, lr=0.022780885399214055
2023-12-13 17:06:30   INFO  epoch: 21/24, acc_iter=82752, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:41:25, time_cost(all): 1 day, 1:22:03/3:11:37, loss=0.294174569275561, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.17(1.03), norm=1.6410290165168613, lr=0.02260239284735126
2023-12-13 17:07:25   INFO  epoch: 21/24, acc_iter=82802, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:18, time_cost(all): 1 day, 1:22:58/3:00:32, loss=0.293987524658472, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=3.7172619999172714, lr=0.022423900295488464
2023-12-13 17:08:21   INFO  epoch: 21/24, acc_iter=82852, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:25, time_cost(all): 1 day, 1:23:54/3:02:02, loss=0.293800480041384, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.89(1.03), norm=1.8510498147098893, lr=0.02224540774362567
2023-12-13 17:09:16   INFO  epoch: 21/24, acc_iter=82902, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:36:17, time_cost(all): 1 day, 1:24:49/3:00:40, loss=0.293613435424295, d_time=0.00(0.00), f_time=1.09(1.01), b_time=1.01(1.03), norm=2.9441025260035243, lr=0.02206691519176287
2023-12-13 17:10:11   INFO  epoch: 21/24, acc_iter=82952, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:35:38, time_cost(all): 1 day, 1:25:44/3:11:45, loss=0.293426390807207, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=4.843594700804819, lr=0.02188842263990008
2023-12-13 17:11:07   INFO  epoch: 21/24, acc_iter=83002, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:35:53, time_cost(all): 1 day, 1:26:40/2:59:23, loss=0.293239346190118, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=4.701057924758301, lr=0.021709930088037278
2023-12-13 17:12:02   INFO  epoch: 21/24, acc_iter=83052, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:49, time_cost(all): 1 day, 1:27:35/3:00:40, loss=0.29305230157303, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=1.5883503182672047, lr=0.021531437536174484
2023-12-13 17:12:57   INFO  epoch: 21/24, acc_iter=83102, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:00, time_cost(all): 1 day, 1:28:30/3:02:57, loss=0.292865256955941, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.99(1.03), norm=4.5857411415719325, lr=0.021352944984311686
2023-12-13 17:13:53   INFO  epoch: 21/24, acc_iter=83152, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:32:59, time_cost(all): 1 day, 1:29:26/2:57:19, loss=0.292678212338853, d_time=0.00(0.00), f_time=1.04(1.01), b_time=1.19(1.03), norm=3.033474815674932, lr=0.021174452432448893
2023-12-13 17:14:48   INFO  epoch: 21/24, acc_iter=83202, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:33:06, time_cost(all): 1 day, 1:30:21/3:06:00, loss=0.292491167721764, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.16(1.03), norm=2.012342027801729, lr=0.020995959880586095
2023-12-13 17:15:43   INFO  epoch: 21/24, acc_iter=83252, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:32:02, time_cost(all): 1 day, 1:31:16/3:07:24, loss=0.292304123104675, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.16(1.03), norm=4.107624534057541, lr=0.0208174673287233
2023-12-13 17:16:39   INFO  epoch: 21/24, acc_iter=83302, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:40, time_cost(all): 1 day, 1:32:12/2:56:25, loss=0.292117078487587, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.11(1.03), norm=3.762336216389086, lr=0.020638974776860504
2023-12-13 17:17:34   INFO  epoch: 21/24, acc_iter=83352, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:31:12, time_cost(all): 1 day, 1:33:07/3:03:33, loss=0.291930033870498, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.17(1.03), norm=3.5163697902793793, lr=0.02046048222499771
2023-12-13 17:18:29   INFO  epoch: 21/24, acc_iter=83402, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:27:37, time_cost(all): 1 day, 1:34:02/2:50:27, loss=0.29174298925341, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=0.8846345101371845, lr=0.020281989673134913
2023-12-13 17:19:25   INFO  epoch: 21/24, acc_iter=83452, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:23, time_cost(all): 1 day, 1:34:58/2:56:46, loss=0.291555944636321, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.01(1.03), norm=4.560215068303449, lr=0.02010349712127212
2023-12-13 17:20:20   INFO  epoch: 21/24, acc_iter=83502, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:27:04, time_cost(all): 1 day, 1:35:53/2:59:06, loss=0.291368900019233, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.99(1.03), norm=3.82332037329518, lr=0.01992500456940932
2023-12-13 17:21:15   INFO  epoch: 21/24, acc_iter=83552, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:12, time_cost(all): 1 day, 1:36:48/2:59:19, loss=0.291181855402144, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.01(1.03), norm=3.810502608356546, lr=0.019746512017546528
2023-12-13 17:22:11   INFO  epoch: 21/24, acc_iter=83602, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:24:22, time_cost(all): 1 day, 1:37:44/2:47:52, loss=0.290994810785056, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.94(1.03), norm=1.0054994727493747, lr=0.019568019465683727
2023-12-13 17:23:06   INFO  epoch: 21/24, acc_iter=83652, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:12, time_cost(all): 1 day, 1:38:39/2:50:43, loss=0.290807766167967, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=1.7474522469131255, lr=0.019389526913820936
2023-12-13 17:24:01   INFO  epoch: 21/24, acc_iter=83702, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:26, time_cost(all): 1 day, 1:39:34/2:47:08, loss=0.290620721550879, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=0.7160068008652042, lr=0.019211034361958135
2023-12-13 17:24:57   INFO  epoch: 21/24, acc_iter=83752, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:23:25, time_cost(all): 1 day, 1:40:30/2:44:40, loss=0.29043367693379, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.19(1.03), norm=1.0109744310191677, lr=0.019032541810095345
2023-12-13 17:25:52   INFO  epoch: 21/24, acc_iter=83802, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:21:33, time_cost(all): 1 day, 1:41:25/2:48:21, loss=0.290246632316702, d_time=0.00(0.00), f_time=1.21(1.01), b_time=1.12(1.03), norm=2.965314683968727, lr=0.018854049258232544
2023-12-13 17:26:48   INFO  epoch: 21/24, acc_iter=83852, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:39, time_cost(all): 1 day, 1:42:21/2:52:10, loss=0.290059587699613, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=1.1293904362999891, lr=0.01867555670636975
2023-12-13 17:27:43   INFO  epoch: 21/24, acc_iter=83902, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:59, time_cost(all): 1 day, 1:43:16/2:51:49, loss=0.289872543082525, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=3.2399164138704863, lr=0.018497064154506956
2023-12-13 17:28:38   INFO  epoch: 21/24, acc_iter=83952, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:19:33, time_cost(all): 1 day, 1:44:11/2:53:25, loss=0.289685498465436, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=0.6289833393124232, lr=0.018318571602644163
2023-12-13 17:29:34   INFO  epoch: 21/24, acc_iter=84002, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:07, time_cost(all): 1 day, 1:45:07/2:46:56, loss=0.289498453848347, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.0(1.03), norm=4.467607486143945, lr=0.01814007905078136
2023-12-13 17:30:29   INFO  epoch: 21/24, acc_iter=84052, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:17:33, time_cost(all): 1 day, 1:46:02/2:36:45, loss=0.289311409231259, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.12(1.03), norm=0.7247517671394533, lr=0.017961586498918568
2023-12-13 17:31:24   INFO  epoch: 21/24, acc_iter=84102, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:04, time_cost(all): 1 day, 1:46:57/2:44:33, loss=0.28912436461417, d_time=0.00(0.00), f_time=1.1(1.01), b_time=1.14(1.03), norm=0.7336299430269291, lr=0.017783093947055767
2023-12-13 17:32:20   INFO  epoch: 21/24, acc_iter=84152, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:15:28, time_cost(all): 1 day, 1:47:53/2:40:48, loss=0.288937319997082, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.2(1.03), norm=3.9416324821583357, lr=0.017604601395192973
2023-12-13 17:33:15   INFO  epoch: 21/24, acc_iter=84202, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:40, time_cost(all): 1 day, 1:48:48/2:38:20, loss=0.288750275379993, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.06(1.03), norm=2.8808477431682378, lr=0.01742610884333018
2023-12-13 17:34:10   INFO  epoch: 21/24, acc_iter=84252, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:46, time_cost(all): 1 day, 1:49:43/2:43:30, loss=0.288563230762905, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=1.46154987754514, lr=0.017247616291467385
2023-12-13 17:35:06   INFO  epoch: 21/24, acc_iter=84302, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:18, time_cost(all): 1 day, 1:50:39/2:33:55, loss=0.288376186145816, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.98(1.03), norm=2.166432337501992, lr=0.017069123739604584
2023-12-13 17:36:01   INFO  epoch: 21/24, acc_iter=84352, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:26, time_cost(all): 1 day, 1:51:34/2:39:55, loss=0.288189141528728, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.14(1.03), norm=4.491616619936385, lr=0.01689063118774179
2023-12-13 17:36:56   INFO  epoch: 21/24, acc_iter=84402, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:33, time_cost(all): 1 day, 1:52:29/2:37:58, loss=0.288002096911639, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.95(1.03), norm=0.8157111692388836, lr=0.016712138635878997
2023-12-13 17:37:52   INFO  epoch: 21/24, acc_iter=84452, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:21, time_cost(all): 1 day, 1:53:25/2:43:05, loss=0.287815052294551, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.21(1.03), norm=3.7531056618699146, lr=0.016533646084016203
2023-12-13 17:38:47   INFO  epoch: 21/24, acc_iter=84502, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:56, time_cost(all): 1 day, 1:54:20/2:29:45, loss=0.287628007677462, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.08(1.03), norm=3.775677434874801, lr=0.016355153532153402
2023-12-13 17:39:42   INFO  epoch: 21/24, acc_iter=84552, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:33, time_cost(all): 1 day, 1:55:15/2:39:32, loss=0.287440963060374, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.21(1.03), norm=3.098652657592858, lr=0.016176660980290608
2023-12-13 17:40:38   INFO  epoch: 21/24, acc_iter=84602, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:46, time_cost(all): 1 day, 1:56:11/2:37:35, loss=0.287253918443285, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=3.59292302762459, lr=0.015998168428427814
2023-12-13 17:41:33   INFO  epoch: 21/24, acc_iter=84652, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:29, time_cost(all): 1 day, 1:57:06/2:37:28, loss=0.287066873826197, d_time=0.00(0.00), f_time=0.93(1.01), b_time=0.88(1.03), norm=0.8815735310011402, lr=0.015819675876565013
2023-12-13 17:42:28   INFO  epoch: 21/24, acc_iter=84702, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:44, time_cost(all): 1 day, 1:58:01/2:28:13, loss=0.286879829209108, d_time=0.00(0.00), f_time=0.94(1.01), b_time=1.1(1.03), norm=2.2624512486148927, lr=0.01564118332470222
2023-12-13 17:43:24   INFO  epoch: 21/24, acc_iter=84752, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:48, time_cost(all): 1 day, 1:58:57/2:33:27, loss=0.286692784592019, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.17(1.03), norm=2.300216310663003, lr=0.015462690772839419
2023-12-13 17:44:19   INFO  epoch: 21/24, acc_iter=84802, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:03:08, time_cost(all): 1 day, 1:59:52/2:35:03, loss=0.286505739974931, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.02(1.03), norm=3.5461406111854434, lr=0.015284198220976625
2023-12-13 17:45:14   INFO  epoch: 21/24, acc_iter=84852, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:02:01, time_cost(all): 1 day, 2:00:47/2:23:51, loss=0.286318695357842, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.05(1.03), norm=1.2525524382658868, lr=0.01510570566911383
2023-12-13 17:46:10   INFO  epoch: 21/24, acc_iter=84902, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:08, time_cost(all): 1 day, 2:01:43/2:29:12, loss=0.286131650740754, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.13(1.03), norm=0.7600680667717592, lr=0.014927213117251037
2023-12-13 17:47:05   INFO  epoch: 21/24, acc_iter=84952, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 1 day, 2:02:38/2:33:07, loss=0.285944606123665, d_time=0.00(0.00), f_time=0.94(1.01), b_time=0.98(1.03), norm=4.187250891803417, lr=0.014748720565388236
2023-12-13 17:48:01   INFO  epoch: 22/24, acc_iter=85014, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:10:26, time_cost(all): 1 day, 2:03:34/2:22:24, loss=0.285712670798476, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.99(1.03), norm=2.471046471912831, lr=0.014527389801078372
2023-12-13 17:48:56   INFO  epoch: 22/24, acc_iter=85064, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:06:06, time_cost(all): 1 day, 2:04:29/2:20:50, loss=0.285525626181387, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.88(1.03), norm=1.9995459119342907, lr=0.014348897249215578
2023-12-13 17:49:51   INFO  epoch: 22/24, acc_iter=85114, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:10:31, time_cost(all): 1 day, 2:05:24/2:31:39, loss=0.285338581564298, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.17(1.03), norm=4.863582968013001, lr=0.014170404697352777
2023-12-13 17:50:47   INFO  epoch: 22/24, acc_iter=85164, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:09:56, time_cost(all): 1 day, 2:06:20/2:23:19, loss=0.28515153694721, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.21(1.03), norm=0.8756370101526741, lr=0.01399191214548999
2023-12-13 17:51:42   INFO  epoch: 22/24, acc_iter=85214, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:05:39, time_cost(all): 1 day, 2:07:15/2:27:56, loss=0.284964492330121, d_time=0.00(0.00), f_time=1.12(1.01), b_time=0.88(1.03), norm=0.8479700463624849, lr=0.01381341959362719
2023-12-13 17:52:37   INFO  epoch: 22/24, acc_iter=85264, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:02:46, time_cost(all): 1 day, 2:08:10/2:26:55, loss=0.284777447713033, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.05(1.03), norm=1.4937878104825748, lr=0.013634927041764396
2023-12-13 17:53:33   INFO  epoch: 22/24, acc_iter=85314, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:07:28, time_cost(all): 1 day, 2:09:06/2:26:56, loss=0.284590403095944, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.22(1.03), norm=1.5572010013432782, lr=0.013456434489901595
2023-12-13 17:54:28   INFO  epoch: 22/24, acc_iter=85364, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:05:38, time_cost(all): 1 day, 2:10:01/2:16:05, loss=0.284403358478856, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.96(1.03), norm=2.7526927259836507, lr=0.0132779419380388
2023-12-13 17:55:23   INFO  epoch: 22/24, acc_iter=85414, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:04:05, time_cost(all): 1 day, 2:10:56/2:24:03, loss=0.284216313861767, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.99(1.03), norm=3.3395838868036, lr=0.013099449386176007
2023-12-13 17:56:19   INFO  epoch: 22/24, acc_iter=85464, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:03:59, time_cost(all): 1 day, 2:11:52/2:23:36, loss=0.284029269244679, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.93(1.03), norm=2.5108931674668544, lr=0.012920956834313213
2023-12-13 17:57:14   INFO  epoch: 22/24, acc_iter=85514, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:00:05, time_cost(all): 1 day, 2:12:47/2:18:15, loss=0.28384222462759, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.93(1.03), norm=1.4691385217828565, lr=0.012742464282450412
2023-12-13 17:58:09   INFO  epoch: 22/24, acc_iter=85564, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/0:57:50, time_cost(all): 1 day, 2:13:42/2:15:13, loss=0.283655180010502, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.16(1.03), norm=0.7582023646611448, lr=0.012563971730587618
2023-12-13 17:59:05   INFO  epoch: 22/24, acc_iter=85614, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/0:58:37, time_cost(all): 1 day, 2:14:38/2:10:36, loss=0.283468135393413, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.89(1.03), norm=4.933630003315444, lr=0.012385479178724824
2023-12-13 18:00:00   INFO  epoch: 22/24, acc_iter=85664, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/1:00:20, time_cost(all): 1 day, 2:15:33/2:18:09, loss=0.283281090776325, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.08(1.03), norm=3.0579327630127344, lr=0.01220698662686203
2023-12-13 18:00:55   INFO  epoch: 22/24, acc_iter=85714, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/1:00:03, time_cost(all): 1 day, 2:16:28/2:17:43, loss=0.283094046159236, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.96(1.03), norm=1.257609107727507, lr=0.01202849407499923
2023-12-13 18:01:51   INFO  epoch: 22/24, acc_iter=85764, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:55:19, time_cost(all): 1 day, 2:17:24/2:11:29, loss=0.282907001542147, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.14(1.03), norm=4.81181244669492, lr=0.011850001523136436
2023-12-13 18:02:46   INFO  epoch: 22/24, acc_iter=85814, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:56:41, time_cost(all): 1 day, 2:18:19/2:05:36, loss=0.282719956925059, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.11(1.03), norm=3.4021164075931165, lr=0.011671508971273642
2023-12-13 18:03:41   INFO  epoch: 22/24, acc_iter=85864, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:57:09, time_cost(all): 1 day, 2:19:14/2:09:40, loss=0.28253291230797, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=2.568456338689561, lr=0.011493016419410848
2023-12-13 18:04:37   INFO  epoch: 22/24, acc_iter=85914, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:53:17, time_cost(all): 1 day, 2:20:10/2:14:48, loss=0.282345867690882, d_time=0.00(0.00), f_time=1.05(1.01), b_time=0.93(1.03), norm=4.053734097692757, lr=0.011314523867548047
2023-12-13 18:05:32   INFO  epoch: 22/24, acc_iter=85964, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:53:04, time_cost(all): 1 day, 2:21:05/2:05:31, loss=0.282158823073793, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.12(1.03), norm=3.7733578524148403, lr=0.011136031315685246
2023-12-13 18:06:27   INFO  epoch: 22/24, acc_iter=86014, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:51:45, time_cost(all): 1 day, 2:22:00/2:09:15, loss=0.281971778456705, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.95(1.03), norm=1.0800938502763386, lr=0.010957538763822453
2023-12-13 18:07:23   INFO  epoch: 22/24, acc_iter=86064, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:53:15, time_cost(all): 1 day, 2:22:56/2:05:23, loss=0.281784733839616, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.08(1.03), norm=0.8175578204136811, lr=0.010779046211959659
2023-12-13 18:08:18   INFO  epoch: 22/24, acc_iter=86114, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:51:30, time_cost(all): 1 day, 2:23:51/2:00:36, loss=0.281597689222528, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.19(1.03), norm=1.2052512912791644, lr=0.010600553660096865
2023-12-13 18:09:14   INFO  epoch: 22/24, acc_iter=86164, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:49:35, time_cost(all): 1 day, 2:24:47/2:04:43, loss=0.281410644605439, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.11(1.03), norm=2.5468563627677194, lr=0.010422061108234064
2023-12-13 18:10:09   INFO  epoch: 22/24, acc_iter=86214, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:47:11, time_cost(all): 1 day, 2:25:42/2:02:41, loss=0.281223599988351, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.09(1.03), norm=4.385515299174875, lr=0.01024356855637127
2023-12-13 18:11:04   INFO  epoch: 22/24, acc_iter=86264, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:48:38, time_cost(all): 1 day, 2:26:37/2:01:10, loss=0.281036555371262, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=3.495462210645065, lr=0.010065076004508476
2023-12-13 18:12:00   INFO  epoch: 22/24, acc_iter=86314, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:48:34, time_cost(all): 1 day, 2:27:33/2:06:06, loss=0.280849510754174, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.21(1.03), norm=4.556862286185507, lr=0.009886583452645682
2023-12-13 18:12:55   INFO  epoch: 22/24, acc_iter=86364, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:47:21, time_cost(all): 1 day, 2:28:28/2:05:55, loss=0.280662466137085, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.85(1.03), norm=1.928386763405825, lr=0.009708090900782881
2023-12-13 18:13:50   INFO  epoch: 22/24, acc_iter=86414, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:43:05, time_cost(all): 1 day, 2:29:23/2:04:33, loss=0.280475421519996, d_time=0.00(0.00), f_time=1.2(1.01), b_time=1.22(1.03), norm=2.734401879419373, lr=0.009529598348920088
2023-12-13 18:14:46   INFO  epoch: 22/24, acc_iter=86464, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:07, time_cost(all): 1 day, 2:30:19/2:04:58, loss=0.280288376902908, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.16(1.03), norm=2.793505164994615, lr=0.009351105797057287
2023-12-13 18:15:41   INFO  epoch: 22/24, acc_iter=86514, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:44, time_cost(all): 1 day, 2:31:14/2:04:11, loss=0.280101332285819, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.0(1.03), norm=2.2379013807879624, lr=0.0091726132451945
2023-12-13 18:16:36   INFO  epoch: 22/24, acc_iter=86564, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:30, time_cost(all): 1 day, 2:32:09/1:55:24, loss=0.279914287668731, d_time=0.00(0.00), f_time=1.21(1.01), b_time=0.96(1.03), norm=4.2148634707121655, lr=0.008994120693331699
2023-12-13 18:17:32   INFO  epoch: 22/24, acc_iter=86614, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:40:56, time_cost(all): 1 day, 2:33:05/1:58:45, loss=0.279727243051642, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.97(1.03), norm=1.9670574932844245, lr=0.008815628141468905
2023-12-13 18:18:27   INFO  epoch: 22/24, acc_iter=86664, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:39:15, time_cost(all): 1 day, 2:34:00/1:54:39, loss=0.279540198434554, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.08(1.03), norm=2.243507878802914, lr=0.008637135589606104
2023-12-13 18:19:22   INFO  epoch: 22/24, acc_iter=86714, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:37:04, time_cost(all): 1 day, 2:34:55/1:49:27, loss=0.279353153817465, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.83(1.03), norm=4.712793457260874, lr=0.00845864303774331
2023-12-13 18:20:18   INFO  epoch: 22/24, acc_iter=86764, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:28, time_cost(all): 1 day, 2:35:51/1:50:32, loss=0.279166109200377, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.99(1.03), norm=3.859258593340237, lr=0.008280150485880516
2023-12-13 18:21:13   INFO  epoch: 22/24, acc_iter=86814, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:38:34, time_cost(all): 1 day, 2:36:46/1:53:39, loss=0.278979064583288, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.06(1.03), norm=1.3236275953665793, lr=0.008101657934017722
2023-12-13 18:22:08   INFO  epoch: 22/24, acc_iter=86864, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:37:25, time_cost(all): 1 day, 2:37:41/1:54:14, loss=0.2787920199662, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.83(1.03), norm=3.005106854741626, lr=0.007923165382154922
2023-12-13 18:23:04   INFO  epoch: 22/24, acc_iter=86914, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:33:54, time_cost(all): 1 day, 2:38:37/1:49:36, loss=0.278604975349111, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.94(1.03), norm=2.668907683907876, lr=0.007744672830292128
2023-12-13 18:23:59   INFO  epoch: 22/24, acc_iter=86964, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:27, time_cost(all): 1 day, 2:39:32/1:48:58, loss=0.278417930732023, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.03(1.03), norm=4.044042917333879, lr=0.007566180278429334
2023-12-13 18:24:54   INFO  epoch: 22/24, acc_iter=87014, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:35:01, time_cost(all): 1 day, 2:40:27/1:50:42, loss=0.278230886114934, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.85(1.03), norm=0.6000674229006646, lr=0.00738768772656654
2023-12-13 18:25:50   INFO  epoch: 22/24, acc_iter=87064, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:32:26, time_cost(all): 1 day, 2:41:23/1:49:54, loss=0.278043841497846, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.1(1.03), norm=0.946143430547718, lr=0.007209195174703739
2023-12-13 18:26:45   INFO  epoch: 22/24, acc_iter=87114, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:33:09, time_cost(all): 1 day, 2:42:18/1:50:34, loss=0.277856796880757, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.92(1.03), norm=4.660949683036164, lr=0.007030702622840945
2023-12-13 18:27:40   INFO  epoch: 22/24, acc_iter=87164, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:29:56, time_cost(all): 1 day, 2:43:13/1:43:26, loss=0.277669752263668, d_time=0.00(0.00), f_time=0.91(1.01), b_time=1.15(1.03), norm=3.014505179231932, lr=0.006852210070978151
2023-12-13 18:28:36   INFO  epoch: 22/24, acc_iter=87214, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:28:24, time_cost(all): 1 day, 2:44:09/1:49:21, loss=0.27748270764658, d_time=0.00(0.00), f_time=1.16(1.01), b_time=1.07(1.03), norm=2.555353533013214, lr=0.006673717519115357
2023-12-13 18:29:31   INFO  epoch: 22/24, acc_iter=87264, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:54, time_cost(all): 1 day, 2:45:04/1:48:47, loss=0.277295663029491, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.06(1.03), norm=3.0971465828806632, lr=0.006495224967252557
2023-12-13 18:30:27   INFO  epoch: 22/24, acc_iter=87314, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:24, time_cost(all): 1 day, 2:46:00/1:40:04, loss=0.277108618412403, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.83(1.03), norm=1.0915454357816863, lr=0.006316732415389763
2023-12-13 18:31:22   INFO  epoch: 22/24, acc_iter=87364, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:25:41, time_cost(all): 1 day, 2:46:55/1:48:04, loss=0.276921573795314, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.17(1.03), norm=1.619970151876939, lr=0.006138239863526962
2023-12-13 18:32:17   INFO  epoch: 22/24, acc_iter=87414, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:13, time_cost(all): 1 day, 2:47:50/1:38:28, loss=0.276734529178226, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.98(1.03), norm=4.049930349778188, lr=0.005959747311664168
2023-12-13 18:33:13   INFO  epoch: 22/24, acc_iter=87464, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:26:19, time_cost(all): 1 day, 2:48:46/1:44:42, loss=0.276547484561137, d_time=0.00(0.00), f_time=1.02(1.01), b_time=1.03(1.03), norm=3.6379155933984753, lr=0.005781254759801374
2023-12-13 18:34:08   INFO  epoch: 22/24, acc_iter=87514, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:23:45, time_cost(all): 1 day, 2:49:41/1:36:35, loss=0.276360439944049, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.08(1.03), norm=1.0052965321196332, lr=0.00560276220793858
2023-12-13 18:35:03   INFO  epoch: 22/24, acc_iter=87564, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:15, time_cost(all): 1 day, 2:50:36/1:43:38, loss=0.27617339532696, d_time=0.00(0.00), f_time=1.0(1.01), b_time=1.07(1.03), norm=0.6384084161293948, lr=0.005424269656075779
2023-12-13 18:35:59   INFO  epoch: 22/24, acc_iter=87614, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:22:05, time_cost(all): 1 day, 2:51:32/1:41:32, loss=0.275986350709872, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.09(1.03), norm=3.4463439322103677, lr=0.005245777104212986
2023-12-13 18:36:54   INFO  epoch: 22/24, acc_iter=87664, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:22:23, time_cost(all): 1 day, 2:52:27/1:36:21, loss=0.275799306092783, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.2(1.03), norm=0.5813851552571152, lr=0.005067284552350192
2023-12-13 18:37:49   INFO  epoch: 22/24, acc_iter=87714, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:41, time_cost(all): 1 day, 2:53:22/1:37:30, loss=0.275612261475695, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.96(1.03), norm=2.631611059955539, lr=0.004968883177973414
2023-12-13 18:38:45   INFO  epoch: 22/24, acc_iter=87764, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:20:14, time_cost(all): 1 day, 2:54:18/1:36:15, loss=0.275425216858606, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.87(1.03), norm=0.6771152512153947, lr=0.004918939639795153
2023-12-13 18:39:40   INFO  epoch: 22/24, acc_iter=87814, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:17:59, time_cost(all): 1 day, 2:55:13/1:30:03, loss=0.275238172241518, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.12(1.03), norm=4.609190220678318, lr=0.004868996101616892
2023-12-13 18:40:35   INFO  epoch: 22/24, acc_iter=87864, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:17:43, time_cost(all): 1 day, 2:56:08/1:37:05, loss=0.275051127624429, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=4.02014617929896, lr=0.004819052563438631
2023-12-13 18:41:31   INFO  epoch: 22/24, acc_iter=87914, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:01, time_cost(all): 1 day, 2:57:04/1:34:12, loss=0.27486408300734, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.9(1.03), norm=3.5869690225273247, lr=0.00476910902526037
2023-12-13 18:42:26   INFO  epoch: 22/24, acc_iter=87964, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:28, time_cost(all): 1 day, 2:57:59/1:28:14, loss=0.274677038390252, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.95(1.03), norm=3.0244626525425335, lr=0.004719165487082109
2023-12-13 18:43:21   INFO  epoch: 22/24, acc_iter=88014, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:21, time_cost(all): 1 day, 2:58:54/1:29:35, loss=0.274489993773163, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.15(1.03), norm=1.6880772384554974, lr=0.004669221948903848
2023-12-13 18:44:17   INFO  epoch: 22/24, acc_iter=88064, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:13:51, time_cost(all): 1 day, 2:59:50/1:34:10, loss=0.274302949156075, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.19(1.03), norm=4.920249852332695, lr=0.004619278410725586
2023-12-13 18:45:12   INFO  epoch: 22/24, acc_iter=88114, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:31, time_cost(all): 1 day, 3:00:45/1:31:01, loss=0.274115904538986, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.2(1.03), norm=0.912517795222657, lr=0.004569334872547326
2023-12-13 18:46:07   INFO  epoch: 22/24, acc_iter=88164, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:12:05, time_cost(all): 1 day, 3:01:40/1:31:51, loss=0.273928859921898, d_time=0.00(0.00), f_time=1.11(1.01), b_time=0.96(1.03), norm=3.001092685068133, lr=0.004519391334369065
2023-12-13 18:47:03   INFO  epoch: 22/24, acc_iter=88214, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:08, time_cost(all): 1 day, 3:02:36/1:24:38, loss=0.273741815304809, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.84(1.03), norm=2.176755726337247, lr=0.004469447796190803
2023-12-13 18:47:58   INFO  epoch: 22/24, acc_iter=88264, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:12, time_cost(all): 1 day, 3:03:31/1:24:37, loss=0.273554770687721, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.08(1.03), norm=4.337089020012639, lr=0.004419504258012542
2023-12-13 18:48:53   INFO  epoch: 22/24, acc_iter=88314, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:37, time_cost(all): 1 day, 3:04:26/1:27:28, loss=0.273367726070632, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.93(1.03), norm=3.1996509095913015, lr=0.004369560719834281
2023-12-13 18:49:49   INFO  epoch: 22/24, acc_iter=88364, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:50, time_cost(all): 1 day, 3:05:22/1:28:18, loss=0.273180681453544, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.86(1.03), norm=3.6086446857836427, lr=0.00431961718165602
2023-12-13 18:50:44   INFO  epoch: 22/24, acc_iter=88414, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:40, time_cost(all): 1 day, 3:06:17/1:27:03, loss=0.272993636836455, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.14(1.03), norm=0.5476431952827527, lr=0.004269673643477759
2023-12-13 18:51:40   INFO  epoch: 22/24, acc_iter=88464, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:22, time_cost(all): 1 day, 3:07:13/1:20:11, loss=0.272806592219367, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.89(1.03), norm=3.0108420799272615, lr=0.004219730105299498
2023-12-13 18:52:35   INFO  epoch: 22/24, acc_iter=88514, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:34, time_cost(all): 1 day, 3:08:08/1:17:47, loss=0.272619547602278, d_time=0.00(0.00), f_time=1.19(1.01), b_time=1.06(1.03), norm=3.131834636881935, lr=0.004169786567121237
2023-12-13 18:53:30   INFO  epoch: 22/24, acc_iter=88564, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:41, time_cost(all): 1 day, 3:09:03/1:19:29, loss=0.272432502985189, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.87(1.03), norm=1.2688795611942445, lr=0.004119843028942975
2023-12-13 18:54:26   INFO  epoch: 22/24, acc_iter=88614, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:54, time_cost(all): 1 day, 3:09:59/1:16:18, loss=0.272245458368101, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.91(1.03), norm=1.4355239031770903, lr=0.004069899490764715
2023-12-13 18:55:21   INFO  epoch: 22/24, acc_iter=88664, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:54, time_cost(all): 1 day, 3:10:54/1:17:54, loss=0.272058413751012, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.97(1.03), norm=3.967371722319596, lr=0.004019955952586454
2023-12-13 18:56:16   INFO  epoch: 22/24, acc_iter=88714, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:01:59, time_cost(all): 1 day, 3:11:49/1:17:54, loss=0.271871369133924, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.09(1.03), norm=4.84732710720334, lr=0.003970012414408192
2023-12-13 18:57:12   INFO  epoch: 22/24, acc_iter=88764, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:06, time_cost(all): 1 day, 3:12:45/1:19:36, loss=0.271684324516835, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.94(1.03), norm=4.7650220099660885, lr=0.003920068876229931
2023-12-13 18:58:07   INFO  epoch: 22/24, acc_iter=88814, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:12, time_cost(all): 1 day, 3:13:40/1:14:31, loss=0.271497279899747, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.94(1.03), norm=2.4160243286022522, lr=0.00387012533805167
2023-12-13 18:59:02   INFO  epoch: 23/24, acc_iter=88876, cur_iter=50/3862, batch_size=32, time_cost(epoch): 0:00:55/1:13:25, time_cost(all): 1 day, 3:14:35/1:13:53, loss=0.271265344574557, d_time=0.00(0.00), f_time=0.95(1.01), b_time=1.05(1.03), norm=0.7132053227091333, lr=0.003808195350710626
2023-12-13 18:59:58   INFO  epoch: 23/24, acc_iter=88926, cur_iter=100/3862, batch_size=32, time_cost(epoch): 0:01:50/1:08:01, time_cost(all): 1 day, 3:15:31/1:13:35, loss=0.271078299957469, d_time=0.00(0.00), f_time=0.92(1.01), b_time=0.89(1.03), norm=4.5531234288641595, lr=0.003758251812532365
2023-12-13 19:00:53   INFO  epoch: 23/24, acc_iter=88976, cur_iter=150/3862, batch_size=32, time_cost(epoch): 0:02:46/1:05:52, time_cost(all): 1 day, 3:16:26/1:11:33, loss=0.27089125534038, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.16(1.03), norm=2.2327659107724243, lr=0.003708308274354104
2023-12-13 19:01:48   INFO  epoch: 23/24, acc_iter=89026, cur_iter=200/3862, batch_size=32, time_cost(epoch): 0:03:41/1:09:02, time_cost(all): 1 day, 3:17:21/1:13:45, loss=0.270704210723291, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.94(1.03), norm=0.8702911564326404, lr=0.003658364736175843
2023-12-13 19:02:44   INFO  epoch: 23/24, acc_iter=89076, cur_iter=250/3862, batch_size=32, time_cost(epoch): 0:04:36/1:04:22, time_cost(all): 1 day, 3:18:17/1:08:20, loss=0.270517166106203, d_time=0.00(0.00), f_time=1.09(1.01), b_time=0.96(1.03), norm=3.591897922456815, lr=0.003608421197997582
2023-12-13 19:03:39   INFO  epoch: 23/24, acc_iter=89126, cur_iter=300/3862, batch_size=32, time_cost(epoch): 0:05:32/1:07:48, time_cost(all): 1 day, 3:19:12/1:10:34, loss=0.270330121489114, d_time=0.00(0.00), f_time=1.14(1.01), b_time=1.15(1.03), norm=1.9219376372151595, lr=0.003558477659819321
2023-12-13 19:04:34   INFO  epoch: 23/24, acc_iter=89176, cur_iter=350/3862, batch_size=32, time_cost(epoch): 0:06:27/1:03:47, time_cost(all): 1 day, 3:20:07/1:06:24, loss=0.270143076872026, d_time=0.00(0.00), f_time=0.96(1.01), b_time=1.01(1.03), norm=2.7129946309268904, lr=0.00350853412164106
2023-12-13 19:05:30   INFO  epoch: 23/24, acc_iter=89226, cur_iter=400/3862, batch_size=32, time_cost(epoch): 0:07:22/1:04:38, time_cost(all): 1 day, 3:21:03/1:09:53, loss=0.269956032254937, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.96(1.03), norm=2.4999610921465147, lr=0.003458590583462799
2023-12-13 19:06:25   INFO  epoch: 23/24, acc_iter=89276, cur_iter=450/3862, batch_size=32, time_cost(epoch): 0:08:18/1:01:07, time_cost(all): 1 day, 3:21:58/1:06:08, loss=0.269768987637849, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.95(1.03), norm=3.1874354997773025, lr=0.003408647045284537
2023-12-13 19:07:20   INFO  epoch: 23/24, acc_iter=89326, cur_iter=500/3862, batch_size=32, time_cost(epoch): 0:09:13/1:03:57, time_cost(all): 1 day, 3:22:53/1:04:55, loss=0.26958194302076, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.06(1.03), norm=1.659188795390644, lr=0.003358703507106276
2023-12-13 19:08:16   INFO  epoch: 23/24, acc_iter=89376, cur_iter=550/3862, batch_size=32, time_cost(epoch): 0:10:08/1:02:39, time_cost(all): 1 day, 3:23:49/1:05:28, loss=0.269394898403672, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.95(1.03), norm=3.1590349280419754, lr=0.003308759968928016
2023-12-13 19:09:11   INFO  epoch: 23/24, acc_iter=89426, cur_iter=600/3862, batch_size=32, time_cost(epoch): 0:11:04/1:02:45, time_cost(all): 1 day, 3:24:44/1:02:53, loss=0.269207853786583, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.2(1.03), norm=1.9736250420342523, lr=0.003258816430749754
2023-12-13 19:10:06   INFO  epoch: 23/24, acc_iter=89476, cur_iter=650/3862, batch_size=32, time_cost(epoch): 0:11:59/1:00:48, time_cost(all): 1 day, 3:25:39/1:01:38, loss=0.269020809169495, d_time=0.00(0.00), f_time=1.18(1.01), b_time=0.99(1.03), norm=2.90102931593345, lr=0.003208872892571493
2023-12-13 19:11:02   INFO  epoch: 23/24, acc_iter=89526, cur_iter=700/3862, batch_size=32, time_cost(epoch): 0:12:54/0:57:21, time_cost(all): 1 day, 3:26:35/1:05:12, loss=0.268833764552406, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.05(1.03), norm=3.7965702822939646, lr=0.003158929354393232
2023-12-13 19:11:57   INFO  epoch: 23/24, acc_iter=89576, cur_iter=750/3862, batch_size=32, time_cost(epoch): 0:13:50/0:58:30, time_cost(all): 1 day, 3:27:30/1:00:52, loss=0.268646719935318, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.14(1.03), norm=3.633304018425356, lr=0.003108985816214971
2023-12-13 19:12:53   INFO  epoch: 23/24, acc_iter=89626, cur_iter=800/3862, batch_size=32, time_cost(epoch): 0:14:45/0:56:54, time_cost(all): 1 day, 3:28:26/1:03:51, loss=0.268459675318229, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.2(1.03), norm=1.965544157258643, lr=0.00305904227803671
2023-12-13 19:13:48   INFO  epoch: 23/24, acc_iter=89676, cur_iter=850/3862, batch_size=32, time_cost(epoch): 0:15:40/0:53:34, time_cost(all): 1 day, 3:29:21/1:03:37, loss=0.26827263070114, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=3.1963199499151265, lr=0.003009098739858449
2023-12-13 19:14:43   INFO  epoch: 23/24, acc_iter=89726, cur_iter=900/3862, batch_size=32, time_cost(epoch): 0:16:36/0:52:00, time_cost(all): 1 day, 3:30:16/1:01:25, loss=0.268085586084052, d_time=0.00(0.00), f_time=1.01(1.01), b_time=1.12(1.03), norm=1.50150977964614, lr=0.002959155201680188
2023-12-13 19:15:39   INFO  epoch: 23/24, acc_iter=89776, cur_iter=950/3862, batch_size=32, time_cost(epoch): 0:17:31/0:56:20, time_cost(all): 1 day, 3:31:12/0:57:04, loss=0.267898541466963, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.95(1.03), norm=2.9847603748017764, lr=0.002909211663501926
2023-12-13 19:16:34   INFO  epoch: 23/24, acc_iter=89826, cur_iter=1000/3862, batch_size=32, time_cost(epoch): 0:18:26/0:55:10, time_cost(all): 1 day, 3:32:07/0:58:11, loss=0.267711496849875, d_time=0.00(0.00), f_time=1.07(1.01), b_time=0.94(1.03), norm=0.5617044072401713, lr=0.002859268125323665
2023-12-13 19:17:29   INFO  epoch: 23/24, acc_iter=89876, cur_iter=1050/3862, batch_size=32, time_cost(epoch): 0:19:22/0:52:13, time_cost(all): 1 day, 3:33:02/0:55:11, loss=0.267524452232786, d_time=0.00(0.00), f_time=1.14(1.01), b_time=0.97(1.03), norm=0.7544967483790471, lr=0.002809324587145404
2023-12-13 19:18:25   INFO  epoch: 23/24, acc_iter=89926, cur_iter=1100/3862, batch_size=32, time_cost(epoch): 0:20:17/0:50:28, time_cost(all): 1 day, 3:33:58/0:53:22, loss=0.267337407615698, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.02(1.03), norm=1.1961948493845935, lr=0.002759381048967143
2023-12-13 19:19:20   INFO  epoch: 23/24, acc_iter=89976, cur_iter=1150/3862, batch_size=32, time_cost(epoch): 0:21:12/0:48:15, time_cost(all): 1 day, 3:34:53/0:57:14, loss=0.267150362998609, d_time=0.00(0.00), f_time=1.15(1.01), b_time=0.96(1.03), norm=3.529647273449285, lr=0.002709437510788882
2023-12-13 19:20:15   INFO  epoch: 23/24, acc_iter=90026, cur_iter=1200/3862, batch_size=32, time_cost(epoch): 0:22:08/0:50:02, time_cost(all): 1 day, 3:35:48/0:55:06, loss=0.266963318381521, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.99(1.03), norm=0.6444610377096424, lr=0.002659493972610621
2023-12-13 19:21:11   INFO  epoch: 23/24, acc_iter=90076, cur_iter=1250/3862, batch_size=32, time_cost(epoch): 0:23:03/0:45:53, time_cost(all): 1 day, 3:36:44/0:51:05, loss=0.266776273764432, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.84(1.03), norm=2.3140265841176686, lr=0.00260955043443236
2023-12-13 19:22:06   INFO  epoch: 23/24, acc_iter=90126, cur_iter=1300/3862, batch_size=32, time_cost(epoch): 0:23:59/0:48:33, time_cost(all): 1 day, 3:37:39/0:53:28, loss=0.266589229147344, d_time=0.00(0.00), f_time=1.11(1.01), b_time=1.18(1.03), norm=2.079837776453904, lr=0.002559606896254099
2023-12-13 19:23:01   INFO  epoch: 23/24, acc_iter=90176, cur_iter=1350/3862, batch_size=32, time_cost(epoch): 0:24:54/0:46:40, time_cost(all): 1 day, 3:38:34/0:52:16, loss=0.266402184530255, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.14(1.03), norm=2.325697153578948, lr=0.002509663358075838
2023-12-13 19:23:57   INFO  epoch: 23/24, acc_iter=90226, cur_iter=1400/3862, batch_size=32, time_cost(epoch): 0:25:49/0:45:50, time_cost(all): 1 day, 3:39:30/0:50:18, loss=0.266215139913167, d_time=0.00(0.00), f_time=1.17(1.01), b_time=0.93(1.03), norm=1.2041552029262381, lr=0.002459719819897576
2023-12-13 19:24:52   INFO  epoch: 23/24, acc_iter=90276, cur_iter=1450/3862, batch_size=32, time_cost(epoch): 0:26:45/0:43:43, time_cost(all): 1 day, 3:40:25/0:51:50, loss=0.266028095296078, d_time=0.00(0.00), f_time=1.19(1.01), b_time=0.9(1.03), norm=1.457428099206219, lr=0.002409776281719315
2023-12-13 19:25:47   INFO  epoch: 23/24, acc_iter=90326, cur_iter=1500/3862, batch_size=32, time_cost(epoch): 0:27:40/0:42:20, time_cost(all): 1 day, 3:41:20/0:48:41, loss=0.265841050678989, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.92(1.03), norm=3.3618675666686118, lr=0.002359832743541054
2023-12-13 19:26:43   INFO  epoch: 23/24, acc_iter=90376, cur_iter=1550/3862, batch_size=32, time_cost(epoch): 0:28:35/0:43:46, time_cost(all): 1 day, 3:42:16/0:48:13, loss=0.265654006061901, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.98(1.03), norm=4.0177945844989456, lr=0.002309889205362793
2023-12-13 19:27:38   INFO  epoch: 23/24, acc_iter=90426, cur_iter=1600/3862, batch_size=32, time_cost(epoch): 0:29:31/0:42:14, time_cost(all): 1 day, 3:43:11/0:48:19, loss=0.265466961444812, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.04(1.03), norm=0.6203099534345583, lr=0.002259945667184532
2023-12-13 19:28:33   INFO  epoch: 23/24, acc_iter=90476, cur_iter=1650/3862, batch_size=32, time_cost(epoch): 0:30:26/0:42:21, time_cost(all): 1 day, 3:44:06/0:47:28, loss=0.265279916827724, d_time=0.00(0.00), f_time=1.04(1.01), b_time=0.91(1.03), norm=4.717473499245176, lr=0.002210002129006271
2023-12-13 19:29:29   INFO  epoch: 23/24, acc_iter=90526, cur_iter=1700/3862, batch_size=32, time_cost(epoch): 0:31:21/0:38:47, time_cost(all): 1 day, 3:45:02/0:43:15, loss=0.265092872210635, d_time=0.00(0.00), f_time=1.03(1.01), b_time=1.22(1.03), norm=2.582872941182001, lr=0.00216005859082801
2023-12-13 19:30:24   INFO  epoch: 23/24, acc_iter=90576, cur_iter=1750/3862, batch_size=32, time_cost(epoch): 0:32:17/0:38:43, time_cost(all): 1 day, 3:45:57/0:46:04, loss=0.264905827593547, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=1.0527597129716955, lr=0.002110115052649749
2023-12-13 19:31:19   INFO  epoch: 23/24, acc_iter=90626, cur_iter=1800/3862, batch_size=32, time_cost(epoch): 0:33:12/0:37:32, time_cost(all): 1 day, 3:46:52/0:42:34, loss=0.264718782976458, d_time=0.00(0.00), f_time=1.03(1.01), b_time=0.96(1.03), norm=3.1017796945125498, lr=0.002060171514471488
2023-12-13 19:32:15   INFO  epoch: 23/24, acc_iter=90676, cur_iter=1850/3862, batch_size=32, time_cost(epoch): 0:34:07/0:36:10, time_cost(all): 1 day, 3:47:48/0:44:04, loss=0.26453173835937, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.96(1.03), norm=4.1657164683228505, lr=0.002010227976293226
2023-12-13 19:33:10   INFO  epoch: 23/24, acc_iter=90726, cur_iter=1900/3862, batch_size=32, time_cost(epoch): 0:35:03/0:36:12, time_cost(all): 1 day, 3:48:43/0:40:46, loss=0.264344693742281, d_time=0.00(0.00), f_time=0.98(1.01), b_time=1.13(1.03), norm=2.8512101474276457, lr=0.001960284438114965
2023-12-13 19:34:05   INFO  epoch: 23/24, acc_iter=90776, cur_iter=1950/3862, batch_size=32, time_cost(epoch): 0:35:58/0:34:34, time_cost(all): 1 day, 3:49:38/0:41:23, loss=0.264157649125193, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.13(1.03), norm=1.975298131339157, lr=0.001910340899936705
2023-12-13 19:35:01   INFO  epoch: 23/24, acc_iter=90826, cur_iter=2000/3862, batch_size=32, time_cost(epoch): 0:36:53/0:34:34, time_cost(all): 1 day, 3:50:34/0:40:51, loss=0.263970604508104, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.12(1.03), norm=4.541545498751835, lr=0.001860397361758443
2023-12-13 19:35:56   INFO  epoch: 23/24, acc_iter=90876, cur_iter=2050/3862, batch_size=32, time_cost(epoch): 0:37:49/0:33:43, time_cost(all): 1 day, 3:51:29/0:37:20, loss=0.263783559891016, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.97(1.03), norm=2.102358724071871, lr=0.001810453823580182
2023-12-13 19:36:52   INFO  epoch: 23/24, acc_iter=90926, cur_iter=2100/3862, batch_size=32, time_cost(epoch): 0:38:44/0:31:10, time_cost(all): 1 day, 3:52:25/0:36:09, loss=0.263596515273927, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.03(1.03), norm=2.262545050291542, lr=0.001760510285401921
2023-12-13 19:37:47   INFO  epoch: 23/24, acc_iter=90976, cur_iter=2150/3862, batch_size=32, time_cost(epoch): 0:39:39/0:31:21, time_cost(all): 1 day, 3:53:20/0:36:01, loss=0.263409470656839, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.86(1.03), norm=1.6246098658028376, lr=0.001710566747223659
2023-12-13 19:38:42   INFO  epoch: 23/24, acc_iter=91026, cur_iter=2200/3862, batch_size=32, time_cost(epoch): 0:40:35/0:30:30, time_cost(all): 1 day, 3:54:15/0:34:51, loss=0.26322242603975, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.09(1.03), norm=0.867420001057565, lr=0.001660623209045399
2023-12-13 19:39:38   INFO  epoch: 23/24, acc_iter=91076, cur_iter=2250/3862, batch_size=32, time_cost(epoch): 0:41:30/0:30:45, time_cost(all): 1 day, 3:55:11/0:33:41, loss=0.263035381422661, d_time=0.00(0.00), f_time=0.97(1.01), b_time=0.95(1.03), norm=2.59733111947542, lr=0.001610679670867138
2023-12-13 19:40:33   INFO  epoch: 23/24, acc_iter=91126, cur_iter=2300/3862, batch_size=32, time_cost(epoch): 0:42:25/0:28:18, time_cost(all): 1 day, 3:56:06/0:34:23, loss=0.262848336805573, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.02(1.03), norm=4.895709446253069, lr=0.001560736132688877
2023-12-13 19:41:28   INFO  epoch: 23/24, acc_iter=91176, cur_iter=2350/3862, batch_size=32, time_cost(epoch): 0:43:21/0:27:25, time_cost(all): 1 day, 3:57:01/0:34:02, loss=0.262661292188484, d_time=0.00(0.00), f_time=1.06(1.01), b_time=1.19(1.03), norm=3.567102855999396, lr=0.001510792594510615
2023-12-13 19:42:24   INFO  epoch: 23/24, acc_iter=91226, cur_iter=2400/3862, batch_size=32, time_cost(epoch): 0:44:16/0:26:59, time_cost(all): 1 day, 3:57:57/0:31:27, loss=0.262474247571396, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.1(1.03), norm=4.178477136626394, lr=0.001460849056332354
2023-12-13 19:43:19   INFO  epoch: 23/24, acc_iter=91276, cur_iter=2450/3862, batch_size=32, time_cost(epoch): 0:45:12/0:25:25, time_cost(all): 1 day, 3:58:52/0:30:10, loss=0.262287202954307, d_time=0.00(0.00), f_time=0.96(1.01), b_time=0.98(1.03), norm=2.7657303667526127, lr=0.001410905518154093
2023-12-13 19:44:14   INFO  epoch: 23/24, acc_iter=91326, cur_iter=2500/3862, batch_size=32, time_cost(epoch): 0:46:07/0:26:08, time_cost(all): 1 day, 3:59:47/0:30:28, loss=0.262100158337219, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.09(1.03), norm=1.9381180445468675, lr=0.001360961979975833
2023-12-13 19:45:10   INFO  epoch: 23/24, acc_iter=91376, cur_iter=2550/3862, batch_size=32, time_cost(epoch): 0:47:02/0:24:38, time_cost(all): 1 day, 4:00:43/0:29:34, loss=0.26191311372013, d_time=0.00(0.00), f_time=0.99(1.01), b_time=1.21(1.03), norm=3.6384772847285696, lr=0.001311018441797571
2023-12-13 19:46:05   INFO  epoch: 23/24, acc_iter=91426, cur_iter=2600/3862, batch_size=32, time_cost(epoch): 0:47:58/0:24:02, time_cost(all): 1 day, 4:01:38/0:29:00, loss=0.261726069103042, d_time=0.00(0.00), f_time=1.13(1.01), b_time=1.22(1.03), norm=3.064898650372065, lr=0.00126107490361931
2023-12-13 19:47:00   INFO  epoch: 23/24, acc_iter=91476, cur_iter=2650/3862, batch_size=32, time_cost(epoch): 0:48:53/0:23:05, time_cost(all): 1 day, 4:02:33/0:28:34, loss=0.261539024485953, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.89(1.03), norm=3.9088090289866364, lr=0.001211131365441049
2023-12-13 19:47:56   INFO  epoch: 23/24, acc_iter=91526, cur_iter=2700/3862, batch_size=32, time_cost(epoch): 0:49:48/0:20:35, time_cost(all): 1 day, 4:03:29/0:25:16, loss=0.261351979868865, d_time=0.00(0.00), f_time=0.93(1.01), b_time=1.04(1.03), norm=3.990494100209409, lr=0.001161187827262788
2023-12-13 19:48:51   INFO  epoch: 23/24, acc_iter=91576, cur_iter=2750/3862, batch_size=32, time_cost(epoch): 0:50:44/0:19:58, time_cost(all): 1 day, 4:04:24/0:24:50, loss=0.261164935251776, d_time=0.00(0.00), f_time=1.06(1.01), b_time=0.84(1.03), norm=2.920481320179091, lr=0.001111244289084527
2023-12-13 19:49:46   INFO  epoch: 23/24, acc_iter=91626, cur_iter=2800/3862, batch_size=32, time_cost(epoch): 0:51:39/0:19:18, time_cost(all): 1 day, 4:05:19/0:23:37, loss=0.260977890634688, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.93(1.03), norm=4.325544832847209, lr=0.001061300750906265
2023-12-13 19:50:42   INFO  epoch: 23/24, acc_iter=91676, cur_iter=2850/3862, batch_size=32, time_cost(epoch): 0:52:34/0:18:16, time_cost(all): 1 day, 4:06:15/0:24:45, loss=0.260790846017599, d_time=0.00(0.00), f_time=1.01(1.01), b_time=0.86(1.03), norm=1.306256481875988, lr=0.001011357212728005
2023-12-13 19:51:37   INFO  epoch: 23/24, acc_iter=91726, cur_iter=2900/3862, batch_size=32, time_cost(epoch): 0:53:30/0:18:27, time_cost(all): 1 day, 4:07:10/0:22:11, loss=0.260603801400511, d_time=0.00(0.00), f_time=1.05(1.01), b_time=1.0(1.03), norm=3.195612255313166, lr=0.000961413674549743
2023-12-13 19:52:32   INFO  epoch: 23/24, acc_iter=91776, cur_iter=2950/3862, batch_size=32, time_cost(epoch): 0:54:25/0:16:37, time_cost(all): 1 day, 4:08:05/0:22:47, loss=0.260416756783422, d_time=0.00(0.00), f_time=1.17(1.01), b_time=1.03(1.03), norm=0.6323636106218633, lr=0.000911470136371482
2023-12-13 19:53:28   INFO  epoch: 23/24, acc_iter=91826, cur_iter=3000/3862, batch_size=32, time_cost(epoch): 0:55:20/0:16:35, time_cost(all): 1 day, 4:09:01/0:21:58, loss=0.260229712166333, d_time=0.00(0.00), f_time=0.97(1.01), b_time=1.2(1.03), norm=0.981920428325253, lr=0.000861526598193221
2023-12-13 19:54:23   INFO  epoch: 23/24, acc_iter=91876, cur_iter=3050/3862, batch_size=32, time_cost(epoch): 0:56:16/0:14:40, time_cost(all): 1 day, 4:09:56/0:19:12, loss=0.260042667549245, d_time=0.00(0.00), f_time=0.92(1.01), b_time=1.21(1.03), norm=1.1921854089266708, lr=0.000811583060014959
2023-12-13 19:55:18   INFO  epoch: 23/24, acc_iter=91926, cur_iter=3100/3862, batch_size=32, time_cost(epoch): 0:57:11/0:14:03, time_cost(all): 1 day, 4:10:51/0:19:43, loss=0.259855622932156, d_time=0.00(0.00), f_time=1.08(1.01), b_time=1.14(1.03), norm=4.106822765924412, lr=0.000761639521836699
2023-12-13 19:56:14   INFO  epoch: 23/24, acc_iter=91976, cur_iter=3150/3862, batch_size=32, time_cost(epoch): 0:58:06/0:12:40, time_cost(all): 1 day, 4:11:47/0:17:24, loss=0.259668578315068, d_time=0.00(0.00), f_time=1.18(1.01), b_time=1.17(1.03), norm=1.7936814106914887, lr=0.000711695983658438
2023-12-13 19:57:09   INFO  epoch: 23/24, acc_iter=92026, cur_iter=3200/3862, batch_size=32, time_cost(epoch): 0:59:02/0:11:43, time_cost(all): 1 day, 4:12:42/0:17:54, loss=0.259481533697979, d_time=0.00(0.00), f_time=1.13(1.01), b_time=0.87(1.03), norm=3.230271176943351, lr=0.000661752445480177
2023-12-13 19:58:05   INFO  epoch: 23/24, acc_iter=92076, cur_iter=3250/3862, batch_size=32, time_cost(epoch): 0:59:57/0:11:30, time_cost(all): 1 day, 4:13:38/0:16:48, loss=0.259294489080891, d_time=0.00(0.00), f_time=1.15(1.01), b_time=1.18(1.03), norm=2.25061773928564, lr=0.000611808907301915
2023-12-13 19:59:00   INFO  epoch: 23/24, acc_iter=92126, cur_iter=3300/3862, batch_size=32, time_cost(epoch): 1:00:52/0:10:30, time_cost(all): 1 day, 4:14:33/0:15:24, loss=0.259107444463802, d_time=0.00(0.00), f_time=0.95(1.01), b_time=0.91(1.03), norm=3.229715631352162, lr=0.000561865369123654
2023-12-13 19:59:55   INFO  epoch: 23/24, acc_iter=92176, cur_iter=3350/3862, batch_size=32, time_cost(epoch): 1:01:48/0:09:01, time_cost(all): 1 day, 4:15:28/0:14:11, loss=0.258920399846714, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.86(1.03), norm=2.7424342617502204, lr=0.000511921830945393
2023-12-13 20:00:51   INFO  epoch: 23/24, acc_iter=92226, cur_iter=3400/3862, batch_size=32, time_cost(epoch): 1:02:43/0:08:54, time_cost(all): 1 day, 4:16:24/0:14:14, loss=0.258733355229625, d_time=0.00(0.00), f_time=0.98(1.01), b_time=0.97(1.03), norm=3.1693635534074307, lr=0.000461978292767132
2023-12-13 20:01:46   INFO  epoch: 23/24, acc_iter=92276, cur_iter=3450/3862, batch_size=32, time_cost(epoch): 1:03:38/0:07:58, time_cost(all): 1 day, 4:17:19/0:12:48, loss=0.258546310612537, d_time=0.00(0.00), f_time=1.02(1.01), b_time=0.9(1.03), norm=4.291218664784456, lr=0.000412034754588871
2023-12-13 20:02:41   INFO  epoch: 23/24, acc_iter=92326, cur_iter=3500/3862, batch_size=32, time_cost(epoch): 1:04:34/0:06:30, time_cost(all): 1 day, 4:18:14/0:11:27, loss=0.258359265995448, d_time=0.00(0.00), f_time=1.07(1.01), b_time=1.21(1.03), norm=4.0334611704060475, lr=0.00036209121641061
2023-12-13 20:03:37   INFO  epoch: 23/24, acc_iter=92376, cur_iter=3550/3862, batch_size=32, time_cost(epoch): 1:05:29/0:05:30, time_cost(all): 1 day, 4:19:10/0:10:59, loss=0.25817222137836, d_time=0.00(0.00), f_time=1.16(1.01), b_time=0.9(1.03), norm=1.9931447792704922, lr=0.000312147678232349
2023-12-13 20:04:32   INFO  epoch: 23/24, acc_iter=92426, cur_iter=3600/3862, batch_size=32, time_cost(epoch): 1:06:25/0:04:54, time_cost(all): 1 day, 4:20:05/0:09:53, loss=0.257985176761271, d_time=0.00(0.00), f_time=1.2(1.01), b_time=0.99(1.03), norm=3.3151712133999327, lr=0.000262204140054088
2023-12-13 20:05:27   INFO  epoch: 23/24, acc_iter=92476, cur_iter=3650/3862, batch_size=32, time_cost(epoch): 1:07:20/0:03:50, time_cost(all): 1 day, 4:21:00/0:08:42, loss=0.257798132144182, d_time=0.00(0.00), f_time=1.1(1.01), b_time=0.96(1.03), norm=2.5513129386609106, lr=0.000212260601875827
2023-12-13 20:06:23   INFO  epoch: 23/24, acc_iter=92526, cur_iter=3700/3862, batch_size=32, time_cost(epoch): 1:08:15/0:02:56, time_cost(all): 1 day, 4:21:56/0:08:18, loss=0.257611087527094, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.87(1.03), norm=1.113667702708976, lr=0.000162317063697566
2023-12-13 20:07:18   INFO  epoch: 23/24, acc_iter=92576, cur_iter=3750/3862, batch_size=32, time_cost(epoch): 1:09:11/0:01:59, time_cost(all): 1 day, 4:22:51/0:07:15, loss=0.257424042910005, d_time=0.00(0.00), f_time=1.08(1.01), b_time=0.96(1.03), norm=3.2370890186927, lr=0.000112373525519305
2023-12-13 20:08:13   INFO  epoch: 23/24, acc_iter=92626, cur_iter=3800/3862, batch_size=32, time_cost(epoch): 1:10:06/0:01:07, time_cost(all): 1 day, 4:23:46/0:06:10, loss=0.257236998292917, d_time=0.00(0.00), f_time=1.12(1.01), b_time=1.02(1.03), norm=1.2863598406664274, lr=6.2429987341043e-05
2023-12-13 20:09:09   INFO  epoch: 23/24, acc_iter=92676, cur_iter=3850/3862, batch_size=32, time_cost(epoch): 1:11:01/0:00:13, time_cost(all): 1 day, 4:24:42/0:05:24, loss=0.257049953675828, d_time=0.00(0.00), f_time=0.99(1.01), b_time=0.84(1.03), norm=0.7437314625883704, lr=1.2486449162782e-05
2023-12-13 20:09:09   INFO  **********************End training cfgs/picture_models/picture_nuscenes_detection(default)**********************



2023-12-13 20:09:09   INFO  **********************Start evaluation cfgs/picture_models/picture_nuscenes_detection(default)**********************
2023-12-13 20:09:09   INFO  Loading NuScenes dataset
2023-12-13 20:09:09   INFO  Total samples for NuScenes dataset: 6019
2023-12-13 20:09:09   INFO  ==> Loading parameters from checkpoint xxxxxxxx to CPU
2023-12-13 20:09:09   INFO  ==> Checkpoint trained from version: pcdet+0.6.0+483aa01
2023-12-13 20:09:09   INFO  ==> Done (loaded 449/449)
2023-12-13 20:09:09   INFO  *************** Epoch 24 EVALUATION *****************
2023-12-13 20:10:27   INFO  *************** Performance of Epoch 24 *****************
2023-12-13 20:10:27   INFO  Generate label finished(sec_per_example: 0.0100 second).
2023-12-13 20:10:27   INFO  recall_roi_0.3: 0.000000
2023-12-13 20:10:27   INFO  recall_rcnn_0.3: 0.816033
2023-12-13 20:10:27   INFO  recall_roi_0.5: 0.000000
2023-12-13 20:10:27   INFO  recall_rcnn_0.5: 0.625637
2023-12-13 20:10:27   INFO  recall_roi_0.7: 0.000000
2023-12-13 20:10:27   INFO  recall_rcnn_0.7: 0.309452
2023-12-13 20:10:27   INFO  Average predicted number of objects(6019 samples): 160.890
2023-12-13 20:10:27   INFO  Deleting GT database from shared memory
2023-12-13 20:10:27   INFO  GT database has been removed from shared memory
2023-12-13 20:15:42   INFO  The predictions of NuScenes have been saved to xxxxxxxxxxxxxxxx
2023-12-13 20:15:42   INFO  ----------------Nuscene detection_cvpr_2019 results-----------------
***car error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.15, 0.14, 0.06, 0.23, 0.19 | 81.43, 89.83, 91.89, 93.68 | mean AP: 0.8979855194430711
***truck error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.29, 0.17, 0.08, 0.21, 0.20 | 43.68, 61.75, 69.76, 73.18 | mean AP: 0.6198188230941965
***construction_vehicle error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.63, 0.42, 0.76, 0.12, 0.30 | 7.15, 23.52, 36.34, 46.02 | mean AP: 0.2819406231443876
***bus error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.29, 0.16, 0.05, 0.33, 0.22 | 54.25, 75.86, 86.75, 89.21 | mean AP: 0.7663175484112302
***trailer error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.52, 0.21, 0.40, 0.15, 0.16 | 15.28, 37.61, 58.67, 67.29 | mean AP: 0.4569849407391441
***barrier error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.18, 0.27, 0.05, nan, nan | 63.05, 72.68, 76.39, 77.68 | mean AP: 0.7237369274158680
***motorcycle error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.14, 0.22, 0.23, 0.34, 0.26 | 64.67, 75.02, 76.38, 76.94 | mean AP: 0.7332336002908450
***bicycle error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.12, 0.25, 0.44, 0.15, 0.01 | 59.61, 62.35, 62.02, 62.86 | mean AP: 0.6160044208262916
***pedestrian error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.14, 0.26, 0.32, 0.20, 0.08 | 88.61, 89.62, 90.35, 91.38 | mean AP: 0.9029760404222692
***traffic_cone error@trans, scale, orient, vel, attr | AP@0.5, 1.0, 2.0, 4.0
0.12, 0.31, nan, nan, nan | 77.05, 77.92, 79.34, 81.83 | mean AP: 0.7915784059051315
--------------average performance-------------
trans_err:	 0.2582
scale_err:	 0.2416
orient_err:	 0.2648
vel_err:	 0.2161
attr_err:	 0.1774
mAP:	 0.6808
NDS:	 0.7262

2023-12-13 20:17:58   INFO  Result is save to xxxxxxxxxxxxxxx
2023-12-13 20:17:58   INFO  ****************Evaluation done.*****************
2023-12-13 20:17:58   INFO  Epoch 24 has been evaluated
2023-12-13 20:17:58   INFO  **********************End evaluation cfgs/picture_models/picture_nuscenes_detection(default)**********************
