/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Humanoid-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 3
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>

<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
 86%|████████▋ | 1726/2000 [00:01<00:00, 895.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 896.13it/s]
episodes                                   77
episode_length                      25.974026
returns                            129.207901
return_std                          47.085484
average_reward                       4.974504
round_time             0 days 00:00:02.242231
episodes_test                           399.0
episode_length_test                 25.035088
returns_test                       125.735379
return_std_test                     41.750761
average_reward_test                  5.022292
round_time_test        0 days 00:00:11.448060
round_time_total       0 days 00:00:13.082611 

=== epoch 1/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 68%|██████▊   | 1365/2000 [00:01<00:00, 905.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 894.28it/s]
episodes                                  150
episode_length                      26.546667
returns                            132.068299
return_std                          49.179146
average_reward                        4.97395
round_time             0 days 00:00:02.698472
episodes_test                           397.0
episode_length_test                 25.183879
returns_test                       126.026648
return_std_test                     34.947138
average_reward_test                  5.004241
round_time_test        0 days 00:00:11.242637
round_time_total       0 days 00:00:12.853210 

=== epoch 1/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 67%|██████▋   | 1343/2000 [00:01<00:00, 886.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 888.03it/s]
episodes                                  226
episode_length                       26.49115
returns                             131.43528
return_std                          49.945644
average_reward                       4.961293
round_time             0 days 00:00:02.704804
episodes_test                           399.0
episode_length_test                 25.062657
returns_test                       125.727015
return_std_test                     37.851641
average_reward_test                  5.016508
round_time_test        0 days 00:00:11.352990
round_time_total       0 days 00:00:12.976365 

=== epoch 1/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 73%|███████▎  | 1457/2000 [00:01<00:00, 905.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 897.00it/s]
episodes                                  306
episode_length                      26.117647
returns                            129.503655
return_std                          47.424763
average_reward                       4.958388
round_time             0 days 00:00:02.680238
episodes_test                           401.0
episode_length_test                 24.890274
returns_test                       124.783179
return_std_test                     34.081005
average_reward_test                  5.012913
round_time_test        0 days 00:00:11.331487
round_time_total       0 days 00:00:13.014231 

=== epoch 1/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 67%|██████▋   | 1346/2000 [00:01<00:00, 899.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 887.61it/s]
episodes                                  382
episode_length                       26.10733
returns                            129.342844
return_std                          46.251654
average_reward                       4.954568
round_time             0 days 00:00:02.757362
episodes_test                           391.0
episode_length_test                  25.56266
returns_test                       128.465058
return_std_test                     36.094714
average_reward_test                  5.025464
round_time_test        0 days 00:00:11.298334
round_time_total       0 days 00:00:12.974597 

=== epoch 1/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 5/2000 [00:01<09:44,  3.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [08:35<00:00,  3.88it/s]
starting training
episodes                                  408
episode_length                      24.490196
returns                             120.61575
return_std                          45.354519
average_reward                       4.925012
round_time             0 days 00:08:35.467468
episodes_test                           398.0
episode_length_test                 25.123116
returns_test                       126.130815
return_std_test                     33.621958
average_reward_test                  5.020496
round_time_test        0 days 00:00:11.463025
round_time_total       0 days 00:08:35.468556
loss_total                       33906.985885
loss_critic                      42646.681086
loss_actor                       -1051.798175
memory_size                         1609.7555 

=== epoch 1/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:58,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:04<00:00,  3.31it/s]
episodes                                  463
episode_length                       21.49892
returns                            105.392153
return_std                          41.553893
average_reward                       4.899852
round_time             0 days 00:10:04.755171
episodes_test                           633.0
episode_length_test                 15.797788
returns_test                        77.197984
return_std_test                      2.822892
average_reward_test                  4.886632
round_time_test        0 days 00:00:11.292576
round_time_total       0 days 00:10:04.756632
loss_total                  1960974280.398355
loss_critic                 2451255932.767594
loss_actor                     -152516.911519
memory_size                         1642.8175 

=== epoch 1/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:32,  2.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  524
episode_length                       19.04771
returns                             92.881684
return_std                          35.142344
average_reward                       4.876505
round_time             0 days 00:10:24.988331
episodes_test                           661.0
episode_length_test                 15.124054
returns_test                        73.049608
return_std_test                      3.583228
average_reward_test                  4.830039
round_time_test        0 days 00:00:12.849338
round_time_total       0 days 00:10:24.989512
loss_total                714964127265.280029
loss_critic               893705796984.832031
loss_actor                    -2609670.872219
memory_size                            1644.0 

=== epoch 1/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                  588
episode_length                      16.967687
returns                             82.251858
return_std                           28.75587
average_reward                       4.846627
round_time             0 days 00:10:29.956029
episodes_test                           688.0
episode_length_test                 14.520349
returns_test                        69.765496
return_std_test                      3.137651
average_reward_test                  4.804662
round_time_test        0 days 00:00:13.125950
round_time_total       0 days 00:10:29.957130
loss_total              12774657537277.951172
loss_critic             15968324316037.119141
loss_actor                    -10839940.20625
memory_size                            1644.0 

=== epoch 1/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  658
episode_length                      15.179331
returns                               73.1371
return_std                          16.219007
average_reward                       4.818398
round_time             0 days 00:10:26.363383
episodes_test                           721.0
episode_length_test                 13.851595
returns_test                         66.45685
return_std_test                      4.933399
average_reward_test                  4.797786
round_time_test        0 days 00:00:13.041542
round_time_total       0 days 00:10:26.364606
loss_total               77152116954628.09375
loss_critic              96440151744446.46875
loss_actor                      -25914255.363
memory_size                            1644.0 

=== epoch 1/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  699
episode_length                      14.271817
returns                             68.674343
return_std                           6.479477
average_reward                       4.811923
round_time             0 days 00:10:26.133279
episodes_test                           712.0
episode_length_test                 14.035112
returns_test                        67.285798
return_std_test                      3.540872
average_reward_test                  4.794107
round_time_test        0 days 00:00:13.215421
round_time_total       0 days 00:10:26.134373
loss_total               275181796981735.4375
loss_critic              343977246962221.0625
loss_actor                      -48318818.979
memory_size                            1644.0 

=== epoch 1/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:01,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                  715
episode_length                      13.969231
returns                             67.068026
return_std                           4.191006
average_reward                       4.801125
round_time             0 days 00:10:27.711823
episodes_test                           688.0
episode_length_test                  14.52907
returns_test                        69.644985
return_std_test                      5.709526
average_reward_test                  4.793511
round_time_test        0 days 00:00:13.243616
round_time_total       0 days 00:10:27.712927
loss_total                754116479082823.625
loss_critic               942645582332166.125
loss_actor                      -78985841.196
memory_size                            1644.0 

=== epoch 1/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  726
episode_length                      13.750689
returns                             65.932766
return_std                           3.670564
average_reward                        4.79487
round_time             0 days 00:10:25.032325
episodes_test                           742.0
episode_length_test                 13.467655
returns_test                        64.525986
return_std_test                      2.971469
average_reward_test                  4.791197
round_time_test        0 days 00:00:13.345004
round_time_total       0 days 00:10:25.033741
loss_total                 1723291251664486.5
loss_critic               2154114028291489.75
loss_actor                     -117819140.792
memory_size                            1644.0 

=== epoch 1/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  720
episode_length                      13.865278
returns                             66.433511
return_std                           4.118769
average_reward                       4.791367
round_time             0 days 00:10:24.751432
episodes_test                           704.0
episode_length_test                   14.1875
returns_test                        67.913796
return_std_test                      5.275496
average_reward_test                  4.786911
round_time_test        0 days 00:00:13.060225
round_time_total       0 days 00:10:24.752529
loss_total                 3440187066304954.5
loss_critic                4300233756108652.5
loss_actor                     -165815346.968
memory_size                            1644.0 

=== epoch 1/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:11,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  717
episode_length                       13.93166
returns                             66.656699
return_std                           4.154909
average_reward                       4.784573
round_time             0 days 00:10:26.325346
episodes_test                           716.0
episode_length_test                 13.952514
returns_test                         66.57591
return_std_test                      5.791268
average_reward_test                  4.771625
round_time_test        0 days 00:00:13.371284
round_time_total       0 days 00:10:26.326534
loss_total                 6294703868093661.0
loss_critic                7868379702543516.0
loss_actor                      -224117351.36
memory_size                            1644.0 

=== epoch 1/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:27,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  717
episode_length                      13.930265
returns                             66.617492
return_std                           4.365261
average_reward                       4.782241
round_time             0 days 00:10:26.320879
episodes_test                           702.0
episode_length_test                 14.240741
returns_test                        67.657595
return_std_test                      3.419036
average_reward_test                  4.751021
round_time_test        0 days 00:00:13.420611
round_time_total       0 days 00:10:26.322147
loss_total                10950087110618514.0
loss_critic               13687608637118218.0
loss_actor                     -293092787.688
memory_size                            1644.0 

=== epoch 1/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:45,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                  712
episode_length                      14.035112
returns                             67.047149
return_std                           4.724589
average_reward                        4.77712
round_time             0 days 00:10:24.197486
episodes_test                           728.0
episode_length_test                 13.728022
returns_test                        65.690113
return_std_test                      3.073508
average_reward_test                  4.785139
round_time_test        0 days 00:00:13.761540
round_time_total       0 days 00:10:24.198765
loss_total                17983018489232752.0
loss_critic               22478772718081672.0
loss_actor                     -373032522.912
memory_size                            1644.0 

=== epoch 1/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:08,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                  701
episode_length                      14.252496
returns                             68.046232
return_std                           5.005129
average_reward                       4.774329
round_time             0 days 00:10:28.757154
episodes_test                           666.0
episode_length_test                 15.003003
returns_test                        71.491757
return_std_test                      11.00994
average_reward_test                  4.765193
round_time_test        0 days 00:00:13.542524
round_time_total       0 days 00:10:28.758373
loss_total                28054833917010640.0
loss_critic               35068541779935496.0
loss_actor                     -464592110.576
memory_size                            1644.0 

=== epoch 1/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                  698
episode_length                      14.312321
returns                             68.283512
return_std                            7.00273
average_reward                       4.770988
round_time             0 days 00:10:23.284177
episodes_test                           718.0
episode_length_test                 13.910864
returns_test                        66.448113
return_std_test                       4.16543
average_reward_test                  4.776723
round_time_test        0 days 00:00:13.205787
round_time_total       0 days 00:10:23.285507
loss_total                42966675047602720.0
loss_critic               53708342857094400.0
loss_actor                     -571020127.904
memory_size                         1646.4975 

=== epoch 1/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:55,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  708
episode_length                      14.101695
returns                             67.314386
return_std                           7.308842
average_reward                       4.773515
round_time             0 days 00:10:25.471140
episodes_test                           770.0
episode_length_test                 12.975325
returns_test                        61.962451
return_std_test                       3.35756
average_reward_test                  4.775421
round_time_test        0 days 00:00:13.496156
round_time_total       0 days 00:10:25.472233
loss_total                62235278874165376.0
loss_critic               77794097227712432.0
loss_actor                     -688814926.432
memory_size                            1653.0 

=== epoch 1/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:18,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  717
episode_length                      13.938633
returns                             66.511233
return_std                           7.395215
average_reward                       4.771738
round_time             0 days 00:10:25.988882
episodes_test                           766.0
episode_length_test                 13.046997
returns_test                        62.302696
return_std_test                      3.665136
average_reward_test                  4.775283
round_time_test        0 days 00:00:13.645228
round_time_total       0 days 00:10:25.989967
loss_total                88392488705633296.0
loss_critic              110490608818846688.0
loss_actor                     -818297859.456
memory_size                            1653.0 

=== epoch 1/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                  723
episode_length                      13.789765
returns                             65.818969
return_std                           7.647406
average_reward                       4.772568
round_time             0 days 00:10:23.637392
episodes_test                           771.0
episode_length_test                 12.963684
returns_test                        61.904486
return_std_test                      2.874724
average_reward_test                  4.775253
round_time_test        0 days 00:00:13.338499
round_time_total       0 days 00:10:23.638513
loss_total               121910811985715392.0
loss_critic              152388512492136960.0
loss_actor                     -960687669.312
memory_size                            1653.0 

=== epoch 1/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  720
episode_length                         13.875
returns                             66.187356
return_std                          10.091665
average_reward                       4.770265
round_time             0 days 00:10:26.404272
episodes_test                           646.0
episode_length_test                 15.455108
returns_test                        73.486936
return_std_test                     15.182743
average_reward_test                  4.754811
round_time_test        0 days 00:00:14.016954
round_time_total       0 days 00:10:26.405363
loss_total               167090790679319680.0
loss_critic              208863484762315040.0
loss_actor                    -1121470638.272
memory_size                          1656.793 

=== epoch 1/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  730
episode_length                      13.683562
returns                             65.280405
return_std                           8.588772
average_reward                       4.770748
round_time             0 days 00:10:24.995395
episodes_test                           682.0
episode_length_test                 14.646628
returns_test                         69.59144
return_std_test                      7.156368
average_reward_test                  4.751399
round_time_test        0 days 00:00:13.593802
round_time_total       0 days 00:10:24.996832
loss_total               231200492337363936.0
loss_critic              289000610744485568.0
loss_actor                    -1306654880.896
memory_size                            1668.0 

=== epoch 1/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:37,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                  726
episode_length                      13.761708
returns                             65.639615
return_std                           8.523418
average_reward                       4.769752
round_time             0 days 00:10:24.419952
episodes_test                           729.0
episode_length_test                 13.717421
returns_test                        65.398429
return_std_test                      4.055527
average_reward_test                  4.767545
round_time_test        0 days 00:00:13.493662
round_time_total       0 days 00:10:24.421147
loss_total               304079243483306496.0
loss_critic              380099047501512832.0
loss_actor                    -1501179341.312
memory_size                            1668.0 

=== epoch 1/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  721
episode_length                      13.841886
returns                             66.014386
return_std                           8.369329
average_reward                       4.769145
round_time             0 days 00:10:24.644444
episodes_test                           741.0
episode_length_test                 13.480432
returns_test                        64.330224
return_std_test                       3.00314
average_reward_test                  4.772098
round_time_test        0 days 00:00:13.554391
round_time_total       0 days 00:10:24.645543
loss_total               402081693014484352.0
loss_critic              502602108062570368.0
loss_actor                    -1721442740.544
memory_size                            1668.0 

=== epoch 1/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:30,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  717
episode_length                      13.921897
returns                             66.344327
return_std                           8.100187
average_reward                       4.765363
round_time             0 days 00:10:24.591548
episodes_test                           747.0
episode_length_test                 13.382865
returns_test                        63.824513
return_std_test                      3.273522
average_reward_test                  4.769139
round_time_test        0 days 00:00:13.679601
round_time_total       0 days 00:10:24.592779
loss_total               508388810838700480.0
loss_critic              635486003105162624.0
loss_actor                    -1945589486.656
memory_size                            1668.0 

=== epoch 1/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                  722
episode_length                       13.83518
returns                              65.84488
return_std                           5.623625
average_reward                       4.759257
round_time             0 days 00:10:29.137179
episodes_test                           712.0
episode_length_test                 14.032303
returns_test                        66.674579
return_std_test                      5.513759
average_reward_test                  4.751543
round_time_test        0 days 00:00:13.925891
round_time_total       0 days 00:10:29.138288
loss_total               659145162344481408.0
loss_critic              823931437878888832.0
loss_actor                     -2205485754.56
memory_size                            1668.0 

=== epoch 1/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:31,  2.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  716
episode_length                      13.938547
returns                             66.285282
return_std                           6.062998
average_reward                       4.755575
round_time             0 days 00:10:26.482215
episodes_test                           698.0
episode_length_test                 14.310888
returns_test                        67.940794
return_std_test                       9.23256
average_reward_test                  4.747525
round_time_test        0 days 00:00:13.632715
round_time_total       0 days 00:10:26.483583
loss_total               838850942723638144.0
loss_critic             1048563661967707904.0
loss_actor                    -2483290612.992
memory_size                            1668.0 

=== epoch 1/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  714
episode_length                      13.980392
returns                             66.453556
return_std                           6.814696
average_reward                       4.753368
round_time             0 days 00:10:25.258729
episodes_test                           692.0
episode_length_test                 14.442197
returns_test                        68.517851
return_std_test                       8.09801
average_reward_test                  4.744332
round_time_test        0 days 00:00:13.408962
round_time_total       0 days 00:10:25.260168
loss_total              1051677194209957248.0
loss_critic             1314596471824481024.0
loss_actor                    -2779741596.416
memory_size                            1668.0 

=== epoch 1/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                  710
episode_length                      14.073239
returns                             66.884591
return_std                           8.200656
average_reward                       4.752647
round_time             0 days 00:10:30.081317
episodes_test                           749.0
episode_length_test                   13.3498
returns_test                        63.631911
return_std_test                      6.845417
average_reward_test                  4.766523
round_time_test        0 days 00:00:13.830882
round_time_total       0 days 00:10:30.082401
loss_total              1315112564234458880.0
loss_critic             1643890673604804864.0
loss_actor                    -3101970097.024
memory_size                          1670.709 

=== epoch 1/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  715
episode_length                      13.973427
returns                             66.445692
return_std                             8.2647
average_reward                       4.755182
round_time             0 days 00:10:24.832562
episodes_test                           751.0
episode_length_test                 13.303595
returns_test                        63.424199
return_std_test                       4.41346
average_reward_test                  4.767455
round_time_test        0 days 00:00:13.512410
round_time_total       0 days 00:10:24.833652
loss_total              1609899074956967168.0
loss_critic             2012373810728040192.0
loss_actor                    -3435256897.536
memory_size                            1677.0 

=== epoch 1/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                  722
episode_length                      13.828255
returns                             65.828961
return_std                           7.896955
average_reward                       4.760496
round_time             0 days 00:10:28.061170
episodes_test                           739.0
episode_length_test                 13.522327
returns_test                        64.410556
return_std_test                      5.186633
average_reward_test                  4.763295
round_time_test        0 days 00:00:13.533040
round_time_total       0 days 00:10:28.062331
loss_total              1989386272060420864.0
loss_critic             2486732801085813248.0
loss_actor                    -3827011253.376
memory_size                            1677.0 

=== epoch 1/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:58,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                  731
episode_length                      13.667579
returns                             65.133685
return_std                           8.202277
average_reward                       4.765583
round_time             0 days 00:10:26.969922
episodes_test                           725.0
episode_length_test                 13.776552
returns_test                        65.701595
return_std_test                      7.884342
average_reward_test                  4.769078
round_time_test        0 days 00:00:13.426248
round_time_total       0 days 00:10:26.971187
loss_total              2478570643209270272.0
loss_critic             3098213249216394752.0
loss_actor                    -4220510359.936
memory_size                          1680.563 

=== epoch 1/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:12,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                  735
episode_length                       13.57551
returns                             64.729435
return_std                           7.577858
average_reward                       4.768107
round_time             0 days 00:10:30.833163
episodes_test                           757.0
episode_length_test                 13.202114
returns_test                        63.036034
return_std_test                      5.434064
average_reward_test                  4.774708
round_time_test        0 days 00:00:13.592884
round_time_total       0 days 00:10:30.834464
loss_total              2999398518341868032.0
loss_critic             3749248082317414400.0
loss_actor                     -4652099156.48
memory_size                            1681.0 

=== epoch 1/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  742
episode_length                      13.458221
returns                             64.172588
return_std                           6.046782
average_reward                       4.768298
round_time             0 days 00:10:25.770896
episodes_test                           726.0
episode_length_test                 13.767218
returns_test                        65.649409
return_std_test                      5.205125
average_reward_test                  4.768552
round_time_test        0 days 00:00:13.358620
round_time_total       0 days 00:10:25.772210
loss_total              3637322285304453632.0
loss_critic             4546652780867344384.0
loss_actor                    -5105818282.752
memory_size                            1681.0 

=== epoch 1/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:28,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                  746
episode_length                      13.391421
returns                             63.864986
return_std                            5.40175
average_reward                       4.769121
round_time             0 days 00:10:28.340996
episodes_test                           766.0
episode_length_test                 13.044386
returns_test                        62.243362
return_std_test                      3.587346
average_reward_test                  4.771696
round_time_test        0 days 00:00:13.614410
round_time_total       0 days 00:10:28.342196
loss_total              4313093760033119744.0
loss_critic             5391367111668152320.0
loss_actor                    -5567114331.904
memory_size                            1681.0 

=== epoch 1/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:59,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  753
episode_length                      13.270916
returns                             63.326953
return_std                           4.663789
average_reward                       4.771873
round_time             0 days 00:10:25.973385
episodes_test                           762.0
episode_length_test                 13.112861
returns_test                         62.59956
return_std_test                      3.014785
average_reward_test                   4.77391
round_time_test        0 days 00:00:13.732882
round_time_total       0 days 00:10:25.974470
loss_total              5147589031922004992.0
loss_critic             6434486172340661248.0
loss_actor                     -6063882164.48
memory_size                            1681.0 

=== epoch 1/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                  755
episode_length                      13.234437
returns                             63.139265
return_std                           3.158068
average_reward                       4.770829
round_time             0 days 00:10:27.981735
episodes_test                           758.0
episode_length_test                 13.192612
returns_test                        62.970439
return_std_test                      3.094045
average_reward_test                  4.773159
round_time_test        0 days 00:00:13.420302
round_time_total       0 days 00:10:27.983160
loss_total              6075957942061413376.0
loss_critic             7594947297147199488.0
loss_actor                    -6603719379.456
memory_size                            1681.0 

=== epoch 1/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                  754
episode_length                      13.259947
returns                             63.245135
return_std                           3.645457
average_reward                       4.769647
round_time             0 days 00:10:23.226991
episodes_test                           758.0
episode_length_test                 13.183377
returns_test                        62.896719
return_std_test                      4.154133
average_reward_test                   4.77094
round_time_test        0 days 00:00:13.665996
round_time_total       0 days 00:10:23.228337
loss_total              7176635894113885184.0
loss_critic             8970794715428715520.0
loss_actor                    -7173164859.904
memory_size                            1681.0 

=== epoch 1/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:19,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  752
episode_length                      13.285904
returns                             63.362281
return_std                           3.981051
average_reward                       4.769145
round_time             0 days 00:10:24.915468
episodes_test                           761.0
episode_length_test                  13.12615
returns_test                        62.597658
return_std_test                      3.806611
average_reward_test                  4.768942
round_time_test        0 days 00:00:13.420413
round_time_total       0 days 00:10:24.916549
loss_total              8459332196495714304.0
loss_critic            10574165080074424320.0
loss_actor                    -7773945177.344
memory_size                            1681.0 

=== epoch 1/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  751
episode_length                       13.29028
returns                             63.380631
return_std                           4.031685
average_reward                       4.768934
round_time             0 days 00:10:26.212538
episodes_test                           749.0
episode_length_test                 13.345794
returns_test                        63.657422
return_std_test                       3.68628
average_reward_test                  4.769863
round_time_test        0 days 00:00:13.262855
round_time_total       0 days 00:10:26.213774
loss_total              9893282373673998336.0
loss_critic            12366602734820665344.0
loss_actor                    -8410478153.728
memory_size                            1681.0 

=== epoch 1/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<13:35,  2.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  748
episode_length                      13.346257
returns                             63.630014
return_std                           4.237733
average_reward                       4.767638
round_time             0 days 00:10:25.862828
episodes_test                           758.0
episode_length_test                 13.188654
returns_test                        62.952606
return_std_test                      2.578907
average_reward_test                  4.773259
round_time_test        0 days 00:00:13.343625
round_time_total       0 days 00:10:25.864005
loss_total             11220498066516342784.0
loss_critic            14025622330051596288.0
loss_actor                 -8992799901.440001
memory_size                            1681.0 

=== epoch 1/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                  744
episode_length                      13.413978
returns                             63.945164
return_std                           4.474001
average_reward                       4.767047
round_time             0 days 00:10:27.272191
episodes_test                           738.0
episode_length_test                 13.542005
returns_test                        64.597664
return_std_test                      3.966815
average_reward_test                  4.770182
round_time_test        0 days 00:00:13.344074
round_time_total       0 days 00:10:27.273283
loss_total             12868203583193610240.0
loss_critic            16085254205694654464.0
loss_actor                 -9615774351.360001
memory_size                            1681.0 

=== epoch 1/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  740
episode_length                      13.491892
returns                             64.316265
return_std                           4.959953
average_reward                       4.767041
round_time             0 days 00:10:25.043804
episodes_test                           719.0
episode_length_test                  13.89847
returns_test                        66.218361
return_std_test                      4.996649
average_reward_test                  4.764457
round_time_test        0 days 00:00:13.399366
round_time_total       0 days 00:10:25.045246
loss_total             14976760301568858112.0
loss_critic            18720950093012197376.0
loss_actor                    -10375673451.52
memory_size                         1681.1525 

=== epoch 1/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                  739
episode_length                       13.49797
returns                             64.362567
return_std                           4.651832
average_reward                       4.768242
round_time             0 days 00:10:29.010323
episodes_test                           734.0
episode_length_test                 13.604905
returns_test                        64.852698
return_std_test                      3.937463
average_reward_test                   4.76673
round_time_test        0 days 00:00:13.128579
round_time_total       0 days 00:10:29.011841
loss_total             16886309612702289920.0
loss_critic            21107886674410782720.0
loss_actor                   -11015599205.888
memory_size                            1683.0 

=== epoch 1/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:15,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                  733
episode_length                      13.637108
returns                             65.061018
return_std                           8.048015
average_reward                       4.770887
round_time             0 days 00:10:25.380922
episodes_test                           723.0
episode_length_test                 13.817427
returns_test                         65.86755
return_std_test                      4.387689
average_reward_test                  4.767038
round_time_test        0 days 00:00:13.367101
round_time_total       0 days 00:10:25.382010
loss_total             19287756702490066944.0
loss_critic            24109695428962082816.0
loss_actor                -11789956732.927999
memory_size                          1687.731 

=== epoch 1/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                  734
episode_length                      13.606267
returns                             64.926648
return_std                            7.96657
average_reward                        4.77182
round_time             0 days 00:10:27.128555
episodes_test                           742.0
episode_length_test                 13.462264
returns_test                         64.19554
return_std_test                      4.436797
average_reward_test                  4.768579
round_time_test        0 days 00:00:13.245321
round_time_total       0 days 00:10:27.129638
loss_total             23088642788754563072.0
loss_critic            28860802980855046144.0
loss_actor                -12790356415.488001
memory_size                            1702.0 

=== epoch 1/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:29,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                  740
episode_length                      13.504054
returns                             64.446392
return_std                           7.851628
average_reward                       4.772385
round_time             0 days 00:10:26.539892
episodes_test                           765.0
episode_length_test                 13.056209
returns_test                        62.310109
return_std_test                      3.062748
average_reward_test                  4.772474
round_time_test        0 days 00:00:14.123317
round_time_total       0 days 00:10:26.540979
loss_total             27029329389801762816.0
loss_critic            33786661210998452224.0
loss_actor                -13832396795.903999
memory_size                            1702.0 

=== epoch 1/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:22,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  745
episode_length                      13.398658
returns                             63.951825
return_std                           7.337485
average_reward                       4.772981
round_time             0 days 00:10:27.606543
episodes_test                           767.0
episode_length_test                 13.022164
returns_test                         62.15416
return_std_test                      2.280984
average_reward_test                  4.772959
round_time_test        0 days 00:00:13.498242
round_time_total       0 days 00:10:27.607830
loss_total             31766934346818629632.0
loss_critic            39708667276565086208.0
loss_actor                -14897346402.304001
memory_size                            1702.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<10:44,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:20<00:00,  3.22it/s]
episodes                                  150
episode_length                      13.286667
returns                             63.440525
return_std                           3.151045
average_reward                       4.774859
round_time             0 days 00:10:20.623747
episodes_test                           761.0
episode_length_test                 13.134034
returns_test                        62.702959
return_std_test                      3.988543
average_reward_test                  4.774106
round_time_test        0 days 00:00:13.377257
round_time_total       0 days 00:10:20.624840
loss_total             36608882921027551232.0
loss_critic            45761102794627440640.0
loss_actor                -15993146536.959999
memory_size                            1702.0 

=== epoch 2/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                  297
episode_length                       13.43771
returns                             64.121774
return_std                           3.804754
average_reward                       4.771816
round_time             0 days 00:10:22.770434
episodes_test                           745.0
episode_length_test                 13.410738
returns_test                        63.981441
return_std_test                      5.038787
average_reward_test                  4.770925
round_time_test        0 days 00:00:13.485542
round_time_total       0 days 00:10:22.771528
loss_total             42179104541477232640.0
loss_critic            52723879773597736960.0
loss_actor                -17129276628.992001
memory_size                            1702.0 

=== epoch 2/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:19<00:00,  3.23it/s]
episodes                                  445
episode_length                      13.469663
returns                             64.241253
return_std                            3.89326
average_reward                       4.769362
round_time             0 days 00:10:20.484400
episodes_test                           740.0
episode_length_test                 13.509459
returns_test                        64.415177
return_std_test                      5.795543
average_reward_test                  4.768184
round_time_test        0 days 00:00:13.389163
round_time_total       0 days 00:10:20.485496
loss_total             48264536726195806208.0
loss_critic            60330669829123858432.0
loss_actor                -18268664141.824001
memory_size                            1702.0 

=== epoch 2/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                  597
episode_length                       13.39531
returns                             63.903266
return_std                           3.677804
average_reward                       4.770598
round_time             0 days 00:10:21.858517
episodes_test                           728.0
episode_length_test                 13.726648
returns_test                        65.390971
return_std_test                      6.323184
average_reward_test                  4.763817
round_time_test        0 days 00:00:13.179216
round_time_total       0 days 00:10:21.859614
loss_total             55323403150029987840.0
loss_critic            69154252789372461056.0
loss_actor                   -19489598349.312
memory_size                            1702.0 

=== epoch 2/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                  746
episode_length                      13.404826
returns                              63.94375
return_std                           3.749853
average_reward                       4.770204
round_time             0 days 00:10:21.869125
episodes_test                           746.0
episode_length_test                 13.402145
returns_test                        63.954706
return_std_test                      3.456485
average_reward_test                  4.772002
round_time_test        0 days 00:00:13.501639
round_time_total       0 days 00:10:21.870587
loss_total             63577464072392859648.0
loss_critic            79471828784821010432.0
loss_actor                   -20842825714.688
memory_size                            1702.0 

=== epoch 2/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                  742
episode_length                      13.451482
returns                              64.14797
return_std                           3.969339
average_reward                       4.768837
round_time             0 days 00:10:19.363518
episodes_test                           736.0
episode_length_test                 13.580163
returns_test                        64.717104
return_std_test                       4.80396
average_reward_test                  4.765577
round_time_test        0 days 00:00:13.206463
round_time_total       0 days 00:10:19.364595
loss_total             72112305405573070848.0
loss_critic            90140380228920049664.0
loss_actor                -22223445810.175999
memory_size                            1702.0 

=== epoch 2/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:51,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                   747
episode_length                       13.372155
returns                              63.770961
return_std                             4.14786
average_reward                         4.76895
round_time              0 days 00:10:23.684360
episodes_test                            726.0
episode_length_test                  13.774105
returns_test                         65.631888
return_std_test                       5.854662
average_reward_test                   4.764875
round_time_test         0 days 00:00:14.065795
round_time_total        0 days 00:10:23.685445
loss_total              80715872263902740480.0
loss_critic            100894838482424020992.0
loss_actor                 -23644099939.327999
memory_size                             1702.0 

=== epoch 2/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                   745
episode_length                       13.394631
returns                              63.873356
return_std                            4.359019
average_reward                        4.768585
round_time              0 days 00:10:26.251333
episodes_test                            742.0
episode_length_test                   13.47035
returns_test                         64.193967
return_std_test                       4.018929
average_reward_test                   4.765603
round_time_test         0 days 00:00:13.227985
round_time_total        0 days 00:10:26.252422
loss_total              92022344128944340992.0
loss_critic            115027928176012165120.0
loss_actor                 -25060772230.144001
memory_size                             1702.0 

=== epoch 2/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:20<00:00,  3.22it/s]
episodes                                   736
episode_length                       13.569293
returns                              64.657006
return_std                            4.643586
average_reward                        4.764958
round_time              0 days 00:10:21.505726
episodes_test                            682.0
episode_length_test                   14.66129
returns_test                         69.573682
return_std_test                       5.997672
average_reward_test                   4.745415
round_time_test         0 days 00:00:13.457936
round_time_total        0 days 00:10:21.507189
loss_total             101185554757609111552.0
loss_critic            126481941326053457920.0
loss_actor                 -26497792065.535999
memory_size                             1702.0 

=== epoch 2/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                   727
episode_length                       13.755158
returns                              65.531818
return_std                            7.881708
average_reward                        4.764163
round_time              0 days 00:10:22.368044
episodes_test                            719.0
episode_length_test                  13.908206
returns_test                         66.215928
return_std_test                       4.819628
average_reward_test                   4.760925
round_time_test         0 days 00:00:13.492367
round_time_total        0 days 00:10:22.369244
loss_total             112910377760110100480.0
loss_critic            141137969720738906112.0
loss_actor                 -27932242029.568001
memory_size                          1716.1955 

=== epoch 2/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:17,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                   730
episode_length                       13.691781
returns                              65.241803
return_std                            7.820307
average_reward                         4.76498
round_time              0 days 00:10:25.904421
episodes_test                            712.0
episode_length_test                  14.044944
returns_test                         66.783761
return_std_test                       5.939826
average_reward_test                   4.755004
round_time_test         0 days 00:00:13.195275
round_time_total        0 days 00:10:25.905511
loss_total             121130655223437852672.0
loss_critic            151413316673043890176.0
loss_actor                    -29113897276.416
memory_size                             1724.0 

=== epoch 2/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   730
episode_length                       13.687671
returns                              65.226899
return_std                            7.590112
average_reward                        4.765382
round_time              0 days 00:10:25.124583
episodes_test                            773.0
episode_length_test                  12.921087
returns_test                         61.679523
return_std_test                       2.163632
average_reward_test                   4.773568
round_time_test         0 days 00:00:13.660453
round_time_total        0 days 00:10:25.126061
loss_total             137327338864677814272.0
loss_critic            171659170754552627200.0
loss_actor                 -30779561945.088001
memory_size                             1724.0 

=== epoch 2/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:50,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   730
episode_length                       13.690411
returns                              65.224371
return_std                            7.418665
average_reward                        4.764265
round_time              0 days 00:10:25.128183
episodes_test                            726.0
episode_length_test                  13.767218
returns_test                         65.576558
return_std_test                       4.079436
average_reward_test                   4.763263
round_time_test         0 days 00:00:13.583187
round_time_total        0 days 00:10:25.129263
loss_total             152035598840160124928.0
loss_critic            190044495339076452352.0
loss_actor                 -32455095470.080002
memory_size                             1724.0 

=== epoch 2/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                   738
episode_length                        13.52981
returns                              64.507134
return_std                            7.260593
average_reward                        4.767791
round_time              0 days 00:10:27.213587
episodes_test                            738.0
episode_length_test                  13.547425
returns_test                         64.538517
return_std_test                       3.663986
average_reward_test                   4.763922
round_time_test         0 days 00:00:13.490787
round_time_total        0 days 00:10:27.214665
loss_total             167229782602926227456.0
loss_critic            209037224557099679744.0
loss_actor                 -34119459485.695999
memory_size                             1724.0 

=== epoch 2/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                   752
episode_length                       13.295213
returns                              63.414597
return_std                            3.023656
average_reward                        4.769767
round_time              0 days 00:10:22.387173
episodes_test                            751.0
episode_length_test                  13.304927
returns_test                         63.490651
return_std_test                       3.013536
average_reward_test                   4.771962
round_time_test         0 days 00:00:13.454255
round_time_total        0 days 00:10:22.388239
loss_total             187562850846150656000.0
loss_critic            234453559534025506816.0
loss_actor                 -35920851362.816002
memory_size                             1724.0 

=== epoch 2/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:45,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                   751
episode_length                       13.292943
returns                              63.409726
return_std                            2.844272
average_reward                        4.770166
round_time              0 days 00:10:23.978973
episodes_test                            748.0
episode_length_test                  13.367647
returns_test                         63.777243
return_std_test                       4.189957
average_reward_test                   4.771032
round_time_test         0 days 00:00:13.408011
round_time_total        0 days 00:10:23.980106
loss_total             206817688742251134976.0
loss_critic            258522106485786935296.0
loss_actor                 -37740545736.704002
memory_size                             1724.0 

=== epoch 2/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:12,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   750
episode_length                       13.313333
returns                              63.507654
return_std                            2.787646
average_reward                        4.770267
round_time              0 days 00:10:25.401258
episodes_test                            746.0
episode_length_test                  13.391421
returns_test                         63.885046
return_std_test                       2.878568
average_reward_test                   4.770618
round_time_test         0 days 00:00:13.557280
round_time_total        0 days 00:10:25.402562
loss_total             228452883060197523456.0
loss_critic            285566099340338987008.0
loss_actor                     -39652622407.68
memory_size                             1724.0 

=== epoch 2/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:09,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                   755
episode_length                       13.227815
returns                               63.14488
return_std                            2.466894
average_reward                        4.773643
round_time              0 days 00:10:25.866070
episodes_test                            757.0
episode_length_test                  13.192867
returns_test                         62.975287
return_std_test                       3.190669
average_reward_test                    4.77345
round_time_test         0 days 00:00:13.596738
round_time_total        0 days 00:10:25.867580
loss_total             252674952208105701376.0
loss_critic            315843685162796253184.0
loss_actor                 -41679633446.912003
memory_size                             1724.0 

=== epoch 2/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:42,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                   755
episode_length                       13.233113
returns                              63.169404
return_std                            2.371512
average_reward                        4.773584
round_time              0 days 00:10:23.647527
episodes_test                            742.0
episode_length_test                  13.474394
returns_test                         64.257942
return_std_test                       3.775779
average_reward_test                    4.76893
round_time_test         0 days 00:00:13.452670
round_time_total        0 days 00:10:23.648691
loss_total             274066753495209279488.0
loss_critic            342583436187835039744.0
loss_actor                 -43546019358.720001
memory_size                             1724.0 

=== epoch 2/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                   752
episode_length                       13.281915
returns                              63.396167
return_std                            2.447474
average_reward                        4.773121
round_time              0 days 00:10:26.030110
episodes_test                            745.0
episode_length_test                  13.421477
returns_test                         64.061324
return_std_test                       2.803749
average_reward_test                   4.773062
round_time_test         0 days 00:00:13.086751
round_time_total        0 days 00:10:26.031529
loss_total             303069493360081764352.0
loss_critic            378836859676421914624.0
loss_actor                 -45693207304.192001
memory_size                             1724.0 

=== epoch 2/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                   749
episode_length                       13.336449
returns                              63.656957
return_std                            3.258012
average_reward                        4.773162
round_time              0 days 00:10:22.028937
episodes_test                            742.0
episode_length_test                   13.47035
returns_test                         64.290147
return_std_test                       2.633088
average_reward_test                   4.772734
round_time_test         0 days 00:00:13.188640
round_time_total        0 days 00:10:22.030092
loss_total             328787271245323108352.0
loss_critic            410984081784484003840.0
loss_actor                 -47692244699.136002
memory_size                             1724.0 

=== epoch 2/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   747
episode_length                       13.360107
returns                               63.76735
return_std                            3.285193
average_reward                        4.772967
round_time              0 days 00:10:24.726179
episodes_test                            758.0
episode_length_test                  13.189974
returns_test                         62.991354
return_std_test                       4.102459
average_reward_test                    4.77573
round_time_test         0 days 00:00:13.243408
round_time_total        0 days 00:10:24.727281
loss_total             358658889403229667328.0
loss_critic            448323603817762127872.0
loss_actor                    -49784029044.736
memory_size                             1724.0 

=== epoch 2/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                   746
episode_length                       13.391421
returns                              63.911856
return_std                            3.369118
average_reward                        4.772588
round_time              0 days 00:10:24.461381
episodes_test                            749.0
episode_length_test                  13.344459
returns_test                         63.712897
return_std_test                       3.917947
average_reward_test                   4.774505
round_time_test         0 days 00:00:13.403432
round_time_total        0 days 00:10:24.462723
loss_total             394653102744093392896.0
loss_critic            493316369642819747840.0
loss_actor                 -52169656870.912003
memory_size                             1724.0 

=== epoch 2/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<13:09,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   738
episode_length                       13.517615
returns                              64.472424
return_std                            3.721177
average_reward                        4.769527
round_time              0 days 00:10:24.641347
episodes_test                            723.0
episode_length_test                  13.828492
returns_test                         65.954328
return_std_test                        3.15266
average_reward_test                   4.769486
round_time_test         0 days 00:00:12.748488
round_time_total        0 days 00:10:24.642429
loss_total             428735092116406468608.0
loss_critic            535918856017362550784.0
loss_actor                 -54438894055.424004
memory_size                             1724.0 

=== epoch 2/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:27,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                   731
episode_length                       13.662107
returns                              65.114361
return_std                            3.951738
average_reward                        4.766069
round_time              0 days 00:10:26.747603
episodes_test                            694.0
episode_length_test                  14.393372
returns_test                         68.335537
return_std_test                        7.04808
average_reward_test                    4.74774
round_time_test         0 days 00:00:13.187635
round_time_total        0 days 00:10:26.749037
loss_total             453164278974437392384.0
loss_critic            566455339004961030144.0
loss_actor                 -56304763512.832001
memory_size                             1724.0 

=== epoch 2/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                   726
episode_length                       13.758953
returns                              65.524499
return_std                            3.526131
average_reward                         4.76233
round_time              0 days 00:10:27.769936
episodes_test                            706.0
episode_length_test                  14.152975
returns_test                         67.244838
return_std_test                        3.42955
average_reward_test                    4.75131
round_time_test         0 days 00:00:13.252745
round_time_total        0 days 00:10:27.771378
loss_total             482873322164384497664.0
loss_critic            603591642491017625600.0
loss_actor                 -58108668004.351997
memory_size                             1724.0 

=== epoch 2/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:33,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   718
episode_length                       13.910864
returns                              66.218285
return_std                            3.481841
average_reward                        4.760196
round_time              0 days 00:10:25.061598
episodes_test                            707.0
episode_length_test                  14.141443
returns_test                          67.26351
return_std_test                       3.334191
average_reward_test                   4.756515
round_time_test         0 days 00:00:13.327709
round_time_total        0 days 00:10:25.062672
loss_total             530684664716720340992.0
loss_critic            663355819929371541504.0
loss_actor                 -60492571961.344002
memory_size                             1724.0 

=== epoch 2/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                   711
episode_length                       14.036568
returns                                66.7735
return_std                            3.169685
average_reward                        4.757137
round_time              0 days 00:10:22.809477
episodes_test                            711.0
episode_length_test                  14.046414
returns_test                         66.717751
return_std_test                        3.00571
average_reward_test                   4.749849
round_time_test         0 days 00:00:13.288138
round_time_total        0 days 00:10:22.810647
loss_total             573565348362238689280.0
loss_critic            716956673116277964800.0
loss_actor                 -62948975253.503998
memory_size                             1724.0 

=== epoch 2/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                   709
episode_length                        14.09732
returns                              67.048822
return_std                            3.302585
average_reward                        4.756168
round_time              0 days 00:10:26.532499
episodes_test                            710.0
episode_length_test                  14.069014
returns_test                         66.783498
return_std_test                       4.726779
average_reward_test                   4.746873
round_time_test         0 days 00:00:13.658823
round_time_total        0 days 00:10:26.533960
loss_total             616092967178050994176.0
loss_critic            770116195318828367872.0
loss_actor                 -65388157667.328003
memory_size                             1724.0 

=== epoch 2/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                   708
episode_length                       14.088983
returns                              67.047569
return_std                            4.010511
average_reward                        4.758703
round_time              0 days 00:10:24.263495
episodes_test                            706.0
episode_length_test                  14.145892
returns_test                         67.189276
return_std_test                       4.378931
average_reward_test                   4.749671
round_time_test         0 days 00:00:13.177441
round_time_total        0 days 00:10:24.264918
loss_total             665006444678557597696.0
loss_critic            831258039962452885504.0
loss_actor                 -68008997566.463997
memory_size                             1724.0 

=== epoch 2/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   707
episode_length                       14.118812
returns                              67.192349
return_std                            3.967692
average_reward                        4.759107
round_time              0 days 00:10:25.098085
episodes_test                            718.0
episode_length_test                  13.915042
returns_test                         66.367536
return_std_test                       2.666072
average_reward_test                   4.769508
round_time_test         0 days 00:00:13.074509
round_time_total        0 days 00:10:25.099172
loss_total             723144941644977012736.0
loss_critic            903931161473942552576.0
loss_actor                 -70647553566.720001
memory_size                             1724.0 

=== epoch 2/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:16,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                   707
episode_length                       14.120226
returns                              67.189016
return_std                            4.037533
average_reward                        4.758337
round_time              0 days 00:10:28.710419
episodes_test                            720.0
episode_length_test                  13.888889
returns_test                         66.079446
return_std_test                       2.953969
average_reward_test                    4.75772
round_time_test         0 days 00:00:12.948156
round_time_total        0 days 00:10:28.711854
loss_total             772250893273874563072.0
loss_critic            965313599580699361280.0
loss_actor                    -73092950949.888
memory_size                             1724.0 

=== epoch 2/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    704
episode_length                        14.178977
returns                               67.460552
return_std                             6.066232
average_reward                         4.757624
round_time               0 days 00:10:26.750822
episodes_test                             532.0
episode_length_test                   18.740602
returns_test                          88.974763
return_std_test                       32.918341
average_reward_test                    4.748105
round_time_test          0 days 00:00:13.230199
round_time_total         0 days 00:10:26.752029
loss_total              814427558507024220160.0
loss_critic            1018034430911030099968.0
loss_actor                  -75030355767.296005
memory_size                             1739.54 

=== epoch 2/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    708
episode_length                        14.096045
returns                               67.095902
return_std                             5.940454
average_reward                         4.759923
round_time               0 days 00:10:25.539348
episodes_test                             715.0
episode_length_test                   13.981818
returns_test                          66.517828
return_std_test                        3.692484
average_reward_test                    4.757483
round_time_test          0 days 00:00:13.177932
round_time_total         0 days 00:10:25.540532
loss_total              843588090864497459200.0
loss_critic            1054485095201185464320.0
loss_actor                     -77009454403.584
memory_size                              1740.0 

=== epoch 2/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    712
episode_length                         14.04073
returns                               66.829209
return_std                             5.426126
average_reward                         4.759663
round_time               0 days 00:10:25.536009
episodes_test                             724.0
episode_length_test                   13.808011
returns_test                          65.877094
return_std_test                        2.834119
average_reward_test                    4.770961
round_time_test          0 days 00:00:12.821764
round_time_total         0 days 00:10:25.537097
loss_total              879350929373171679232.0
loss_critic            1099188642628942692352.0
loss_actor                  -78742768310.272003
memory_size                              1740.0 

=== epoch 2/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:19,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                    714
episode_length                        13.987395
returns                               66.577066
return_std                             5.374652
average_reward                         4.759758
round_time               0 days 00:10:22.493804
episodes_test                             733.0
episode_length_test                   13.627558
returns_test                          65.016705
return_std_test                        2.621691
average_reward_test                    4.770991
round_time_test          0 days 00:00:13.156554
round_time_total         0 days 00:10:22.495132
loss_total              958967723941457821696.0
loss_critic            1198709635091632488448.0
loss_actor                  -82151527673.856003
memory_size                              1740.0 

=== epoch 2/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<13:11,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    712
episode_length                        14.016854
returns                               66.733935
return_std                             5.308969
average_reward                         4.760673
round_time               0 days 00:10:26.586514
episodes_test                             713.0
episode_length_test                   14.007013
returns_test                          66.671365
return_std_test                        5.593011
average_reward_test                    4.759791
round_time_test          0 days 00:00:13.372813
round_time_total         0 days 00:10:26.587593
loss_total             1036246436588150980608.0
loss_critic            1295308024879652274176.0
loss_actor                  -85245260210.175995
memory_size                              1740.0 

=== epoch 2/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                    711
episode_length                        14.056259
returns                               66.951545
return_std                             3.181619
average_reward                         4.763131
round_time               0 days 00:10:24.009305
episodes_test                             696.0
episode_length_test                   14.364943
returns_test                          68.394487
return_std_test                        4.115635
average_reward_test                     4.76123
round_time_test          0 days 00:00:12.759247
round_time_total         0 days 00:10:24.010412
loss_total             1120568573821609181184.0
loss_critic            1400710693193308962816.0
loss_actor                  -88194521550.848007
memory_size                              1740.0 

=== epoch 2/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    705
episode_length                        14.168794
returns                               67.532227
return_std                             4.850796
average_reward                         4.766283
round_time               0 days 00:10:27.238287
episodes_test                             689.0
episode_length_test                   14.507983
returns_test                          69.073889
return_std_test                        3.965613
average_reward_test                    4.761123
round_time_test          0 days 00:00:13.088347
round_time_total         0 days 00:10:27.239358
loss_total             1207635962029188317184.0
loss_critic            1509544924736433422336.0
loss_actor                  -91315398410.240005
memory_size                           1740.7335 

=== epoch 2/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    708
episode_length                        14.121469
returns                                67.32292
return_std                             4.909564
average_reward                         4.767437
round_time               0 days 00:10:27.085538
episodes_test                             729.0
episode_length_test                   13.702332
returns_test                           65.37873
return_std_test                        2.641639
average_reward_test                    4.771374
round_time_test          0 days 00:00:13.063247
round_time_total         0 days 00:10:27.086613
loss_total             1255572979182791557120.0
loss_critic            1569466195347206569984.0
loss_actor                  -93968848867.328003
memory_size                              1742.0 

=== epoch 2/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:35,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                    706
episode_length                        14.137394
returns                                67.43366
return_std                             4.896183
average_reward                         4.769861
round_time               0 days 00:10:31.340811
episodes_test                             716.0
episode_length_test                   13.965084
returns_test                          66.633161
return_std_test                        2.606807
average_reward_test                    4.771427
round_time_test          0 days 00:00:13.008948
round_time_total         0 days 00:10:31.341905
loss_total             1364228737541098700800.0
loss_critic            1705285891184028286976.0
loss_actor                  -97404096102.399994
memory_size                              1742.0 

=== epoch 2/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:55,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    712
episode_length                        14.035112
returns                               66.969269
return_std                             4.958162
average_reward                         4.771567
round_time               0 days 00:10:26.601557
episodes_test                             719.0
episode_length_test                   13.897079
returns_test                          66.285792
return_std_test                        3.063057
average_reward_test                    4.769787
round_time_test          0 days 00:00:13.007949
round_time_total         0 days 00:10:26.602762
loss_total             1444898921564136538112.0
loss_critic            1806123620245255225344.0
loss_actor                 -100802928717.824005
memory_size                              1742.0 

=== epoch 2/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:28,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    713
episode_length                        13.992987
returns                               66.742477
return_std                             5.148232
average_reward                         4.769685
round_time               0 days 00:10:25.891833
episodes_test                             715.0
episode_length_test                   13.969231
returns_test                          66.670575
return_std_test                       10.930356
average_reward_test                     4.77267
round_time_test          0 days 00:00:12.917900
round_time_total         0 days 00:10:25.893218
loss_total             1582359700957674340352.0
loss_critic            1977949593097394978816.0
loss_actor                 -104252646060.031998
memory_size                              1742.0 

=== epoch 2/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    719
episode_length                        13.888734
returns                               66.205864
return_std                             3.591693
average_reward                         4.766887
round_time               0 days 00:10:25.077187
episodes_test                             706.0
episode_length_test                   14.150142
returns_test                          67.494564
return_std_test                        7.642618
average_reward_test                     4.76989
round_time_test          0 days 00:00:13.014162
round_time_total         0 days 00:10:25.078264
loss_total             1693927916471407083520.0
loss_critic            2117409859402132094976.0
loss_actor                 -107831497347.072006
memory_size                              1742.0 

=== epoch 2/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.19it/s]
episodes                                    717
episode_length                        13.919107
returns                               66.336965
return_std                             3.602929
average_reward                         4.765885
round_time               0 days 00:10:26.474088
episodes_test                             740.0
episode_length_test                   13.510811
returns_test                          64.449123
return_std_test                        2.938265
average_reward_test                    4.770215
round_time_test          0 days 00:00:13.409214
round_time_total         0 days 00:10:26.475410
loss_total             1778092333599711821824.0
loss_critic            2222615380566222569472.0
loss_actor                 -111469332025.343994
memory_size                              1742.0 

=== epoch 2/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:20,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    720
episode_length                        13.861111
returns                               66.049584
return_std                             4.247344
average_reward                         4.765112
round_time               0 days 00:10:24.548076
episodes_test                             716.0
episode_length_test                   13.952514
returns_test                          66.443831
return_std_test                        4.353404
average_reward_test                    4.762166
round_time_test          0 days 00:00:13.259686
round_time_total         0 days 00:10:24.549215
loss_total             1941572661529232539648.0
loss_critic            2426965786238406623232.0
loss_actor                 -114932389806.080002
memory_size                            1742.285 

=== epoch 2/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    717
episode_length                        13.923291
returns                               66.342443
return_std                             4.280612
average_reward                         4.764793
round_time               0 days 00:10:27.896917
episodes_test                             749.0
episode_length_test                   13.337784
returns_test                          63.668472
return_std_test                        2.433758
average_reward_test                    4.773551
round_time_test          0 days 00:00:13.467505
round_time_total         0 days 00:10:27.897990
loss_total             2045361262535912718336.0
loss_critic            2556701536406041264128.0
loss_actor                 -118617367052.287994
memory_size                              1745.0 

=== epoch 2/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    717
episode_length                        13.930265
returns                               66.403924
return_std                             3.998302
average_reward                         4.766889
round_time               0 days 00:10:27.483047
episodes_test                             692.0
episode_length_test                   14.440751
returns_test                          68.846659
return_std_test                        3.055074
average_reward_test                    4.767543
round_time_test          0 days 00:00:12.857299
round_time_total         0 days 00:10:27.484128
loss_total             2200546919535197028352.0
loss_critic            2750683604567717838848.0
loss_actor                 -122721662517.248001
memory_size                              1745.0 

=== epoch 2/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:16,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    716
episode_length                        13.941341
returns                               66.465676
return_std                             4.032137
average_reward                         4.767513
round_time               0 days 00:10:27.855815
episodes_test                             716.0
episode_length_test                   13.956704
returns_test                           66.56925
return_std_test                        3.069184
average_reward_test                    4.769707
round_time_test          0 days 00:00:13.407159
round_time_total         0 days 00:10:27.856899
loss_total             2352210414954259415040.0
loss_critic            2940262968862957371392.0
loss_actor                 -126625101262.848007
memory_size                              1745.0 

=== epoch 2/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:40,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    715
episode_length                        13.965035
returns                               66.583763
return_std                             4.010669
average_reward                         4.767897
round_time               0 days 00:10:25.915224
episodes_test                             720.0
episode_length_test                   13.873611
returns_test                          66.179656
return_std_test                        2.880642
average_reward_test                    4.770197
round_time_test          0 days 00:00:13.039069
round_time_total         0 days 00:10:25.916300
loss_total             2506014885537008058368.0
loss_critic            3132518553185927823360.0
loss_actor                 -130475390615.552002
memory_size                              1745.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:18,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    145
episode_length                        13.724138
returns                               65.408368
return_std                             3.827534
average_reward                          4.76592
round_time               0 days 00:10:25.847474
episodes_test                             726.0
episode_length_test                   13.769972
returns_test                          65.647327
return_std_test                        3.180409
average_reward_test                    4.767445
round_time_test          0 days 00:00:12.822000
round_time_total         0 days 00:10:25.848590
loss_total             2647437138580668940288.0
loss_critic            3309296365734572064768.0
loss_actor                 -134358295191.552002
memory_size                              1745.0 

=== epoch 3/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:40,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                    291
episode_length                        13.714777
returns                               65.391203
return_std                             3.361728
average_reward                          4.76795
round_time               0 days 00:10:42.505128
episodes_test                             733.0
episode_length_test                   13.627558
returns_test                          65.012657
return_std_test                        3.078351
average_reward_test                    4.770683
round_time_test          0 days 00:00:13.105922
round_time_total         0 days 00:10:42.506213
loss_total             2814373650039211294720.0
loss_critic            3517966996279249076224.0
loss_actor                 -138521997033.471985
memory_size                              1745.0 

=== epoch 3/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:23,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                    435
episode_length                        13.788506
returns                               65.741137
return_std                             3.282734
average_reward                         4.767864
round_time               0 days 00:11:03.334194
episodes_test                             706.0
episode_length_test                   14.152975
returns_test                          67.472439
return_std_test                        3.006332
average_reward_test                    4.767391
round_time_test          0 days 00:00:12.927684
round_time_total         0 days 00:11:03.335312
loss_total             2955931830058322231296.0
loss_critic            3694914721056847560704.0
loss_actor                 -142618850418.687988
memory_size                              1745.0 

=== epoch 3/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                    580
episode_length                         13.77931
returns                               65.711485
return_std                             3.219498
average_reward                         4.768879
round_time               0 days 00:10:24.460305
episodes_test                             725.0
episode_length_test                   13.790345
returns_test                          65.804059
return_std_test                        3.048426
average_reward_test                    4.771768
round_time_test          0 days 00:00:13.044264
round_time_total         0 days 00:10:24.461714
loss_total             3176644052219367784448.0
loss_critic            3970804997860952637440.0
loss_actor                 -147233653334.015991
memory_size                              1745.0 

=== epoch 3/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    722
episode_length                         13.84903
returns                               66.037008
return_std                             3.252083
average_reward                         4.768365
round_time               0 days 00:10:25.878466
episodes_test                             716.0
episode_length_test                   13.958101
returns_test                          66.579574
return_std_test                        3.227239
average_reward_test                    4.769981
round_time_test          0 days 00:00:13.037315
round_time_total         0 days 00:10:25.879638
loss_total             3371124666537625518080.0
loss_critic            4213905762046824153088.0
loss_actor                 -151785433272.320007
memory_size                              1745.0 

=== epoch 3/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    716
episode_length                        13.955307
returns                                66.55353
return_std                             3.126192
average_reward                         4.769063
round_time               0 days 00:10:26.426175
episodes_test                             705.0
episode_length_test                   14.165957
returns_test                          67.544599
return_std_test                        3.567064
average_reward_test                    4.768115
round_time_test          0 days 00:00:13.173622
round_time_total         0 days 00:10:26.427255
loss_total             3590822403061534687232.0
loss_critic            4488527927019433885696.0
loss_actor                 -155967406772.223999
memory_size                              1745.0 

=== epoch 3/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:09,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                    712
episode_length                        14.022472
returns                               66.874555
return_std                             3.132236
average_reward                         4.769067
round_time               0 days 00:10:26.253585
episodes_test                             703.0
episode_length_test                   14.214794
returns_test                          67.771088
return_std_test                         2.99768
average_reward_test                    4.767658
round_time_test          0 days 00:00:12.826887
round_time_total         0 days 00:10:26.255079
loss_total             3824710057858570911744.0
loss_critic            4780887498295294820352.0
loss_actor                 -161280794345.471985
memory_size                              1745.0 

=== epoch 3/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:29,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                    711
episode_length                        14.032349
returns                               66.924205
return_std                             3.130522
average_reward                         4.769269
round_time               0 days 00:10:23.866635
episodes_test                             704.0
episode_length_test                   14.191761
returns_test                          67.665691
return_std_test                        2.885123
average_reward_test                    4.767962
round_time_test          0 days 00:00:13.247656
round_time_total         0 days 00:10:23.867724
loss_total             4048114596334627454976.0
loss_critic            5060143158811953201152.0
loss_actor                 -164843663007.743988
memory_size                              1745.0 

=== epoch 3/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    706
episode_length                        14.137394
returns                               67.473898
return_std                             7.390258
average_reward                         4.772712
round_time               0 days 00:10:25.551807
episodes_test                             723.0
episode_length_test                   13.813278
returns_test                          65.895356
return_std_test                        3.139058
average_reward_test                    4.770439
round_time_test          0 days 00:00:13.212267
round_time_total         0 days 00:10:25.552895
loss_total             4156043493838538407936.0
loss_critic            5195054282204769157120.0
loss_actor                 -168799290195.967987
memory_size                           1749.4415 

=== epoch 3/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    709
episode_length                        14.080395
returns                                 67.2102
return_std                             7.405582
average_reward                         4.773327
round_time               0 days 00:10:28.539917
episodes_test                             690.0
episode_length_test                   14.486957
returns_test                          69.009296
return_std_test                         6.94201
average_reward_test                     4.76358
round_time_test          0 days 00:00:12.965331
round_time_total         0 days 00:10:28.541020
loss_total             4461415451470174093312.0
loss_critic            5576769218618633224192.0
loss_actor                  -174475279114.23999
memory_size                              1763.0 

=== epoch 3/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    713
episode_length                        14.012623
returns                               66.885592
return_std                             7.401298
average_reward                         4.773229
round_time               0 days 00:10:24.861617
episodes_test                             711.0
episode_length_test                   14.057665
returns_test                          67.068029
return_std_test                        3.155067
average_reward_test                    4.770939
round_time_test          0 days 00:00:13.025963
round_time_total         0 days 00:10:24.862867
loss_total             4697887434496300548096.0
loss_critic            5872359187848734179328.0
loss_actor                 -179196688982.015991
memory_size                              1763.0 

=== epoch 3/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    712
episode_length                         14.02809
returns                               66.953616
return_std                             7.425006
average_reward                          4.77281
round_time               0 days 00:10:26.629606
episodes_test                             721.0
episode_length_test                   13.862691
returns_test                           66.09323
return_std_test                        3.229185
average_reward_test                    4.767729
round_time_test          0 days 00:00:13.275341
round_time_total         0 days 00:10:26.630785
loss_total             5033905836182713925632.0
loss_critic            6292382173490464161792.0
loss_actor                 -184990164664.320007
memory_size                              1763.0 

=== epoch 3/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:53,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    715
episode_length                        13.977622
returns                               66.716579
return_std                             7.419239
average_reward                         4.773126
round_time               0 days 00:10:28.394220
episodes_test                             724.0
episode_length_test                   13.809392
returns_test                          65.853361
return_std_test                        2.927847
average_reward_test                    4.768769
round_time_test          0 days 00:00:13.095835
round_time_total         0 days 00:10:28.395628
loss_total             5401820510337179320320.0
loss_critic            6752275520581593464832.0
loss_actor                 -191189801025.536011
memory_size                              1763.0 

=== epoch 3/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    721
episode_length                        13.861304
returns                               66.114076
return_std                             3.150532
average_reward                         4.769706
round_time               0 days 00:10:25.196338
episodes_test                             727.0
episode_length_test                   13.755158
returns_test                          65.642831
return_std_test                         3.31215
average_reward_test                    4.772234
round_time_test          0 days 00:00:13.187432
round_time_total         0 days 00:10:25.197442
loss_total             5677986505743614869504.0
loss_critic            7097483009245322412032.0
loss_actor                 -196688277913.600006
memory_size                              1763.0 

=== epoch 3/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:33,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                    721
episode_length                        13.855756
returns                               66.081849
return_std                             3.752317
average_reward                         4.769251
round_time               0 days 00:10:31.406887
episodes_test                             741.0
episode_length_test                    13.48583
returns_test                          64.360396
return_std_test                        2.764479
average_reward_test                    4.772458
round_time_test          0 days 00:00:13.056000
round_time_total         0 days 00:10:31.408030
loss_total             6095378604309987983360.0
loss_critic            7619223118098065260544.0
loss_actor                 -201819514028.032013
memory_size                           1763.2795 

=== epoch 3/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                    723
episode_length                        13.817427
returns                               65.900463
return_std                             3.662802
average_reward                         4.769397
round_time               0 days 00:10:25.094465
episodes_test                             732.0
episode_length_test                   13.644809
returns_test                          65.069889
return_std_test                        2.946949
average_reward_test                    4.768861
round_time_test          0 days 00:00:13.143703
round_time_total         0 days 00:10:25.095871
loss_total             6432556259984296378368.0
loss_critic            8040695190259409879040.0
loss_actor                 -207657411215.359985
memory_size                              1764.0 

=== epoch 3/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:08,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    728
episode_length                        13.726648
returns                               65.478112
return_std                              3.64446
average_reward                         4.770154
round_time               0 days 00:10:27.665974
episodes_test                             730.0
episode_length_test                   13.693151
returns_test                          65.315746
return_std_test                        3.087723
average_reward_test                    4.769989
round_time_test          0 days 00:00:13.286278
round_time_total         0 days 00:10:27.667403
loss_total             6741401918458653835264.0
loss_critic            8426752255928454610944.0
loss_actor                 -213551444566.015991
memory_size                              1764.0 

=== epoch 3/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                    729
episode_length                        13.698217
returns                               65.337555
return_std                             3.697252
average_reward                         4.769789
round_time               0 days 00:10:28.410721
episodes_test                             736.0
episode_length_test                   13.581522
returns_test                            64.7798
return_std_test                         2.93798
average_reward_test                    4.769722
round_time_test          0 days 00:00:13.206626
round_time_total         0 days 00:10:28.411793
loss_total             7157518385659898757120.0
loss_critic            8946897836481940815872.0
loss_actor                 -219099223900.160004
memory_size                              1764.0 

=== epoch 3/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                    725
episode_length                        13.768276
returns                               65.678669
return_std                              5.04966
average_reward                         4.770281
round_time               0 days 00:10:35.512006
episodes_test                             725.0
episode_length_test                   13.777931
returns_test                          65.725597
return_std_test                        2.782413
average_reward_test                    4.770375
round_time_test          0 days 00:00:13.219743
round_time_total         0 days 00:10:35.513104
loss_total             7651242280711362707456.0
loss_critic            9564052697590894952448.0
loss_actor                 -224224508649.471985
memory_size                           1769.2745 

=== epoch 3/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:47,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                    725
episode_length                         13.77931
returns                               65.717345
return_std                             4.614867
average_reward                         4.769273
round_time               0 days 00:10:27.295015
episodes_test                             723.0
episode_length_test                   13.821577
returns_test                          65.931944
return_std_test                        2.652816
average_reward_test                    4.770229
round_time_test          0 days 00:00:13.144065
round_time_total         0 days 00:10:27.296274
loss_total             7965163367331702767616.0
loss_critic            9956454049779422330880.0
loss_actor                 -230104663728.127991
memory_size                              1771.0 

=== epoch 3/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     720
episode_length                         13.866667
returns                                66.116023
return_std                              4.695253
average_reward                          4.767997
round_time                0 days 00:10:29.790613
episodes_test                              714.0
episode_length_test                    14.002801
returns_test                           66.765859
return_std_test                         2.749649
average_reward_test                     4.768066
round_time_test           0 days 00:00:12.874173
round_time_total          0 days 00:10:29.791697
loss_total              8389684373063076937728.0
loss_critic            10487105295403167252480.0
loss_actor                  -235728427237.376007
memory_size                               1771.0 

=== epoch 3/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:09,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                     717
episode_length                         13.910739
returns                                66.319403
return_std                              4.575364
average_reward                          4.767318
round_time                0 days 00:10:30.690542
episodes_test                              711.0
episode_length_test                    14.053446
returns_test                            66.97468
return_std_test                         2.716877
average_reward_test                     4.765727
round_time_test           0 days 00:00:12.828755
round_time_total          0 days 00:10:30.691798
loss_total              8830084168390309576704.0
loss_critic            11037605014581303836672.0
loss_actor                  -241644602474.496002
memory_size                               1771.0 

=== epoch 3/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                     717
episode_length                         13.933054
returns                                 66.42728
return_std                              4.608216
average_reward                          4.767599
round_time                0 days 00:10:33.822746
episodes_test                              712.0
episode_length_test                    14.026685
returns_test                            66.92009
return_std_test                            4.284
average_reward_test                     4.770907
round_time_test           0 days 00:00:12.944752
round_time_total          0 days 00:10:33.824169
loss_total              9312221778218277928960.0
loss_critic            11640277015220235272192.0
loss_actor                  -247441703583.743988
memory_size                               1771.0 

=== epoch 3/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:58,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                     719
episode_length                          13.89847
returns                                66.239518
return_std                              3.235868
average_reward                          4.765961
round_time                0 days 00:10:32.190697
episodes_test                              727.0
episode_length_test                    13.752407
returns_test                           65.622051
return_std_test                         2.539784
average_reward_test                      4.77171
round_time_test           0 days 00:00:12.993220
round_time_total          0 days 00:10:32.191806
loss_total              9705837293407259590656.0
loss_critic            12132296411669369716736.0
loss_actor                  -253268217651.200012
memory_size                               1771.0 

=== epoch 3/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:16,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     715
episode_length                         13.962238
returns                                66.565542
return_std                              3.242757
average_reward                          4.767523
round_time                0 days 00:10:28.843844
episodes_test                              712.0
episode_length_test                    14.030899
returns_test                           66.772711
return_std_test                         3.907871
average_reward_test                     4.758989
round_time_test           0 days 00:00:13.266467
round_time_total          0 days 00:10:28.844910
loss_total              9936594257131054563328.0
loss_critic            12420742600596698169344.0
loss_actor                  -258141430439.936005
memory_size                               1771.0 

=== epoch 3/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:28,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     715
episode_length                         13.974825
returns                                66.641216
return_std                              3.254144
average_reward                          4.768691
round_time                0 days 00:10:29.535954
episodes_test                              699.0
episode_length_test                    14.304721
returns_test                           68.237492
return_std_test                         2.798363
average_reward_test                     4.770295
round_time_test           0 days 00:00:13.415584
round_time_total          0 days 00:10:29.537029
loss_total              9798991078350529757184.0
loss_critic            12248738632011670880256.0
loss_actor                  -257076692811.776001
memory_size                               1771.0 

=== epoch 3/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     711
episode_length                         14.059072
returns                                67.046916
return_std                              3.322489
average_reward                          4.768973
round_time                0 days 00:10:29.853882
episodes_test                              708.0
episode_length_test                    14.117232
returns_test                           67.354702
return_std_test                         2.574115
average_reward_test                     4.771127
round_time_test           0 days 00:00:13.193903
round_time_total          0 days 00:10:29.854967
loss_total              9579219064732023521280.0
loss_critic            11974023623643892809728.0
loss_actor                  -256441212715.007996
memory_size                               1771.0 

=== epoch 3/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:14,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                     706
episode_length                         14.133144
returns                                67.405022
return_std                              3.044006
average_reward                          4.769301
round_time                0 days 00:10:24.684070
episodes_test                              701.0
episode_length_test                    14.263909
returns_test                           68.044263
return_std_test                         2.644465
average_reward_test                     4.770392
round_time_test           0 days 00:00:12.713084
round_time_total          0 days 00:10:24.685320
loss_total              9585433698529208958976.0
loss_critic            11981791917825515847680.0
loss_actor                  -257688127823.872009
memory_size                               1771.0 

=== epoch 3/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     705
episode_length                         14.165957
returns                                67.574022
return_std                              2.790785
average_reward                          4.770156
round_time                0 days 00:10:27.782919
episodes_test                              713.0
episode_length_test                    14.009818
returns_test                           66.840986
return_std_test                         2.575339
average_reward_test                     4.771024
round_time_test           0 days 00:00:13.266812
round_time_total          0 days 00:10:27.784330
loss_total              9881112334769512775680.0
loss_critic            12351390204013142081536.0
loss_actor                  -263184862494.720001
memory_size                               1771.0 

=== epoch 3/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     705
episode_length                         14.177305
returns                                67.626429
return_std                              2.865354
average_reward                          4.770068
round_time                0 days 00:10:26.565957
episodes_test                              708.0
episode_length_test                    14.122881
returns_test                           67.384167
return_std_test                         2.510628
average_reward_test                      4.77129
round_time_test           0 days 00:00:12.951152
round_time_total          0 days 00:10:26.567419
loss_total             10538152252854922903552.0
loss_critic            13172690068229937168384.0
loss_actor                   -269954481930.23999
memory_size                               1771.0 

=== epoch 3/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:35,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     702
episode_length                         14.219373
returns                                67.841278
return_std                              4.223138
average_reward                          4.771025
round_time                0 days 00:10:29.447259
episodes_test                              691.0
episode_length_test                     14.45152
returns_test                           68.897218
return_std_test                         4.862245
average_reward_test                     4.767483
round_time_test           0 days 00:00:13.015579
round_time_total          0 days 00:10:29.448639
loss_total             10875404794946784329728.0
loss_critic            13594255746865110712320.0
loss_actor                  -274930666708.992004
memory_size                            1772.0775 

=== epoch 3/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     702
episode_length                          14.22792
returns                                67.871233
return_std                              4.271732
average_reward                          4.770274
round_time                0 days 00:10:26.671597
episodes_test                              700.0
episode_length_test                    14.277143
returns_test                           68.091211
return_std_test                         2.952043
average_reward_test                     4.769264
round_time_test           0 days 00:00:12.790489
round_time_total          0 days 00:10:26.672880
loss_total             11422166054400667156480.0
loss_critic            14277707319528799600640.0
loss_actor                  -278908460834.815979
memory_size                               1773.0 

=== epoch 3/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:21,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     700
episode_length                         14.268571
returns                                68.057451
return_std                              4.289555
average_reward                          4.769752
round_time                0 days 00:10:28.894546
episodes_test                              694.0
episode_length_test                    14.394813
returns_test                           68.651535
return_std_test                         3.029917
average_reward_test                     4.769188
round_time_test           0 days 00:00:12.961935
round_time_total          0 days 00:10:28.895901
loss_total             11832893865850046513152.0
loss_critic            14791117074411110268928.0
loss_actor                  -283855991357.440002
memory_size                               1773.0 

=== epoch 3/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                     697
episode_length                         14.332855
returns                                68.357666
return_std                              4.274317
average_reward                          4.769318
round_time                0 days 00:10:25.354421
episodes_test                              707.0
episode_length_test                    14.142857
returns_test                           67.475064
return_std_test                         2.882443
average_reward_test                     4.770979
round_time_test           0 days 00:00:12.727859
round_time_total          0 days 00:10:25.355882
loss_total             12280346398398833754112.0
loss_critic            15350432724264125726720.0
loss_actor                  -290793413779.455994
memory_size                               1773.0 

=== epoch 3/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:18,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                     693
episode_length                         14.395382
returns                                68.625178
return_std                              4.334287
average_reward                           4.76713
round_time                0 days 00:10:24.794784
episodes_test                              695.0
episode_length_test                     14.38705
returns_test                           68.612353
return_std_test                         2.897174
average_reward_test                     4.769048
round_time_test           0 days 00:00:12.960773
round_time_total          0 days 00:10:24.795898
loss_total             12695637801926217695232.0
loss_critic            15869546991480469454848.0
loss_actor                  -296111516483.583984
memory_size                               1773.0 

=== epoch 3/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:25,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     692
episode_length                         14.443642
returns                                68.801146
return_std                              3.237282
average_reward                          4.763449
round_time                0 days 00:10:26.857419
episodes_test                              685.0
episode_length_test                    14.581022
returns_test                           69.416328
return_std_test                         4.226272
average_reward_test                     4.760764
round_time_test           0 days 00:00:13.451060
round_time_total          0 days 00:10:26.858585
loss_total             13402238442157820084224.0
loss_critic            16752797771222296297472.0
loss_actor                  -302462329643.007996
memory_size                               1773.0 

=== epoch 3/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     688
episode_length                         14.518895
returns                                69.086367
return_std                              3.559413
average_reward                           4.75841
round_time                0 days 00:10:29.819472
episodes_test                              684.0
episode_length_test                    14.612573
returns_test                           69.485274
return_std_test                         4.592887
average_reward_test                     4.755202
round_time_test           0 days 00:00:12.933523
round_time_total          0 days 00:10:29.820548
loss_total             13415855774533540642816.0
loss_critic            16769819446332467707904.0
loss_actor                  -304939256733.695984
memory_size                               1773.0 

=== epoch 3/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     686
episode_length                         14.552478
returns                                69.232629
return_std                              3.506432
average_reward                          4.757446
round_time                0 days 00:10:30.280881
episodes_test                              680.0
episode_length_test                    14.695588
returns_test                           69.918321
return_std_test                         3.430387
average_reward_test                     4.757804
round_time_test           0 days 00:00:12.726321
round_time_total          0 days 00:10:30.282227
loss_total             13911412824769692696576.0
loss_critic            17389265742309525291008.0
loss_actor                  -309874434244.607971
memory_size                               1773.0 

=== epoch 3/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     685
episode_length                         14.591241
returns                                69.342199
return_std                               3.86438
average_reward                          4.752338
round_time                0 days 00:10:27.773734
episodes_test                              692.0
episode_length_test                    14.436416
returns_test                           68.800075
return_std_test                         7.241522
average_reward_test                     4.765764
round_time_test           0 days 00:00:12.967055
round_time_total          0 days 00:10:27.774803
loss_total             14463228131671063658496.0
loss_critic            18079034859258847952896.0
loss_actor                  -317695992561.664001
memory_size                               1773.0 

=== epoch 3/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:31,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     682
episode_length                         14.643695
returns                                69.500492
return_std                              4.009265
average_reward                          4.746134
round_time                0 days 00:10:26.425972
episodes_test                              680.0
episode_length_test                    14.694118
returns_test                           69.522251
return_std_test                         4.193387
average_reward_test                     4.731344
round_time_test           0 days 00:00:13.056722
round_time_total          0 days 00:10:26.427388
loss_total             14881482718824198307840.0
loss_critic            18601853111144296742912.0
loss_actor                  -322212435312.640015
memory_size                               1773.0 

=== epoch 3/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                     683
episode_length                         14.620791
returns                                69.375702
return_std                              3.937497
average_reward                          4.745033
round_time                0 days 00:10:24.093802
episodes_test                              674.0
episode_length_test                    14.832344
returns_test                           70.154577
return_std_test                         4.568594
average_reward_test                     4.729877
round_time_test           0 days 00:00:12.769760
round_time_total          0 days 00:10:24.095154
loss_total             15664568792323742236672.0
loss_critic            19580710671141685952512.0
loss_actor                  -330522730348.544006
memory_size                               1773.0 

=== epoch 3/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                     683
episode_length                         14.620791
returns                                 69.36708
return_std                              4.058779
average_reward                          4.744456
round_time                0 days 00:10:25.236665
episodes_test                              673.0
episode_length_test                    14.858841
returns_test                           70.292271
return_std_test                         7.123327
average_reward_test                      4.73067
round_time_test           0 days 00:00:13.132256
round_time_total          0 days 00:10:25.238124
loss_total             16279871021446256918528.0
loss_critic            20349838442485916893184.0
loss_actor                  -335764690108.416016
memory_size                               1773.0 

=== epoch 3/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     680
episode_length                         14.685294
returns                                69.584434
return_std                              4.258688
average_reward                          4.738371
round_time                0 days 00:10:28.330114
episodes_test                              679.0
episode_length_test                    14.709867
returns_test                           69.737911
return_std_test                         4.077599
average_reward_test                     4.740921
round_time_test           0 days 00:00:13.114505
round_time_total          0 days 00:10:28.331197
loss_total             16516890235526098976768.0
loss_critic            20646112441226898702336.0
loss_actor                  -340301060194.304016
memory_size                               1773.0 

=== epoch 3/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.18it/s]
episodes                                     678
episode_length                         14.728614
returns                                69.787399
return_std                              4.227515
average_reward                          4.738271
round_time                0 days 00:10:28.539043
episodes_test                              678.0
episode_length_test                    14.737463
returns_test                           69.873374
return_std_test                         3.825835
average_reward_test                     4.741243
round_time_test           0 days 00:00:13.150908
round_time_total          0 days 00:10:28.540105
loss_total             17470086157057090650112.0
loss_critic            21837607340255517605888.0
loss_actor                  -348654044889.088013
memory_size                               1773.0 

=== epoch 3/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     681
episode_length                         14.681351
returns                                69.642977
return_std                              4.031522
average_reward                          4.743635
round_time                0 days 00:10:26.727289
episodes_test                              681.0
episode_length_test                    14.681351
returns_test                            69.72927
return_std_test                         3.430365
average_reward_test                     4.749553
round_time_test           0 days 00:00:13.074832
round_time_total          0 days 00:10:26.728718
loss_total             18439494233635556425728.0
loss_critic            23049367408534787981312.0
loss_actor                  -356055373004.799988
memory_size                               1773.0 

=== epoch 3/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     681
episode_length                         14.657856
returns                                69.575389
return_std                              4.025819
average_reward                          4.746669
round_time                0 days 00:10:26.105751
episodes_test                              691.0
episode_length_test                    14.468886
returns_test                           68.912721
return_std_test                         3.346881
average_reward_test                     4.762842
round_time_test           0 days 00:00:12.798065
round_time_total          0 days 00:10:26.106915
loss_total             19263690657195538513920.0
loss_critic            24079612903926293069824.0
loss_actor                  -363831705092.096008
memory_size                               1773.0 

=== epoch 3/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     684
episode_length                         14.614035
returns                                69.407394
return_std                              3.634456
average_reward                          4.749366
round_time                0 days 00:10:29.521385
episodes_test                              685.0
episode_length_test                     14.59708
returns_test                           69.498009
return_std_test                         3.751206
average_reward_test                     4.761102
round_time_test           0 days 00:00:13.509381
round_time_total          0 days 00:10:29.522550
loss_total             19617352835348949893120.0
loss_critic            24521690610714720337920.0
loss_actor                   -369361301012.47998
memory_size                               1773.0 

=== epoch 3/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:14,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     685
episode_length                         14.586861
returns                                69.309829
return_std                               3.44542
average_reward                          4.751268
round_time                0 days 00:10:29.907679
episodes_test                              687.0
episode_length_test                     14.54294
returns_test                           69.042428
return_std_test                         3.781974
average_reward_test                     4.747506
round_time_test           0 days 00:00:13.089572
round_time_total          0 days 00:10:29.908738
loss_total             20142143902843442036736.0
loss_critic            25177679458875111112704.0
loss_actor                  -374322554290.176025
memory_size                               1773.0 

=== epoch 3/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     687
episode_length                          14.54294
returns                                69.108846
return_std                               3.41073
average_reward                          4.752101
round_time                0 days 00:10:29.207514
episodes_test                              688.0
episode_length_test                    14.521802
returns_test                           68.999614
return_std_test                         3.913514
average_reward_test                     4.751468
round_time_test           0 days 00:00:13.256431
round_time_total          0 days 00:10:29.208865
loss_total             20904785424497046978560.0
loss_critic            26130981313795059613696.0
loss_actor                  -382755226386.432007
memory_size                               1773.0 

=== epoch 3/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                     687
episode_length                         14.540029
returns                                69.087618
return_std                               3.69984
average_reward                          4.751568
round_time                0 days 00:10:25.578500
episodes_test                              689.0
episode_length_test                    14.509434
returns_test                           68.922082
return_std_test                         4.198815
average_reward_test                     4.750181
round_time_test           0 days 00:00:13.101035
round_time_total          0 days 00:10:25.579884
loss_total             22138165284840715321344.0
loss_critic            27672706116636278521856.0
loss_actor                  -391474401492.992004
memory_size                               1773.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                     137
episode_length                          14.59854
returns                                69.467561
return_std                              3.418937
average_reward                          4.758528
round_time                0 days 00:10:24.885896
episodes_test                              690.0
episode_length_test                    14.492754
returns_test                           68.984874
return_std_test                         3.329573
average_reward_test                     4.759956
round_time_test           0 days 00:00:12.747546
round_time_total          0 days 00:10:24.887186
loss_total             23519775362874710425600.0
loss_critic            29399718652043170152448.0
loss_actor                  -401846016688.127991
memory_size                               1773.0 

=== epoch 4/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                     274
episode_length                         14.583942
returns                                69.398354
return_std                              3.340279
average_reward                          4.758649
round_time                0 days 00:10:23.010502
episodes_test                              691.0
episode_length_test                     14.46165
returns_test                           68.941564
return_std_test                         2.953873
average_reward_test                     4.767217
round_time_test           0 days 00:00:12.765264
round_time_total          0 days 00:10:23.011783
loss_total             24463889260939376066560.0
loss_critic            30579861029127603617792.0
loss_actor                   -408844558893.05603
memory_size                               1773.0 

=== epoch 4/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:31,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     412
episode_length                         14.553398
returns                                69.243604
return_std                               3.52096
average_reward                          4.757951
round_time                0 days 00:10:25.538354
episodes_test                              685.0
episode_length_test                    14.592701
returns_test                           69.511363
return_std_test                         3.211007
average_reward_test                     4.763465
round_time_test           0 days 00:00:13.044730
round_time_total          0 days 00:10:25.539441
loss_total             25150788482388581154816.0
loss_critic            31438485048902232834048.0
loss_actor                  -417315581214.719971
memory_size                               1773.0 

=== epoch 4/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     550
episode_length                         14.543636
returns                                69.193211
return_std                              3.463101
average_reward                          4.757647
round_time                0 days 00:10:26.228448
episodes_test                              694.0
episode_length_test                    14.407781
returns_test                           68.582372
return_std_test                         3.479434
average_reward_test                     4.760108
round_time_test           0 days 00:00:12.947650
round_time_total          0 days 00:10:26.229878
loss_total             26324469761260405129216.0
loss_critic            32905586636936701804544.0
loss_actor                  -425945997049.856018
memory_size                               1773.0 

=== epoch 4/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:59,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:23<00:00,  3.21it/s]
episodes                                     685
episode_length                         14.581022
returns                                69.324433
return_std                              3.533946
average_reward                          4.754452
round_time                0 days 00:10:23.769076
episodes_test                              682.0
episode_length_test                    14.645161
returns_test                           69.611549
return_std_test                         3.326106
average_reward_test                     4.753238
round_time_test           0 days 00:00:13.360252
round_time_total          0 days 00:10:23.770367
loss_total             27015386528532446314496.0
loss_critic            33769232612211564740608.0
loss_actor                  -433811352305.664001
memory_size                               1773.0 

=== epoch 4/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     682
episode_length                         14.658358
returns                                69.656081
return_std                              6.618683
average_reward                          4.752004
round_time                0 days 00:10:27.961197
episodes_test                              684.0
episode_length_test                    14.616959
returns_test                            69.44122
return_std_test                          3.57499
average_reward_test                     4.750761
round_time_test           0 days 00:00:12.774396
round_time_total          0 days 00:10:27.962295
loss_total             28592536696708055171072.0
loss_critic            35740670271765583233024.0
loss_actor                  -441597324607.487976
memory_size                            1784.0495 

=== epoch 4/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:47,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     681
episode_length                         14.654919
returns                                69.630239
return_std                              6.610832
average_reward                          4.750845
round_time                0 days 00:10:27.224421
episodes_test                              687.0
episode_length_test                    14.547307
returns_test                           69.125786
return_std_test                          3.58687
average_reward_test                     4.751826
round_time_test           0 days 00:00:13.172464
round_time_total          0 days 00:10:27.225743
loss_total             29947145252617590931456.0
loss_critic            37433930942867864289280.0
loss_actor                  -449597430333.440002
memory_size                               1787.0 

=== epoch 4/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:23,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     682
episode_length                         14.631965
returns                                69.542254
return_std                              6.524005
average_reward                          4.752798
round_time                0 days 00:10:30.035329
episodes_test                              679.0
episode_length_test                    14.727541
returns_test                           70.213388
return_std_test                          7.77612
average_reward_test                     4.767489
round_time_test           0 days 00:00:13.196300
round_time_total          0 days 00:10:30.036489
loss_total             30753433130836684701696.0
loss_critic            38441790781212321644544.0
loss_actor                  -455425077428.223999
memory_size                               1787.0 

=== epoch 4/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     683
episode_length                         14.619327
returns                                69.496077
return_std                              6.509216
average_reward                          4.753724
round_time                0 days 00:10:28.091233
episodes_test                              693.0
episode_length_test                    14.412698
returns_test                           68.658124
return_std_test                         3.383491
average_reward_test                     4.763749
round_time_test           0 days 00:00:13.063544
round_time_total          0 days 00:10:28.092559
loss_total             31560683310639604563968.0
loss_critic            39450853521447096156160.0
loss_actor                  -463824115138.559998
memory_size                               1787.0 

=== epoch 4/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     684
episode_length                         14.612573
returns                                69.477444
return_std                               6.51565
average_reward                          4.754669
round_time                0 days 00:10:26.358395
episodes_test                              688.0
episode_length_test                    14.520349
returns_test                           69.132235
return_std_test                         3.834124
average_reward_test                     4.761077
round_time_test           0 days 00:00:12.888542
round_time_total          0 days 00:10:26.359548
loss_total             33216959853585418420224.0
loss_critic            41521199161145076744192.0
loss_actor                  -472426144038.911987
memory_size                               1787.0 

=== epoch 4/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:25,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     686
episode_length                         14.556851
returns                                69.229576
return_std                              3.394536
average_reward                          4.755823
round_time                0 days 00:10:27.443965
episodes_test                              684.0
episode_length_test                    14.609649
returns_test                            69.52321
return_std_test                         3.427981
average_reward_test                     4.758737
round_time_test           0 days 00:00:13.131993
round_time_total          0 days 00:10:27.445452
loss_total             34298570419877208653824.0
loss_critic            42873212293152310820864.0
loss_actor                  -480209168089.088013
memory_size                               1787.0 

=== epoch 4/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     685
episode_length                         14.583942
returns                                69.328709
return_std                              3.413577
average_reward                          4.753788
round_time                0 days 00:10:27.360830
episodes_test                              683.0
episode_length_test                    14.641288
returns_test                           69.708309
return_std_test                         3.363706
average_reward_test                     4.761077
round_time_test           0 days 00:00:13.018389
round_time_total          0 days 00:10:27.361918
loss_total             35106465063660905562112.0
loss_critic            43883080586622935236608.0
loss_actor                  -488667904278.528015
memory_size                               1787.0 

=== epoch 4/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:24,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     681
episode_length                         14.665198
returns                                 69.63671
return_std                              3.665677
average_reward                          4.748451
round_time                0 days 00:10:28.392855
episodes_test                              676.0
episode_length_test                    14.789941
returns_test                            70.10739
return_std_test                         4.116022
average_reward_test                      4.74024
round_time_test           0 days 00:00:13.539035
round_time_total          0 days 00:10:28.393948
loss_total             36234877570393938329600.0
loss_critic            45293596187106647998464.0
loss_actor                  -497569956806.656006
memory_size                               1787.0 

=== epoch 4/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:40,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     676
episode_length                         14.773669
returns                                70.056909
return_std                              4.729778
average_reward                           4.74203
round_time                0 days 00:10:26.708240
episodes_test                              682.0
episode_length_test                    14.655425
returns_test                           69.589684
return_std_test                         3.576883
average_reward_test                     4.748426
round_time_test           0 days 00:00:13.445985
round_time_total          0 days 00:10:26.709612
loss_total             37574870306765339099136.0
loss_critic            46968587044801986166784.0
loss_actor                  -506758759088.127991
memory_size                               1787.0 

=== epoch 4/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:03,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     677
episode_length                         14.750369
returns                                69.984258
return_std                              4.813002
average_reward                          4.744605
round_time                0 days 00:10:28.322967
episodes_test                              680.0
episode_length_test                    14.697059
returns_test                           69.823584
return_std_test                         3.362975
average_reward_test                     4.750889
round_time_test           0 days 00:00:13.345966
round_time_total          0 days 00:10:28.324047
loss_total             38707409027802513539072.0
loss_critic            48384260369115049361408.0
loss_actor                  -513150792876.031982
memory_size                               1787.0 

=== epoch 4/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     678
episode_length                         14.718289
returns                                69.876681
return_std                              4.765986
average_reward                          4.747658
round_time                0 days 00:10:28.830251
episodes_test                              687.0
episode_length_test                    14.540029
returns_test                           69.255567
return_std_test                         3.204431
average_reward_test                     4.763097
round_time_test           0 days 00:00:12.917712
round_time_total          0 days 00:10:28.831452
loss_total             40201303106480964108288.0
loss_critic            50251627987166356832256.0
loss_actor                  -524262825787.392029
memory_size                               1787.0 

=== epoch 4/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     679
episode_length                         14.698085
returns                                69.824446
return_std                              4.786742
average_reward                          4.750526
round_time                0 days 00:10:29.737601
episodes_test                              688.0
episode_length_test                    14.527616
returns_test                           69.220834
return_std_test                         3.023547
average_reward_test                     4.764794
round_time_test           0 days 00:00:13.001991
round_time_total          0 days 00:10:29.739114
loss_total             41460585577425965940736.0
loss_critic            51825730981553493770240.0
loss_actor                  -531861675491.328003
memory_size                               1787.0 

=== epoch 4/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     682
episode_length                         14.653959
returns                                69.682921
return_std                              4.577847
average_reward                          4.755217
round_time                0 days 00:10:29.301110
episodes_test                              690.0
episode_length_test                    14.473913
returns_test                           68.964696
return_std_test                         3.064139
average_reward_test                     4.764761
round_time_test           0 days 00:00:13.098756
round_time_total          0 days 00:10:29.302210
loss_total             42829064592498603589632.0
loss_critic            53536329751238717472768.0
loss_actor                  -539978692149.247986
memory_size                               1787.0 

=== epoch 4/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     686
episode_length                         14.567055
returns                                69.371712
return_std                              3.502497
average_reward                          4.762259
round_time                0 days 00:10:28.051446
episodes_test                              688.0
episode_length_test                    14.523256
returns_test                           69.205542
return_std_test                         3.159536
average_reward_test                     4.765164
round_time_test           0 days 00:00:13.117107
round_time_total          0 days 00:10:28.052521
loss_total             44753942820881168334848.0
loss_critic            55942427508850890375168.0
loss_actor                  -548967783137.280029
memory_size                               1787.0 

=== epoch 4/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:04,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     686
episode_length                         14.555394
returns                                69.306571
return_std                              3.366684
average_reward                            4.7616
round_time                0 days 00:10:28.748599
episodes_test                              688.0
episode_length_test                    14.527616
returns_test                           69.163301
return_std_test                         3.270095
average_reward_test                     4.760846
round_time_test           0 days 00:00:13.077748
round_time_total          0 days 00:10:28.749821
loss_total             45775735337869377536000.0
loss_critic            57219668154241727332352.0
loss_actor                  -559092924317.696045
memory_size                               1787.0 

=== epoch 4/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     687
episode_length                         14.538574
returns                                69.217617
return_std                              3.408169
average_reward                          4.761023
round_time                0 days 00:10:28.178862
episodes_test                              693.0
episode_length_test                    14.422799
returns_test                           68.736594
return_std_test                         3.116489
average_reward_test                     4.765838
round_time_test           0 days 00:00:13.142895
round_time_total          0 days 00:10:28.180312
loss_total             47867656759048820228096.0
loss_critic            59834569883991186341888.0
loss_actor                  -567247941189.631958
memory_size                               1787.0 

=== epoch 4/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:23,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     687
episode_length                         14.532751
returns                                69.190346
return_std                              3.467688
average_reward                          4.761024
round_time                0 days 00:10:30.391943
episodes_test                              690.0
episode_length_test                    14.492754
returns_test                           69.029828
return_std_test                         3.345613
average_reward_test                     4.763058
round_time_test           0 days 00:00:13.209725
round_time_total          0 days 00:10:30.393037
loss_total             48321826943836908158976.0
loss_critic            60402282631583326273536.0
loss_actor                  -576437120237.567993
memory_size                               1787.0 

=== epoch 4/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     689
episode_length                         14.496372
returns                                69.033732
return_std                              3.581612
average_reward                          4.762156
round_time                0 days 00:10:29.575295
episodes_test                              695.0
episode_length_test                    14.381295
returns_test                           68.545718
return_std_test                         3.137289
average_reward_test                     4.766335
round_time_test           0 days 00:00:12.928766
round_time_total          0 days 00:10:29.576382
loss_total             50357334921626326663168.0
loss_critic            62946667565820971843584.0
loss_actor                  -585608564654.079956
memory_size                               1787.0 

=== epoch 4/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     689
episode_length                         14.499274
returns                                69.041545
return_std                              3.868474
average_reward                          4.761737
round_time                0 days 00:10:28.181629
episodes_test                              694.0
episode_length_test                    14.396254
returns_test                           68.574855
return_std_test                         3.448954
average_reward_test                     4.763401
round_time_test           0 days 00:00:13.428336
round_time_total          0 days 00:10:28.182713
loss_total             52348583185232108191744.0
loss_critic            65435727934734691467264.0
loss_actor                  -596584328757.248047
memory_size                               1787.0 

=== epoch 4/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     689
episode_length                         14.481858
returns                                68.989682
return_std                              3.768364
average_reward                          4.763878
round_time                0 days 00:10:29.692648
episodes_test                              690.0
episode_length_test                    14.475362
returns_test                           68.960541
return_std_test                         4.572883
average_reward_test                      4.76402
round_time_test           0 days 00:00:13.438864
round_time_total          0 days 00:10:29.694027
loss_total             53620756543577713016832.0
loss_critic            67025944542876180414464.0
loss_actor                  -605992686092.287964
memory_size                               1787.0 

=== epoch 4/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     690
episode_length                          14.45942
returns                                68.883969
return_std                              3.636598
average_reward                          4.763622
round_time                0 days 00:10:29.825822
episodes_test                              692.0
episode_length_test                    14.445087
returns_test                           68.810084
return_std_test                         3.368051
average_reward_test                     4.763578
round_time_test           0 days 00:00:12.785150
round_time_total          0 days 00:10:29.826936
loss_total             55316999192108460933120.0
loss_critic            69146247841436200337408.0
loss_actor                  -613431353475.072021
memory_size                               1787.0 

=== epoch 4/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<14:26,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     693
episode_length                         14.419913
returns                                 68.70964
return_std                              3.489419
average_reward                          4.764898
round_time                0 days 00:10:28.627012
episodes_test                              694.0
episode_length_test                    14.394813
returns_test                           68.597373
return_std_test                         3.104212
average_reward_test                     4.765437
round_time_test           0 days 00:00:12.891530
round_time_total          0 days 00:10:28.628328
loss_total             57718277036395747868672.0
loss_critic            72147845113862726615040.0
loss_actor                  -623926385950.719971
memory_size                               1787.0 

=== epoch 4/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     692
episode_length                         14.436416
returns                                68.784038
return_std                              3.569437
average_reward                          4.764647
round_time                0 days 00:10:26.635457
episodes_test                              691.0
episode_length_test                     14.47178
returns_test                           68.931274
return_std_test                         3.217217
average_reward_test                     4.763151
round_time_test           0 days 00:00:12.702871
round_time_total          0 days 00:10:26.636543
loss_total             58347479648849086119936.0
loss_critic            72934348368733357998080.0
loss_actor                  -632729513361.407959
memory_size                               1787.0 

=== epoch 4/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                     693
episode_length                          14.40404
returns                                68.656726
return_std                              3.042928
average_reward                          4.766486
round_time                0 days 00:10:29.966180
episodes_test                              690.0
episode_length_test                    14.484058
returns_test                           69.004555
return_std_test                          3.19835
average_reward_test                     4.764202
round_time_test           0 days 00:00:13.499089
round_time_total          0 days 00:10:29.967487
loss_total             59964863577727944359936.0
loss_critic            74956078283772571680768.0
loss_actor                   -642440813543.42395
memory_size                               1787.0 

=== epoch 4/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     692
episode_length                         14.424855
returns                                 68.73894
return_std                              3.011353
average_reward                          4.765339
round_time                0 days 00:10:28.997929
episodes_test                              686.0
episode_length_test                    14.569971
returns_test                           69.443356
return_std_test                         2.929736
average_reward_test                     4.766214
round_time_test           0 days 00:00:13.041966
round_time_total          0 days 00:10:28.999344
loss_total             61730764434145449869312.0
loss_critic            77163454277733257838592.0
loss_actor                  -654094359101.439941
memory_size                               1787.0 

=== epoch 4/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     692
episode_length                         14.442197
returns                                68.824537
return_std                              3.102219
average_reward                          4.765536
round_time                0 days 00:10:27.595886
episodes_test                              688.0
episode_length_test                    14.521802
returns_test                           69.186054
return_std_test                         3.398453
average_reward_test                     4.764305
round_time_test           0 days 00:00:13.044605
round_time_total          0 days 00:10:27.597268
loss_total             64253125758524833398784.0
loss_critic            80316405810765884293120.0
loss_actor                  -663756159057.920044
memory_size                               1787.0 

=== epoch 4/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:06,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     691
episode_length                         14.465991
returns                                 68.94323
return_std                              3.058461
average_reward                          4.765908
round_time                0 days 00:10:28.222026
episodes_test                              691.0
episode_length_test                    14.463097
returns_test                           68.909227
return_std_test                         3.129261
average_reward_test                     4.764513
round_time_test           0 days 00:00:13.113263
round_time_total          0 days 00:10:28.223173
loss_total             66307112374880524304384.0
loss_critic            82883889035330073919488.0
loss_actor                  -674652087287.807983
memory_size                               1787.0 

=== epoch 4/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                     690
episode_length                         14.478261
returns                                68.996715
return_std                              2.884963
average_reward                          4.765556
round_time                0 days 00:10:28.288169
episodes_test                              690.0
episode_length_test                    14.478261
returns_test                           68.954754
return_std_test                         3.363057
average_reward_test                     4.762662
round_time_test           0 days 00:00:13.189519
round_time_total          0 days 00:10:28.289252
loss_total             69687461390806490808320.0
loss_critic            87109325214321116971008.0
loss_actor                  -685268361183.232056
memory_size                               1787.0 

=== epoch 4/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:51,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     689
episode_length                         14.487663
returns                                 69.02436
return_std                              2.971611
average_reward                          4.764366
round_time                0 days 00:10:27.282306
episodes_test                              692.0
episode_length_test                    14.439306
returns_test                           68.807793
return_std_test                         3.213927
average_reward_test                     4.765344
round_time_test           0 days 00:00:12.909663
round_time_total          0 days 00:10:27.283401
loss_total             71663457466676638908416.0
loss_critic            89579320392475404664832.0
loss_actor                  -697641656582.144043
memory_size                               1787.0 

=== epoch 4/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                     692
episode_length                         14.446532
returns                                68.846573
return_std                              2.951617
average_reward                          4.765644
round_time                0 days 00:10:29.232847
episodes_test                              694.0
episode_length_test                    14.399135
returns_test                           68.637672
return_std_test                         2.957378
average_reward_test                     4.766813
round_time_test           0 days 00:00:13.079061
round_time_total          0 days 00:10:29.233916
loss_total             73723019614806301736960.0
loss_critic            92153772999387436285952.0
loss_actor                  -709297089740.800049
memory_size                               1787.0 

=== epoch 4/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                     692
episode_length                         14.436416
returns                                68.800538
return_std                              2.842019
average_reward                          4.765763
round_time                0 days 00:10:26.079297
episodes_test                              691.0
episode_length_test                     14.47178
returns_test                            68.95458
return_std_test                         3.222792
average_reward_test                     4.764762
round_time_test           0 days 00:00:13.074796
round_time_total          0 days 00:10:26.080486
loss_total             76588412790942308761600.0
loss_critic            95735514212007839531008.0
loss_actor                  -721599967657.984009
memory_size                               1787.0 

=== epoch 4/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:06,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                     692
episode_length                          14.42052
returns                                68.724631
return_std                              2.925206
average_reward                          4.765749
round_time                0 days 00:10:27.413068
episodes_test                              696.0
episode_length_test                    14.357759
returns_test                           68.434184
return_std_test                         2.940561
average_reward_test                     4.766367
round_time_test           0 days 00:00:13.013044
round_time_total          0 days 00:10:27.414483
loss_total             78662167885804744998912.0
loss_critic            98327708031327760547840.0
loss_actor                  -734252610125.823975
memory_size                               1787.0 

=== epoch 4/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      692
episode_length                          14.436416
returns                                 68.790779
return_std                               3.107644
average_reward                           4.765069
round_time                 0 days 00:10:28.718236
episodes_test                               692.0
episode_length_test                     14.432081
returns_test                            68.781128
return_std_test                          3.187667
average_reward_test                      4.765853
round_time_test            0 days 00:00:13.133962
round_time_total           0 days 00:10:28.719296
loss_total              81059288512445097705472.0
loss_critic            101324108877115647066112.0
loss_actor                    -742687306776.57605
memory_size                                1787.0 

=== epoch 4/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                      692
episode_length                          14.445087
returns                                 68.823083
return_std                               3.367127
average_reward                           4.764457
round_time                 0 days 00:10:31.055830
episodes_test                               691.0
episode_length_test                     14.455861
returns_test                            68.922017
return_std_test                          3.247952
average_reward_test                      4.767764
round_time_test            0 days 00:00:12.899138
round_time_total           0 days 00:10:31.056903
loss_total              83578072692942650212352.0
loss_critic            104472589058546023268352.0
loss_actor                   -755959514234.880005
memory_size                                1787.0 

=== epoch 4/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      690
episode_length                           14.45942
returns                                 68.892932
return_std                               3.388366
average_reward                           4.764558
round_time                 0 days 00:10:28.981129
episodes_test                               694.0
episode_length_test                     14.397695
returns_test                            68.654041
return_std_test                          2.901949
average_reward_test                      4.768431
round_time_test            0 days 00:00:13.087042
round_time_total           0 days 00:10:28.982283
loss_total              86650506151662979121152.0
loss_critic            108313130769919367970816.0
loss_actor                   -766442519199.744019
memory_size                                1787.0 

=== epoch 4/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:01,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      691
episode_length                          14.444284
returns                                 68.839019
return_std                                3.37164
average_reward                            4.76584
round_time                 0 days 00:10:29.008099
episodes_test                               689.0
episode_length_test                     14.507983
returns_test                            69.167087
return_std_test                          2.786879
average_reward_test                      4.767541
round_time_test            0 days 00:00:13.207996
round_time_total           0 days 00:10:29.009170
loss_total              88923101933354562355200.0
loss_critic            111153875482960103211008.0
loss_actor                   -776278724083.712036
memory_size                                1787.0 

=== epoch 4/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      692
episode_length                          14.427746
returns                                 68.769667
return_std                               3.301112
average_reward                           4.766487
round_time                 0 days 00:10:27.119967
episodes_test                               697.0
episode_length_test                     14.345768
returns_test                            68.376001
return_std_test                          2.772111
average_reward_test                      4.766297
round_time_test            0 days 00:00:12.934804
round_time_total           0 days 00:10:27.121106
loss_total              90161622754848279625728.0
loss_critic            112702026335875730767872.0
loss_actor                   -790898745507.839966
memory_size                                1787.0 

=== epoch 4/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      695
episode_length                           14.37554
returns                                 68.542774
return_std                               3.046964
average_reward                           4.768033
round_time                 0 days 00:10:27.781870
episodes_test                               693.0
episode_length_test                     14.430014
returns_test                            68.726484
return_std_test                          3.811584
average_reward_test                      4.762745
round_time_test            0 days 00:00:13.086472
round_time_total           0 days 00:10:27.782930
loss_total              95396649059101261692928.0
loss_critic            119245809221258505617408.0
loss_actor                   -803140830494.719971
memory_size                                1787.0 

=== epoch 4/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:20,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      697
episode_length                          14.341463
returns                                 68.397848
return_std                               3.200743
average_reward                           4.769262
round_time                 0 days 00:10:29.225614
episodes_test                               697.0
episode_length_test                     14.341463
returns_test                            68.355145
return_std_test                          3.017411
average_reward_test                       4.76628
round_time_test            0 days 00:00:13.226864
round_time_total           0 days 00:10:29.226731
loss_total              97227837781913437732864.0
loss_critic            121534795162491352514560.0
loss_actor                   -816553589866.495972
memory_size                                1787.0 

=== epoch 4/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:51,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      698
episode_length                          14.316619
returns                                 68.267912
return_std                               3.285268
average_reward                           4.768457
round_time                 0 days 00:10:29.617095
episodes_test                               697.0
episode_length_test                     14.341463
returns_test                             68.34832
return_std_test                          2.833208
average_reward_test                      4.765805
round_time_test            0 days 00:00:12.975432
round_time_total           0 days 00:10:29.618176
loss_total              98858008767703482892288.0
loss_critic            123572508858137180110848.0
loss_actor                   -825942452338.687988
memory_size                                1787.0 

=== epoch 4/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.312321
returns                                 68.246344
return_std                                3.34936
average_reward                           4.768348
round_time                 0 days 00:10:26.603624
episodes_test                               699.0
episode_length_test                     14.291845
returns_test                            68.097463
return_std_test                          3.645693
average_reward_test                        4.7648
round_time_test            0 days 00:00:12.965198
round_time_total           0 days 00:10:26.605002
loss_total             100551963372576644792320.0
loss_critic            125689952110850934308864.0
loss_actor                    -833230949908.47998
memory_size                                1787.0 

=== epoch 4/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      698
episode_length                          14.310888
returns                                  68.22916
return_std                               3.756276
average_reward                           4.767648
round_time                 0 days 00:10:28.567825
episodes_test                               694.0
episode_length_test                     14.396254
returns_test                            68.632033
return_std_test                          3.323607
average_reward_test                      4.767357
round_time_test            0 days 00:00:13.014861
round_time_total           0 days 00:10:28.568935
loss_total             105320570220094951522304.0
loss_critic            131650710623523952394240.0
loss_actor                   -848613446942.719971
memory_size                                1787.0 

=== epoch 4/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                      698
episode_length                          14.315186
returns                                 68.244452
return_std                               3.837076
average_reward                           4.767293
round_time                 0 days 00:10:25.348726
episodes_test                               698.0
episode_length_test                     14.308023
returns_test                            68.239636
return_std_test                          2.646316
average_reward_test                      4.769339
round_time_test            0 days 00:00:13.239307
round_time_total           0 days 00:10:25.350148
loss_total             108889034431452497838080.0
loss_critic            136111290642274707308544.0
loss_actor                   -864259376381.952026
memory_size                                1787.0 

=== epoch 4/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      697
episode_length                          14.311334
returns                                 68.230062
return_std                               3.563141
average_reward                           4.767145
round_time                 0 days 00:10:28.499146
episodes_test                               697.0
episode_length_test                     14.347202
returns_test                            68.381399
return_std_test                          3.199333
average_reward_test                      4.766184
round_time_test            0 days 00:00:12.903413
round_time_total           0 days 00:10:28.500215
loss_total             113195548984340876623872.0
loss_critic            141494433929649650139136.0
loss_actor                   -878386843942.911987
memory_size                                1787.0 

=== epoch 4/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      698
episode_length                          14.310888
returns                                 68.241521
return_std                               3.455663
average_reward                           4.768476
round_time                 0 days 00:10:28.425992
episodes_test                               697.0
episode_length_test                     14.335725
returns_test                            68.338255
return_std_test                          2.764426
average_reward_test                      4.766993
round_time_test            0 days 00:00:13.021102
round_time_total           0 days 00:10:28.427054
loss_total             116484879394987856363520.0
loss_critic            145606096954780310044672.0
loss_actor                   -891192011718.656006
memory_size                                1787.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<12:42,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      140
episode_length                          14.285714
returns                                 68.079817
return_std                                2.82877
average_reward                           4.765587
round_time                 0 days 00:10:27.149129
episodes_test                               699.0
episode_length_test                     14.287554
returns_test                            68.136327
return_std_test                          2.738941
average_reward_test                      4.768927
round_time_test            0 days 00:00:13.207291
round_time_total           0 days 00:10:27.150287
loss_total             121184889998048246104064.0
loss_critic            151481110173139953778688.0
loss_actor                   -904193985609.728027
memory_size                                1787.0 

=== epoch 5/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:14,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      280
episode_length                          14.267857
returns                                 68.004729
return_std                               2.712754
average_reward                           4.766326
round_time                 0 days 00:10:28.758280
episodes_test                               700.0
episode_length_test                         14.28
returns_test                            68.094591
return_std_test                          2.938895
average_reward_test                       4.76856
round_time_test            0 days 00:00:12.859231
round_time_total           0 days 00:10:28.759408
loss_total             122984066310550331588608.0
loss_critic            153730080388690131025920.0
loss_actor                   -916399354576.895996
memory_size                                1787.0 

=== epoch 5/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      419
episode_length                          14.317422
returns                                 68.267723
return_std                               3.380343
average_reward                           4.768178
round_time                 0 days 00:10:27.069428
episodes_test                               699.0
episode_length_test                     14.290415
returns_test                            68.133478
return_std_test                          2.723987
average_reward_test                      4.767781
round_time_test            0 days 00:00:13.254053
round_time_total           0 days 00:10:27.070540
loss_total             127293573620232886419456.0
loss_critic            159116964516786132221952.0
loss_actor                   -932505225527.296021
memory_size                                1787.0 

=== epoch 5/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:29,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      558
episode_length                          14.324373
returns                                 68.306108
return_std                               3.177263
average_reward                           4.768544
round_time                 0 days 00:10:30.271078
episodes_test                               696.0
episode_length_test                     14.353448
returns_test                            68.466911
return_std_test                          2.693696
average_reward_test                      4.770069
round_time_test            0 days 00:00:12.940819
round_time_total           0 days 00:10:30.272300
loss_total             131593182772287437275136.0
loss_critic            164491475874100644151296.0
loss_actor                   -945698708488.192017
memory_size                                1787.0 

=== epoch 5/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                      698
episode_length                          14.312321
returns                                 68.256958
return_std                               3.037291
average_reward                           4.769126
round_time                 0 days 00:10:31.397851
episodes_test                               699.0
episode_length_test                     14.298999
returns_test                            68.193114
return_std_test                          2.904255
average_reward_test                      4.769108
round_time_test            0 days 00:00:12.871352
round_time_total           0 days 00:10:31.399118
loss_total             135676485732096427425792.0
loss_critic            169595604426931973390336.0
loss_actor                   -961003192123.391968
memory_size                                1787.0 

=== epoch 5/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:20,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      699
episode_length                          14.304721
returns                                 68.234818
return_std                               3.017106
average_reward                           4.770106
round_time                 0 days 00:10:26.406231
episodes_test                               698.0
episode_length_test                     14.309456
returns_test                             68.22512
return_std_test                           2.89046
average_reward_test                      4.767837
round_time_test            0 days 00:00:12.925769
round_time_total           0 days 00:10:26.407312
loss_total             139863598930878735056896.0
loss_critic            174829495637179480670208.0
loss_actor                   -975149193494.527954
memory_size                                1787.0 

=== epoch 5/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:09,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.310888
returns                                 68.264984
return_std                               3.044596
average_reward                           4.770121
round_time                 0 days 00:10:28.238656
episodes_test                               699.0
episode_length_test                      14.30186
returns_test                            68.208091
return_std_test                          3.009785
average_reward_test                      4.769198
round_time_test            0 days 00:00:12.833711
round_time_total           0 days 00:10:28.239735
loss_total             145269093743082092888064.0
loss_critic            181586363949771695063040.0
loss_actor                   -988602317209.599976
memory_size                                1787.0 

=== epoch 5/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      699
episode_length                          14.280401
returns                                   68.1087
return_std                               2.650346
average_reward                           4.769438
round_time                 0 days 00:10:27.606393
episodes_test                               700.0
episode_length_test                     14.278571
returns_test                            68.083765
return_std_test                          2.607053
average_reward_test                      4.768269
round_time_test            0 days 00:00:12.980370
round_time_total           0 days 00:10:27.607798
loss_total             149084231200648405712896.0
loss_critic            186355285776796124971008.0
loss_actor                  -1006989209698.303955
memory_size                                1787.0 

=== epoch 5/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      698
episode_length                           14.30086
returns                                 68.206203
return_std                               2.683843
average_reward                           4.769379
round_time                 0 days 00:10:29.176174
episodes_test                               693.0
episode_length_test                     14.419913
returns_test                            68.785449
return_std_test                          2.865227
average_reward_test                      4.770186
round_time_test            0 days 00:00:13.185804
round_time_total           0 days 00:10:29.177337
loss_total             152553251498439768801280.0
loss_critic            190691560883885917077504.0
loss_actor                  -1018851668131.839966
memory_size                                1787.0 

=== epoch 5/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                      697
episode_length                           14.33142
returns                                 68.329123
return_std                               2.754114
average_reward                           4.767795
round_time                 0 days 00:10:30.700465
episodes_test                               695.0
episode_length_test                     14.374101
returns_test                             68.54394
return_std_test                          2.797342
average_reward_test                      4.768571
round_time_test            0 days 00:00:13.238719
round_time_total           0 days 00:10:30.701970
loss_total             159488859546741383364608.0
loss_critic            199361070871079418855424.0
loss_actor                  -1033966666743.807983
memory_size                                1787.0 

=== epoch 5/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:41,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      696
episode_length                          14.347701
returns                                 68.400688
return_std                               2.783271
average_reward                           4.767371
round_time                 0 days 00:10:29.116655
episodes_test                               697.0
episode_length_test                     14.345768
returns_test                            68.410215
return_std_test                          2.902748
average_reward_test                      4.768684
round_time_test            0 days 00:00:13.242459
round_time_total           0 days 00:10:29.117722
loss_total             163646224916243026018304.0
loss_critic            204557777548053588213760.0
loss_actor                  -1051676033286.144043
memory_size                                1787.0 

=== epoch 5/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      696
episode_length                          14.343391
returns                                 68.385354
return_std                               2.728464
average_reward                           4.767728
round_time                 0 days 00:10:30.292878
episodes_test                               696.0
episode_length_test                     14.354885
returns_test                            68.453819
return_std_test                          2.775272
average_reward_test                      4.768685
round_time_test            0 days 00:00:13.465755
round_time_total           0 days 00:10:30.294217
loss_total             168996883408491173117952.0
loss_critic            211246100371755667292160.0
loss_actor                  -1068020079951.871948
memory_size                                1787.0 

=== epoch 5/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      695
episode_length                          14.371223
returns                                 68.522688
return_std                               2.673201
average_reward                           4.768057
round_time                 0 days 00:10:26.393957
episodes_test                               697.0
episode_length_test                     14.335725
returns_test                            68.357152
return_std_test                          2.644266
average_reward_test                       4.76832
round_time_test            0 days 00:00:12.984871
round_time_total           0 days 00:10:26.395183
loss_total             173976034477589563179008.0
loss_critic            217470039203625079668736.0
loss_actor                  -1084178441732.095947
memory_size                                1787.0 

=== epoch 5/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:06,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      697
episode_length                          14.340029
returns                                 68.368486
return_std                               2.758025
average_reward                           4.767692
round_time                 0 days 00:10:28.454861
episodes_test                               697.0
episode_length_test                     14.347202
returns_test                            68.418785
return_std_test                          2.539003
average_reward_test                      4.768789
round_time_test            0 days 00:00:12.832561
round_time_total           0 days 00:10:28.455934
loss_total             179604375271647343542272.0
loss_critic            224505465196197313511424.0
loss_actor                  -1097699806871.552002
memory_size                                1787.0 

=== epoch 5/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      697
episode_length                          14.327116
returns                                 68.322007
return_std                                2.80224
average_reward                           4.768724
round_time                 0 days 00:10:27.909881
episodes_test                               701.0
episode_length_test                     14.262482
returns_test                            67.995487
return_std_test                          2.752284
average_reward_test                      4.767461
round_time_test            0 days 00:00:13.093153
round_time_total           0 days 00:10:27.911042
loss_total             184933359902458380288000.0
loss_critic            231166695801189438586880.0
loss_actor                  -1116007293255.679932
memory_size                                1787.0 

=== epoch 5/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:25,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      698
episode_length                          14.308023
returns                                 68.235183
return_std                               2.786272
average_reward                           4.769015
round_time                 0 days 00:10:28.664679
episodes_test                               702.0
episode_length_test                     14.237892
returns_test                            67.917018
return_std_test                          2.650168
average_reward_test                      4.770182
round_time_test            0 days 00:00:12.971184
round_time_total           0 days 00:10:28.665851
loss_total             191069245845104922460160.0
loss_critic            238836553060612590010368.0
loss_actor                  -1134118133891.072021
memory_size                                1787.0 

=== epoch 5/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      697
episode_length                          14.318508
returns                                 68.284294
return_std                               2.881246
average_reward                           4.768949
round_time                 0 days 00:10:29.220124
episodes_test                               702.0
episode_length_test                     14.242165
returns_test                            67.940336
return_std_test                          2.369266
average_reward_test                       4.77039
round_time_test            0 days 00:00:12.951839
round_time_total           0 days 00:10:29.221644
loss_total             197092167921736915877888.0
loss_critic            246365205461621905817600.0
loss_actor                   -1150280109391.87207
memory_size                                1787.0 

=== epoch 5/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.293696
returns                                 68.158314
return_std                               2.876987
average_reward                           4.768422
round_time                 0 days 00:10:28.336447
episodes_test                               703.0
episode_length_test                     14.223329
returns_test                            67.854279
return_std_test                          2.365172
average_reward_test                      4.770648
round_time_test            0 days 00:00:13.183624
round_time_total           0 days 00:10:28.337798
loss_total             201382466982691970482176.0
loss_critic            251728079170722122956800.0
loss_actor                  -1166177270956.031982
memory_size                                1787.0 

=== epoch 5/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:46,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.303725
returns                                 68.195468
return_std                               3.121801
average_reward                           4.767663
round_time                 0 days 00:10:27.691094
episodes_test                               700.0
episode_length_test                     14.285714
returns_test                            68.130135
return_std_test                          2.906617
average_reward_test                      4.769109
round_time_test            0 days 00:00:12.934401
round_time_total           0 days 00:10:27.692350
loss_total             209434710979359503548416.0
loss_critic            261793384183445072117760.0
loss_actor                  -1185824542851.072021
memory_size                                1787.0 

=== epoch 5/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      698
episode_length                          14.297994
returns                                 68.157227
return_std                               3.126718
average_reward                           4.766893
round_time                 0 days 00:10:26.188228
episodes_test                               702.0
episode_length_test                     14.236467
returns_test                            67.898767
return_std_test                          2.609645
average_reward_test                      4.769381
round_time_test            0 days 00:00:12.858188
round_time_total           0 days 00:10:26.189534
loss_total             216396861401892617453568.0
loss_critic            270496072400762621132800.0
loss_actor                  -1202702729117.696045
memory_size                                1787.0 

=== epoch 5/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:09,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                      699
episode_length                          14.304721
returns                                 68.182571
return_std                               3.209769
average_reward                           4.766453
round_time                 0 days 00:10:31.147966
episodes_test                               704.0
episode_length_test                     14.197443
returns_test                            67.717157
return_std_test                          2.650125
average_reward_test                      4.769682
round_time_test            0 days 00:00:13.272990
round_time_total           0 days 00:10:31.149116
loss_total             221942866450814102142976.0
loss_critic            277428578563295690096640.0
loss_actor                   -1218306672197.63208
memory_size                                1787.0 

=== epoch 5/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      699
episode_length                          14.284692
returns                                 68.095432
return_std                               3.118769
average_reward                           4.767052
round_time                 0 days 00:10:26.281613
episodes_test                               703.0
episode_length_test                     14.224751
returns_test                            67.875853
return_std_test                          3.509137
average_reward_test                      4.771672
round_time_test            0 days 00:00:13.069681
round_time_total           0 days 00:10:26.282700
loss_total             227849088105885527441408.0
loss_critic            284811355485767972421632.0
loss_actor                  -1236518067273.728027
memory_size                                1787.0 

=== epoch 5/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:43,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                      700
episode_length                          14.261429
returns                                 67.994057
return_std                               3.113002
average_reward                           4.767674
round_time                 0 days 00:10:31.757085
episodes_test                               706.0
episode_length_test                     14.155807
returns_test                            67.541291
return_std_test                          2.428919
average_reward_test                      4.771294
round_time_test            0 days 00:00:13.507512
round_time_total           0 days 00:10:31.758297
loss_total             232128427639132974481408.0
loss_critic            290160529897823722799104.0
loss_actor                  -1250508420743.167969
memory_size                                1787.0 

=== epoch 5/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:20,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      701
episode_length                          14.242511
returns                                 67.927635
return_std                               2.779953
average_reward                           4.769354
round_time                 0 days 00:10:26.944023
episodes_test                               697.0
episode_length_test                     14.335725
returns_test                            68.396673
return_std_test                           2.59103
average_reward_test                      4.771086
round_time_test            0 days 00:00:12.667018
round_time_total           0 days 00:10:26.945470
loss_total             235703773611929930563584.0
loss_critic            294629712318783905857536.0
loss_actor                  -1262841411403.775879
memory_size                                1787.0 

=== epoch 5/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      702
episode_length                          14.226496
returns                                 67.873293
return_std                               2.721817
average_reward                             4.7709
round_time                 0 days 00:10:27.093001
episodes_test                               702.0
episode_length_test                     14.226496
returns_test                            67.859657
return_std_test                          2.503365
average_reward_test                      4.769954
round_time_test            0 days 00:00:13.069661
round_time_total           0 days 00:10:27.094273
loss_total             242943633758690815770624.0
loss_critic            303679537139695254241280.0
loss_actor                  -1284696930385.919922
memory_size                                1787.0 

=== epoch 5/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      702
episode_length                          14.206553
returns                                 67.789379
return_std                               2.629423
average_reward                           4.771729
round_time                 0 days 00:10:25.794165
episodes_test                               705.0
episode_length_test                     14.180142
returns_test                            67.660576
return_std_test                          2.394595
average_reward_test                      4.771528
round_time_test            0 days 00:00:12.705837
round_time_total           0 days 00:10:25.795367
loss_total             254212072952900379213824.0
loss_critic            317765085599906538717184.0
loss_actor                   -1302000560373.76001
memory_size                                1787.0 

=== epoch 5/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      702
episode_length                          14.225071
returns                                 67.871977
return_std                                2.72615
average_reward                           4.771302
round_time                 0 days 00:10:27.405943
episodes_test                               704.0
episode_length_test                     14.198864
returns_test                            67.743966
return_std_test                          2.428111
average_reward_test                       4.77111
round_time_test            0 days 00:00:13.026907
round_time_total           0 days 00:10:27.407197
loss_total             257953350964498436980736.0
loss_critic            322441683535490648637440.0
loss_actor                  -1317908237713.407959
memory_size                                1787.0 

=== epoch 5/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:33,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      701
episode_length                          14.249643
returns                                 67.992529
return_std                               2.696912
average_reward                           4.771538
round_time                 0 days 00:10:28.688699
episodes_test                               697.0
episode_length_test                     14.345768
returns_test                            68.437738
return_std_test                          2.488193
average_reward_test                      4.770601
round_time_test            0 days 00:00:13.166087
round_time_total           0 days 00:10:28.689774
loss_total             263313221849007542763520.0
loss_critic            329141521860777989898240.0
loss_actor                  -1334092915146.751953
memory_size                                1787.0 

=== epoch 5/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      700
episode_length                          14.257143
returns                                 68.018362
return_std                               2.684548
average_reward                           4.770814
round_time                 0 days 00:10:29.533657
episodes_test                               697.0
episode_length_test                     14.341463
returns_test                            68.414948
return_std_test                          2.608058
average_reward_test                      4.770444
round_time_test            0 days 00:00:12.904543
round_time_total           0 days 00:10:29.534991
loss_total             270446347742945656963072.0
loss_critic            338057928898311937327104.0
loss_actor                  -1353949710712.832031
memory_size                                1787.0 

=== epoch 5/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:49,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      699
episode_length                          14.290415
returns                                 68.174721
return_std                               2.695279
average_reward                            4.77067
round_time                 0 days 00:10:25.809339
episodes_test                               697.0
episode_length_test                     14.341463
returns_test                            68.409581
return_std_test                          2.640101
average_reward_test                      4.770073
round_time_test            0 days 00:00:12.873351
round_time_total           0 days 00:10:25.810620
loss_total             274825930900700517105664.0
loss_critic            343532407651850747117568.0
loss_actor                  -1371068390703.104004
memory_size                                1787.0 

=== epoch 5/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:18,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.319484
returns                                 68.308002
return_std                               2.665283
average_reward                            4.77031
round_time                 0 days 00:10:28.408618
episodes_test                               697.0
episode_length_test                     14.327116
returns_test                            68.335732
return_std_test                          2.685987
average_reward_test                      4.769663
round_time_test            0 days 00:00:13.050600
round_time_total           0 days 00:10:28.409730
loss_total             283560838288587049926656.0
loss_critic            354451041467874104311808.0
loss_actor                   -1388090259210.23999
memory_size                                1787.0 

=== epoch 5/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:55,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      696
episode_length                          14.344828
returns                                 68.439626
return_std                               3.351602
average_reward                           4.771026
round_time                 0 days 00:10:29.606902
episodes_test                               698.0
episode_length_test                      14.32235
returns_test                            68.302422
return_std_test                          2.627103
average_reward_test                      4.768965
round_time_test            0 days 00:00:13.409368
round_time_total           0 days 00:10:29.607982
loss_total             293753658793116504686592.0
loss_critic            367192066961176140972032.0
loss_actor                  -1404265633808.384033
memory_size                                1787.0 

=== epoch 5/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      697
episode_length                          14.335725
returns                                 68.388722
return_std                               3.404797
average_reward                           4.770521
round_time                 0 days 00:10:28.868180
episodes_test                               698.0
episode_length_test                     14.318052
returns_test                            68.305229
return_std_test                          2.478134
average_reward_test                       4.77059
round_time_test            0 days 00:00:12.839675
round_time_total           0 days 00:10:28.869256
loss_total             298467928736101237260288.0
loss_critic            373084904278442983292928.0
loss_actor                  -1423914951311.360107
memory_size                                1787.0 

=== epoch 5/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:22,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      697
episode_length                          14.338594
returns                                 68.410698
return_std                               3.435244
average_reward                           4.771104
round_time                 0 days 00:10:28.831472
episodes_test                               695.0
episode_length_test                     14.388489
returns_test                            68.673902
return_std_test                          4.114326
average_reward_test                      4.772836
round_time_test            0 days 00:00:13.121996
round_time_total           0 days 00:10:28.832887
loss_total             306975964910354748669952.0
loss_critic            383719949182133874982912.0
loss_actor                  -1445063174520.832031
memory_size                                1787.0 

=== epoch 5/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:53,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      699
episode_length                          14.293276
returns                                 68.197688
return_std                               3.429152
average_reward                           4.771321
round_time                 0 days 00:10:29.541133
episodes_test                               698.0
episode_length_test                      14.32235
returns_test                            68.291026
return_std_test                           2.81706
average_reward_test                      4.768173
round_time_test            0 days 00:00:12.735694
round_time_total           0 days 00:10:29.542209
loss_total             315573636664841025355776.0
loss_critic            394467039023860423852032.0
loss_actor                  -1464673669349.375977
memory_size                                1787.0 

=== epoch 5/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      700
episode_length                          14.268571
returns                                 68.078541
return_std                               3.414779
average_reward                           4.771223
round_time                 0 days 00:10:29.447120
episodes_test                               709.0
episode_length_test                     14.098731
returns_test                             67.27172
return_std_test                          2.332711
average_reward_test                      4.771493
round_time_test            0 days 00:00:13.093153
round_time_total           0 days 00:10:29.448186
loss_total             322282414040913482350592.0
loss_critic            402853010223785257730048.0
loss_actor                  -1485978393640.959961
memory_size                                1787.0 

=== epoch 5/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      703
episode_length                          14.217639
returns                                 67.837052
return_std                                2.66369
average_reward                            4.77135
round_time                 0 days 00:10:26.183541
episodes_test                               703.0
episode_length_test                     14.210526
returns_test                            67.827692
return_std_test                          4.876162
average_reward_test                      4.773059
round_time_test            0 days 00:00:13.008819
round_time_total           0 days 00:10:26.184701
loss_total             332112296567160552030208.0
loss_critic            415140363131644345843712.0
loss_actor                  -1502997286682.624023
memory_size                                1787.0 

=== epoch 5/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      703
episode_length                          14.197724
returns                                  67.74724
return_std                                2.75039
average_reward                           4.771686
round_time                 0 days 00:10:28.120139
episodes_test                               707.0
episode_length_test                     14.142857
returns_test                            67.479441
return_std_test                          2.493961
average_reward_test                      4.771288
round_time_test            0 days 00:00:13.018635
round_time_total           0 days 00:10:28.121207
loss_total             342328833081370977763328.0
loss_critic            427911033920774293422080.0
loss_actor                    -1519367059406.8479
memory_size                                1787.0 

=== epoch 5/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:42,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      705
episode_length                          14.165957
returns                                 67.597161
return_std                               2.656642
average_reward                            4.77179
round_time                 0 days 00:10:29.307599
episodes_test                               709.0
episode_length_test                     14.101551
returns_test                            67.283934
return_std_test                          2.455486
average_reward_test                      4.771413
round_time_test            0 days 00:00:12.948586
round_time_total           0 days 00:10:29.308675
loss_total             348226426150304694140928.0
loss_critic            435283024807707455520768.0
loss_actor                  -1539456353239.040039
memory_size                                1787.0 

=== epoch 5/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:22,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      705
episode_length                          14.170213
returns                                 67.612418
return_std                               2.601579
average_reward                           4.771418
round_time                 0 days 00:10:29.607608
episodes_test                               710.0
episode_length_test                     14.067606
returns_test                            67.116553
return_std_test                          2.291987
average_reward_test                      4.771006
round_time_test            0 days 00:00:13.200585
round_time_total           0 days 00:10:29.608976
loss_total             357521913747028232372224.0
loss_critic            446902384399313329455104.0
loss_actor                  -1562376902868.991943
memory_size                                1787.0 

=== epoch 5/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      706
episode_length                          14.141643
returns                                 67.483968
return_std                               2.590804
average_reward                           4.771998
round_time                 0 days 00:10:28.770446
episodes_test                               709.0
episode_length_test                     14.093089
returns_test                            67.238731
return_std_test                          2.405955
average_reward_test                      4.771058
round_time_test            0 days 00:00:13.104097
round_time_total           0 days 00:10:28.771968
loss_total             367827699166910928125952.0
loss_critic            459784615764339091046400.0
loss_actor                  -1580124066217.983887
memory_size                                1787.0 

=== epoch 5/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      706
episode_length                          14.134561
returns                                 67.445011
return_std                               2.553713
average_reward                           4.771625
round_time                 0 days 00:10:29.897391
episodes_test                               709.0
episode_length_test                      14.09732
returns_test                             67.27998
return_std_test                          3.645283
average_reward_test                      4.772557
round_time_test            0 days 00:00:13.197916
round_time_total           0 days 00:10:29.898477
loss_total             373171131656074902372352.0
loss_critic            466463906434340922327040.0
loss_actor                  -1597438160994.303955
memory_size                                1787.0 

=== epoch 5/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:08,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      706
episode_length                          14.148725
returns                                 67.515609
return_std                               2.325426
average_reward                           4.771851
round_time                 0 days 00:10:28.715463
episodes_test                               702.0
episode_length_test                     14.242165
returns_test                            67.951346
return_std_test                           2.35701
average_reward_test                      4.771165
round_time_test            0 days 00:00:12.988562
round_time_total           0 days 00:10:28.716616
loss_total             387580682895026502500352.0
loss_critic            484475845293879240687616.0
loss_actor                   -1614202735951.87207
memory_size                                1787.0 

=== epoch 5/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      705
episode_length                          14.150355
returns                                 67.521881
return_std                                2.34241
average_reward                           4.771752
round_time                 0 days 00:10:25.733777
episodes_test                               704.0
episode_length_test                     14.196023
returns_test                            67.722618
return_std_test                          2.674757
average_reward_test                      4.770556
round_time_test            0 days 00:00:12.922646
round_time_total           0 days 00:10:25.734852
loss_total             383364409447103332352000.0
loss_critic            479205503263298902032384.0
loss_actor                  -1616497773117.439941
memory_size                                1787.0 

=== epoch 5/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      706
episode_length                          14.145892
returns                                 67.502125
return_std                                2.33112
average_reward                           4.771859
round_time                 0 days 00:10:26.638168
episodes_test                               703.0
episode_length_test                     14.207681
returns_test                            67.775394
return_std_test                          2.643604
average_reward_test                      4.770321
round_time_test            0 days 00:00:13.053391
round_time_total           0 days 00:10:26.639498
loss_total             393829699926845344972800.0
loss_critic            492287115685184625704960.0
loss_actor                  -1638048197443.583984
memory_size                                1787.0 

=== epoch 5/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:59,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      704
episode_length                           14.18892
returns                                  67.70419
return_std                               2.381351
average_reward                           4.771612
round_time                 0 days 00:10:28.523147
episodes_test                               702.0
episode_length_test                     14.240741
returns_test                            67.910757
return_std_test                          2.767504
average_reward_test                      4.768802
round_time_test            0 days 00:00:13.025698
round_time_total           0 days 00:10:28.524549
loss_total             398190715626042093993984.0
loss_critic            497738386137842844696576.0
loss_actor                   -1655428483514.36792
memory_size                                1787.0 

=== epoch 5/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      699
episode_length                          14.283262
returns                                 68.139642
return_std                               2.810092
average_reward                           4.770609
round_time                 0 days 00:10:27.619192
episodes_test                               698.0
episode_length_test                     14.316619
returns_test                            68.251191
return_std_test                          3.075596
average_reward_test                      4.767274
round_time_test            0 days 00:00:13.070566
round_time_total           0 days 00:10:27.620397
loss_total             411438113971942583173120.0
loss_critic            514297633813513394716672.0
loss_actor                  -1674004533280.768066
memory_size                                1787.0 

=== epoch 5/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:23,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      696
episode_length                          14.341954
returns                                 68.406049
return_std                               3.078326
average_reward                           4.769662
round_time                 0 days 00:10:28.367150
episodes_test                               681.0
episode_length_test                     14.681351
returns_test                            69.915772
return_std_test                          4.835439
average_reward_test                      4.762247
round_time_test            0 days 00:00:12.960644
round_time_total           0 days 00:10:28.368259
loss_total             421156936605478789578752.0
loss_critic            526446162281074025037824.0
loss_actor                  -1698976572637.184082
memory_size                                1787.0 

=== epoch 5/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      695
episode_length                          14.382734
returns                                 68.587808
return_std                               3.161153
average_reward                           4.768772
round_time                 0 days 00:10:27.848170
episodes_test                               696.0
episode_length_test                     14.364943
returns_test                            68.513088
return_std_test                          2.662006
average_reward_test                      4.769492
round_time_test            0 days 00:00:13.808676
round_time_total           0 days 00:10:27.849546
loss_total             437221061789715171115008.0
loss_critic            546526318443865621659648.0
loss_actor                  -1721046856105.983887
memory_size                                1787.0 

=== epoch 5/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      694
episode_length                          14.393372
returns                                 68.642026
return_std                               3.156556
average_reward                           4.769015
round_time                 0 days 00:10:29.658821
episodes_test                               698.0
episode_length_test                     14.313754
returns_test                            68.256512
return_std_test                          3.531689
average_reward_test                      4.768595
round_time_test            0 days 00:00:13.076519
round_time_total           0 days 00:10:29.659893
loss_total             450927652720071501938688.0
loss_critic            563659557313976735891456.0
loss_actor                  -1741241936445.439941
memory_size                                1787.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:04,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      140
episode_length                               14.2
returns                                 67.755129
return_std                               2.262017
average_reward                           4.771548
round_time                 0 days 00:10:26.721849
episodes_test                               700.0
episode_length_test                         14.27
returns_test                            68.075573
return_std_test                          2.558846
average_reward_test                      4.770546
round_time_test            0 days 00:00:13.224662
round_time_total           0 days 00:10:26.723133
loss_total             454707176587734709960704.0
loss_critic            568383962171073722580992.0
loss_actor                  -1762799278948.352051
memory_size                                1787.0 

=== epoch 6/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      280
episode_length                          14.246429
returns                                 67.953689
return_std                               2.612464
average_reward                           4.769897
round_time                 0 days 00:10:26.305763
episodes_test                               702.0
episode_length_test                     14.242165
returns_test                            67.933713
return_std_test                          2.646442
average_reward_test                      4.769929
round_time_test            0 days 00:00:13.025531
round_time_total           0 days 00:10:26.306850
loss_total             472375959278152247672832.0
loss_critic            590469939372166985285632.0
loss_actor                  -1787191060660.224121
memory_size                                1787.0 

=== epoch 6/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:31,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      419
episode_length                          14.315036
returns                                 68.274664
return_std                               2.663123
average_reward                           4.769479
round_time                 0 days 00:10:28.780338
episodes_test                               693.0
episode_length_test                     14.411255
returns_test                            68.745145
return_std_test                          2.582409
average_reward_test                      4.770233
round_time_test            0 days 00:00:13.147067
round_time_total           0 days 00:10:28.781418
loss_total             479279666082997025636352.0
loss_critic            599099573461438999560192.0
loss_actor                  -1809175329177.600098
memory_size                                1787.0 

=== epoch 6/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:43,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      557
episode_length                          14.344704
returns                                 68.400183
return_std                               2.720131
average_reward                           4.768325
round_time                 0 days 00:10:28.245167
episodes_test                               699.0
episode_length_test                      14.30186
returns_test                            68.204682
return_std_test                           2.68719
average_reward_test                      4.768967
round_time_test            0 days 00:00:13.455103
round_time_total           0 days 00:10:28.246249
loss_total             496474809280865285177344.0
loss_critic            620593501627860287225856.0
loss_actor                  -1836212552138.751953
memory_size                                1787.0 

=== epoch 6/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:15,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      697
episode_length                          14.338594
returns                                 68.379251
return_std                               2.670269
average_reward                            4.76891
round_time                 0 days 00:10:26.796468
episodes_test                               698.0
episode_length_test                     14.323782
returns_test                            68.311477
return_std_test                          2.646467
average_reward_test                      4.769119
round_time_test            0 days 00:00:13.103904
round_time_total           0 days 00:10:26.797771
loss_total             508397390768012060000256.0
loss_critic            635496727982390548365312.0
loss_actor                  -1860943573614.592041
memory_size                                1787.0 

=== epoch 6/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:19,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                      694
episode_length                          14.400576
returns                                 68.662132
return_std                               2.727174
average_reward                           4.768031
round_time                 0 days 00:10:29.185513
episodes_test                               696.0
episode_length_test                     14.359195
returns_test                            68.505746
return_std_test                          2.545324
average_reward_test                      4.770889
round_time_test            0 days 00:00:12.973488
round_time_total           0 days 00:10:29.186592
loss_total             523398258936480266190848.0
loss_critic            654247813069126817021952.0
loss_actor                  -1880250745880.575928
memory_size                                1787.0 

=== epoch 6/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:33,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      693
episode_length                           14.41847
returns                                 68.749056
return_std                               2.636116
average_reward                           4.768141
round_time                 0 days 00:10:27.758739
episodes_test                               695.0
episode_length_test                     14.372662
returns_test                            68.549983
return_std_test                          2.559613
average_reward_test                       4.76948
round_time_test            0 days 00:00:12.959793
round_time_total           0 days 00:10:27.760083
loss_total             531610108031465636757504.0
loss_critic            664512623874908627140608.0
loss_actor                   -1904222208196.60791
memory_size                                1787.0 

=== epoch 6/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      694
episode_length                          14.380403
returns                                 68.564699
return_std                               2.760628
average_reward                           4.767911
round_time                 0 days 00:10:27.239076
episodes_test                               696.0
episode_length_test                     14.360632
returns_test                            68.488577
return_std_test                          3.324988
average_reward_test                      4.769207
round_time_test            0 days 00:00:13.149486
round_time_total           0 days 00:10:27.240163
loss_total             533802525551199367725056.0
loss_critic            667253146015518355881984.0
loss_actor                  -1919022401912.832031
memory_size                                1787.0 

=== epoch 6/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      696
episode_length                          14.354885
returns                                 68.455575
return_std                               2.735871
average_reward                           4.768818
round_time                 0 days 00:10:29.976681
episodes_test                               697.0
episode_length_test                     14.332855
returns_test                            68.343692
return_std_test                          2.581092
average_reward_test                       4.76834
round_time_test            0 days 00:00:12.792779
round_time_total           0 days 00:10:29.978070
loss_total             548965771346506011377664.0
loss_critic            686207202698953461071872.0
loss_actor                  -1936927119835.135986
memory_size                                1787.0 

=== epoch 6/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:08,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                      696
episode_length                          14.346264
returns                                  68.41072
return_std                               2.683933
average_reward                           4.768543
round_time                 0 days 00:10:32.426551
episodes_test                               698.0
episode_length_test                     14.310888
returns_test                            68.253105
return_std_test                          2.529407
average_reward_test                      4.769302
round_time_test            0 days 00:00:13.315326
round_time_total           0 days 00:10:32.427986
loss_total             568858183825932444237824.0
loss_critic            711072718516661090516992.0
loss_actor                  -1964306231001.087891
memory_size                                1787.0 

=== epoch 6/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:31,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      697
episode_length                          14.322812
returns                                 68.301821
return_std                               2.740466
average_reward                           4.768743
round_time                 0 days 00:10:27.379145
episodes_test                               697.0
episode_length_test                     14.347202
returns_test                             68.40213
return_std_test                          2.854999
average_reward_test                      4.767628
round_time_test            0 days 00:00:12.795540
round_time_total           0 days 00:10:27.380225
loss_total             586114925449056849231872.0
loss_critic            732643643807177090531328.0
loss_actor                  -1992736572768.256104
memory_size                                1787.0 

=== epoch 6/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                      698
episode_length                          14.309456
returns                                  68.24173
return_std                               2.725719
average_reward                           4.768993
round_time                 0 days 00:10:31.134573
episodes_test                               697.0
episode_length_test                      14.33142
returns_test                            68.341544
return_std_test                          2.668048
average_reward_test                      4.768678
round_time_test            0 days 00:00:13.196469
round_time_total           0 days 00:10:31.135848
loss_total             598017293069708081233920.0
loss_critic            747521602734012424519680.0
loss_actor                  -2021418986242.048096
memory_size                                1787.0 

=== epoch 6/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                      698
episode_length                          14.316619
returns                                 68.282984
return_std                                2.61059
average_reward                            4.76947
round_time                 0 days 00:10:32.486021
episodes_test                               695.0
episode_length_test                     14.382734
returns_test                            68.606834
return_std_test                          2.596063
average_reward_test                      4.770103
round_time_test            0 days 00:00:12.851093
round_time_total           0 days 00:10:32.487173
loss_total             612338261233170705809408.0
loss_critic            765422812994635630116864.0
loss_actor                    -2044228016078.8479
memory_size                                1787.0 

=== epoch 6/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:01,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:26<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.308023
returns                                 68.247113
return_std                               2.609759
average_reward                           4.769841
round_time                 0 days 00:10:26.926718
episodes_test                               701.0
episode_length_test                     14.248217
returns_test                            67.966481
return_std_test                          2.522897
average_reward_test                       4.77017
round_time_test            0 days 00:00:12.979610
round_time_total           0 days 00:10:26.928093
loss_total             621620519816737176158208.0
loss_critic            777025635341388214173696.0
loss_actor                  -2055663355625.471924
memory_size                                1787.0 

=== epoch 6/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.309456
returns                                 68.248759
return_std                               2.735324
average_reward                           4.769503
round_time                 0 days 00:10:28.212471
episodes_test                               703.0
episode_length_test                     14.217639
returns_test                             67.82937
return_std_test                          2.580063
average_reward_test                      4.770812
round_time_test            0 days 00:00:12.809139
round_time_total           0 days 00:10:28.213618
loss_total             630720265972164765679616.0
loss_critic            788400318373442720104448.0
loss_actor                  -2085389134856.191895
memory_size                                1787.0 

=== epoch 6/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      699
episode_length                          14.286123
returns                                 68.147192
return_std                                2.60867
average_reward                            4.77018
round_time                 0 days 00:10:27.565923
episodes_test                               699.0
episode_length_test                     14.297568
returns_test                            68.195874
return_std_test                          2.687203
average_reward_test                      4.769771
round_time_test            0 days 00:00:12.897921
round_time_total           0 days 00:10:27.567044
loss_total             656026835532353854504960.0
loss_critic            820033528729404803383296.0
loss_actor                  -2116081521524.736084
memory_size                                1787.0 

=== epoch 6/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      699
episode_length                          14.283262
returns                                  68.13604
return_std                               2.620813
average_reward                           4.770335
round_time                 0 days 00:10:27.861217
episodes_test                               695.0
episode_length_test                     14.382734
returns_test                            68.612453
return_std_test                          3.354059
average_reward_test                      4.770496
round_time_test            0 days 00:00:12.728276
round_time_total           0 days 00:10:27.862495
loss_total             686021210339382564749312.0
loss_critic            857526496769816286724096.0
loss_actor                  -2141564404105.216064
memory_size                                1787.0 

=== epoch 6/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                      699
episode_length                          14.293276
returns                                  68.18287
return_std                               2.657589
average_reward                           4.770292
round_time                 0 days 00:10:26.516538
episodes_test                               696.0
episode_length_test                     14.363506
returns_test                            68.510756
return_std_test                          2.635322
average_reward_test                      4.769803
round_time_test            0 days 00:00:12.942149
round_time_total           0 days 00:10:26.517616
loss_total             704053009857104702341120.0
loss_critic            880066246392149094432768.0
loss_actor                  -2166807692312.575928
memory_size                                1787.0 

=== epoch 6/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:27<00:00,  3.19it/s]
episodes                                      698
episode_length                          14.295129
returns                                 68.190556
return_std                               2.638293
average_reward                           4.770189
round_time                 0 days 00:10:28.469575
episodes_test                               698.0
episode_length_test                     14.320917
returns_test                            68.296962
return_std_test                           2.79121
average_reward_test                       4.76906
round_time_test            0 days 00:00:13.080521
round_time_total           0 days 00:10:28.470811
loss_total             708825575497048149458944.0
loss_critic            886031952212595562250240.0
loss_actor                  -2189501451141.120117
memory_size                                1787.0 

=== epoch 6/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                      699
episode_length                          14.290415
returns                                 68.172939
return_std                               2.560212
average_reward                            4.77053
round_time                 0 days 00:10:29.651491
episodes_test                               697.0
episode_length_test                     14.337159
returns_test                            68.366458
return_std_test                           2.74365
average_reward_test                      4.768498
round_time_test            0 days 00:00:13.318019
round_time_total           0 days 00:10:29.652596
loss_total             721892609073385735979008.0
loss_critic            902365744953133065306112.0
loss_actor                  -2204527579430.912109
memory_size                                1787.0 

=== epoch 6/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:22<00:00,  3.21it/s]
episodes                                      699
episode_length                          14.294707
returns                                 68.181506
return_std                                2.60852
average_reward                           4.769699
round_time                 0 days 00:10:22.858497
episodes_test                               699.0
episode_length_test                     14.290415
returns_test                            68.153123
return_std_test                          2.518474
average_reward_test                      4.769139
round_time_test            0 days 00:00:13.241802
round_time_total           0 days 00:10:22.859590
loss_total             737588400255417368707072.0
loss_critic            921985483831593448505344.0
loss_actor                  -2223981293731.839844
memory_size                                1787.0 

=== epoch 6/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                      697
episode_length                          14.315638
returns                                 68.266838
return_std                               2.668549
average_reward                            4.76869
round_time                 0 days 00:10:17.945582
episodes_test                               699.0
episode_length_test                     14.294707
returns_test                             68.19134
return_std_test                          2.618129
average_reward_test                      4.770404
round_time_test            0 days 00:00:13.103318
round_time_total           0 days 00:10:17.946648
loss_total             746548999521820357951488.0
loss_critic            933186233540597661040640.0
loss_actor                   -2250204651716.60791
memory_size                                1787.0 

=== epoch 6/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                      698
episode_length                          14.316619
returns                                 68.269495
return_std                               2.618033
average_reward                           4.768564
round_time                 0 days 00:10:19.224796
episodes_test                               694.0
episode_length_test                     14.396254
returns_test                            68.637472
return_std_test                          2.699256
average_reward_test                      4.767769
round_time_test            0 days 00:00:13.302160
round_time_total           0 days 00:10:19.226262
loss_total             765013324639668950532096.0
loss_critic            956266638983145090711552.0
loss_actor                  -2269250929295.359863
memory_size                                1787.0 

=== epoch 6/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:19<00:00,  3.23it/s]
episodes                                      696
episode_length                          14.346264
returns                                 68.401874
return_std                                2.60585
average_reward                           4.767924
round_time                 0 days 00:10:20.286353
episodes_test                               699.0
episode_length_test                     14.291845
returns_test                             68.16664
return_std_test                          2.590563
average_reward_test                      4.769604
round_time_test            0 days 00:00:12.939334
round_time_total           0 days 00:10:20.287479
loss_total             774324608004099516923904.0
loss_critic            967905743211201342472192.0
loss_actor                   -2291402748592.12793
memory_size                                1787.0 

=== epoch 6/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                      696
episode_length                          14.347701
returns                                 68.401911
return_std                               2.700717
average_reward                           4.767432
round_time                 0 days 00:10:18.461622
episodes_test                               700.0
episode_length_test                     14.281429
returns_test                            68.113842
return_std_test                          2.520414
average_reward_test                      4.769424
round_time_test            0 days 00:00:12.891318
round_time_total           0 days 00:10:18.462730
loss_total             787208046865250105950208.0
loss_critic            984010042080373711568896.0
loss_actor                  -2304222099341.312012
memory_size                                1787.0 

=== epoch 6/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:19<00:00,  3.23it/s]
episodes                                      694
episode_length                          14.380403
returns                                 68.553502
return_std                               2.780422
average_reward                           4.767159
round_time                 0 days 00:10:19.607708
episodes_test                               697.0
episode_length_test                     14.328551
returns_test                            68.309804
return_std_test                          2.683142
average_reward_test                      4.767418
round_time_test            0 days 00:00:12.974346
round_time_total           0 days 00:10:19.609156
loss_total             789919220470225808916480.0
loss_critic            987399009284751588589568.0
loss_actor                  -2315506655756.288086
memory_size                                1787.0 

=== epoch 6/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                       696
episode_length                           14.362069
returns                                  68.481857
return_std                                2.795383
average_reward                            4.768272
round_time                  0 days 00:10:17.909431
episodes_test                                697.0
episode_length_test                      14.340029
returns_test                              68.36998
return_std_test                           3.376302
average_reward_test                        4.76779
round_time_test             0 days 00:00:12.552626
round_time_total            0 days 00:10:17.910497
loss_total              803476374813229914783744.0
loss_critic            1004345451272254303240192.0
loss_actor                   -2341583315271.680176
memory_size                                 1787.0 

=== epoch 6/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:39,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:19<00:00,  3.23it/s]
episodes                                       696
episode_length                           14.347701
returns                                  68.415816
return_std                                 2.79487
average_reward                            4.768409
round_time                  0 days 00:10:20.247422
episodes_test                                697.0
episode_length_test                       14.33429
returns_test                             68.364012
return_std_test                           2.732641
average_reward_test                       4.769275
round_time_test             0 days 00:00:12.653975
round_time_total            0 days 00:10:20.248614
loss_total              821725040415206897876992.0
loss_critic            1027156283130610421071872.0
loss_actor                    -2362143338004.47998
memory_size                                 1787.0 

=== epoch 6/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                       697
episode_length                           14.328551
returns                                  68.331708
return_std                                2.782005
average_reward                            4.768924
round_time                  0 days 00:10:19.537950
episodes_test                                696.0
episode_length_test                      14.357759
returns_test                             68.454004
return_std_test                            2.59233
average_reward_test                       4.767762
round_time_test             0 days 00:00:12.928844
round_time_total            0 days 00:10:19.539202
loss_total              837322851662047788138496.0
loss_critic            1046653547175650741190656.0
loss_actor                   -2382644291043.328125
memory_size                                 1787.0 

=== epoch 6/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:11,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                       695
episode_length                           14.352518
returns                                  68.453695
return_std                                 2.72258
average_reward                            4.769432
round_time                  0 days 00:10:17.231488
episodes_test                                696.0
episode_length_test                      14.364943
returns_test                             68.507879
return_std_test                             2.8454
average_reward_test                       4.769128
round_time_test             0 days 00:00:13.360876
round_time_total            0 days 00:10:17.232917
loss_total              849247829280369141612544.0
loss_critic            1061559768640106155474944.0
loss_actor                   -2401339689664.512207
memory_size                                 1787.0 

=== epoch 6/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                       696
episode_length                           14.356322
returns                                  68.483309
return_std                                2.749329
average_reward                            4.770247
round_time                  0 days 00:10:18.766736
episodes_test                                696.0
episode_length_test                      14.359195
returns_test                             68.497154
return_std_test                           2.946074
average_reward_test                       4.770287
round_time_test             0 days 00:00:12.605557
round_time_total            0 days 00:10:18.768172
loss_total              858592874455867003502592.0
loss_critic            1073241075866083243589632.0
loss_actor                   -2414177010843.647949
memory_size                                 1787.0 

=== epoch 6/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                       695
episode_length                           14.361151
returns                                  68.510323
return_std                                2.855496
average_reward                            4.770528
round_time                  0 days 00:10:17.745228
episodes_test                                691.0
episode_length_test                      14.457308
returns_test                             68.976449
return_std_test                           3.902743
average_reward_test                       4.771058
round_time_test             0 days 00:00:13.263530
round_time_total            0 days 00:10:17.746688
loss_total              870747963893465617530880.0
loss_critic            1088434936474131148832768.0
loss_actor                   -2439584362201.087891
memory_size                                 1787.0 

=== epoch 6/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<13:40,  2.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:19<00:00,  3.23it/s]
episodes                                       695
episode_length                           14.355396
returns                                  68.488835
return_std                                2.837709
average_reward                            4.770939
round_time                  0 days 00:10:19.934360
episodes_test                                702.0
episode_length_test                      14.226496
returns_test                             67.859735
return_std_test                            2.55753
average_reward_test                       4.769939
round_time_test             0 days 00:00:13.113724
round_time_total            0 days 00:10:19.935467
loss_total              899083570994229442248704.0
loss_critic            1123854445462675813040128.0
loss_actor                   -2460454577897.472168
memory_size                                 1787.0 

=== epoch 6/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:20<00:00,  3.22it/s]
episodes                                       696
episode_length                           14.353448
returns                                  68.475893
return_std                                2.914488
average_reward                            4.770737
round_time                  0 days 00:10:21.394275
episodes_test                                699.0
episode_length_test                       14.30186
returns_test                             68.207629
return_std_test                           2.642767
average_reward_test                       4.769176
round_time_test             0 days 00:00:12.613389
round_time_total            0 days 00:10:21.395401
loss_total              907158977539708909780992.0
loss_critic            1133948703419345213063168.0
loss_actor                   -2482725249744.895996
memory_size                                 1787.0 

=== epoch 6/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:20<00:00,  3.23it/s]
episodes                                       696
episode_length                           14.344828
returns                                  68.453367
return_std                                3.702009
average_reward                             4.77198
round_time                  0 days 00:10:20.539577
episodes_test                                700.0
episode_length_test                      14.277143
returns_test                             68.074973
return_std_test                           2.793075
average_reward_test                       4.768134
round_time_test             0 days 00:00:13.116904
round_time_total            0 days 00:10:20.540678
loss_total              919452251879437175357440.0
loss_critic            1149315296780854688219136.0
loss_actor                   -2501252163174.399902
memory_size                                 1787.0 

=== epoch 6/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:01<00:00,  3.33it/s]
episodes                                       696
episode_length                           14.350575
returns                                  68.484807
return_std                                4.306138
average_reward                            4.772293
round_time                  0 days 00:10:01.560900
episodes_test                                695.0
episode_length_test                      14.371223
returns_test                             68.542527
return_std_test                           3.044485
average_reward_test                       4.769441
round_time_test             0 days 00:00:12.887179
round_time_total            0 days 00:10:01.561975
loss_total              946307856256824953012224.0
loss_critic            1182884800234976714948608.0
loss_actor                    -2522405976539.13623
memory_size                              1787.6225 

=== epoch 6/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:43<00:00,  3.43it/s]
episodes                                       695
episode_length                           14.364029
returns                                  68.543567
return_std                                4.235287
average_reward                            4.771912
round_time                  0 days 00:09:43.693739
episodes_test                                693.0
episode_length_test                      14.427128
returns_test                             68.781751
return_std_test                           3.586554
average_reward_test                       4.767558
round_time_test             0 days 00:00:12.250493
round_time_total            0 days 00:09:43.695053
loss_total              957726094465566894981120.0
loss_critic            1197157599058753694466048.0
loss_actor                   -2540097793818.624023
memory_size                                 1788.0 

=== epoch 6/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:24,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.381295
returns                                  68.618035
return_std                                4.261695
average_reward                             4.77136
round_time                  0 days 00:09:39.645499
episodes_test                                697.0
episode_length_test                      14.338594
returns_test                             68.384188
return_std_test                           2.939583
average_reward_test                       4.769263
round_time_test             0 days 00:00:12.009124
round_time_total            0 days 00:09:39.646933
loss_total              968655035165654254616576.0
loss_critic            1210818775420251767046144.0
loss_actor                   -2555908574412.799805
memory_size                                 1788.0 

=== epoch 6/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       694
episode_length                           14.387608
returns                                  68.654394
return_std                                4.203629
average_reward                             4.77177
round_time                  0 days 00:09:40.131759
episodes_test                                697.0
episode_length_test                      14.329986
returns_test                             68.332635
return_std_test                           2.768159
average_reward_test                       4.768499
round_time_test             0 days 00:00:12.149906
round_time_total            0 days 00:09:40.133044
loss_total              976590886471723749736448.0
loss_critic            1220738588751197955424256.0
loss_actor                   -2567904382025.728027
memory_size                                 1788.0 

=== epoch 6/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.382734
returns                                  68.617539
return_std                                3.500676
average_reward                            4.770853
round_time                  0 days 00:09:40.751509
episodes_test                                697.0
episode_length_test                      14.332855
returns_test                             68.353455
return_std_test                           2.867337
average_reward_test                       4.769018
round_time_test             0 days 00:00:12.082809
round_time_total            0 days 00:09:40.752586
loss_total              998755770061945026117632.0
loss_critic            1248444692802125307052032.0
loss_actor                   -2589226208854.016113
memory_size                                 1788.0 

=== epoch 6/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       697
episode_length                           14.341463
returns                                  68.408649
return_std                                2.631054
average_reward                             4.77001
round_time                  0 days 00:09:41.586620
episodes_test                                699.0
episode_length_test                      14.293276
returns_test                             68.159592
return_std_test                           2.790353
average_reward_test                       4.768654
round_time_test             0 days 00:00:12.454293
round_time_total            0 days 00:09:41.587692
loss_total             1014653089635153305665536.0
loss_critic            1268316340669857736425472.0
loss_actor                   -2614639269511.167969
memory_size                                 1788.0 

=== epoch 6/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.312321
returns                                  68.259216
return_std                                2.682473
average_reward                            4.769097
round_time                  0 days 00:09:40.709814
episodes_test                                698.0
episode_length_test                      14.308023
returns_test                             68.221562
return_std_test                           2.658109
average_reward_test                       4.768076
round_time_test             0 days 00:00:12.374295
round_time_total            0 days 00:09:40.711112
loss_total             1039535763735095581605888.0
loss_critic            1299419682353533398548480.0
loss_actor                   -2632502955999.231934
memory_size                                 1788.0 

=== epoch 6/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.312769
returns                                  68.262755
return_std                                2.726317
average_reward                            4.769315
round_time                  0 days 00:09:40.979702
episodes_test                                698.0
episode_length_test                       14.32235
returns_test                             68.287631
return_std_test                           2.918739
average_reward_test                       4.767939
round_time_test             0 days 00:00:12.229807
round_time_total            0 days 00:09:40.980919
loss_total             1052709899395755625414656.0
loss_critic            1315887352537344225837056.0
loss_actor                    -2658055374372.86377
memory_size                                 1788.0 

=== epoch 6/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:13,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.312321
returns                                   68.24758
return_std                                2.790094
average_reward                             4.76846
round_time                  0 days 00:09:40.637700
episodes_test                                697.0
episode_length_test                      14.329986
returns_test                             68.345122
return_std_test                           2.825878
average_reward_test                       4.769391
round_time_test             0 days 00:00:12.160547
round_time_total            0 days 00:09:40.638761
loss_total             1069883950966553790382080.0
loss_critic            1337354916865734139183104.0
loss_actor                   -2678088485699.583984
memory_size                                 1788.0 

=== epoch 6/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:42<00:00,  3.44it/s]
episodes                                       697
episode_length                           14.319943
returns                                  68.271622
return_std                                2.922558
average_reward                            4.767587
round_time                  0 days 00:09:42.648765
episodes_test                                698.0
episode_length_test                      14.312321
returns_test                             68.248728
return_std_test                           2.938792
average_reward_test                        4.76855
round_time_test             0 days 00:00:12.314160
round_time_total            0 days 00:09:42.649833
loss_total             1087339289641900597837824.0
loss_critic            1359174090295485845733376.0
loss_actor                   -2699971266740.224121
memory_size                                 1788.0 

=== epoch 6/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.305158
returns                                  68.202579
return_std                                 2.90073
average_reward                            4.767722
round_time                  0 days 00:09:40.036804
episodes_test                                699.0
episode_length_test                      14.296137
returns_test                             68.170239
return_std_test                           2.788219
average_reward_test                       4.768449
round_time_test             0 days 00:00:12.461785
round_time_total            0 days 00:09:40.038072
loss_total             1097913915135696123723776.0
loss_critic            1372392371915032342036480.0
loss_actor                   -2715072530350.080078
memory_size                                 1788.0 

=== epoch 6/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       698
episode_length                           14.308023
returns                                  68.219492
return_std                                2.890215
average_reward                            4.767911
round_time                  0 days 00:09:41.988324
episodes_test                                699.0
episode_length_test                      14.306152
returns_test                             68.204351
return_std_test                           2.839377
average_reward_test                       4.767484
round_time_test             0 days 00:00:12.303502
round_time_total            0 days 00:09:41.989417
loss_total             1118814334713856419430400.0
loss_critic            1398517894406148886364160.0
loss_actor                   -2738729657171.967773
memory_size                                 1788.0 

=== epoch 6/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.293276
returns                                  68.142896
return_std                                2.881625
average_reward                             4.76749
round_time                  0 days 00:09:40.823971
episodes_test                                699.0
episode_length_test                      14.288984
returns_test                             68.156036
return_std_test                           3.043765
average_reward_test                       4.769844
round_time_test             0 days 00:00:12.161754
round_time_total            0 days 00:09:40.825046
loss_total             1137357256544315406024704.0
loss_critic            1421696548090338481274880.0
loss_actor                   -2761756825223.167969
memory_size                                 1788.0 

=== epoch 6/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       699
episode_length                           14.270386
returns                                  68.044696
return_std                                2.893399
average_reward                            4.768253
round_time                  0 days 00:09:42.177266
episodes_test                                702.0
episode_length_test                      14.240741
returns_test                             67.913064
return_std_test                            2.79948
average_reward_test                       4.768954
round_time_test             0 days 00:00:11.972612
round_time_total            0 days 00:09:42.178329
loss_total             1158472323030992642113536.0
loss_critic            1448090378365920757153792.0
loss_actor                   -2778622077698.047852
memory_size                                 1788.0 

=== epoch 6/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       700
episode_length                           14.268571
returns                                  68.038211
return_std                                2.837112
average_reward                            4.768411
round_time                  0 days 00:09:40.936459
episodes_test                                700.0
episode_length_test                      14.275714
returns_test                              68.08464
return_std_test                            2.85781
average_reward_test                       4.769264
round_time_test             0 days 00:00:12.448048
round_time_total            0 days 00:09:40.937603
loss_total             1172265606596851188891648.0
loss_critic            1465331982742179184377856.0
loss_actor                   -2795417610223.616211
memory_size                                 1788.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:17,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       139
episode_length                           14.316547
returns                                    68.2706
return_std                                3.013828
average_reward                            4.768753
round_time                  0 days 00:09:40.051955
episodes_test                                691.0
episode_length_test                      14.460203
returns_test                             68.983557
return_std_test                           7.551268
average_reward_test                       4.770604
round_time_test             0 days 00:00:12.422062
round_time_total            0 days 00:09:40.053045
loss_total             1186663978100087844765696.0
loss_critic            1483329945923267491528704.0
loss_actor                   -2820332743557.120117
memory_size                                 1788.0 

=== epoch 7/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       278
episode_length                           14.341727
returns                                  68.386404
return_std                                3.119852
average_reward                            4.768398
round_time                  0 days 00:09:40.687922
episodes_test                                699.0
episode_length_test                      14.298999
returns_test                             68.191915
return_std_test                           2.973366
average_reward_test                       4.769028
round_time_test             0 days 00:00:12.391766
round_time_total            0 days 00:09:40.689331
loss_total             1193008389541544031420416.0
loss_critic            1491260462085074482888704.0
loss_actor                   -2842833316085.759766
memory_size                                 1788.0 

=== epoch 7/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:01,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       418
episode_length                           14.339713
returns                                   68.39295
return_std                                3.051112
average_reward                            4.769508
round_time                  0 days 00:09:41.354424
episodes_test                                698.0
episode_length_test                      14.320917
returns_test                             68.296986
return_std_test                           2.623763
average_reward_test                       4.769056
round_time_test             0 days 00:00:12.291003
round_time_total            0 days 00:09:41.355505
loss_total             1218546930835598744748032.0
loss_critic            1523183635838353679581184.0
loss_actor                   -2859346931744.768066
memory_size                                 1788.0 

=== epoch 7/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       558
episode_length                           14.318996
returns                                  68.289549
return_std                                2.945864
average_reward                            4.769151
round_time                  0 days 00:09:41.279242
episodes_test                                700.0
episode_length_test                      14.282857
returns_test                             68.116304
return_std_test                           2.717856
average_reward_test                       4.769124
round_time_test             0 days 00:00:12.227512
round_time_total            0 days 00:09:41.280315
loss_total             1225321571419673757483008.0
loss_critic            1531651937604275110150144.0
loss_actor                   -2873746022727.680176
memory_size                                 1788.0 

=== epoch 7/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       698
episode_length                           14.309456
returns                                   68.23659
return_std                                2.917933
average_reward                            4.768668
round_time                  0 days 00:09:41.153293
episodes_test                                702.0
episode_length_test                       14.24359
returns_test                              67.94695
return_std_test                           2.553775
average_reward_test                        4.77037
round_time_test             0 days 00:00:12.506836
round_time_total            0 days 00:09:41.154362
loss_total             1253185678426862537146368.0
loss_critic            1566482070777793087537152.0
loss_actor                   -2895565161103.359863
memory_size                                 1788.0 

=== epoch 7/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:31,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.293276
returns                                  68.170301
return_std                                2.775876
average_reward                            4.769405
round_time                  0 days 00:09:40.959504
episodes_test                                701.0
episode_length_test                      14.253923
returns_test                             67.979092
return_std_test                           2.691004
average_reward_test                       4.769167
round_time_test             0 days 00:00:12.432759
round_time_total            0 days 00:09:40.960844
loss_total             1263444886484491641552896.0
loss_critic            1579306079147468982845440.0
loss_actor                   -2916566655565.824219
memory_size                                 1788.0 

=== epoch 7/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       701
episode_length                           14.262482
returns                                  68.027751
return_std                                2.567656
average_reward                            4.769714
round_time                  0 days 00:09:40.518802
episodes_test                                703.0
episode_length_test                      14.217639
returns_test                             67.823801
return_std_test                           2.537831
average_reward_test                       4.770421
round_time_test             0 days 00:00:12.143641
round_time_total            0 days 00:09:40.519881
loss_total             1296155328219620973740032.0
loss_critic            1620194130496725264105472.0
loss_actor                   -2953550120747.007812
memory_size                                 1788.0 

=== epoch 7/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       702
episode_length                           14.232194
returns                                  67.884593
return_std                                2.486993
average_reward                            4.769792
round_time                  0 days 00:09:40.983377
episodes_test                                701.0
episode_length_test                      14.249643
returns_test                             67.960838
return_std_test                           2.614653
average_reward_test                       4.769297
round_time_test             0 days 00:00:12.189086
round_time_total            0 days 00:09:40.984464
loss_total             1315101044092430977073152.0
loss_critic            1643876275486356588724224.0
loss_actor                        -2971328757760.0
memory_size                                 1788.0 

=== epoch 7/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       702
episode_length                           14.222222
returns                                  67.831569
return_std                                2.607788
average_reward                            4.769411
round_time                  0 days 00:09:41.487470
episodes_test                                706.0
episode_length_test                      14.151558
returns_test                             67.502746
return_std_test                           2.657141
average_reward_test                       4.770005
round_time_test             0 days 00:00:12.438196
round_time_total            0 days 00:09:41.488695
loss_total             1328335966026211667214336.0
loss_critic            1660419927160488818900992.0
loss_actor                   -2986660737843.200195
memory_size                                 1788.0 

=== epoch 7/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       703
episode_length                           14.216216
returns                                  67.808825
return_std                                2.548836
average_reward                            4.769846
round_time                  0 days 00:09:40.103945
episodes_test                                702.0
episode_length_test                      14.230769
returns_test                              67.86425
return_std_test                           2.694284
average_reward_test                        4.76886
round_time_test             0 days 00:00:12.102688
round_time_total            0 days 00:09:40.105032
loss_total             1355047306920995179200512.0
loss_critic            1693809103116838427951104.0
loss_actor                   -3012596608663.551758
memory_size                                 1788.0 

=== epoch 7/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       700
episode_length                           14.257143
returns                                  67.976997
return_std                                2.873694
average_reward                            4.767916
round_time                  0 days 00:09:42.054124
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.101711
return_std_test                           2.633175
average_reward_test                        4.76761
round_time_test             0 days 00:00:12.190371
round_time_total            0 days 00:09:42.055217
loss_total             1361417884582531112632320.0
loss_critic            1701772324815456148914176.0
loss_actor                   -3030693844877.312012
memory_size                                 1788.0 

=== epoch 7/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.274678
returns                                  68.054446
return_std                                2.942978
average_reward                            4.767512
round_time                  0 days 00:09:40.945471
episodes_test                                696.0
episode_length_test                      14.367816
returns_test                             68.490896
return_std_test                           3.007797
average_reward_test                       4.766966
round_time_test             0 days 00:00:12.374828
round_time_total            0 days 00:09:40.946839
loss_total             1402548768906802387484672.0
loss_critic            1753185931355702299721728.0
loss_actor                   -3054024543436.799805
memory_size                                 1788.0 

=== epoch 7/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       698
episode_length                           14.297994
returns                                  68.152292
return_std                                3.005179
average_reward                            4.766594
round_time                  0 days 00:09:42.148570
episodes_test                                700.0
episode_length_test                      14.281429
returns_test                             68.088504
return_std_test                            2.76553
average_reward_test                       4.767654
round_time_test             0 days 00:00:12.236194
round_time_total            0 days 00:09:42.149644
loss_total             1410839904215501276971008.0
loss_critic            1763549845438537022832640.0
loss_actor                   -3080983285727.231934
memory_size                                 1788.0 

=== epoch 7/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       698
episode_length                           14.319484
returns                                  68.269104
return_std                                2.905527
average_reward                            4.767585
round_time                  0 days 00:09:41.913645
episodes_test                                697.0
episode_length_test                      14.328551
returns_test                             68.306401
return_std_test                           2.851545
average_reward_test                       4.767154
round_time_test             0 days 00:00:12.138697
round_time_total            0 days 00:09:41.914716
loss_total             1432045578280807851622400.0
loss_critic            1790056942325611620204544.0
loss_actor                   -3096912455925.759766
memory_size                                 1788.0 

=== epoch 7/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.318508
returns                                  68.264948
return_std                                 2.91315
average_reward                            4.767619
round_time                  0 days 00:09:40.741284
episodes_test                                701.0
episode_length_test                      14.262482
returns_test                             68.022869
return_std_test                           2.654791
average_reward_test                       4.769384
round_time_test             0 days 00:00:12.178884
round_time_total            0 days 00:09:40.742500
loss_total             1443482144842996688879616.0
loss_critic            1804352646961496512790528.0
loss_actor                   -3120735539953.664062
memory_size                                 1788.0 

=== epoch 7/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                               14.26
returns                                  68.008452
return_std                                2.734566
average_reward                            4.769153
round_time                  0 days 00:09:39.925692
episodes_test                                702.0
episode_length_test                      14.237892
returns_test                             67.898671
return_std_test                           2.780225
average_reward_test                       4.768895
round_time_test             0 days 00:00:12.173404
round_time_total            0 days 00:09:39.926775
loss_total             1486582165234085011652608.0
loss_critic            1858227672099076104519680.0
loss_actor                   -3141127756840.959961
memory_size                                 1788.0 

=== epoch 7/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       701
episode_length                           14.241084
returns                                  67.918178
return_std                                2.746364
average_reward                            4.769172
round_time                  0 days 00:09:41.726138
episodes_test                                702.0
episode_length_test                       14.22792
returns_test                              67.86207
return_std_test                            2.62542
average_reward_test                       4.769663
round_time_test             0 days 00:00:11.999392
round_time_total            0 days 00:09:41.727215
loss_total             1499025311425062296879104.0
loss_critic            1873781604540560539910144.0
loss_actor                   -3161169026088.959961
memory_size                                 1788.0 

=== epoch 7/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       701
episode_length                            14.24679
returns                                  67.949016
return_std                                2.679335
average_reward                            4.769431
round_time                  0 days 00:09:39.688612
episodes_test                                701.0
episode_length_test                      14.263909
returns_test                             68.024303
return_std_test                           2.688298
average_reward_test                       4.768996
round_time_test             0 days 00:00:12.226099
round_time_total            0 days 00:09:39.689872
loss_total             1523819911887698249383936.0
loss_critic            1904774855082826543398912.0
loss_actor                   -3188609653145.600098
memory_size                                 1788.0 

=== epoch 7/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.264286
returns                                  68.040017
return_std                                2.937692
average_reward                            4.769665
round_time                  0 days 00:09:40.071397
episodes_test                                700.0
episode_length_test                      14.275714
returns_test                             68.067197
return_std_test                           2.843774
average_reward_test                       4.768066
round_time_test             0 days 00:00:12.138972
round_time_total            0 days 00:09:40.072452
loss_total             1547743038332466009997312.0
loss_critic            1934678761796713452666880.0
loss_actor                    -3211327278415.87207
memory_size                                 1788.0 

=== epoch 7/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.293276
returns                                  68.179674
return_std                                3.007726
average_reward                            4.770068
round_time                  0 days 00:09:40.112456
episodes_test                                692.0
episode_length_test                      14.434971
returns_test                             68.894642
return_std_test                           6.608565
average_reward_test                       4.772764
round_time_test             0 days 00:00:12.544573
round_time_total            0 days 00:09:40.113536
loss_total             1589949771610962878529536.0
loss_critic            1987437177485107449036800.0
loss_actor                    -3239528174452.73584
memory_size                                 1788.0 

=== epoch 7/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.312321
returns                                  68.270191
return_std                                3.091997
average_reward                            4.770042
round_time                  0 days 00:09:38.399186
episodes_test                                704.0
episode_length_test                      14.193182
returns_test                             67.702391
return_std_test                            2.76602
average_reward_test                       4.770089
round_time_test             0 days 00:00:12.231836
round_time_total            0 days 00:09:38.400250
loss_total             1604531825991852725633024.0
loss_critic            2005664747902170669514752.0
loss_actor                   -3261631030755.328125
memory_size                                 1788.0 

=== epoch 7/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.315186
returns                                  68.298673
return_std                                3.019142
average_reward                             4.77107
round_time                  0 days 00:09:39.143508
episodes_test                                702.0
episode_length_test                      14.245014
returns_test                             67.960219
return_std_test                           2.574936
average_reward_test                       4.770807
round_time_test             0 days 00:00:12.272341
round_time_total            0 days 00:09:39.144644
loss_total             1610940405927764283097088.0
loss_critic            2013675470471181245612032.0
loss_actor                   -3275909177802.751953
memory_size                                 1788.0 

=== epoch 7/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       697
episode_length                           14.324247
returns                                  68.342926
return_std                                3.069914
average_reward                            4.771129
round_time                  0 days 00:09:39.352242
episodes_test                                701.0
episode_length_test                      14.259629
returns_test                             68.027238
return_std_test                           3.145412
average_reward_test                       4.770633
round_time_test             0 days 00:00:12.217419
round_time_total            0 days 00:09:39.353642
loss_total             1651406237209878572564480.0
loss_critic            2064257761591436467240960.0
loss_actor                   -3302290054184.959961
memory_size                                 1788.0 

=== epoch 7/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.319943
returns                                   68.31008
return_std                                3.160485
average_reward                            4.770317
round_time                  0 days 00:09:40.468213
episodes_test                                696.0
episode_length_test                      14.360632
returns_test                             68.501543
return_std_test                           3.498593
average_reward_test                        4.77012
round_time_test             0 days 00:00:12.213242
round_time_total            0 days 00:09:40.469283
loss_total             1683268746035024971694080.0
loss_critic            2104085897397689766641664.0
loss_actor                    -3330215776747.52002
memory_size                                 1788.0 

=== epoch 7/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       696
episode_length                            14.33046
returns                                  68.348605
return_std                                3.117366
average_reward                            4.769405
round_time                  0 days 00:09:39.376569
episodes_test                                697.0
episode_length_test                      14.347202
returns_test                             68.439985
return_std_test                            2.71488
average_reward_test                       4.770267
round_time_test             0 days 00:00:12.189494
round_time_total            0 days 00:09:39.377937
loss_total             1682035579464315377287168.0
loss_critic            2102544442102635340759040.0
loss_actor                   -3349656409210.879883
memory_size                                 1788.0 

=== epoch 7/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:38,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       695
episode_length                           14.381295
returns                                  68.585087
return_std                                  3.1373
average_reward                            4.769037
round_time                  0 days 00:09:41.200001
episodes_test                                688.0
episode_length_test                       14.53343
returns_test                             69.320804
return_std_test                            3.30219
average_reward_test                       4.769761
round_time_test             0 days 00:00:12.051318
round_time_total            0 days 00:09:41.201182
loss_total             1716356981466282651025408.0
loss_critic            2145446191646229454127104.0
loss_actor                   -3379227915059.200195
memory_size                                 1788.0 

=== epoch 7/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:38,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       692
episode_length                           14.419075
returns                                  68.753785
return_std                                3.286054
average_reward                            4.768231
round_time                  0 days 00:09:39.831266
episodes_test                                695.0
episode_length_test                      14.385612
returns_test                             68.605895
return_std_test                           3.289662
average_reward_test                       4.769095
round_time_test             0 days 00:00:12.211929
round_time_total            0 days 00:09:39.832338
loss_total             1735600383328906888871936.0
loss_critic            2169500443330494840438784.0
loss_actor                   -3398192401481.728027
memory_size                                 1788.0 

=== epoch 7/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       692
episode_length                           14.429191
returns                                  68.807281
return_std                                3.406259
average_reward                            4.768609
round_time                  0 days 00:09:38.117070
episodes_test                                694.0
episode_length_test                       14.39049
returns_test                             68.641492
return_std_test                           3.168962
average_reward_test                       4.769924
round_time_test             0 days 00:00:12.032779
round_time_total            0 days 00:09:38.118161
loss_total             1732507223843082241835008.0
loss_critic            2165633992225817727336448.0
loss_actor                   -3415738603274.240234
memory_size                                 1788.0 

=== epoch 7/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       692
episode_length                           14.430636
returns                                  68.821833
return_std                                3.181649
average_reward                            4.769159
round_time                  0 days 00:09:40.558266
episodes_test                                693.0
episode_length_test                      14.422799
returns_test                             68.807588
return_std_test                           4.705388
average_reward_test                       4.770765
round_time_test             0 days 00:00:12.123522
round_time_total            0 days 00:09:40.559314
loss_total             1775460054348557249937408.0
loss_critic            2219325030799013870305280.0
loss_actor                   -3448458291052.543945
memory_size                                 1788.0 

=== epoch 7/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       693
episode_length                           14.425685
returns                                  68.809904
return_std                                3.289415
average_reward                            4.769966
round_time                  0 days 00:09:40.894597
episodes_test                                700.0
episode_length_test                      14.274286
returns_test                             68.085525
return_std_test                           2.901004
average_reward_test                       4.769812
round_time_test             0 days 00:00:12.138660
round_time_total            0 days 00:09:40.895667
loss_total             1806458881067737996591104.0
loss_critic            2258073563306277149343744.0
loss_actor                   -3471850023682.047852
memory_size                                 1788.0 

=== epoch 7/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.384173
returns                                  68.619388
return_std                                3.229183
average_reward                            4.770508
round_time                  0 days 00:09:39.820852
episodes_test                                700.0
episode_length_test                      14.277143
returns_test                              68.08407
return_std_test                           3.266779
average_reward_test                       4.768785
round_time_test             0 days 00:00:12.338401
round_time_total            0 days 00:09:39.822036
loss_total             1841097924707626631823360.0
loss_critic            2301372368693807695265792.0
loss_actor                   -3503659902566.399902
memory_size                                 1788.0 

=== epoch 7/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.349138
returns                                  68.455588
return_std                                3.108592
average_reward                            4.770729
round_time                  0 days 00:09:39.830035
episodes_test                                700.0
episode_length_test                      14.267143
returns_test                              68.05676
return_std_test                           2.734772
average_reward_test                       4.770196
round_time_test             0 days 00:00:12.195643
round_time_total            0 days 00:09:39.831093
loss_total             1843944532362036363919360.0
loss_critic            2304930630117302909009920.0
loss_actor                   -3510848262766.591797
memory_size                                 1788.0 

=== epoch 7/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       697
episode_length                           14.340029
returns                                  68.407169
return_std                                2.950281
average_reward                            4.770351
round_time                  0 days 00:09:38.617704
episodes_test                                702.0
episode_length_test                      14.245014
returns_test                             67.935986
return_std_test                           2.729707
average_reward_test                       4.769106
round_time_test             0 days 00:00:12.389527
round_time_total            0 days 00:09:38.618764
loss_total             1865686969714342233112576.0
loss_critic            2332108673024661609512960.0
loss_actor                   -3537912200298.496094
memory_size                                 1788.0 

=== epoch 7/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.324247
returns                                  68.329169
return_std                                2.918241
average_reward                            4.770165
round_time                  0 days 00:09:39.726272
episodes_test                                702.0
episode_length_test                       14.24359
returns_test                             67.949281
return_std_test                           2.598026
average_reward_test                       4.770533
round_time_test             0 days 00:00:12.188630
round_time_total            0 days 00:09:39.727332
loss_total             1892068332657061838979072.0
loss_critic            2365085377189449520119808.0
loss_actor                   -3562805083897.855957
memory_size                                 1788.0 

=== epoch 7/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.311334
returns                                  68.267966
return_std                                2.695281
average_reward                            4.770171
round_time                  0 days 00:09:39.435136
episodes_test                                700.0
episode_length_test                      14.267143
returns_test                             68.051577
return_std_test                           2.723378
average_reward_test                       4.769821
round_time_test             0 days 00:00:12.094692
round_time_total            0 days 00:09:39.436200
loss_total             1931987014847032742379520.0
loss_critic            2414983730701532580020224.0
loss_actor                   -3585855763251.200195
memory_size                                 1788.0 

=== epoch 7/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       697
episode_length                           14.322812
returns                                  68.318822
return_std                                 2.68892
average_reward                            4.769922
round_time                  0 days 00:09:39.107971
episodes_test                                699.0
episode_length_test                      14.291845
returns_test                             68.168022
return_std_test                           2.718171
average_reward_test                       4.769727
round_time_test             0 days 00:00:12.343291
round_time_total            0 days 00:09:39.109045
loss_total             1952550089232657340694528.0
loss_critic            2440687573161145383518208.0
loss_actor                   -3615044046290.943848
memory_size                                 1788.0 

=== epoch 7/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.368345
returns                                  68.530286
return_std                                2.700988
average_reward                            4.769519
round_time                  0 days 00:09:40.149172
episodes_test                                698.0
episode_length_test                      14.325215
returns_test                              68.33655
return_std_test                           2.674151
average_reward_test                       4.770379
round_time_test             0 days 00:00:11.931217
round_time_total            0 days 00:09:40.150235
loss_total             1982358318449641213919232.0
loss_critic            2477947860294865277419520.0
loss_actor                    -3641687261577.21582
memory_size                                 1788.0 

=== epoch 7/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.350575
returns                                  68.451284
return_std                                2.652992
average_reward                            4.769927
round_time                  0 days 00:09:39.542572
episodes_test                                697.0
episode_length_test                       14.33429
returns_test                             68.371641
return_std_test                           2.657821
average_reward_test                       4.769818
round_time_test             0 days 00:00:12.037747
round_time_total            0 days 00:09:39.543739
loss_total             2021943076777669974556672.0
loss_critic            2527428804484927980568576.0
loss_actor                   -3662152437727.231934
memory_size                                 1788.0 

=== epoch 7/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.347701
returns                                  68.442261
return_std                                2.523823
average_reward                            4.770257
round_time                  0 days 00:09:40.615118
episodes_test                                697.0
episode_length_test                      14.328551
returns_test                             68.330284
return_std_test                           2.578574
average_reward_test                       4.768825
round_time_test             0 days 00:00:12.151672
round_time_total            0 days 00:09:40.616181
loss_total             2062787806056555623743488.0
loss_critic            2578484716696024222007296.0
loss_actor                   -3694961240965.120117
memory_size                                 1788.0 

=== epoch 7/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                       696
episode_length                           14.346264
returns                                  68.440299
return_std                                2.516856
average_reward                              4.7706
round_time                  0 days 00:09:41.614975
episodes_test                                697.0
episode_length_test                      14.340029
returns_test                             68.417787
return_std_test                           2.571493
average_reward_test                       4.771125
round_time_test             0 days 00:00:12.117346
round_time_total            0 days 00:09:41.616339
loss_total             2092362630278099813531648.0
loss_critic            2615453245261586689425408.0
loss_actor                   -3716624923492.352051
memory_size                                 1788.0 

=== epoch 7/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.341954
returns                                  68.420692
return_std                                  2.4926
average_reward                            4.770675
round_time                  0 days 00:09:39.382957
episodes_test                                694.0
episode_length_test                      14.409222
returns_test                             68.725172
return_std_test                           2.627944
average_reward_test                       4.769527
round_time_test             0 days 00:00:12.279768
round_time_total            0 days 00:09:39.384015
loss_total             2107870566231610480394240.0
loss_critic            2634838165446669627293696.0
loss_actor                   -3740080284106.751953
memory_size                                 1788.0 

=== epoch 7/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.288984
returns                                  68.179617
return_std                                2.444579
average_reward                            4.771478
round_time                  0 days 00:09:40.430323
episodes_test                                700.0
episode_length_test                      14.275714
returns_test                             68.108424
return_std_test                           2.550539
average_reward_test                       4.770961
round_time_test             0 days 00:00:12.094116
round_time_total            0 days 00:09:40.431720
loss_total             2140587717213291771068416.0
loss_critic            2675734601381538980429824.0
loss_actor                   -3770104082989.056152
memory_size                                 1788.0 

=== epoch 7/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.296137
returns                                  68.205264
return_std                                2.528351
average_reward                            4.770902
round_time                  0 days 00:09:39.057565
episodes_test                                699.0
episode_length_test                      14.306152
returns_test                             68.228672
return_std_test                           2.769076
average_reward_test                       4.769184
round_time_test             0 days 00:00:12.327621
round_time_total            0 days 00:09:39.058641
loss_total             2201984555297256291434496.0
loss_critic            2752480643636218570997760.0
loss_actor                   -3796550922665.983887
memory_size                                 1788.0 

=== epoch 7/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.293696
returns                                  68.192326
return_std                                2.628776
average_reward                            4.770797
round_time                  0 days 00:09:40.316724
episodes_test                                696.0
episode_length_test                      14.353448
returns_test                             68.459664
return_std_test                            2.67061
average_reward_test                       4.769603
round_time_test             0 days 00:00:12.133203
round_time_total            0 days 00:09:40.317780
loss_total             2237416422235235931914240.0
loss_critic            2796770477362736359538688.0
loss_actor                   -3822050925346.815918
memory_size                                 1788.0 

=== epoch 7/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.308023
returns                                   68.25449
return_std                                2.708737
average_reward                            4.770383
round_time                  0 days 00:09:40.506803
episodes_test                                696.0
episode_length_test                      14.347701
returns_test                             68.430359
return_std_test                           2.682245
average_reward_test                       4.769459
round_time_test             0 days 00:00:12.289085
round_time_total            0 days 00:09:40.507867
loss_total             2251470199083602062344192.0
loss_critic            2814337700711022504968192.0
loss_actor                   -3848564634812.416016
memory_size                                 1788.0 

=== epoch 7/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.312321
returns                                  68.273409
return_std                                2.818358
average_reward                            4.770263
round_time                  0 days 00:09:40.020395
episodes_test                                698.0
episode_length_test                      14.325215
returns_test                             68.322891
return_std_test                           2.631818
average_reward_test                       4.769431
round_time_test             0 days 00:00:12.243056
round_time_total            0 days 00:09:40.021448
loss_total             2265702200807160905990144.0
loss_critic            2832127701856664913182720.0
loss_actor                    -3866398981750.78418
memory_size                                 1788.0 

=== epoch 7/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:32,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                            14.33429
returns                                   68.37225
return_std                                2.988266
average_reward                            4.769853
round_time                  0 days 00:09:40.082617
episodes_test                                701.0
episode_length_test                      14.258203
returns_test                             68.009193
return_std_test                           2.779966
average_reward_test                        4.76985
round_time_test             0 days 00:00:12.330966
round_time_total            0 days 00:09:40.084038
loss_total             2275523797691878443319296.0
loss_critic            2844404697683338360520704.0
loss_actor                   -3876711249084.416016
memory_size                                 1788.0 

=== epoch 7/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.309456
returns                                  68.262036
return_std                                3.029466
average_reward                            4.770422
round_time                  0 days 00:09:40.129484
episodes_test                                698.0
episode_length_test                      14.308023
returns_test                             68.243258
return_std_test                           2.523944
average_reward_test                       4.769605
round_time_test             0 days 00:00:11.950220
round_time_total            0 days 00:09:40.130545
loss_total             2260229551379762078810112.0
loss_critic            2825286890027380157448192.0
loss_actor                   -3893130115940.352051
memory_size                                 1788.0 

=== epoch 7/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       699
episode_length                           14.300429
returns                                  68.213388
return_std                                3.011562
average_reward                            4.770033
round_time                  0 days 00:09:41.262094
episodes_test                                701.0
episode_length_test                      14.262482
returns_test                             68.023849
return_std_test                           2.697915
average_reward_test                       4.769451
round_time_test             0 days 00:00:12.045111
round_time_total            0 days 00:09:41.263155
loss_total             2277099627053362502959104.0
loss_critic            2846374489654405342691328.0
loss_actor                    -3911144849801.21582
memory_size                                 1788.0 

=== epoch 7/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:14,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       699
episode_length                           14.281831
returns                                  68.128541
return_std                                2.910973
average_reward                            4.770132
round_time                  0 days 00:09:40.337339
episodes_test                                697.0
episode_length_test                      14.332855
returns_test                             68.351707
return_std_test                           2.566834
average_reward_test                       4.768894
round_time_test             0 days 00:00:12.149483
round_time_total            0 days 00:09:40.338396
loss_total             2326764393480465292460032.0
loss_critic            2908455440230322563710976.0
loss_actor                   -3934046513790.976074
memory_size                                 1788.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       140
episode_length                           14.264286
returns                                  68.054344
return_std                                2.562492
average_reward                              4.7711
round_time                  0 days 00:09:38.466028
episodes_test                                696.0
episode_length_test                      14.364943
returns_test                             68.523331
return_std_test                           3.543248
average_reward_test                       4.770204
round_time_test             0 days 00:00:12.241561
round_time_total            0 days 00:09:38.467276
loss_total             2372336378241889557544960.0
loss_critic            2965420418867252693041152.0
loss_actor                   -3973655066968.063965
memory_size                                 1788.0 

=== epoch 8/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       279
episode_length                           14.326165
returns                                  68.351709
return_std                                 2.59309
average_reward                            4.771167
round_time                  0 days 00:09:39.802199
episodes_test                                698.0
episode_length_test                      14.310888
returns_test                             68.261237
return_std_test                           3.055071
average_reward_test                       4.769874
round_time_test             0 days 00:00:12.289123
round_time_total            0 days 00:09:39.803286
loss_total             2418316681359125993488384.0
loss_critic            3022895795710157008142336.0
loss_actor                   -4000209988616.191895
memory_size                                 1788.0 

=== epoch 8/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       418
episode_length                           14.351675
returns                                  68.461232
return_std                                 2.62911
average_reward                            4.770287
round_time                  0 days 00:09:39.125155
episodes_test                                698.0
episode_length_test                      14.326648
returns_test                             68.336494
return_std_test                           2.792313
average_reward_test                       4.769887
round_time_test             0 days 00:00:12.259162
round_time_total            0 days 00:09:39.126230
loss_total             2458113041099474341986304.0
loss_critic            3072641244809131328536576.0
loss_actor                   -4019103628394.496094
memory_size                                 1788.0 

=== epoch 8/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       558
episode_length                           14.335125
returns                                  68.383081
return_std                                2.611586
average_reward                            4.770332
round_time                  0 days 00:09:40.364517
episodes_test                                693.0
episode_length_test                      14.412698
returns_test                             68.721705
return_std_test                           3.479852
average_reward_test                       4.768124
round_time_test             0 days 00:00:12.167283
round_time_total            0 days 00:09:40.365586
loss_total             2475355776707901639360512.0
loss_critic            3094194664283636982874112.0
loss_actor                   -4049498342424.576172
memory_size                                 1788.0 

=== epoch 8/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:13,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                            14.33142
returns                                   68.36142
return_std                                2.678247
average_reward                            4.770054
round_time                  0 days 00:09:40.411815
episodes_test                                695.0
episode_length_test                      14.369784
returns_test                             68.525714
return_std_test                           2.929302
average_reward_test                       4.768754
round_time_test             0 days 00:00:12.342634
round_time_total            0 days 00:09:40.413009
loss_total             2520942568476238298480640.0
loss_critic            3151178155020878455242752.0
loss_actor                   -4076509030252.543945
memory_size                                 1788.0 

=== epoch 8/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.343391
returns                                  68.407321
return_std                                2.859062
average_reward                            4.769268
round_time                  0 days 00:09:39.754533
episodes_test                                697.0
episode_length_test                      14.347202
returns_test                             68.423628
return_std_test                           2.815545
average_reward_test                       4.769127
round_time_test             0 days 00:00:12.349270
round_time_total            0 days 00:09:39.755612
loss_total             2525428062732386624864256.0
loss_critic            3156785024435338275192832.0
loss_actor                   -4097879970021.375977
memory_size                                 1788.0 

=== epoch 8/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.346264
returns                                  68.406591
return_std                                2.939632
average_reward                            4.768246
round_time                  0 days 00:09:39.495313
episodes_test                                698.0
episode_length_test                      14.315186
returns_test                             68.256008
return_std_test                           2.826529
average_reward_test                       4.768096
round_time_test             0 days 00:00:12.127966
round_time_total            0 days 00:09:39.496383
loss_total             2556757097159270013599744.0
loss_critic            3195946313577832063696896.0
loss_actor                   -4123146208018.432129
memory_size                                 1788.0 

=== epoch 8/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.341954
returns                                   68.39181
return_std                                2.898299
average_reward                            4.768657
round_time                  0 days 00:09:39.721083
episodes_test                                700.0
episode_length_test                      14.278571
returns_test                             68.098711
return_std_test                           2.501692
average_reward_test                       4.769319
round_time_test             0 days 00:00:12.140534
round_time_total            0 days 00:09:39.722149
loss_total             2592928324191386800750592.0
loss_critic            3241160343557932960448512.0
loss_actor                   -4149160643264.512207
memory_size                                 1788.0 

=== epoch 8/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.368345
returns                                  68.509403
return_std                                3.162308
average_reward                            4.768049
round_time                  0 days 00:09:39.935830
episodes_test                                696.0
episode_length_test                      14.354885
returns_test                             68.481182
return_std_test                           3.085629
average_reward_test                       4.770594
round_time_test             0 days 00:00:11.986195
round_time_total            0 days 00:09:39.936888
loss_total             2646045293207508970635264.0
loss_critic            3307556555314474747166720.0
loss_actor                   -4193035024334.848145
memory_size                                 1788.0 

=== epoch 8/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:30,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.371223
returns                                  68.531096
return_std                                3.078993
average_reward                            4.768661
round_time                  0 days 00:09:39.721435
episodes_test                                697.0
episode_length_test                      14.329986
returns_test                             68.343865
return_std_test                             3.4581
average_reward_test                        4.76929
round_time_test             0 days 00:00:12.014496
round_time_total            0 days 00:09:39.722574
loss_total             2717108967523431152091136.0
loss_critic            3396386145525231622553600.0
loss_actor                   -4222110076108.799805
memory_size                                 1788.0 

=== epoch 8/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       695
episode_length                           14.369784
returns                                  68.527466
return_std                                2.949567
average_reward                            4.768866
round_time                  0 days 00:09:38.270995
episodes_test                                700.0
episode_length_test                          14.28
returns_test                             68.125484
return_std_test                           2.711977
average_reward_test                       4.770727
round_time_test             0 days 00:00:12.229465
round_time_total            0 days 00:09:38.272056
loss_total             2743106083057713083842560.0
loss_critic            3428882543095604125368320.0
loss_actor                   -4255125312634.879883
memory_size                                 1788.0 

=== epoch 8/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       696
episode_length                            14.33477
returns                                  68.370526
return_std                                2.877774
average_reward                            4.769526
round_time                  0 days 00:09:38.417913
episodes_test                                697.0
episode_length_test                      14.342898
returns_test                             68.410766
return_std_test                           2.767606
average_reward_test                       4.769685
round_time_test             0 days 00:00:12.235927
round_time_total            0 days 00:09:38.418999
loss_total             2779096483178887621115904.0
loss_critic            3473870542670611047514112.0
loss_actor                   -4280182500622.335938
memory_size                                 1788.0 

=== epoch 8/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.295129
returns                                  68.180249
return_std                                2.893402
average_reward                            4.769482
round_time                  0 days 00:09:39.544899
episodes_test                                699.0
episode_length_test                       14.30186
returns_test                             68.221488
return_std_test                           2.589137
average_reward_test                        4.77014
round_time_test             0 days 00:00:12.106009
round_time_total            0 days 00:09:39.545980
loss_total             2787095886652796156510208.0
loss_critic            3483869796292421329354752.0
loss_actor                   -4285981172170.751953
memory_size                                 1788.0 

=== epoch 8/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       702
episode_length                           14.245014
returns                                  67.954106
return_std                                2.552133
average_reward                            4.770378
round_time                  0 days 00:09:40.330326
episodes_test                                702.0
episode_length_test                      14.225071
returns_test                             67.868661
return_std_test                           4.464182
average_reward_test                       4.770992
round_time_test             0 days 00:00:12.117363
round_time_total            0 days 00:09:40.331450
loss_total             2808729042178560055312384.0
loss_critic            3510911240879770150371328.0
loss_actor                   -4313818816053.248047
memory_size                                 1788.0 

=== epoch 8/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       701
episode_length                           14.245364
returns                                  67.942997
return_std                                2.633417
average_reward                            4.769485
round_time                  0 days 00:09:40.098854
episodes_test                                700.0
episode_length_test                          14.28
returns_test                             68.117271
return_std_test                           2.457481
average_reward_test                       4.770137
round_time_test             0 days 00:00:12.373365
round_time_total            0 days 00:09:40.099918
loss_total             2823314029411613802496000.0
loss_critic            3529142471786581681242112.0
loss_actor                   -4327052011503.616211
memory_size                                 1788.0 

=== epoch 8/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       701
episode_length                           14.249643
returns                                  67.964192
return_std                                2.725068
average_reward                            4.769519
round_time                  0 days 00:09:38.790240
episodes_test                                697.0
episode_length_test                       14.33142
returns_test                             68.355496
return_std_test                           3.521766
average_reward_test                       4.769633
round_time_test             0 days 00:00:12.316211
round_time_total            0 days 00:09:38.791313
loss_total             2838540896901330840322048.0
loss_critic            3548176054851691341676544.0
loss_actor                   -4368524358057.983887
memory_size                                 1788.0 

=== epoch 8/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.274286
returns                                  68.085461
return_std                                2.695088
average_reward                            4.769815
round_time                  0 days 00:09:39.903440
episodes_test                                699.0
episode_length_test                      14.304721
returns_test                             68.231587
return_std_test                           2.558195
average_reward_test                       4.769882
round_time_test             0 days 00:00:12.181976
round_time_total            0 days 00:09:39.904660
loss_total             2903815427074642122113024.0
loss_critic            3629769214199638343548928.0
loss_actor                    -4385855181684.73584
memory_size                                 1788.0 

=== epoch 8/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       700
episode_length                           14.277143
returns                                  68.100772
return_std                                2.672061
average_reward                            4.769935
round_time                  0 days 00:09:39.322351
episodes_test                                698.0
episode_length_test                      14.325215
returns_test                             68.314016
return_std_test                           2.622095
average_reward_test                       4.768812
round_time_test             0 days 00:00:12.342684
round_time_total            0 days 00:09:39.323741
loss_total             2993372284998364239495168.0
loss_critic            3741715288711975458045952.0
loss_actor                   -4429049626755.072266
memory_size                                 1788.0 

=== epoch 8/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.297568
returns                                  68.198726
return_std                                2.692101
average_reward                            4.769989
round_time                  0 days 00:09:39.016247
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                             68.138228
return_std_test                           2.545991
average_reward_test                       4.769676
round_time_test             0 days 00:00:12.365996
round_time_total            0 days 00:09:39.017547
loss_total             3049744877722363196604416.0
loss_critic            3812181028175821903233024.0
loss_actor                   -4458949713592.320312
memory_size                                 1788.0 

=== epoch 8/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.291845
returns                                  68.181454
return_std                                 2.63315
average_reward                            4.770659
round_time                  0 days 00:09:40.140065
episodes_test                                701.0
episode_length_test                      14.249643
returns_test                             67.975716
return_std_test                           2.713364
average_reward_test                       4.770348
round_time_test             0 days 00:00:12.299796
round_time_total            0 days 00:09:40.141137
loss_total             3089387505756365209468928.0
loss_critic            3861734314010958352089088.0
loss_actor                   -4479384900337.664062
memory_size                                 1788.0 

=== epoch 8/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:16,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.275714
returns                                  68.109668
return_std                                2.428605
average_reward                            4.771023
round_time                  0 days 00:09:39.870422
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                             68.143093
return_std_test                           2.602289
average_reward_test                       4.770016
round_time_test             0 days 00:00:12.408315
round_time_total            0 days 00:09:39.871492
loss_total             3121466106911322084999168.0
loss_critic            3901832564662020513726464.0
loss_actor                   -4517305009307.648438
memory_size                                 1788.0 

=== epoch 8/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.275714
returns                                  68.103238
return_std                                 2.57772
average_reward                            4.770559
round_time                  0 days 00:09:40.041271
episodes_test                                700.0
episode_length_test                      14.274286
returns_test                              68.07295
return_std_test                           2.635174
average_reward_test                        4.76893
round_time_test             0 days 00:00:12.091507
round_time_total            0 days 00:09:40.042327
loss_total             3157179479954893008011264.0
loss_critic            3946474281452873241853952.0
loss_actor                   -4548522190962.688477
memory_size                                 1788.0 

=== epoch 8/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.299427
returns                                  68.207827
return_std                                 2.63169
average_reward                            4.769941
round_time                  0 days 00:09:39.419233
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.129166
return_std_test                            2.67639
average_reward_test                       4.769532
round_time_test             0 days 00:00:12.060281
round_time_total            0 days 00:09:39.420314
loss_total             3236476204277219022864384.0
loss_critic            4045595188044730652950528.0
loss_actor                   -4570151089733.631836
memory_size                                 1788.0 

=== epoch 8/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.308023
returns                                   68.24521
return_std                                2.625121
average_reward                             4.76972
round_time                  0 days 00:09:40.603248
episodes_test                                699.0
episode_length_test                      14.306152
returns_test                             68.239814
return_std_test                           2.642388
average_reward_test                       4.769963
round_time_test             0 days 00:00:12.172344
round_time_total            0 days 00:09:40.604613
loss_total             3223830859037022942658560.0
loss_critic            4029788508331954434736128.0
loss_actor                   -4586543720693.759766
memory_size                                 1788.0 

=== epoch 8/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       698
episode_length                           14.296562
returns                                  68.189364
return_std                                2.636087
average_reward                            4.769621
round_time                  0 days 00:09:41.099923
episodes_test                                698.0
episode_length_test                      14.310888
returns_test                             68.248596
return_std_test                           2.698021
average_reward_test                       4.769017
round_time_test             0 days 00:00:12.175807
round_time_total            0 days 00:09:41.100989
loss_total             3265673817658498446524416.0
loss_critic            4082092202771732074856448.0
loss_actor                   -4620435393609.727539
memory_size                                 1788.0 

=== epoch 8/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.303725
returns                                   68.22535
return_std                                2.683758
average_reward                            4.769761
round_time                  0 days 00:09:38.894331
episodes_test                                698.0
episode_length_test                      14.310888
returns_test                              68.24834
return_std_test                            2.87276
average_reward_test                       4.768998
round_time_test             0 days 00:00:12.451401
round_time_total            0 days 00:09:38.895397
loss_total             3317709565646615628218368.0
loss_critic            4147136885288906189176832.0
loss_actor                   -4646861075644.416016
memory_size                                 1788.0 

=== epoch 8/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.310888
returns                                   68.25756
return_std                                2.765424
average_reward                             4.76965
round_time                  0 days 00:09:39.040458
episodes_test                                700.0
episode_length_test                      14.277143
returns_test                              68.09543
return_std_test                           2.499794
average_reward_test                       4.769564
round_time_test             0 days 00:00:12.225519
round_time_total            0 days 00:09:39.041516
loss_total             3353764570156646348619776.0
loss_critic            4192205644781525495971840.0
loss_actor                   -4683592575746.047852
memory_size                                 1788.0 

=== epoch 8/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.294707
returns                                  68.184853
return_std                                2.686188
average_reward                            4.769958
round_time                  0 days 00:09:39.695893
episodes_test                                702.0
episode_length_test                      14.226496
returns_test                               67.8695
return_std_test                           2.550598
average_reward_test                       4.770651
round_time_test             0 days 00:00:12.399072
round_time_total            0 days 00:09:39.697084
loss_total             3333782025687718149750784.0
loss_critic            4167227462754213801492480.0
loss_actor                   -4693486657470.463867
memory_size                                 1788.0 

=== epoch 8/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.283262
returns                                  68.132736
return_std                                2.665656
average_reward                            4.770086
round_time                  0 days 00:09:39.630930
episodes_test                                701.0
episode_length_test                       14.24679
returns_test                             67.958749
return_std_test                           2.729578
average_reward_test                       4.770118
round_time_test             0 days 00:00:12.446289
round_time_total            0 days 00:09:39.632039
loss_total             3408973481952269377208320.0
loss_critic            4261216779067691343282176.0
loss_actor                   -4730229717991.423828
memory_size                                 1788.0 

=== epoch 8/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       699
episode_length                           14.288984
returns                                  68.150404
return_std                                2.680804
average_reward                            4.769448
round_time                  0 days 00:09:41.145371
episodes_test                                700.0
episode_length_test                      14.271429
returns_test                             68.047864
return_std_test                           3.016672
average_reward_test                       4.768127
round_time_test             0 days 00:00:12.266776
round_time_total            0 days 00:09:41.146433
loss_total             3487828559372014057422848.0
loss_critic            4359785625410026961108992.0
loss_actor                   -4762148258316.288086
memory_size                                 1788.0 

=== epoch 8/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.284692
returns                                  68.128622
return_std                                2.655336
average_reward                             4.76914
round_time                  0 days 00:09:40.813950
episodes_test                                699.0
episode_length_test                      14.293276
returns_test                             68.138952
return_std_test                           2.716316
average_reward_test                       4.767203
round_time_test             0 days 00:00:12.137468
round_time_total            0 days 00:09:40.815022
loss_total             3532324272777597034692608.0
loss_critic            4415405270103352416403456.0
loss_actor                   -4788972905431.040039
memory_size                                 1788.0 

=== epoch 8/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.268571
returns                                  68.046994
return_std                                2.589619
average_reward                             4.76902
round_time                  0 days 00:09:39.954229
episodes_test                                699.0
episode_length_test                      14.287554
returns_test                             68.116734
return_std_test                           2.837615
average_reward_test                       4.767574
round_time_test             0 days 00:00:12.172216
round_time_total            0 days 00:09:39.955416
loss_total             3575257468345301099085824.0
loss_critic            4469071758113827645816832.0
loss_actor                   -4833138444206.080078
memory_size                                 1788.0 

=== epoch 8/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.277539
returns                                  68.084403
return_std                                  2.6907
average_reward                            4.768634
round_time                  0 days 00:09:38.218784
episodes_test                                700.0
episode_length_test                      14.274286
returns_test                             68.057993
return_std_test                            2.95088
average_reward_test                       4.767912
round_time_test             0 days 00:00:12.382424
round_time_total            0 days 00:09:38.219981
loss_total             3601563689845130314907648.0
loss_critic            4501954535204787385794560.0
loss_actor                   -4850578536792.064453
memory_size                                 1788.0 

=== epoch 8/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.290415
returns                                  68.136952
return_std                                2.768375
average_reward                            4.768015
round_time                  0 days 00:09:40.338508
episodes_test                                701.0
episode_length_test                      14.258203
returns_test                             68.002903
return_std_test                           2.731047
average_reward_test                       4.769406
round_time_test             0 days 00:00:12.254513
round_time_total            0 days 00:09:40.339567
loss_total             3637481470250975903088640.0
loss_critic            4546851763828585589112832.0
loss_actor                   -4885576326709.248047
memory_size                                 1788.0 

=== epoch 8/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.288984
returns                                  68.134733
return_std                                2.769517
average_reward                            4.768366
round_time                  0 days 00:09:38.237779
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                               68.1325
return_std_test                           2.647928
average_reward_test                       4.769275
round_time_test             0 days 00:00:12.200697
round_time_total            0 days 00:09:38.238856
loss_total             3697963718259363033382912.0
loss_critic            4622454568795037092544512.0
loss_actor                   -4912716153094.143555
memory_size                                 1788.0 

=== epoch 8/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                               14.27
returns                                  68.054551
return_std                                2.675227
average_reward                            4.769073
round_time                  0 days 00:09:39.912286
episodes_test                                701.0
episode_length_test                      14.248217
returns_test                             67.975733
return_std_test                           2.613549
average_reward_test                       4.770831
round_time_test             0 days 00:00:12.213589
round_time_total            0 days 00:09:39.913756
loss_total             3737581184502103113990144.0
loss_critic            4671976404012392056356864.0
loss_actor                   -4950793957539.839844
memory_size                                 1788.0 

=== epoch 8/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.284692
returns                                  68.125787
return_std                                2.646086
average_reward                            4.769126
round_time                  0 days 00:09:40.784273
episodes_test                                699.0
episode_length_test                      14.300429
returns_test                             68.200057
return_std_test                           2.752519
average_reward_test                       4.769111
round_time_test             0 days 00:00:12.314955
round_time_total            0 days 00:09:40.785329
loss_total             3775593930503519129829376.0
loss_critic            4719492342008553481961472.0
loss_actor                   -4975500064063.488281
memory_size                                 1788.0 

=== epoch 8/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.297568
returns                                   68.20683
return_std                                3.032156
average_reward                            4.770503
round_time                  0 days 00:09:40.851518
episodes_test                                698.0
episode_length_test                       14.32235
returns_test                             68.297785
return_std_test                           2.797078
average_reward_test                       4.768643
round_time_test             0 days 00:00:12.229889
round_time_total            0 days 00:09:40.852576
loss_total             3816880140664289080901632.0
loss_critic            4771100095702316666060800.0
loss_actor                   -5000019702317.055664
memory_size                                 1788.0 

=== epoch 8/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.283262
returns                                  68.140234
return_std                                3.036185
average_reward                            4.770621
round_time                  0 days 00:09:38.028104
episodes_test                                699.0
episode_length_test                      14.298999
returns_test                             68.218359
return_std_test                           3.521491
average_reward_test                       4.770869
round_time_test             0 days 00:00:12.215780
round_time_total            0 days 00:09:38.029162
loss_total             3887477610777524276559872.0
loss_critic            4859346933668119551410176.0
loss_actor                   -5032268162924.543945
memory_size                                 1788.0 

=== epoch 8/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.290415
returns                                  68.172998
return_std                                3.062409
average_reward                             4.77054
round_time                  0 days 00:09:40.650846
episodes_test                                702.0
episode_length_test                      14.230769
returns_test                             67.885315
return_std_test                           2.505208
average_reward_test                       4.770345
round_time_test             0 days 00:00:12.091137
round_time_total            0 days 00:09:40.651904
loss_total             3933376287916925764239360.0
loss_critic            4916720275949060483973120.0
loss_actor                    -5071080167702.52832
memory_size                                 1788.0 

=== epoch 8/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.287554
returns                                  68.155306
return_std                                3.085028
average_reward                            4.770249
round_time                  0 days 00:09:38.761220
episodes_test                                698.0
episode_length_test                      14.313754
returns_test                             68.298807
return_std_test                            4.08175
average_reward_test                       4.771569
round_time_test             0 days 00:00:12.138215
round_time_total            0 days 00:09:38.762583
loss_total             4025349247056802813575168.0
loss_critic            5031686481232989068984320.0
loss_actor                   -5096235612241.919922
memory_size                                 1788.0 

=== epoch 8/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.288984
returns                                  68.169535
return_std                                3.536165
average_reward                            4.770765
round_time                  0 days 00:09:39.466778
episodes_test                                700.0
episode_length_test                      14.268571
returns_test                              68.05894
return_std_test                           2.926098
average_reward_test                       4.769846
round_time_test             0 days 00:00:12.239893
round_time_total            0 days 00:09:39.467840
loss_total             4045569308415717095243776.0
loss_critic            5056961554815140568760320.0
loss_actor                   -5127963302690.816406
memory_size                                 1788.0 

=== epoch 8/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       700
episode_length                           14.275714
returns                                  68.087481
return_std                                3.232182
average_reward                            4.769478
round_time                  0 days 00:09:38.849724
episodes_test                                697.0
episode_length_test                       14.33142
returns_test                             68.342034
return_std_test                           3.857391
average_reward_test                       4.768709
round_time_test             0 days 00:00:12.103085
round_time_total            0 days 00:09:38.850785
loss_total             4027671005859157374926848.0
loss_critic            5034588674475727714779136.0
loss_actor                   -5142150208290.816406
memory_size                                 1788.0 

=== epoch 8/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       701
episode_length                           14.249643
returns                                  67.964862
return_std                                3.198352
average_reward                            4.769575
round_time                  0 days 00:09:38.996193
episodes_test                                700.0
episode_length_test                      14.268571
returns_test                             68.056186
return_std_test                           3.175464
average_reward_test                       4.769657
round_time_test             0 days 00:00:12.512124
round_time_total            0 days 00:09:38.997252
loss_total             4140603457969247108464640.0
loss_critic            5175754241432794328727552.0
loss_actor                   -5188741389615.103516
memory_size                                 1788.0 

=== epoch 8/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:58,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       701
episode_length                           14.243937
returns                                  67.933352
return_std                                3.245754
average_reward                            4.769279
round_time                  0 days 00:09:40.045498
episodes_test                                704.0
episode_length_test                      14.203125
returns_test                             67.734329
return_std_test                             2.8214
average_reward_test                       4.768986
round_time_test             0 days 00:00:12.286651
round_time_total            0 days 00:09:40.046551
loss_total             4177123461086359056285696.0
loss_critic            5221404238663857265967104.0
loss_actor                   -5227884860932.095703
memory_size                                 1788.0 

=== epoch 8/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:02,  3.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.273247
returns                                  68.065872
return_std                                3.350068
average_reward                            4.768779
round_time                  0 days 00:09:40.215673
episodes_test                                701.0
episode_length_test                      14.249643
returns_test                             67.939321
return_std_test                           2.948974
average_reward_test                       4.767812
round_time_test             0 days 00:00:12.304237
round_time_total            0 days 00:09:40.216727
loss_total             4231533055213084329639936.0
loss_critic            5289416229773024862142464.0
loss_actor                   -5247729348116.480469
memory_size                                 1788.0 

=== epoch 8/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       701
episode_length                           14.249643
returns                                  67.957857
return_std                                2.874753
average_reward                            4.769089
round_time                  0 days 00:09:40.937993
episodes_test                                697.0
episode_length_test                      14.338594
returns_test                             68.349513
return_std_test                           3.065931
average_reward_test                       4.766838
round_time_test             0 days 00:00:12.254227
round_time_total            0 days 00:09:40.939053
loss_total             4317215654375642265288704.0
loss_critic            5396519475663774905532416.0
loss_actor                   -5279630610923.519531
memory_size                                 1788.0 

=== epoch 8/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.257143
returns                                  67.999514
return_std                                2.929679
average_reward                            4.769514
round_time                  0 days 00:09:40.007893
episodes_test                                696.0
episode_length_test                      14.350575
returns_test                             68.426934
return_std_test                           3.035283
average_reward_test                        4.76823
round_time_test             0 days 00:00:12.339781
round_time_total            0 days 00:09:40.009101
loss_total             4295901829283616955301888.0
loss_critic            5369877201324358697484288.0
loss_actor                   -5301246368940.032227
memory_size                                 1788.0 

=== epoch 8/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:12,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.290831
returns                                  68.159657
return_std                                2.961555
average_reward                            4.769475
round_time                  0 days 00:09:38.881404
episodes_test                                702.0
episode_length_test                      14.235043
returns_test                             67.894004
return_std_test                           2.748861
average_reward_test                        4.76953
round_time_test             0 days 00:00:12.191144
round_time_total            0 days 00:09:38.882793
loss_total             4377956692116927269568512.0
loss_critic            5472445778226685822369792.0
loss_actor                   -5336091655667.711914
memory_size                                 1788.0 

=== epoch 8/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       698
episode_length                           14.295129
returns                                  68.177662
return_std                                2.946908
average_reward                            4.769281
round_time                  0 days 00:09:41.871770
episodes_test                                700.0
episode_length_test                      14.272857
returns_test                             68.052954
return_std_test                           2.930591
average_reward_test                       4.768012
round_time_test             0 days 00:00:12.418594
round_time_total            0 days 00:09:41.872818
loss_total             4463179609225807125282816.0
loss_critic            5578974415965874684952576.0
loss_actor                   -5373005352402.944336
memory_size                                 1788.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:01,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       140
episode_length                           14.264286
returns                                  68.004007
return_std                                 3.10087
average_reward                            4.767599
round_time                  0 days 00:09:39.837623
episodes_test                                699.0
episode_length_test                       14.30186
returns_test                             68.200688
return_std_test                           2.924891
average_reward_test                       4.768685
round_time_test             0 days 00:00:12.140262
round_time_total            0 days 00:09:39.838726
loss_total             4498962195101661335126016.0
loss_critic            5623702652526062188101632.0
loss_actor                   -5400759668834.303711
memory_size                                 1788.0 

=== epoch 9/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       280
episode_length                           14.239286
returns                                  67.881941
return_std                                3.254649
average_reward                            4.767223
round_time                  0 days 00:09:40.749056
episodes_test                                701.0
episode_length_test                       14.25107
returns_test                             67.969489
return_std_test                           4.070077
average_reward_test                       4.769456
round_time_test             0 days 00:00:12.379119
round_time_total            0 days 00:09:40.750121
loss_total             4511377869082710375399424.0
loss_critic            5639222238859462958907392.0
loss_actor                   -5414812571140.095703
memory_size                                 1788.0 

=== epoch 9/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       420
episode_length                           14.278571
returns                                  68.078691
return_std                                 3.25766
average_reward                            4.767933
round_time                  0 days 00:09:40.240123
episodes_test                                700.0
episode_length_test                          14.27
returns_test                             68.049925
return_std_test                           2.939549
average_reward_test                       4.768767
round_time_test             0 days 00:00:12.444213
round_time_total            0 days 00:09:40.241192
loss_total             4590939826888280891195392.0
loss_critic            5738674690439881917726720.0
loss_actor                   -5473762521382.912109
memory_size                                 1788.0 

=== epoch 9/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       559
episode_length                           14.293381
returns                                  68.156592
return_std                                3.135958
average_reward                            4.768387
round_time                  0 days 00:09:39.513349
episodes_test                                700.0
episode_length_test                      14.268571
returns_test                             68.049431
return_std_test                           2.748814
average_reward_test                       4.769201
round_time_test             0 days 00:00:12.273179
round_time_total            0 days 00:09:39.514425
loss_total             4653979130422478445215744.0
loss_critic            5817473813984934990184448.0
loss_actor                   -5501383879950.335938
memory_size                                 1788.0 

=== epoch 9/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.298999
returns                                  68.192979
return_std                                3.021256
average_reward                             4.76911
round_time                  0 days 00:09:39.214113
episodes_test                                698.0
episode_length_test                      14.308023
returns_test                             68.241588
return_std_test                           3.324909
average_reward_test                       4.769456
round_time_test             0 days 00:00:12.552426
round_time_total            0 days 00:09:39.215180
loss_total             4673091507674035049725952.0
loss_critic            5841364279856830554832896.0
loss_actor                   -5529945495044.095703
memory_size                                 1788.0 

=== epoch 9/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.321377
returns                                  68.303605
return_std                                3.023027
average_reward                            4.769344
round_time                  0 days 00:09:39.431964
episodes_test                                697.0
episode_length_test                      14.344333
returns_test                             68.389629
return_std_test                           3.132652
average_reward_test                       4.767743
round_time_test             0 days 00:00:12.087737
round_time_total            0 days 00:09:39.433048
loss_total             4712853506822723508109312.0
loss_critic            5891066783566506743562240.0
loss_actor                   -5554373071208.448242
memory_size                                 1788.0 

=== epoch 9/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       697
episode_length                           14.332855
returns                                  68.363603
return_std                                2.876606
average_reward                             4.76971
round_time                  0 days 00:09:39.184657
episodes_test                                700.0
episode_length_test                      14.267143
returns_test                             68.060311
return_std_test                           2.663394
average_reward_test                       4.770431
round_time_test             0 days 00:00:12.124342
round_time_total            0 days 00:09:39.185725
loss_total             4835725908145924095344640.0
loss_critic            6044657274321797243731968.0
loss_actor                   -5593244583067.648438
memory_size                                 1788.0 

=== epoch 9/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       697
episode_length                           14.319943
returns                                  68.299139
return_std                                2.853566
average_reward                            4.769514
round_time                  0 days 00:09:38.264335
episodes_test                                698.0
episode_length_test                      14.312321
returns_test                             68.249411
return_std_test                           2.894006
average_reward_test                       4.768587
round_time_test             0 days 00:00:12.198167
round_time_total            0 days 00:09:38.265408
loss_total             4872136902678891160666112.0
loss_critic            6090171025990800881221632.0
loss_actor                   -5621785532825.599609
memory_size                                 1788.0 

=== epoch 9/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.305158
returns                                   68.22888
return_std                                2.835674
average_reward                            4.769526
round_time                  0 days 00:09:40.258797
episodes_test                                696.0
episode_length_test                      14.353448
returns_test                             68.458101
return_std_test                            2.68766
average_reward_test                       4.769488
round_time_test             0 days 00:00:12.273506
round_time_total            0 days 00:09:40.259889
loss_total             4973969183195835249197056.0
loss_critic            6217461361630988566790144.0
loss_actor                   -5660865439203.328125
memory_size                                 1788.0 

=== epoch 9/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.291845
returns                                  68.164758
return_std                                 2.86727
average_reward                            4.769479
round_time                  0 days 00:09:39.476524
episodes_test                                697.0
episode_length_test                      14.329986
returns_test                             68.342381
return_std_test                           2.809653
average_reward_test                       4.769219
round_time_test             0 days 00:00:12.343133
round_time_total            0 days 00:09:39.477579
loss_total             5066365883934981958402048.0
loss_critic            6332957239950835636502528.0
loss_actor                   -5702545271947.263672
memory_size                                 1788.0 

=== epoch 9/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.276109
returns                                  68.091183
return_std                                2.762155
average_reward                            4.769582
round_time                  0 days 00:09:38.969913
episodes_test                                699.0
episode_length_test                      14.294707
returns_test                             68.166788
return_std_test                           2.841735
average_reward_test                       4.768682
round_time_test             0 days 00:00:12.181830
round_time_total            0 days 00:09:38.970986
loss_total             5156857189277815348920320.0
loss_critic            6446071365612569769279488.0
loss_actor                   -5739429473288.192383
memory_size                                 1788.0 

=== epoch 9/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       699
episode_length                           14.277539
returns                                  68.103208
return_std                                 2.78176
average_reward                            4.769948
round_time                  0 days 00:09:39.227978
episodes_test                                700.0
episode_length_test                      14.267143
returns_test                             68.049584
return_std_test                           2.663304
average_reward_test                       4.769665
round_time_test             0 days 00:00:12.182225
round_time_total            0 days 00:09:39.229061
loss_total             5103355109387106483437568.0
loss_critic            6379193772810826644717568.0
loss_actor                   -5763851387928.576172
memory_size                                 1788.0 

=== epoch 9/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:21,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       699
episode_length                           14.284692
returns                                  68.138518
return_std                                2.721004
average_reward                            4.770041
round_time                  0 days 00:09:41.336023
episodes_test                                696.0
episode_length_test                      14.356322
returns_test                             68.473498
return_std_test                           2.871642
average_reward_test                       4.769602
round_time_test             0 days 00:00:12.280060
round_time_total            0 days 00:09:41.337275
loss_total             5171703870092749817511936.0
loss_critic            6464629715946689055948800.0
loss_actor                   -5785314244427.776367
memory_size                                 1788.0 

=== epoch 9/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                            14.30659
returns                                  68.233745
return_std                                2.769652
average_reward                             4.76939
round_time                  0 days 00:09:39.318812
episodes_test                                698.0
episode_length_test                       14.32235
returns_test                              68.31032
return_std_test                           3.413348
average_reward_test                       4.769515
round_time_test             0 days 00:00:12.479169
round_time_total            0 days 00:09:39.319885
loss_total             5271254675100884000571392.0
loss_critic            6589068222134799313141760.0
loss_actor                   -5819385133400.064453
memory_size                                 1788.0 

=== epoch 9/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.329986
returns                                  68.333373
return_std                                2.806968
average_reward                            4.768573
round_time                  0 days 00:09:40.143134
episodes_test                                698.0
episode_length_test                      14.313754
returns_test                             68.266373
return_std_test                           2.826235
average_reward_test                       4.769293
round_time_test             0 days 00:00:12.081560
round_time_total            0 days 00:09:40.144401
loss_total             5315692540021453883441152.0
loss_critic            6644615541396009251766272.0
loss_actor                   -5858361923076.095703
memory_size                                 1788.0 

=== epoch 9/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:18,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.350575
returns                                  68.433566
return_std                                2.819964
average_reward                            4.768705
round_time                  0 days 00:09:40.286965
episodes_test                                696.0
episode_length_test                      14.349138
returns_test                             68.441606
return_std_test                            2.50859
average_reward_test                        4.76973
round_time_test             0 days 00:00:12.322127
round_time_total            0 days 00:09:40.288063
loss_total             5345158946242893824655360.0
loss_critic            6681448557513475411148800.0
loss_actor                   -5887775540510.719727
memory_size                                 1788.0 

=== epoch 9/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.364943
returns                                  68.503657
return_std                                2.782127
average_reward                            4.768818
round_time                  0 days 00:09:39.929200
episodes_test                                696.0
episode_length_test                      14.350575
returns_test                             68.435515
return_std_test                           3.002829
average_reward_test                       4.768846
round_time_test             0 days 00:00:12.312848
round_time_total            0 days 00:09:39.930258
loss_total             5425974679811645109174272.0
loss_critic            6782468227176574937464832.0
loss_actor                   -5927398873759.744141
memory_size                                 1788.0 

=== epoch 9/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       694
episode_length                           14.383285
returns                                  68.595084
return_std                                2.769818
average_reward                            4.769082
round_time                  0 days 00:09:38.825725
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                             68.111837
return_std_test                            3.08742
average_reward_test                       4.767829
round_time_test             0 days 00:00:12.135711
round_time_total            0 days 00:09:38.826963
loss_total             5524538066998598823837696.0
loss_critic            6905672455954105293078528.0
loss_actor                   -5962392253759.488281
memory_size                                 1788.0 

=== epoch 9/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       695
episode_length                           14.374101
returns                                  68.567843
return_std                                 2.71636
average_reward                            4.770237
round_time                  0 days 00:09:38.516852
episodes_test                                697.0
episode_length_test                      14.328551
returns_test                             68.340819
return_std_test                           2.902856
average_reward_test                        4.76957
round_time_test             0 days 00:00:12.313256
round_time_total            0 days 00:09:38.517913
loss_total             5544585022021454878736384.0
loss_critic            6930731147354775576641536.0
loss_actor                   -6015707357904.896484
memory_size                                 1788.0 

=== epoch 9/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.362069
returns                                  68.524002
return_std                                2.667725
average_reward                            4.771187
round_time                  0 days 00:09:39.618380
episodes_test                                698.0
episode_length_test                      14.326648
returns_test                             68.308941
return_std_test                            2.98533
average_reward_test                       4.767964
round_time_test             0 days 00:00:12.392398
round_time_total            0 days 00:09:39.619438
loss_total             5663458439517510295355392.0
loss_critic            7079322919080729635717120.0
loss_actor                   -6044193365950.463867
memory_size                                 1788.0 

=== epoch 9/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.332855
returns                                  68.387433
return_std                                2.695161
average_reward                            4.771387
round_time                  0 days 00:09:39.598396
episodes_test                                698.0
episode_length_test                      14.308023
returns_test                             68.232067
return_std_test                           2.842222
average_reward_test                       4.768816
round_time_test             0 days 00:00:12.257567
round_time_total            0 days 00:09:39.599466
loss_total             5756352715093195599380480.0
loss_critic            7195440761947054233616384.0
loss_actor                   -6074298355941.375977
memory_size                                 1788.0 

=== epoch 9/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       697
episode_length                           14.322812
returns                                  68.341602
return_std                                2.654054
average_reward                            4.771485
round_time                  0 days 00:09:41.114734
episodes_test                                699.0
episode_length_test                      14.294707
returns_test                             68.174904
return_std_test                           2.713766
average_reward_test                       4.769256
round_time_test             0 days 00:00:12.289529
round_time_total            0 days 00:09:41.115791
loss_total             5762527601911378200231936.0
loss_critic            7203159369947364942938112.0
loss_actor                   -6084861386817.536133
memory_size                                 1788.0 

=== epoch 9/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                            14.30186
returns                                  68.238596
return_std                                2.573303
average_reward                            4.771332
round_time                  0 days 00:09:40.209042
episodes_test                                701.0
episode_length_test                      14.261056
returns_test                             68.022517
return_std_test                           2.722756
average_reward_test                       4.769837
round_time_test             0 days 00:00:12.293780
round_time_total            0 days 00:09:40.210105
loss_total             5822312771960801389969408.0
loss_critic            7277890829770954907844608.0
loss_actor                   -6130116576673.791992
memory_size                                 1788.0 

=== epoch 9/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.308023
returns                                  68.254236
return_std                                2.688082
average_reward                            4.770342
round_time                  0 days 00:09:40.439539
episodes_test                                697.0
episode_length_test                      14.347202
returns_test                             68.419447
return_std_test                           2.718782
average_reward_test                       4.768835
round_time_test             0 days 00:00:12.270715
round_time_total            0 days 00:09:40.440592
loss_total             5903594676225458174951424.0
loss_critic            7379493202391613539614720.0
loss_actor                   -6168348457697.280273
memory_size                                 1788.0 

=== epoch 9/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.305158
returns                                  68.228734
return_std                                2.759885
average_reward                            4.769534
round_time                  0 days 00:09:40.005023
episodes_test                                699.0
episode_length_test                      14.306152
returns_test                             68.219529
return_std_test                           2.685161
average_reward_test                       4.768545
round_time_test             0 days 00:00:12.401107
round_time_total            0 days 00:09:40.006296
loss_total             5968149761834674527666176.0
loss_critic            7460187065239799651631104.0
loss_actor                   -6206888527003.648438
memory_size                                 1788.0 

=== epoch 9/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                            14.30659
returns                                  68.231075
return_std                                2.754843
average_reward                            4.769209
round_time                  0 days 00:09:39.272487
episodes_test                                701.0
episode_length_test                      14.256776
returns_test                             68.009373
return_std_test                           2.617094
average_reward_test                       4.770325
round_time_test             0 days 00:00:12.403744
round_time_total            0 days 00:09:39.273550
loss_total             6094686364664931318497280.0
loss_critic            7618357824326055082590208.0
loss_actor                   -6233748505886.719727
memory_size                                 1788.0 

=== epoch 9/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.286123
returns                                  68.135576
return_std                                2.711562
average_reward                            4.769372
round_time                  0 days 00:09:40.347898
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.137526
return_std_test                           2.636361
average_reward_test                       4.770122
round_time_test             0 days 00:00:12.143595
round_time_total            0 days 00:09:40.348950
loss_total             6081843631336811030118400.0
loss_critic            7602304398442532515610624.0
loss_actor                   -6282460071985.152344
memory_size                                 1788.0 

=== epoch 9/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:15,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.289398
returns                                  68.155171
return_std                                2.778527
average_reward                            4.769608
round_time                  0 days 00:09:39.975614
episodes_test                                700.0
episode_length_test                      14.272857
returns_test                             68.079272
return_std_test                           2.751357
average_reward_test                       4.769859
round_time_test             0 days 00:00:12.077057
round_time_total            0 days 00:09:39.976675
loss_total             6224065978106561079279616.0
loss_critic            7780082327905523824852992.0
loss_actor                    -6319900606070.78418
memory_size                                 1788.0 

=== epoch 9/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                       700
episode_length                               14.27
returns                                   68.07429
return_std                                2.601298
average_reward                            4.770446
round_time                  0 days 00:09:41.513025
episodes_test                                700.0
episode_length_test                      14.281429
returns_test                             68.127319
return_std_test                           2.880264
average_reward_test                       4.770373
round_time_test             0 days 00:00:12.407786
round_time_total            0 days 00:09:41.514076
loss_total             6366337783479476651294720.0
loss_critic            7957922094457529944768512.0
loss_actor                   -6346156929449.984375
memory_size                                 1788.0 

=== epoch 9/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                            14.27897
returns                                  68.124607
return_std                                2.460016
average_reward                            4.770937
round_time                  0 days 00:09:39.858908
episodes_test                                698.0
episode_length_test                      14.326648
returns_test                             68.344727
return_std_test                            2.78811
average_reward_test                       4.770462
round_time_test             0 days 00:00:12.262438
round_time_total            0 days 00:09:39.860003
loss_total             6334554579981321004646400.0
loss_critic            7918193094300204590432256.0
loss_actor                   -6374074479017.984375
memory_size                                 1788.0 

=== epoch 9/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.290415
returns                                  68.177022
return_std                                2.395791
average_reward                            4.770826
round_time                  0 days 00:09:40.068316
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.143969
return_std_test                           2.658411
average_reward_test                        4.77057
round_time_test             0 days 00:00:12.243967
round_time_total            0 days 00:09:40.069374
loss_total             6446171410087901497655296.0
loss_critic            8057714120332156872949760.0
loss_actor                   -6423057424449.536133
memory_size                                 1788.0 

=== epoch 9/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       698
episode_length                            14.30659
returns                                  68.245451
return_std                                2.417931
average_reward                            4.770199
round_time                  0 days 00:09:38.483238
episodes_test                                701.0
episode_length_test                       14.25107
returns_test                             67.989048
return_std_test                           2.485601
average_reward_test                       4.770824
round_time_test             0 days 00:00:12.153675
round_time_total            0 days 00:09:38.484297
loss_total             6498363829371912847360000.0
loss_critic            8122954653804659015155712.0
loss_actor                   -6447218137235.456055
memory_size                                 1788.0 

=== epoch 9/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.319484
returns                                   68.30957
return_std                                2.484059
average_reward                            4.770422
round_time                  0 days 00:09:38.710327
episodes_test                                696.0
episode_length_test                      14.362069
returns_test                             68.518234
return_std_test                           2.562006
average_reward_test                       4.770803
round_time_test             0 days 00:00:12.118238
round_time_total            0 days 00:09:38.711535
loss_total             6623418598189084755099648.0
loss_critic            8279273107440220038496256.0
loss_actor                   -6491550792548.351562
memory_size                                 1788.0 

=== epoch 9/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.328551
returns                                  68.340903
return_std                                2.685715
average_reward                            4.769556
round_time                  0 days 00:09:39.988152
episodes_test                                694.0
episode_length_test                      14.391931
returns_test                             68.660346
return_std_test                           5.239475
average_reward_test                       4.770761
round_time_test             0 days 00:00:12.177429
round_time_total            0 days 00:09:39.989340
loss_total             6630960105484725979709440.0
loss_critic            8288699993757529420070912.0
loss_actor                   -6532402727616.511719
memory_size                                 1788.0 

=== epoch 9/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.316619
returns                                  68.281389
return_std                                2.853206
average_reward                            4.769372
round_time                  0 days 00:09:39.150280
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.134825
return_std_test                            2.63372
average_reward_test                       4.769927
round_time_test             0 days 00:00:12.287372
round_time_total            0 days 00:09:39.151515
loss_total             6728763889978082276147200.0
loss_critic            8410954724806568546664448.0
loss_actor                   -6538166590242.816406
memory_size                                 1788.0 

=== epoch 9/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                       698
episode_length                           14.299427
returns                                  68.207039
return_std                                 2.88511
average_reward                            4.769913
round_time                  0 days 00:09:38.333544
episodes_test                                701.0
episode_length_test                      14.262482
returns_test                             68.018326
return_std_test                           2.927325
average_reward_test                       4.769069
round_time_test             0 days 00:00:12.277789
round_time_total            0 days 00:09:38.334749
loss_total             6764012483829945946079232.0
loss_critic            8455015464203066372259840.0
loss_actor                   -6578096331292.671875
memory_size                                 1788.0 

=== epoch 9/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.297994
returns                                  68.201719
return_std                                2.914619
average_reward                            4.770029
round_time                  0 days 00:09:40.383085
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                             68.132542
return_std_test                           2.702848
average_reward_test                       4.769278
round_time_test             0 days 00:00:12.010073
round_time_total            0 days 00:09:40.384140
loss_total             6872485966264397172047872.0
loss_critic            8590607320632836671668224.0
loss_actor                   -6616399687974.912109
memory_size                                 1788.0 

=== epoch 9/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.274678
returns                                   68.08463
return_std                                2.905936
average_reward                            4.769599
round_time                  0 days 00:09:39.451250
episodes_test                                698.0
episode_length_test                      14.309456
returns_test                             68.243987
return_std_test                           2.753164
average_reward_test                       4.769158
round_time_test             0 days 00:00:12.383040
round_time_total            0 days 00:09:39.452303
loss_total             6876702942976423634141184.0
loss_critic            8595878535109744305111040.0
loss_actor                   -6679002808582.143555
memory_size                                 1788.0 

=== epoch 9/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       701
episode_length                           14.261056
returns                                  68.029971
return_std                                2.719495
average_reward                            4.770364
round_time                  0 days 00:09:38.864439
episodes_test                                698.0
episode_length_test                      14.326648
returns_test                             68.322646
return_std_test                           2.812045
average_reward_test                       4.768921
round_time_test             0 days 00:00:12.240828
round_time_total            0 days 00:09:38.865502
loss_total             6895794969866386324586496.0
loss_critic            8619743571856703455494144.0
loss_actor                   -6689002664230.912109
memory_size                                 1788.0 

=== epoch 9/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       700
episode_length                           14.254286
returns                                   67.99171
return_std                                2.579162
average_reward                            4.769899
round_time                  0 days 00:09:39.024329
episodes_test                                699.0
episode_length_test                      14.287554
returns_test                             68.168416
return_std_test                           3.737534
average_reward_test                       4.771175
round_time_test             0 days 00:00:12.248454
round_time_total            0 days 00:09:39.025547
loss_total             7123260306973103950397440.0
loss_critic            8904075241042342636945408.0
loss_actor                   -6723182809841.664062
memory_size                                 1788.0 

=== epoch 9/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                           14.265714
returns                                  68.044672
return_std                                2.559233
average_reward                            4.769806
round_time                  0 days 00:09:39.525898
episodes_test                                699.0
episode_length_test                      14.294707
returns_test                             68.171835
return_std_test                           2.534307
average_reward_test                       4.769031
round_time_test             0 days 00:00:12.163947
round_time_total            0 days 00:09:39.526955
loss_total             7152079912773301328412672.0
loss_critic            8940099752075614002085888.0
loss_actor                   -6772362228531.200195
memory_size                                 1788.0 

=== epoch 9/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       700
episode_length                               14.27
returns                                  68.056328
return_std                                2.667783
average_reward                            4.769206
round_time                  0 days 00:09:39.818565
episodes_test                                700.0
episode_length_test                      14.284286
returns_test                             68.130441
return_std_test                            2.61543
average_reward_test                       4.769619
round_time_test             0 days 00:00:12.226335
round_time_total            0 days 00:09:39.819973
loss_total             7274456244615160019288064.0
loss_critic            9093070164031662001750016.0
loss_actor                   -6813453930725.375977
memory_size                                 1788.0 

=== epoch 9/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       700
episode_length                           14.275714
returns                                   68.08403
return_std                                2.638389
average_reward                            4.769229
round_time                  0 days 00:09:38.997007
episodes_test                                700.0
episode_length_test                      14.285714
returns_test                             68.127668
return_std_test                           2.803732
average_reward_test                       4.768937
round_time_test             0 days 00:00:12.212924
round_time_total            0 days 00:09:38.998079
loss_total             7351534646217299107774464.0
loss_critic            9189418155261726270947328.0
loss_actor                   -6843487876022.272461
memory_size                                 1788.0 

=== epoch 9/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       699
episode_length                           14.274678
returns                                   68.07445
return_std                                2.721684
average_reward                            4.768875
round_time                  0 days 00:09:40.026784
episodes_test                                702.0
episode_length_test                      14.226496
returns_test                              67.85784
return_std_test                           2.602974
average_reward_test                       4.769823
round_time_test             0 days 00:00:12.438311
round_time_total            0 days 00:09:40.028214
loss_total             7365802886137245689970688.0
loss_critic            9207253458944682799136768.0
loss_actor                   -6872298153312.255859
memory_size                                 1788.0 

=== epoch 9/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       698
episode_length                           14.310888
returns                                  68.255973
return_std                                2.800604
average_reward                            4.769516
round_time                  0 days 00:09:39.524950
episodes_test                                697.0
episode_length_test                      14.335725
returns_test                             68.369337
return_std_test                             3.0423
average_reward_test                       4.769176
round_time_test             0 days 00:00:12.227005
round_time_total            0 days 00:09:39.526010
loss_total             7456176793374375326777344.0
loss_critic            9320220840613194791649280.0
loss_actor                   -6916670736302.080078
memory_size                                 1788.0 

=== epoch 9/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                       697
episode_length                           14.332855
returns                                  68.358162
return_std                                2.820243
average_reward                            4.769349
round_time                  0 days 00:09:40.554629
episodes_test                                697.0
episode_length_test                      14.342898
returns_test                             68.422263
return_std_test                           2.644701
average_reward_test                        4.77049
round_time_test             0 days 00:00:12.513468
round_time_total            0 days 00:09:40.555884
loss_total             7476688830158776605605888.0
loss_critic            9345860881153347215687680.0
loss_actor                   -6921513667657.727539
memory_size                                 1788.0 

=== epoch 9/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                       696
episode_length                            14.33908
returns                                  68.394467
return_std                                2.785823
average_reward                            4.769786
round_time                  0 days 00:09:38.970404
episodes_test                                692.0
episode_length_test                      14.442197
returns_test                             68.918411
return_std_test                           4.868673
average_reward_test                       4.772058
round_time_test             0 days 00:00:12.163147
round_time_total            0 days 00:09:38.971672
loss_total             7442000303956572730556416.0
loss_critic            9302500229561517067993088.0
loss_actor                   -6931567725248.511719
memory_size                                 1788.0 

=== epoch 9/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       696
episode_length                           14.337644
returns                                  68.390558
return_std                                2.724797
average_reward                            4.769981
round_time                  0 days 00:09:40.003222
episodes_test                                697.0
episode_length_test                      14.342898
returns_test                              68.41036
return_std_test                           2.576976
average_reward_test                       4.769664
round_time_test             0 days 00:00:12.249156
round_time_total            0 days 00:09:40.004272
loss_total             7608523366854155445469184.0
loss_critic            9510654061858432688848896.0
loss_actor                   -6982407460487.167969
memory_size                                 1788.0 

=== epoch 9/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                       695
episode_length                           14.368345
returns                                  68.535771
return_std                                2.812858
average_reward                            4.769928
round_time                  0 days 00:09:40.163386
episodes_test                                700.0
episode_length_test                          14.27
returns_test                             68.071507
return_std_test                           2.355977
average_reward_test                       4.770259
round_time_test             0 days 00:00:12.201345
round_time_total            0 days 00:09:40.164586
loss_total             7677697186723302094143488.0
loss_critic            9597121335938262144909312.0
loss_actor                   -7021796342890.496094
memory_size                                 1788.0 

=== epoch 9/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                       697
episode_length                            14.33142
returns                                  68.358975
return_std                                2.720627
average_reward                            4.769868
round_time                  0 days 00:09:39.219688
episodes_test                                696.0
episode_length_test                      14.350575
returns_test                             68.453191
return_std_test                           2.781718
average_reward_test                       4.770074
round_time_test             0 days 00:00:12.090287
round_time_total            0 days 00:09:39.220873
loss_total             7847646024582887941079040.0
loss_critic            9809557377101820562767872.0
loss_actor                   -7062000211853.311523
memory_size                                 1788.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:06,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.45it/s]
episodes                                       139
episode_length                           14.323741
returns                                  68.315062
return_std                                2.577454
average_reward                             4.76951
round_time                  0 days 00:09:38.978841
episodes_test                                698.0
episode_length_test                      14.308023
returns_test                             68.238036
return_std_test                           2.676888
average_reward_test                       4.769236
round_time_test             0 days 00:00:12.219003
round_time_total            0 days 00:09:38.979927
loss_total             7897273511725484946227200.0
loss_critic            9871591739921177266618368.0
loss_actor                   -7104860188573.696289
memory_size                                 1788.0 

=== epoch 10/10 ==== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                        279
episode_length                            14.308244
returns                                   68.238338
return_std                                 2.658004
average_reward                             4.769245
round_time                   0 days 00:09:39.261496
episodes_test                                 700.0
episode_length_test                           14.28
returns_test                              68.099875
return_std_test                            2.832076
average_reward_test                        4.768918
round_time_test              0 days 00:00:12.228369
round_time_total             0 days 00:09:39.262570
loss_total              8101607588832734110285824.0
loss_critic            10127009324415734964551680.0
loss_actor                    -7152418706423.807617
memory_size                                  1788.0 

=== epoch 10/10 ==== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        420
episode_length                            14.283333
returns                                    68.12627
return_std                                 2.723346
average_reward                             4.769655
round_time                   0 days 00:09:39.716671
episodes_test                                 697.0
episode_length_test                       14.342898
returns_test                              68.398247
return_std_test                              2.8122
average_reward_test                        4.768817
round_time_test              0 days 00:00:12.299621
round_time_total             0 days 00:09:39.717787
loss_total              8204588687966939511783424.0
loss_critic            10255735696423964289007616.0
loss_actor                    -7189668410687.488281
memory_size                                  1788.0 

=== epoch 10/10 ==== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                        559
episode_length                            14.307692
returns                                   68.233968
return_std                                 2.885103
average_reward                             4.769073
round_time                   0 days 00:09:38.047360
episodes_test                                 695.0
episode_length_test                       14.388489
returns_test                              68.624419
return_std_test                             3.05933
average_reward_test                        4.769397
round_time_test              0 days 00:00:12.080954
round_time_total             0 days 00:09:38.048692
loss_total              8254323678698772864434176.0
loss_critic            10317904431668221859332096.0
loss_actor                    -7230302733467.648438
memory_size                                  1788.0 

=== epoch 10/10 ==== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:32,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        699
episode_length                            14.300429
returns                                   68.207885
return_std                                  2.87281
average_reward                             4.769669
round_time                   0 days 00:09:40.203796
episodes_test                                 700.0
episode_length_test                       14.277143
returns_test                              68.098842
return_std_test                             2.65434
average_reward_test                          4.7698
round_time_test              0 days 00:00:12.203655
round_time_total             0 days 00:09:40.204870
loss_total              8364014278748358683131904.0
loss_critic            10455017683891932845047808.0
loss_actor                    -7266711229431.807617
memory_size                                  1788.0 

=== epoch 10/10 ==== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:37<00:00,  3.46it/s]
episodes                                        698
episode_length                            14.299427
returns                                   68.210057
return_std                                 2.902234
average_reward                             4.769867
round_time                   0 days 00:09:38.468932
episodes_test                                 703.0
episode_length_test                       14.220484
returns_test                              67.846994
return_std_test                            2.873742
average_reward_test                        4.771101
round_time_test              0 days 00:00:12.175563
round_time_total             0 days 00:09:38.470062
loss_total              8445595424077766920765440.0
loss_critic            10556994106474435073540096.0
loss_actor                    -7308926951620.608398
memory_size                                  1788.0 

=== epoch 10/10 ==== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                        687
episode_length                            14.525473
returns                                   69.359246
return_std                                 9.187458
average_reward                             4.774994
round_time                   0 days 00:09:39.124225
episodes_test                                 698.0
episode_length_test                       14.310888
returns_test                              68.280508
return_std_test                            2.726695
average_reward_test                        4.771226
round_time_test              0 days 00:00:11.928060
round_time_total             0 days 00:09:39.125371
loss_total              8690208595932757142536192.0
loss_critic            10862760560088217355812864.0
loss_actor                     -7326974260281.34375
memory_size                                1800.325 

=== epoch 10/10 ==== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        680
episode_length                            14.666176
returns                                   70.048623
return_std                                 9.500523
average_reward                             4.776145
round_time                   0 days 00:09:40.342560
episodes_test                                 650.0
episode_length_test                       15.383077
returns_test                              73.500222
return_std_test                            9.385915
average_reward_test                        4.778008
round_time_test              0 days 00:00:12.343556
round_time_total             0 days 00:09:40.343627
loss_total              8905013089651952065708032.0
loss_critic            11131266170319682190442496.0
loss_actor                    -7348091292483.583984
memory_size                                  1813.0 

=== epoch 10/10 ==== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                        678
episode_length                            14.725664
returns                                   70.346527
return_std                                 9.612756
average_reward                             4.777112
round_time                   0 days 00:09:40.914883
episodes_test                                 689.0
episode_length_test                        14.50508
returns_test                               69.22154
return_std_test                            4.015018
average_reward_test                        4.772254
round_time_test              0 days 00:00:12.172661
round_time_total             0 days 00:09:40.915944
loss_total              8869744826815103721537536.0
loss_critic            11087180849195555296903168.0
loss_actor                     -7360972183830.52832
memory_size                                  1813.0 

=== epoch 10/10 ==== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        675
episode_length                            14.791111
returns                                   70.637862
return_std                                10.481259
average_reward                             4.775697
round_time                   0 days 00:09:40.010926
episodes_test                                 677.0
episode_length_test                       14.756278
returns_test                              70.373157
return_std_test                            7.435413
average_reward_test                        4.769031
round_time_test              0 days 00:00:12.222346
round_time_total             0 days 00:09:40.011999
loss_total              8889648567946270241456128.0
loss_critic            11112060525177167543271424.0
loss_actor                    -7402514739101.696289
memory_size                               1823.1605 

=== epoch 10/10 ==== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        676
episode_length                            14.778107
returns                                   70.575828
return_std                                10.474111
average_reward                             4.775721
round_time                   0 days 00:09:41.693452
episodes_test                                 696.0
episode_length_test                       14.353448
returns_test                              68.460371
return_std_test                            2.938762
average_reward_test                        4.769608
round_time_test              0 days 00:00:12.314682
round_time_total             0 days 00:09:41.694519
loss_total              8872487769748599906238464.0
loss_critic            11090609516693497560694784.0
loss_actor                     -7427098460553.21582
memory_size                                  1827.0 

=== epoch 10/10 ==== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:27,  2.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        688
episode_length                            14.513081
returns                                   69.237087
return_std                                 5.906889
average_reward                             4.770651
round_time                   0 days 00:09:41.590814
episodes_test                                 689.0
episode_length_test                       14.510885
returns_test                              69.190452
return_std_test                            6.772427
average_reward_test                        4.768202
round_time_test              0 days 00:00:11.950331
round_time_total             0 days 00:09:41.592276
loss_total              8931199925553518610481152.0
loss_critic            11163999717358368278446080.0
loss_actor                    -7455773915086.847656
memory_size                                  1827.0 

=== epoch 10/10 ==== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        694
episode_length                            14.404899
returns                                    68.69946
return_std                                 5.337356
average_reward                             4.769212
round_time                   0 days 00:09:41.796389
episodes_test                                 698.0
episode_length_test                       14.308023
returns_test                              68.209066
return_std_test                            3.130749
average_reward_test                         4.76722
round_time_test              0 days 00:00:12.233852
round_time_total             0 days 00:09:41.797754
loss_total              8906862600420919710056448.0
loss_critic            11133578051755274974265344.0
loss_actor                    -7463477205532.671875
memory_size                                  1827.0 

=== epoch 10/10 ==== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        695
episode_length                            14.361151
returns                                   68.460649
return_std                                 5.184355
average_reward                             4.767091
round_time                   0 days 00:09:42.451793
episodes_test                                 696.0
episode_length_test                       14.366379
returns_test                              68.455042
return_std_test                            3.474462
average_reward_test                         4.76496
round_time_test              0 days 00:00:12.265538
round_time_total             0 days 00:09:42.452862
loss_total              8900095186096633394757632.0
loss_critic            11125118797468803109224448.0
loss_actor                    -7465683602898.944336
memory_size                                  1827.0 

=== epoch 10/10 ==== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        698
episode_length                            14.303725
returns                                   68.193133
return_std                                 3.102358
average_reward                              4.76752
round_time                   0 days 00:09:42.139106
episodes_test                                 698.0
episode_length_test                       14.316619
returns_test                              68.273244
return_std_test                            3.009439
average_reward_test                        4.768831
round_time_test              0 days 00:00:12.077655
round_time_total             0 days 00:09:42.140175
loss_total              8920620085738031290515456.0
loss_critic            11150774911319997285400576.0
loss_actor                    -7507773831249.919922
memory_size                                  1827.0 

=== epoch 10/10 ==== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:42<00:00,  3.44it/s]
episodes                                        698
episode_length                            14.302292
returns                                   68.183843
return_std                                 3.170683
average_reward                             4.767333
round_time                   0 days 00:09:42.679830
episodes_test                                 702.0
episode_length_test                       14.230769
returns_test                              67.869517
return_std_test                            2.727295
average_reward_test                        4.769229
round_time_test              0 days 00:00:12.063222
round_time_total             0 days 00:09:42.681076
loss_total              9011542440040746006347776.0
loss_critic            11264427863349706220371968.0
loss_actor                    -7547589976653.824219
memory_size                                  1827.0 

=== epoch 10/10 ==== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        697
episode_length                            14.325681
returns                                   68.298304
return_std                                 3.098987
average_reward                             4.767554
round_time                   0 days 00:09:41.372199
episodes_test                                 700.0
episode_length_test                       14.275714
returns_test                              68.079319
return_std_test                            2.741908
average_reward_test                        4.768905
round_time_test              0 days 00:00:12.138223
round_time_total             0 days 00:09:41.373267
loss_total              9060514673469571540713472.0
loss_critic            11325643146921171876839424.0
loss_actor                    -7567572868530.175781
memory_size                                  1827.0 

=== epoch 10/10 ==== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        698
episode_length                            14.302292
returns                                   68.188924
return_std                                 3.130883
average_reward                              4.76769
round_time                   0 days 00:09:41.139120
episodes_test                                 703.0
episode_length_test                       14.211949
returns_test                              67.821056
return_std_test                            2.732802
average_reward_test                        4.772123
round_time_test              0 days 00:00:12.300386
round_time_total             0 days 00:09:41.140211
loss_total              9185477547237066019962880.0
loss_critic            11481846729366736061071360.0
loss_actor                    -7620999999651.839844
memory_size                                  1827.0 

=== epoch 10/10 ==== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        701
episode_length                            14.253923
returns                                   67.981426
return_std                                 2.765385
average_reward                              4.76932
round_time                   0 days 00:09:42.289215
episodes_test                                 705.0
episode_length_test                       14.177305
returns_test                              67.621386
return_std_test                            2.792984
average_reward_test                        4.769715
round_time_test              0 days 00:00:12.216180
round_time_total             0 days 00:09:42.290280
loss_total              9197217895933582279442432.0
loss_critic            11496522167759397187485696.0
loss_actor                    -7658648178262.015625
memory_size                                  1827.0 

=== epoch 10/10 ==== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        701
episode_length                            14.241084
returns                                   67.925858
return_std                                 2.641059
average_reward                             4.769726
round_time                   0 days 00:09:42.403183
episodes_test                                 704.0
episode_length_test                       14.204545
returns_test                              67.754328
return_std_test                            2.805075
average_reward_test                        4.769905
round_time_test              0 days 00:00:12.227341
round_time_total             0 days 00:09:42.404263
loss_total              9357485334433225943023616.0
loss_critic            11696856460695804145827840.0
loss_actor                     -7654096942137.34375
memory_size                                  1827.0 

=== epoch 10/10 ==== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:08,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        700
episode_length                            14.264286
returns                                   68.031995
return_std                                 2.727265
average_reward                             4.769386
round_time                   0 days 00:09:41.411669
episodes_test                                 703.0
episode_length_test                       14.213371
returns_test                              67.787464
return_std_test                            2.786189
average_reward_test                        4.769282
round_time_test              0 days 00:00:12.322884
round_time_total             0 days 00:09:41.412724
loss_total              9306278468047802889928704.0
loss_critic            11632847881497050508951552.0
loss_actor                    -7700092984229.887695
memory_size                                  1827.0 

=== epoch 10/10 ==== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        699
episode_length                            14.287554
returns                                   68.140612
return_std                                 3.013127
average_reward                             4.769227
round_time                   0 days 00:09:41.096379
episodes_test                                 694.0
episode_length_test                       14.402017
returns_test                              68.718278
return_std_test                            4.769362
average_reward_test                        4.771448
round_time_test              0 days 00:00:12.109285
round_time_total             0 days 00:09:41.097443
loss_total              9491373739107760937631744.0
loss_critic            11864216965422080870842368.0
loss_actor                    -7715036675964.927734
memory_size                                  1827.0 

=== epoch 10/10 ==== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        692
episode_length                            14.434971
returns                                   68.835472
return_std                                 3.696791
average_reward                             4.768644
round_time                   0 days 00:09:41.859405
episodes_test                                 661.0
episode_length_test                       15.119516
returns_test                               72.09846
return_std_test                             5.19045
average_reward_test                         4.76859
round_time_test              0 days 00:00:12.250230
round_time_total             0 days 00:09:41.860684
loss_total              9478564095203283927302144.0
loss_critic            11848204913712020070596608.0
loss_actor                    -7748753384800.255859
memory_size                                  1827.0 

=== epoch 10/10 ==== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        689
episode_length                            14.492017
returns                                   69.106396
return_std                                 3.676592
average_reward                             4.768626
round_time                   0 days 00:09:41.282707
episodes_test                                 687.0
episode_length_test                       14.548763
returns_test                               69.37561
return_std_test                            3.294013
average_reward_test                         4.76851
round_time_test              0 days 00:00:12.050463
round_time_total             0 days 00:09:41.283947
loss_total              9585957624679269322981376.0
loss_critic            11982446805705134155235328.0
loss_actor                    -7754153045786.624023
memory_size                                  1827.0 

=== epoch 10/10 ==== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        687
episode_length                            14.537118
returns                                   69.324226
return_std                                 3.641696
average_reward                             4.768788
round_time                   0 days 00:09:39.789077
episodes_test                                 695.0
episode_length_test                       14.384173
returns_test                              68.620799
return_std_test                            2.615267
average_reward_test                        4.770607
round_time_test              0 days 00:00:11.977457
round_time_total             0 days 00:09:39.790151
loss_total              9642255113214154879533056.0
loss_critic            12052818679199992257708032.0
loss_actor                    -7809333667823.616211
memory_size                                  1827.0 

=== epoch 10/10 ==== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        686
episode_length                            14.562682
returns                                   69.450867
return_std                                 3.563251
average_reward                             4.769099
round_time                   0 days 00:09:41.611632
episodes_test                                 691.0
episode_length_test                       14.460203
returns_test                              68.967061
return_std_test                            2.938404
average_reward_test                        4.769454
round_time_test              0 days 00:00:12.084669
round_time_total             0 days 00:09:41.612700
loss_total              9717115362728491385618432.0
loss_critic            12146394000136141283524608.0
loss_actor                    -7758694667517.952148
memory_size                                  1827.0 

=== epoch 10/10 ==== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        686
episode_length                            14.568513
returns                                   69.472813
return_std                                 3.422658
average_reward                             4.768696
round_time                   0 days 00:09:41.706437
episodes_test                                 689.0
episode_length_test                       14.510885
returns_test                              69.227517
return_std_test                            3.721704
average_reward_test                        4.770752
round_time_test              0 days 00:00:12.140898
round_time_total             0 days 00:09:41.707483
loss_total              9779119750885763337682944.0
loss_critic            12223899473695431435747328.0
loss_actor                    -7821223812661.248047
memory_size                                  1827.0 

=== epoch 10/10 ==== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:20,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.44it/s]
episodes                                        692
episode_length                            14.445087
returns                                   68.894726
return_std                                 2.828226
average_reward                             4.769446
round_time                   0 days 00:09:41.541092
episodes_test                                 697.0
episode_length_test                       14.335725
returns_test                              68.364546
return_std_test                            3.013544
average_reward_test                        4.768843
round_time_test              0 days 00:00:12.223254
round_time_total             0 days 00:09:41.542142
loss_total              9908412440675244513951744.0
loss_critic            12385515328834610005016576.0
loss_actor                     -7881256782790.65625
memory_size                                  1827.0 

=== epoch 10/10 ==== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        691
episode_length                            14.435601
returns                                   68.854469
return_std                                 2.817662
average_reward                             4.769742
round_time                   0 days 00:09:41.945626
episodes_test                                 696.0
episode_length_test                       14.366379
returns_test                              68.527358
return_std_test                            2.762848
average_reward_test                        4.769993
round_time_test              0 days 00:00:12.143351
round_time_total             0 days 00:09:41.946961
loss_total             10016728570941595975680000.0
loss_critic            12520910487632321257144320.0
loss_actor                    -7925687669751.807617
memory_size                                  1827.0 

=== epoch 10/10 ==== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:28,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:42<00:00,  3.44it/s]
episodes                                        693
episode_length                            14.414141
returns                                   68.752714
return_std                                 2.768485
average_reward                             4.769824
round_time                   0 days 00:09:42.641786
episodes_test                                 693.0
episode_length_test                       14.421356
returns_test                              68.855067
return_std_test                            8.475288
average_reward_test                        4.774539
round_time_test              0 days 00:00:12.368065
round_time_total             0 days 00:09:42.642842
loss_total              9937555172470889010692096.0
loss_critic            12421943721529540115169280.0
loss_actor                    -7935292975349.759766
memory_size                                  1827.0 

=== epoch 10/10 ==== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                        696
episode_length                            14.352011
returns                                   68.457532
return_std                                 2.654637
average_reward                             4.769895
round_time                   0 days 00:09:40.873388
episodes_test                                 701.0
episode_length_test                       14.258203
returns_test                              68.016339
return_std_test                             2.67473
average_reward_test                        4.770355
round_time_test              0 days 00:00:12.198661
round_time_total             0 days 00:09:40.874513
loss_total              9849123166088073683402752.0
loss_critic            12311403741725540413865984.0
loss_actor                    -7992997231984.639648
memory_size                                  1827.0 

=== epoch 10/10 ==== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        696
episode_length                            14.329023
returns                                    68.34332
return_std                                 2.685462
average_reward                             4.769554
round_time                   0 days 00:09:41.690922
episodes_test                                 699.0
episode_length_test                       14.296137
returns_test                              68.192146
return_std_test                            2.875278
average_reward_test                        4.769988
round_time_test              0 days 00:00:12.099661
round_time_total             0 days 00:09:41.691971
loss_total             10063773702272423926169600.0
loss_critic            12579716895346703491661824.0
loss_actor                    -8019625875341.311523
memory_size                                  1827.0 

=== epoch 10/10 ==== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        696
episode_length                            14.341954
returns                                   68.396814
return_std                                  2.77965
average_reward                             4.768998
round_time                   0 days 00:09:41.763510
episodes_test                                 700.0
episode_length_test                       14.267143
returns_test                              68.008324
return_std_test                            3.001735
average_reward_test                        4.766802
round_time_test              0 days 00:00:12.199309
round_time_total             0 days 00:09:41.764662
loss_total             10228375655093300180811776.0
loss_critic            12785469357881990196494336.0
loss_actor                    -8071240626208.767578
memory_size                                  1827.0 

=== epoch 10/10 ==== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                        699
episode_length                            14.300429
returns                                    68.19998
return_std                                 2.725601
average_reward                             4.769112
round_time                   0 days 00:09:40.932521
episodes_test                                 700.0
episode_length_test                       14.284286
returns_test                              68.141193
return_std_test                            2.588957
average_reward_test                        4.770378
round_time_test              0 days 00:00:12.366806
round_time_total             0 days 00:09:40.933807
loss_total             10340708211808294748028928.0
loss_critic            12925885038751726276968448.0
loss_actor                    -8095053401554.944336
memory_size                                  1827.0 

=== epoch 10/10 ==== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        699
episode_length                            14.286123
returns                                   68.123243
return_std                                 2.780887
average_reward                             4.768509
round_time                   0 days 00:09:42.368295
episodes_test                                 696.0
episode_length_test                       14.366379
returns_test                              68.512153
return_std_test                            2.650537
average_reward_test                        4.768937
round_time_test              0 days 00:00:12.147245
round_time_total             0 days 00:09:42.369353
loss_total             10351352133739196956278784.0
loss_critic            12939189916341513050128384.0
loss_actor                    -8118287386542.080078
memory_size                                  1827.0 

=== epoch 10/10 ==== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        699
episode_length                            14.284692
returns                                   68.111698
return_std                                 2.828034
average_reward                             4.768147
round_time                   0 days 00:09:41.631567
episodes_test                                 702.0
episode_length_test                        14.24359
returns_test                              67.935188
return_std_test                            2.750987
average_reward_test                        4.769538
round_time_test              0 days 00:00:12.317627
round_time_total             0 days 00:09:41.632625
loss_total             10404251408909453521584128.0
loss_critic            13005314020608568037736448.0
loss_actor                    -8143103666159.616211
memory_size                                  1827.0 

=== epoch 10/10 ==== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:39<00:00,  3.45it/s]
episodes                                        701
episode_length                             14.25535
returns                                    67.98554
return_std                                 2.806783
average_reward                             4.769129
round_time                   0 days 00:09:40.529361
episodes_test                                 704.0
episode_length_test                       14.193182
returns_test                              67.680712
return_std_test                            2.946173
average_reward_test                         4.76855
round_time_test              0 days 00:00:12.319804
round_time_total             0 days 00:09:40.530512
loss_total             10561225236594968187895808.0
loss_critic            13201531296568549742477312.0
loss_actor                    -8179649368293.375977
memory_size                                  1827.0 

=== epoch 10/10 ==== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        702
episode_length                            14.220798
returns                                    67.82676
return_std                                  3.02768
average_reward                             4.769549
round_time                   0 days 00:09:41.757348
episodes_test                                 703.0
episode_length_test                       14.211949
returns_test                               67.77691
return_std_test                            3.334322
average_reward_test                        4.769018
round_time_test              0 days 00:00:12.078344
round_time_total             0 days 00:09:41.758501
loss_total             10572378967877185683062784.0
loss_critic            13215473477172511874482176.0
loss_actor                    -8209718294872.064453
memory_size                                  1827.0 

=== epoch 10/10 ==== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        702
episode_length                            14.213675
returns                                   67.790597
return_std                                 3.071969
average_reward                             4.769409
round_time                   0 days 00:09:41.833064
episodes_test                                 704.0
episode_length_test                       14.190341
returns_test                              67.678439
return_std_test                             3.41825
average_reward_test                        4.769341
round_time_test              0 days 00:00:12.114939
round_time_total             0 days 00:09:41.834111
loss_total             10672442490337017148211200.0
loss_critic            13340552863025534615945216.0
loss_actor                    -8221809100455.935547
memory_size                                  1827.0 

=== epoch 10/10 ==== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        702
episode_length                            14.217949
returns                                   67.811668
return_std                                 3.108493
average_reward                             4.769444
round_time                   0 days 00:09:42.334298
episodes_test                                 702.0
episode_length_test                       14.239316
returns_test                              67.913746
return_std_test                            2.897649
average_reward_test                        4.769487
round_time_test              0 days 00:00:12.109204
round_time_total             0 days 00:09:42.335427
loss_total             10635162830843397732827136.0
loss_critic            13293953297089250365276160.0
loss_actor                    -8196606786273.280273
memory_size                                  1827.0 

=== epoch 10/10 ==== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:42<00:00,  3.44it/s]
episodes                                        705
episode_length                            14.167376
returns                                   67.579378
return_std                                 3.172143
average_reward                             4.770074
round_time                   0 days 00:09:42.516167
episodes_test                                 708.0
episode_length_test                       14.118644
returns_test                              67.334579
return_std_test                            3.004555
average_reward_test                         4.76922
round_time_test              0 days 00:00:12.094344
round_time_total             0 days 00:09:42.517212
loss_total             10744409126650449921835008.0
loss_critic            13430511168577448141914112.0
loss_actor                    -8278881927430.143555
memory_size                                  1827.0 

=== epoch 10/10 ==== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:41<00:00,  3.44it/s]
episodes                                        705
episode_length                            14.157447
returns                                   67.518429
return_std                                 3.261774
average_reward                             4.769087
round_time                   0 days 00:09:41.542214
episodes_test                                 714.0
episode_length_test                       14.005602
returns_test                               66.80075
return_std_test                            2.800749
average_reward_test                        4.769574
round_time_test              0 days 00:00:12.150875
round_time_total             0 days 00:09:41.543260
loss_total             10741150218485380348379136.0
loss_critic            13426437527678559373492224.0
loss_actor                    -8318983019167.744141
memory_size                                  1827.0 

=== epoch 10/10 ==== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:40<00:00,  3.45it/s]
episodes                                        707
episode_length                            14.130127
returns                                   67.379585
return_std                                 2.983362
average_reward                             4.768529
round_time                   0 days 00:09:41.046258
episodes_test                                 712.0
episode_length_test                       14.044944
returns_test                              66.965196
return_std_test                            2.821639
average_reward_test                        4.767922
round_time_test              0 days 00:00:12.060685
round_time_total             0 days 00:09:41.047315
loss_total             10935755889749447002816512.0
loss_critic            13669694599969222541967360.0
loss_actor                    -8357903946219.519531
memory_size                                  1827.0 

=== epoch 10/10 ==== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:38<00:00,  3.46it/s]
episodes                                        707
episode_length                             14.12447
returns                                   67.345702
return_std                                  2.99724
average_reward                             4.768028
round_time                   0 days 00:09:38.992291
episodes_test                                 707.0
episode_length_test                       14.142857
returns_test                              67.451118
return_std_test                            2.560968
average_reward_test                        4.769287
round_time_test              0 days 00:00:12.159423
round_time_total             0 days 00:09:38.993442
loss_total             11031845935977787046232064.0
loss_critic            13789807184127728384212992.0
loss_actor                    -8376714898964.480469
memory_size                                  1827.0 

=== epoch 10/10 ==== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:36<00:00,  3.47it/s]
episodes                                        709
episode_length                            14.091678
returns                                   67.192266
return_std                                 2.885735
average_reward                             4.768235
round_time                   0 days 00:09:37.478817
episodes_test                                 708.0
episode_length_test                       14.117232
returns_test                              67.354973
return_std_test                            2.332336
average_reward_test                        4.771168
round_time_test              0 days 00:00:12.186338
round_time_total             0 days 00:09:37.479862
loss_total             11035331078759172016177152.0
loss_critic            13794163598048826704789504.0
loss_actor                    -8414409615212.543945
memory_size                                  1827.0 

=== epoch 10/10 ==== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:33<00:00,  3.49it/s]
episodes                                        706
episode_length                            14.128895
returns                                   67.366719
return_std                                  2.88479
average_reward                             4.767771
round_time                   0 days 00:09:34.125668
episodes_test                                 707.0
episode_length_test                       14.138614
returns_test                              67.432232
return_std_test                            2.667239
average_reward_test                        4.769389
round_time_test              0 days 00:00:12.046221
round_time_total             0 days 00:09:34.126910
loss_total             11237083825866562026340352.0
loss_critic            14046354539174851524952064.0
loss_actor                    -8464781125091.328125
memory_size                                  1827.0 

=== epoch 10/10 ==== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:31<00:00,  3.50it/s]
episodes                                        706
episode_length                            14.150142
returns                                   67.476974
return_std                                 2.702644
average_reward                             4.768659
round_time                   0 days 00:09:32.358732
episodes_test                                 701.0
episode_length_test                        14.24679
returns_test                              67.942478
return_std_test                             2.76055
average_reward_test                         4.76897
round_time_test              0 days 00:00:12.160869
round_time_total             0 days 00:09:32.359789
loss_total             11216697989599437510410240.0
loss_critic            14020872228528707791224832.0
loss_actor                    -8447446693969.919922
memory_size                                  1827.0 

=== epoch 10/10 ==== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:31<00:00,  3.50it/s]
episodes                                        704
episode_length                            14.184659
returns                                   67.648069
return_std                                 2.754884
average_reward                             4.769103
round_time                   0 days 00:09:32.287359
episodes_test                                 698.0
episode_length_test                       14.309456
returns_test                              68.249939
return_std_test                            2.707125
average_reward_test                        4.769586
round_time_test              0 days 00:00:12.092697
round_time_total             0 days 00:09:32.288402
loss_total             11272100431735572167393280.0
loss_critic            14090125291431053190234112.0
loss_actor                    -8439981421101.055664
memory_size                                  1827.0 

=== epoch 10/10 ==== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:33<00:00,  3.49it/s]
episodes                                        703
episode_length                            14.207681
returns                                   67.765825
return_std                                 2.700257
average_reward                             4.769665
round_time                   0 days 00:09:33.787512
episodes_test                                 701.0
episode_length_test                       14.263909
returns_test                              68.034462
return_std_test                            2.747519
average_reward_test                        4.769712
round_time_test              0 days 00:00:11.976951
round_time_total             0 days 00:09:33.788561
loss_total             11150241368647164342829056.0
loss_critic            13937801455004496271769600.0
loss_actor                    -8469145413746.688477
memory_size                                  1827.0 

=== epoch 10/10 ==== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:31<00:00,  3.50it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
episodes                                        699
episode_length                            14.276109
returns                                   68.084886
return_std                                 2.904232
average_reward                             4.769158
round_time                   0 days 00:09:31.697426
episodes_test                                 698.0
episode_length_test                       14.318052
returns_test                              68.275022
return_std_test                            2.957671
average_reward_test                         4.76847
round_time_test              0 days 00:00:12.183797
round_time_total             0 days 00:09:31.698479
loss_total             11168536950876562190761984.0
loss_critic            13960670936250007588700160.0
loss_actor                    -8445761029406.719727
memory_size                                  1827.0 


