/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Humanoid-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 4
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>

<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
 85%|████████▌ | 1702/2000 [00:02<00:00, 830.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 834.57it/s]
episodes                                   72
episode_length                      27.138889
returns                            135.297752
return_std                          65.902445
average_reward                       4.989549
round_time             0 days 00:00:02.407023
episodes_test                           313.0
episode_length_test                  31.84984
returns_test                       159.967014
return_std_test                     76.879375
average_reward_test                  5.021937
round_time_test        0 days 00:00:12.531482
round_time_total       0 days 00:00:14.190170 

=== epoch 1/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 62%|██████▏   | 1248/2000 [00:01<00:00, 778.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 819.05it/s]
episodes                                  145
episode_length                      27.544828
returns                            136.768537
return_std                          69.395395
average_reward                       4.965242
round_time             0 days 00:00:02.891731
episodes_test                           314.0
episode_length_test                 31.770701
returns_test                       159.590285
return_std_test                     80.386754
average_reward_test                  5.022863
round_time_test        0 days 00:00:12.668842
round_time_total       0 days 00:00:14.288522 

=== epoch 1/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 63%|██████▎   | 1255/2000 [00:01<00:00, 828.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 831.19it/s]
episodes                                  218
episode_length                      27.490826
returns                            135.953906
return_std                          67.279394
average_reward                       4.945377
round_time             0 days 00:00:02.885670
episodes_test                           314.0
episode_length_test                 31.738854
returns_test                       157.769213
return_std_test                     76.470024
average_reward_test                  4.970517
round_time_test        0 days 00:00:12.567023
round_time_total       0 days 00:00:14.186528 

=== epoch 1/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 59%|█████▉    | 1177/2000 [00:01<00:00, 828.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 827.93it/s]
episodes                                  293
episode_length                      27.215017
returns                            134.763756
return_std                          65.449719
average_reward                       4.950863
round_time             0 days 00:00:02.944804
episodes_test                           305.0
episode_length_test                 32.734426
returns_test                        164.52977
return_std_test                     80.956386
average_reward_test                  5.025975
round_time_test        0 days 00:00:12.532787
round_time_total       0 days 00:00:14.177365 

=== epoch 1/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
 62%|██████▏   | 1246/2000 [00:01<00:00, 824.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 824.49it/s]
episodes                                  371
episode_length                       26.90566
returns                            133.264372
return_std                          64.391236
average_reward                       4.953172
round_time             0 days 00:00:02.917296
episodes_test                           299.0
episode_length_test                 33.314381
returns_test                       166.014786
return_std_test                      76.69964
average_reward_test                  4.981161
round_time_test        0 days 00:00:12.564055
round_time_total       0 days 00:00:14.226636 

=== epoch 1/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 3/2000 [00:01<13:37,  2.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:16<00:00,  2.72it/s]
starting training
episodes                                  374
episode_length                      26.716578
returns                             131.95313
return_std                          60.229702
average_reward                       4.939151
round_time             0 days 00:12:16.982985
episodes_test                           299.0
episode_length_test                 33.391304
returns_test                       167.971481
return_std_test                     82.135642
average_reward_test                  5.030301
round_time_test        0 days 00:00:12.601390
round_time_total       0 days 00:12:16.984390
loss_total                       26376.221827
loss_critic                      33235.462328
loss_actor                       -1060.742344
memory_size                          1293.494 

=== epoch 1/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<15:09,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                  384
episode_length                      25.859375
returns                            127.403618
return_std                          53.594622
average_reward                       4.926332
round_time             0 days 00:15:09.631700
episodes_test                           366.0
episode_length_test                 27.295082
returns_test                       136.320628
return_std_test                     47.600509
average_reward_test                   4.99419
round_time_test        0 days 00:00:14.321244
round_time_total       0 days 00:15:09.632789
loss_total                   140684712.390771
loss_critic                  175877911.046113
loss_actor                      -88096.102294
memory_size                          1380.452 

=== epoch 1/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:01,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:32<00:00,  2.14it/s]
episodes                                  390
episode_length                      25.584615
returns                            125.708292
return_std                          49.901947
average_reward                       4.913436
round_time             0 days 00:15:33.355633
episodes_test                           397.0
episode_length_test                 25.178841
returns_test                       122.169746
return_std_test                     43.312052
average_reward_test                  4.852101
round_time_test        0 days 00:00:14.592637
round_time_total       0 days 00:15:33.356988
loss_total                508127328347.263977
loss_critic               635159769795.264038
loss_actor                    -2490467.232031
memory_size                         1450.0275 

=== epoch 1/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:39<00:00,  2.13it/s]
episodes                                  402
episode_length                      24.818408
returns                            121.522557
return_std                          45.873468
average_reward                       4.895511
round_time             0 days 00:15:40.316138
episodes_test                           426.0
episode_length_test                 23.441315
returns_test                        113.92453
return_std_test                     38.178537
average_reward_test                  4.859924
round_time_test        0 days 00:00:14.324844
round_time_total       0 days 00:15:40.317386
loss_total              17111236187455.488281
loss_critic             21389048152260.609375
loss_actor                    -12831862.52075
memory_size                         1520.5425 

=== epoch 1/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:01,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:58<00:00,  2.09it/s]
episodes                                  414
episode_length                      24.060386
returns                            116.844796
return_std                          39.496419
average_reward                       4.857315
round_time             0 days 00:15:59.089172
episodes_test                           449.0
episode_length_test                 22.262806
returns_test                       107.664131
return_std_test                     32.777228
average_reward_test                  4.836092
round_time_test        0 days 00:00:14.174569
round_time_total       0 days 00:15:59.090471
loss_total              159248744412348.40625
loss_critic              199060935501938.6875
loss_actor                      -36723512.584
memory_size                          1561.909 

=== epoch 1/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:31,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:00<00:00,  2.08it/s]
episodes                                  425
episode_length                      23.428235
returns                            113.085592
return_std                          36.971177
average_reward                       4.826094
round_time             0 days 00:16:00.677453
episodes_test                           455.0
episode_length_test                 21.934066
returns_test                       104.530984
return_std_test                     29.637866
average_reward_test                  4.764988
round_time_test        0 days 00:00:14.387414
round_time_total       0 days 00:16:00.678546
loss_total                 810332168977907.75
loss_critic               1012915194570997.75
loss_actor                      -78457062.744
memory_size                           1588.89 

=== epoch 1/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                  429
episode_length                       23.25641
returns                            111.673406
return_std                          34.723698
average_reward                       4.802205
round_time             0 days 00:16:03.968537
episodes_test                           446.0
episode_length_test                  22.38565
returns_test                       106.923601
return_std_test                     30.596811
average_reward_test                  4.776523
round_time_test        0 days 00:00:14.508699
round_time_total       0 days 00:16:03.969635
loss_total                 2394548312263885.0
loss_critic                2993185338295320.5
loss_actor                      -130210880.84
memory_size                         1612.7265 

=== epoch 1/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:48,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                  430
episode_length                      23.167442
returns                            110.802476
return_std                          32.791416
average_reward                       4.784584
round_time             0 days 00:16:10.494770
episodes_test                           425.0
episode_length_test                 23.491765
returns_test                       112.108542
return_std_test                     32.565349
average_reward_test                  4.772438
round_time_test        0 days 00:00:14.153087
round_time_total       0 days 00:16:10.496391
loss_total                 5403035816849048.0
loss_critic                6753794656389038.0
loss_actor                     -191271627.048
memory_size                          1633.103 

=== epoch 1/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<21:20,  1.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                  426
episode_length                      23.415493
returns                            110.966073
return_std                          30.535799
average_reward                       4.738797
round_time             0 days 00:16:05.867109
episodes_test                           420.0
episode_length_test                      23.8
returns_test                       112.235498
return_std_test                     30.083604
average_reward_test                  4.715863
round_time_test        0 days 00:00:14.334187
round_time_total       0 days 00:16:05.868214
loss_total                 9773710915504440.0
loss_critic               12217138434497576.0
loss_actor                     -246841417.768
memory_size                         1677.9715 

=== epoch 1/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:16,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                  426
episode_length                      23.450704
returns                            110.835939
return_std                          30.079976
average_reward                       4.726558
round_time             0 days 00:16:02.027343
episodes_test                           418.0
episode_length_test                 23.906699
returns_test                       111.893831
return_std_test                     26.699243
average_reward_test                  4.680563
round_time_test        0 days 00:00:14.319221
round_time_total       0 days 00:16:02.028463
loss_total                16128069027481256.0
loss_critic               20160085926460000.0
loss_actor                     -310103411.424
memory_size                         1692.2095 

=== epoch 1/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:21,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                  436
episode_length                      22.869266
returns                            107.669458
return_std                          27.993122
average_reward                       4.707666
round_time             0 days 00:16:09.118830
episodes_test                           457.0
episode_length_test                 21.868709
returns_test                       102.427637
return_std_test                     22.762833
average_reward_test                  4.683877
round_time_test        0 days 00:00:14.380109
round_time_total       0 days 00:16:09.120075
loss_total                24538077478080480.0
loss_critic               30672596319588056.0
loss_actor                       -383285575.2
memory_size                            1696.0 

=== epoch 1/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:02<00:00,  2.08it/s]
episodes                                  450
episode_length                      22.193333
returns                             104.34768
return_std                          26.905857
average_reward                        4.70193
round_time             0 days 00:16:03.511985
episodes_test                           547.0
episode_length_test                 18.255941
returns_test                         86.58178
return_std_test                      15.00066
average_reward_test                  4.742761
round_time_test        0 days 00:00:14.184663
round_time_total       0 days 00:16:03.513326
loss_total                37947147874184728.0
loss_critic               47433934039706240.0
loss_actor                     -471290715.504
memory_size                            1696.0 

=== epoch 1/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:39,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:02<00:00,  2.08it/s]
episodes                                  468
episode_length                      21.320513
returns                            100.118261
return_std                          23.005486
average_reward                       4.696084
round_time             0 days 00:16:02.882871
episodes_test                           496.0
episode_length_test                 20.149194
returns_test                        95.041538
return_std_test                     20.170414
average_reward_test                  4.717001
round_time_test        0 days 00:00:14.144368
round_time_total       0 days 00:16:02.883953
loss_total                55781770580885240.0
loss_critic               69727211991840328.0
loss_actor                       -570293778.0
memory_size                            1696.0 

=== epoch 1/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:23,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                  487
episode_length                      20.513347
returns                             96.716525
return_std                           20.73483
average_reward                       4.714373
round_time             0 days 00:16:06.582011
episodes_test                           490.0
episode_length_test                 20.385714
returns_test                        96.536577
return_std_test                     21.042051
average_reward_test                  4.735594
round_time_test        0 days 00:00:13.859751
round_time_total       0 days 00:16:06.583456
loss_total                82692000224301280.0
loss_critic              103364998435419712.0
loss_actor                     -681162865.824
memory_size                            1696.0 

=== epoch 1/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                  492
episode_length                      20.288618
returns                               95.7757
return_std                          19.885492
average_reward                       4.720737
round_time             0 days 00:16:02.304223
episodes_test                           505.0
episode_length_test                  19.79604
returns_test                        93.266344
return_std_test                     19.100177
average_reward_test                  4.711425
round_time_test        0 days 00:00:14.406054
round_time_total       0 days 00:16:02.305415
loss_total               125192487608430624.0
loss_critic              156490606883628896.0
loss_actor                     -829251883.872
memory_size                            1696.0 

=== epoch 1/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  494
episode_length                      20.218623
returns                             95.800576
return_std                           20.22865
average_reward                        4.73865
round_time             0 days 00:16:05.053990
episodes_test                           484.0
episode_length_test                 20.632231
returns_test                        97.578185
return_std_test                     22.892632
average_reward_test                  4.729456
round_time_test        0 days 00:00:14.455505
round_time_total       0 days 00:16:05.055263
loss_total               177992003284621056.0
loss_critic              222490000311172736.0
loss_actor                     -991969055.456
memory_size                            1696.0 

=== epoch 1/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:06,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:07<00:00,  2.07it/s]
episodes                                  492
episode_length                       20.29878
returns                             96.166992
return_std                          20.585817
average_reward                       4.737859
round_time             0 days 00:16:07.890197
episodes_test                           473.0
episode_length_test                  21.12685
returns_test                        99.765565
return_std_test                     21.773481
average_reward_test                  4.722306
round_time_test        0 days 00:00:14.569891
round_time_total       0 days 00:16:07.891284
loss_total               256675258780983744.0
loss_critic              320844068275024320.0
loss_actor                    -1173690000.896
memory_size                            1696.0 

=== epoch 1/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:43,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                  490
episode_length                      20.383673
returns                             96.498131
return_std                          20.820257
average_reward                         4.7341
round_time             0 days 00:16:07.597171
episodes_test                           502.0
episode_length_test                 19.916335
returns_test                        94.295889
return_std_test                      20.28157
average_reward_test                  4.734637
round_time_test        0 days 00:00:14.361991
round_time_total       0 days 00:16:07.598300
loss_total               365173552242537536.0
loss_critic              456466932428349376.0
loss_actor                    -1392243240.064
memory_size                            1696.0 

=== epoch 1/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
episodes                                  487
episode_length                      20.449692
returns                             96.677296
return_std                          20.692278
average_reward                       4.727976
round_time             0 days 00:16:11.535782
episodes_test                           498.0
episode_length_test                 20.044177
returns_test                         94.74231
return_std_test                     20.515395
average_reward_test                  4.726698
round_time_test        0 days 00:00:14.196970
round_time_total       0 days 00:16:11.537066
loss_total               457598024864753536.0
loss_critic              571997522095870336.0
loss_actor                     -1584429839.36
memory_size                         1698.1045 

=== epoch 1/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:10,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                  489
episode_length                      20.396728
returns                             96.431584
return_std                          20.753376
average_reward                       4.728351
round_time             0 days 00:16:09.201808
episodes_test                           489.0
episode_length_test                 20.425358
returns_test                        95.801163
return_std_test                     21.170005
average_reward_test                  4.690485
round_time_test        0 days 00:00:14.395473
round_time_total       0 days 00:16:09.202905
loss_total               560159446502618432.0
loss_critic              700199296826066560.0
loss_actor                    -1762299208.448
memory_size                            1699.0 

=== epoch 1/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                  489
episode_length                      20.388548
returns                             96.256099
return_std                          20.361536
average_reward                       4.720874
round_time             0 days 00:16:04.361100
episodes_test                           506.0
episode_length_test                 19.747036
returns_test                        93.608448
return_std_test                     19.876613
average_reward_test                  4.740458
round_time_test        0 days 00:00:14.067885
round_time_total       0 days 00:16:04.362192
loss_total               718647814807168768.0
loss_critic              898309752709923712.0
loss_actor                    -1989684981.504
memory_size                            1699.0 

=== epoch 1/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:18,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                  491
episode_length                      20.281059
returns                             95.777366
return_std                          20.081085
average_reward                       4.722823
round_time             0 days 00:16:03.644908
episodes_test                           486.0
episode_length_test                 20.534979
returns_test                        96.976467
return_std_test                       21.3553
average_reward_test                  4.722522
round_time_test        0 days 00:00:14.381650
round_time_total       0 days 00:16:03.646089
loss_total               925756404884712704.0
loss_critic             1157195488376265984.0
loss_actor                    -2248794218.112
memory_size                            1699.0 

=== epoch 1/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:03,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                  496
episode_length                      20.100806
returns                             95.060806
return_std                          19.556219
average_reward                       4.729267
round_time             0 days 00:16:11.689102
episodes_test                           505.0
episode_length_test                 19.774257
returns_test                        93.769188
return_std_test                     19.986295
average_reward_test                  4.742082
round_time_test        0 days 00:00:14.802949
round_time_total       0 days 00:16:11.690211
loss_total              1201079090532988160.0
loss_critic             1501348837293352192.0
loss_actor                    -2529063561.856
memory_size                            1699.0 

=== epoch 1/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:10,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.07it/s]
episodes                                  498
episode_length                      20.034137
returns                             94.790653
return_std                          19.125164
average_reward                       4.731697
round_time             0 days 00:16:08.802052
episodes_test                           498.0
episode_length_test                 20.076305
returns_test                        95.077678
return_std_test                     19.415941
average_reward_test                  4.735851
round_time_test        0 days 00:00:14.287779
round_time_total       0 days 00:16:08.803145
loss_total              1554915769189744384.0
loss_critic             1943644677737327616.0
loss_actor                    -2869380557.824
memory_size                            1699.0 

=== epoch 1/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:19,  1.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                  499
episode_length                       19.96994
returns                             94.395309
return_std                          18.287924
average_reward                       4.727181
round_time             0 days 00:16:06.286737
episodes_test                           487.0
episode_length_test                 20.519507
returns_test                        96.694688
return_std_test                     19.908195
average_reward_test                  4.712446
round_time_test        0 days 00:00:14.044313
round_time_total       0 days 00:16:06.287880
loss_total              2033963426807411712.0
loss_critic             2542454241581793792.0
loss_actor                    -3266638587.648
memory_size                            1699.0 

=== epoch 1/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.07it/s]
episodes                                  503
episode_length                      19.842942
returns                             93.804522
return_std                          17.977305
average_reward                       4.727136
round_time             0 days 00:16:04.458300
episodes_test                           499.0
episode_length_test                 19.997996
returns_test                        93.887683
return_std_test                     17.781645
average_reward_test                  4.694397
round_time_test        0 days 00:00:14.237006
round_time_total       0 days 00:16:04.459606
loss_total              2575551710742494208.0
loss_critic             3219439581098894336.0
loss_actor                    -3696912907.136
memory_size                            1699.0 

=== epoch 1/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:57,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                  502
episode_length                      19.914343
returns                             94.008059
return_std                           17.72526
average_reward                       4.720656
round_time             0 days 00:16:04.395837
episodes_test                           505.0
episode_length_test                  19.79802
returns_test                        93.631634
return_std_test                     18.295919
average_reward_test                  4.729376
round_time_test        0 days 00:00:14.138545
round_time_total       0 days 00:16:04.396927
loss_total              3238239347501237760.0
loss_critic             4047799113629846016.0
loss_actor                    -4139597798.912
memory_size                            1699.0 

=== epoch 1/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:09,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                  499
episode_length                      20.018036
returns                             94.415692
return_std                          18.141483
average_reward                       4.716571
round_time             0 days 00:16:11.736286
episodes_test                           492.0
episode_length_test                 20.292683
returns_test                        95.782675
return_std_test                     20.396115
average_reward_test                  4.720087
round_time_test        0 days 00:00:13.941856
round_time_total       0 days 00:16:11.737355
loss_total              3992981467671807488.0
loss_critic             4991226753122820096.0
loss_actor                     -4590843340.16
memory_size                            1699.0 

=== epoch 1/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:50,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                  501
episode_length                      19.944112
returns                             93.998305
return_std                          18.452898
average_reward                       4.713189
round_time             0 days 00:16:11.655050
episodes_test                           474.0
episode_length_test                 21.094937
returns_test                        99.615853
return_std_test                     21.645995
average_reward_test                  4.722281
round_time_test        0 days 00:00:14.102824
round_time_total       0 days 00:16:11.656404
loss_total              4816574843383429120.0
loss_critic             6020718450256717824.0
loss_actor                    -5063295754.624
memory_size                            1699.0 

=== epoch 1/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:44,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                  502
episode_length                      19.912351
returns                             94.039184
return_std                          19.053245
average_reward                       4.722771
round_time             0 days 00:16:09.485296
episodes_test                           492.0
episode_length_test                 20.296748
returns_test                        95.857035
return_std_test                     20.157772
average_reward_test                  4.722958
round_time_test        0 days 00:00:14.108605
round_time_total       0 days 00:16:09.486396
loss_total              5612328973769438208.0
loss_critic             7015411094135215104.0
loss_actor                    -5510293481.472
memory_size                            1699.0 

=== epoch 1/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:08,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                  500
episode_length                         19.968
returns                             94.321218
return_std                          19.479641
average_reward                        4.72373
round_time             0 days 00:16:09.659457
episodes_test                           503.0
episode_length_test                 19.860835
returns_test                        93.700745
return_std_test                     18.088463
average_reward_test                  4.717995
round_time_test        0 days 00:00:14.447133
round_time_total       0 days 00:16:09.660536
loss_total              6791462758204335104.0
loss_critic             8489328304853266432.0
loss_actor                    -6044701688.832
memory_size                            1699.0 

=== epoch 1/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:40,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                  499
episode_length                      19.991984
returns                             94.511308
return_std                          20.167024
average_reward                       4.727477
round_time             0 days 00:16:12.260966
episodes_test                           498.0
episode_length_test                 20.064257
returns_test                        94.769731
return_std_test                     19.602459
average_reward_test                  4.723417
round_time_test        0 days 00:00:14.378642
round_time_total       0 days 00:16:12.262255
loss_total              8052870817963907072.0
loss_critic            10066088344059121664.0
loss_actor                     -6631506900.48
memory_size                          1704.754 

=== epoch 1/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:57,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                  497
episode_length                       20.05835
returns                             94.800373
return_std                          20.041942
average_reward                       4.726389
round_time             0 days 00:16:06.770650
episodes_test                           510.0
episode_length_test                 19.560784
returns_test                        92.624304
return_std_test                     17.690218
average_reward_test                  4.735034
round_time_test        0 days 00:00:14.333282
round_time_total       0 days 00:16:06.771731
loss_total              9773529207396407296.0
loss_critic            12216911302331166720.0
loss_actor                    -7322197579.008
memory_size                         1706.2535 

=== epoch 1/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:23,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
episodes                                  492
episode_length                      20.296748
returns                             96.018234
return_std                          20.965829
average_reward                       4.730938
round_time             0 days 00:16:11.356629
episodes_test                           480.0
episode_length_test                 20.833333
returns_test                        97.922445
return_std_test                     21.554412
average_reward_test                  4.700277
round_time_test        0 days 00:00:14.096806
round_time_total       0 days 00:16:11.357710
loss_total             11784351919375880192.0
loss_critic            14730439637123764224.0
loss_actor                    -7992079755.008
memory_size                           1709.15 

=== epoch 1/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.07it/s]
episodes                                  489
episode_length                      20.376278
returns                             96.128016
return_std                          20.743881
average_reward                         4.7177
round_time             0 days 00:16:08.711366
episodes_test                           487.0
episode_length_test                 20.474333
returns_test                        96.591825
return_std_test                     21.293416
average_reward_test                  4.716928
round_time_test        0 days 00:00:13.796460
round_time_total       0 days 00:16:08.712492
loss_total             14267324083312478208.0
loss_critic            17834154798132768768.0
loss_actor                 -8702152883.200001
memory_size                          1713.573 

=== epoch 1/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:13<00:00,  2.06it/s]
episodes                                  486
episode_length                       20.50823
returns                             96.749038
return_std                          21.112472
average_reward                       4.717096
round_time             0 days 00:16:13.853038
episodes_test                           479.0
episode_length_test                 20.874739
returns_test                        98.455327
return_std_test                     24.181416
average_reward_test                  4.716502
round_time_test        0 days 00:00:13.978319
round_time_total       0 days 00:16:13.854300
loss_total             16729804786481170432.0
loss_critic            20912255648300171264.0
loss_actor                    -9425286312.704
memory_size                            1717.0 

=== epoch 1/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                  483
episode_length                      20.656315
returns                             97.372924
return_std                          21.601066
average_reward                       4.713991
round_time             0 days 00:16:12.498683
episodes_test                           497.0
episode_length_test                 20.102616
returns_test                         94.81562
return_std_test                      20.02547
average_reward_test                  4.716712
round_time_test        0 days 00:00:14.358470
round_time_total       0 days 00:16:12.499975
loss_total             18571042977573011456.0
loss_critic            23213803343047069696.0
loss_actor                -10037145039.360001
memory_size                         1724.7875 

=== epoch 1/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:42,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                  486
episode_length                      20.549383
returns                             96.915394
return_std                          21.927474
average_reward                       4.716326
round_time             0 days 00:16:12.767880
episodes_test                           491.0
episode_length_test                 20.340122
returns_test                        96.045965
return_std_test                      21.74191
average_reward_test                  4.722136
round_time_test        0 days 00:00:14.066170
round_time_total       0 days 00:16:12.769418
loss_total             19943051192208596992.0
loss_critic            24928813529977692160.0
loss_actor                -10632233324.544001
memory_size                         1736.8695 

=== epoch 1/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:36,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
episodes                                  487
episode_length                      20.466119
returns                             96.438514
return_std                          21.440617
average_reward                       4.712097
round_time             0 days 00:16:11.445388
episodes_test                           503.0
episode_length_test                 19.848907
returns_test                        93.870219
return_std_test                     19.445627
average_reward_test                  4.729368
round_time_test        0 days 00:00:14.100108
round_time_total       0 days 00:16:11.446865
loss_total             21098785877029703680.0
loss_critic            26373481879819321344.0
loss_actor                    -11121300689.92
memory_size                          1742.122 

=== epoch 1/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:06,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                  495
episode_length                      20.165657
returns                             95.145155
return_std                          21.393127
average_reward                       4.717845
round_time             0 days 00:16:13.533199
episodes_test                           499.0
episode_length_test                 20.036072
returns_test                        94.697717
return_std_test                     18.942109
average_reward_test                    4.7264
round_time_test        0 days 00:00:14.206471
round_time_total       0 days 00:16:13.534451
loss_total             21515775577201356800.0
loss_critic            26894719000360964096.0
loss_actor                -11384404054.016001
memory_size                         1753.2325 

=== epoch 1/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:07,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                  494
episode_length                      20.198381
returns                             95.205325
return_std                          21.236805
average_reward                       4.713388
round_time             0 days 00:16:13.622337
episodes_test                           499.0
episode_length_test                      20.0
returns_test                        94.194118
return_std_test                     18.674003
average_reward_test                  4.709657
round_time_test        0 days 00:00:13.855911
round_time_total       0 days 00:16:13.623432
loss_total             23244219897916575744.0
loss_critic            29055274315493081088.0
loss_actor                -11827539595.775999
memory_size                            1755.0 

=== epoch 1/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:37,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                  499
episode_length                       19.98998
returns                              94.23816
return_std                          20.382111
average_reward                       4.714164
round_time             0 days 00:16:16.178845
episodes_test                           490.0
episode_length_test                 20.379592
returns_test                        96.117586
return_std_test                      19.63389
average_reward_test                  4.716342
round_time_test        0 days 00:00:14.238293
round_time_total       0 days 00:16:16.179931
loss_total             25336240457564692480.0
loss_critic            31670300022474932224.0
loss_actor                -12457457444.351999
memory_size                          1757.807 

=== epoch 1/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                  501
episode_length                      19.926148
returns                             93.980128
return_std                           19.79293
average_reward                       4.716174
round_time             0 days 00:16:15.652948
episodes_test                           503.0
episode_length_test                 19.862823
returns_test                        93.702676
return_std_test                     17.801138
average_reward_test                  4.717611
round_time_test        0 days 00:00:14.251162
round_time_total       0 days 00:16:15.654319
loss_total             27975643116888084480.0
loss_critic            34969553325051252736.0
loss_actor                -12977325332.992001
memory_size                            1762.0 

=== epoch 1/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:03,  1.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:13<00:00,  2.05it/s]
episodes                                  499
episode_length                      20.038076
returns                             94.477453
return_std                          19.521643
average_reward                       4.714868
round_time             0 days 00:16:13.849842
episodes_test                           481.0
episode_length_test                 20.769231
returns_test                        97.373718
return_std_test                     19.954089
average_reward_test                   4.68852
round_time_test        0 days 00:00:14.258276
round_time_total       0 days 00:16:13.850908
loss_total             29634073712536109056.0
loss_critic            37042591528929353728.0
loss_actor                -13484003959.808001
memory_size                            1762.0 

=== epoch 1/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:40,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  485
episode_length                       20.54433
returns                             96.775008
return_std                          19.980513
average_reward                       4.710745
round_time             0 days 00:16:15.693646
episodes_test                           461.0
episode_length_test                 21.655098
returns_test                       101.822863
return_std_test                     25.426124
average_reward_test                  4.702294
round_time_test        0 days 00:00:14.086123
round_time_total       0 days 00:16:15.694743
loss_total             30937096397758828544.0
loss_critic            38671369849036431360.0
loss_actor                   -13849409832.448
memory_size                         1762.1945 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:51<00:00,  2.10it/s]
episodes                                   96
episode_length                      20.770833
returns                             97.039916
return_std                          21.947408
average_reward                       4.672526
round_time             0 days 00:15:51.857509
episodes_test                           464.0
episode_length_test                 21.538793
returns_test                       101.449263
return_std_test                     23.177331
average_reward_test                  4.710174
round_time_test        0 days 00:00:14.345663
round_time_total       0 days 00:15:51.859003
loss_total             32911634602873479168.0
loss_critic            41139542540558557184.0
loss_actor                -14292343641.087999
memory_size                         1766.8865 

=== epoch 2/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:58,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  185
episode_length                      21.540541
returns                             101.03399
return_std                          24.082427
average_reward                       4.690927
round_time             0 days 00:16:04.683788
episodes_test                           452.0
episode_length_test                 22.106195
returns_test                       103.674594
return_std_test                     25.604836
average_reward_test                  4.689968
round_time_test        0 days 00:00:14.311522
round_time_total       0 days 00:16:04.685200
loss_total             36991372386135900160.0
loss_critic            46239214669443579904.0
loss_actor                -14952171935.743999
memory_size                          1775.541 

=== epoch 2/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:59<00:00,  2.08it/s]
episodes                                  275
episode_length                      21.734545
returns                            102.222904
return_std                          24.637323
average_reward                       4.703172
round_time             0 days 00:16:00.414743
episodes_test                           457.0
episode_length_test                  21.85558
returns_test                       102.893492
return_std_test                     26.146175
average_reward_test                  4.708032
round_time_test        0 days 00:00:14.048968
round_time_total       0 days 00:16:00.415856
loss_total             40416154556310290432.0
loss_critic            50520192301759782912.0
loss_actor                   -15392031476.736
memory_size                         1781.0175 

=== epoch 2/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:57<00:00,  2.09it/s]
episodes                                  362
episode_length                      22.060773
returns                            104.129147
return_std                           26.08128
average_reward                       4.720231
round_time             0 days 00:15:57.766004
episodes_test                           441.0
episode_length_test                 22.664399
returns_test                       108.120049
return_std_test                     29.571943
average_reward_test                  4.770546
round_time_test        0 days 00:00:13.650947
round_time_total       0 days 00:15:57.767280
loss_total             46350029836154871808.0
loss_critic            57937536230041698304.0
loss_actor                   -16319616902.656
memory_size                          1801.533 

=== epoch 2/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                  457
episode_length                      21.866521
returns                            103.507138
return_std                          25.746102
average_reward                       4.733689
round_time             0 days 00:16:01.941420
episodes_test                           435.0
episode_length_test                 22.921839
returns_test                       108.971876
return_std_test                     31.702014
average_reward_test                  4.753692
round_time_test        0 days 00:00:13.835556
round_time_total       0 days 00:16:01.942983
loss_total             51235592251568889856.0
loss_critic            64044489219622412288.0
loss_actor                   -16882635314.688
memory_size                         1815.4615 

=== epoch 2/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:44,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:02<00:00,  2.08it/s]
episodes                                  449
episode_length                       22.22049
returns                            105.408943
return_std                          27.387665
average_reward                       4.743747
round_time             0 days 00:16:03.387482
episodes_test                           459.0
episode_length_test                 21.738562
returns_test                       102.962973
return_std_test                      20.61032
average_reward_test                  4.735858
round_time_test        0 days 00:00:13.898616
round_time_total       0 days 00:16:03.388850
loss_total             55316152852708687872.0
loss_critic            69145189869067452416.0
loss_actor                   -17431349445.632
memory_size                         1839.4675 

=== epoch 2/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:45,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  449
episode_length                       22.25167
returns                            105.705879
return_std                          27.348332
average_reward                       4.750459
round_time             0 days 00:16:04.578733
episodes_test                           415.0
episode_length_test                 24.086747
returns_test                       114.104913
return_std_test                     34.013499
average_reward_test                  4.737307
round_time_test        0 days 00:00:14.266560
round_time_total       0 days 00:16:04.579814
loss_total             61329548761820282880.0
loss_critic            76661934721646968832.0
loss_actor                -18331368783.360001
memory_size                         1852.6855 

=== epoch 2/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:59,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  448
episode_length                      22.287946
returns                            106.074524
return_std                          27.543038
average_reward                       4.759254
round_time             0 days 00:16:05.369570
episodes_test                           434.0
episode_length_test                 22.990783
returns_test                       109.538039
return_std_test                      29.33379
average_reward_test                  4.764507
round_time_test        0 days 00:00:13.970356
round_time_total       0 days 00:16:05.371117
loss_total             68546411487029149696.0
loss_critic            85683012919250845696.0
loss_actor                -19171451876.863998
memory_size                         1864.8655 

=== epoch 2/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:57,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  446
episode_length                      22.376682
returns                            106.467779
return_std                          27.084491
average_reward                       4.758131
round_time             0 days 00:16:05.040332
episodes_test                           451.0
episode_length_test                 22.148559
returns_test                       105.460489
return_std_test                     22.547092
average_reward_test                  4.761575
round_time_test        0 days 00:00:13.802369
round_time_total       0 days 00:16:05.041443
loss_total             71678714402534490112.0
loss_critic            89598391457529643008.0
loss_actor                -19685680891.391998
memory_size                          1873.386 

=== epoch 2/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:15,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                  435
episode_length                      22.894253
returns                            108.583365
return_std                          27.220139
average_reward                       4.742812
round_time             0 days 00:16:04.790840
episodes_test                           415.0
episode_length_test                 24.086747
returns_test                       114.908652
return_std_test                     30.093331
average_reward_test                  4.770673
round_time_test        0 days 00:00:13.897965
round_time_total       0 days 00:16:04.791933
loss_total             78281327131662041088.0
loss_critic            97851657104231645184.0
loss_actor                -20439577731.071999
memory_size                          1889.433 

=== epoch 2/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:13,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                   435
episode_length                       22.951724
returns                             108.888969
return_std                           26.060693
average_reward                        4.744356
round_time              0 days 00:16:09.948281
episodes_test                            424.0
episode_length_test                  23.540094
returns_test                        111.944632
return_std_test                      28.309099
average_reward_test                   4.755429
round_time_test         0 days 00:00:14.226987
round_time_total        0 days 00:16:09.949397
loss_total              86397964415650365440.0
loss_critic            107997453560233246720.0
loss_actor                 -21249830769.664001
memory_size                            1897.83 

=== epoch 2/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:05,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                   434
episode_length                       23.036866
returns                             109.312474
return_std                           26.877572
average_reward                        4.745147
round_time              0 days 00:16:09.657047
episodes_test                            462.0
episode_length_test                  21.599567
returns_test                         102.51856
return_std_test                      24.290835
average_reward_test                   4.746414
round_time_test         0 days 00:00:14.409379
round_time_total        0 days 00:16:09.658140
loss_total              91814489196275138560.0
loss_critic            114768109525568847872.0
loss_actor                 -21938981942.271999
memory_size                           1907.315 

=== epoch 2/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:03,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                   428
episode_length                       23.221963
returns                             110.160583
return_std                           28.336304
average_reward                        4.742399
round_time              0 days 00:16:05.387087
episodes_test                            431.0
episode_length_test                  23.167053
returns_test                        110.305877
return_std_test                      27.825815
average_reward_test                   4.761312
round_time_test         0 days 00:00:13.918410
round_time_total        0 days 00:16:05.388522
loss_total             100509094548659372032.0
loss_critic            125636366062667251712.0
loss_actor                 -22251718107.136002
memory_size                           1960.572 

=== epoch 2/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:01,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                   448
episode_length                        22.28125
returns                             105.739477
return_std                           28.186763
average_reward                        4.745842
round_time              0 days 00:16:18.214530
episodes_test                            453.0
episode_length_test                  22.066225
returns_test                        105.365692
return_std_test                      32.280566
average_reward_test                    4.77503
round_time_test         0 days 00:00:14.255769
round_time_total        0 days 00:16:18.215619
loss_total              94537071858024628224.0
loss_critic            118171337718065528832.0
loss_actor                 -21123163329.535999
memory_size                           2009.679 

=== epoch 2/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:42,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   471
episode_length                       21.193206
returns                             100.837565
return_std                            26.24273
average_reward                        4.758015
round_time              0 days 00:16:22.810587
episodes_test                            523.0
episode_length_test                   19.08413
returns_test                         91.576043
return_std_test                      13.537184
average_reward_test                    4.79841
round_time_test         0 days 00:00:14.171763
round_time_total        0 days 00:16:22.811719
loss_total              84640704969405661184.0
loss_critic            105800879251327827968.0
loss_actor                 -20103513508.863998
memory_size                             2010.0 

=== epoch 2/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:07,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.03it/s]
episodes                                  487
episode_length                      20.474333
returns                             97.590039
return_std                          24.732483
average_reward                       4.766532
round_time             0 days 00:16:23.387642
episodes_test                           506.0
episode_length_test                 19.743083
returns_test                        94.708022
return_std_test                     17.793143
average_reward_test                  4.797034
round_time_test        0 days 00:00:14.568139
round_time_total       0 days 00:16:23.388980
loss_total             73037519935530926080.0
loss_critic            91296898350410563584.0
loss_actor                   -18362270125.056
memory_size                            2010.0 

=== epoch 2/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:42,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:18<00:00,  2.04it/s]
episodes                                  501
episode_length                      19.892216
returns                              95.21898
return_std                          22.770788
average_reward                       4.786594
round_time             0 days 00:16:19.082868
episodes_test                           523.0
episode_length_test                 19.082218
returns_test                        91.970938
return_std_test                     18.115871
average_reward_test                  4.820113
round_time_test        0 days 00:00:13.882441
round_time_total       0 days 00:16:19.084137
loss_total             70376559416154275840.0
loss_critic            87970697773207764992.0
loss_actor                    -18607412999.68
memory_size                         2014.5785 

=== epoch 2/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:39,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:45<00:00,  1.99it/s]
episodes                                  519
episode_length                      19.238921
returns                             92.288641
return_std                          17.838282
average_reward                       4.796164
round_time             0 days 00:16:45.606193
episodes_test                           486.0
episode_length_test                 20.537037
returns_test                        98.802875
return_std_test                     28.985526
average_reward_test                  4.810616
round_time_test        0 days 00:00:13.732071
round_time_total       0 days 00:16:45.607811
loss_total             66419033377812783104.0
loss_critic            83023790287953051648.0
loss_actor                -17424666600.448002
memory_size                         2021.7135 

=== epoch 2/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:37,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:54<00:00,  1.97it/s]
episodes                                  521
episode_length                      19.145873
returns                             91.983463
return_std                          20.188273
average_reward                       4.804208
round_time             0 days 00:16:54.718056
episodes_test                           539.0
episode_length_test                 18.541744
returns_test                         89.53559
return_std_test                     17.208191
average_reward_test                  4.828883
round_time_test        0 days 00:00:14.315978
round_time_total       0 days 00:16:54.719155
loss_total             62609668528027213824.0
loss_critic            78262084437926838272.0
loss_actor                -16400781329.408001
memory_size                         2039.0035 

=== epoch 2/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:26,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:25<00:00,  2.03it/s]
episodes                                  523
episode_length                       19.07457
returns                             91.792478
return_std                          22.324927
average_reward                       4.812787
round_time             0 days 00:16:26.120876
episodes_test                           525.0
episode_length_test                 19.024762
returns_test                        91.804798
return_std_test                     30.584972
average_reward_test                  4.825545
round_time_test        0 days 00:00:13.807880
round_time_total       0 days 00:16:26.122398
loss_total             60875647963722981376.0
loss_critic            76094558777626525696.0
loss_actor                   -15963957391.872
memory_size                         2051.1635 

=== epoch 2/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:49,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:25<00:00,  2.03it/s]
episodes                                  528
episode_length                      18.901515
returns                             91.186087
return_std                          22.964395
average_reward                       4.824168
round_time             0 days 00:16:26.190679
episodes_test                           549.0
episode_length_test                 18.198543
returns_test                         87.73397
return_std_test                     22.004066
average_reward_test                  4.820964
round_time_test        0 days 00:00:14.225700
round_time_total       0 days 00:16:26.191858
loss_total             62141916252867493888.0
loss_critic            77677394117066932224.0
loss_actor                -16180853056.511999
memory_size                          2059.442 

=== epoch 2/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:26<00:00,  2.03it/s]
episodes                                  542
episode_length                      18.442804
returns                             88.970397
return_std                          21.190764
average_reward                       4.824156
round_time             0 days 00:16:26.749207
episodes_test                           567.0
episode_length_test                 17.613757
returns_test                        85.098718
return_std_test                     16.034022
average_reward_test                  4.831351
round_time_test        0 days 00:00:14.382002
round_time_total       0 days 00:16:26.750332
loss_total             64467394317640261632.0
loss_critic            80584241615844409344.0
loss_actor                   -16718918860.288
memory_size                            2060.0 

=== epoch 2/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:37,  1.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.03it/s]
episodes                                  544
episode_length                      18.327206
returns                             88.411166
return_std                          22.698931
average_reward                       4.824058
round_time             0 days 00:16:27.834978
episodes_test                           517.0
episode_length_test                 19.313346
returns_test                         92.62271
return_std_test                     31.852754
average_reward_test                  4.795794
round_time_test        0 days 00:00:14.528955
round_time_total       0 days 00:16:27.836153
loss_total             67136052698242875392.0
loss_critic            83920064410728005632.0
loss_actor                -17052344773.120001
memory_size                         2071.4295 

=== epoch 2/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                  542
episode_length                      18.424354
returns                             88.919248
return_std                          24.605865
average_reward                       4.826169
round_time             0 days 00:16:24.377099
episodes_test                           554.0
episode_length_test                 18.036101
returns_test                        87.141115
return_std_test                      22.49897
average_reward_test                  4.831501
round_time_test        0 days 00:00:15.069855
round_time_total       0 days 00:16:24.378186
loss_total             72470736204965888000.0
loss_critic            90588418691602300928.0
loss_actor                -17880451433.984001
memory_size                         2152.2355 

=== epoch 2/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:11,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                  543
episode_length                      18.403315
returns                             88.894159
return_std                          24.099091
average_reward                       4.830652
round_time             0 days 00:16:25.586659
episodes_test                           558.0
episode_length_test                 17.921147
returns_test                        86.399779
return_std_test                     21.829812
average_reward_test                  4.821108
round_time_test        0 days 00:00:14.024651
round_time_total       0 days 00:16:25.588075
loss_total             78772033776934420480.0
loss_critic            98465040406973841408.0
loss_actor                -18467215926.271999
memory_size                          2188.412 

=== epoch 2/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:13,  1.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                   541
episode_length                       18.471349
returns                               89.17212
return_std                           25.038039
average_reward                        4.827613
round_time              0 days 00:16:33.686017
episodes_test                            566.0
episode_length_test                  17.651943
returns_test                         85.336135
return_std_test                      15.071733
average_reward_test                   4.834382
round_time_test         0 days 00:00:13.836109
round_time_total        0 days 00:16:33.687344
loss_total              82356217605213454336.0
loss_critic            102945270020249075712.0
loss_actor                 -18852672195.071999
memory_size                          2223.7465 

=== epoch 2/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:42,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                   537
episode_length                       18.579143
returns                              89.644364
return_std                           26.005183
average_reward                         4.82492
round_time              0 days 00:16:31.984882
episodes_test                            561.0
episode_length_test                  17.800357
returns_test                         86.065122
return_std_test                      17.168718
average_reward_test                   4.834973
round_time_test         0 days 00:00:13.983982
round_time_total        0 days 00:16:31.986003
loss_total              82680592407424958464.0
loss_critic            103350738671447523328.0
loss_actor                    -18790068385.792
memory_size                          2246.3175 

=== epoch 2/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                   546
episode_length                       18.305861
returns                               88.49633
return_std                           24.107685
average_reward                        4.834438
round_time              0 days 00:16:34.382390
episodes_test                            573.0
episode_length_test                  17.429319
returns_test                          84.31681
return_std_test                       13.19508
average_reward_test                   4.837596
round_time_test         0 days 00:00:13.892581
round_time_total        0 days 00:16:34.383731
loss_total              85218889127020773376.0
loss_critic            106523609442849177600.0
loss_actor                 -19059867578.880001
memory_size                          2270.6205 

=== epoch 2/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:21,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.00it/s]
episodes                                   558
episode_length                       17.903226
returns                               86.57075
return_std                           19.462041
average_reward                         4.83545
round_time              0 days 00:16:38.493848
episodes_test                            580.0
episode_length_test                  17.239655
returns_test                         83.329595
return_std_test                      11.024162
average_reward_test                   4.833606
round_time_test         0 days 00:00:13.950929
round_time_total        0 days 00:16:38.495285
loss_total              89497980988043689984.0
loss_critic            111872474143783501824.0
loss_actor                 -19905906575.872002
memory_size                             2275.0 

=== epoch 2/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:41,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.02it/s]
episodes                                   563
episode_length                        17.71048
returns                              85.636557
return_std                           18.376149
average_reward                         4.83526
round_time              0 days 00:16:28.255335
episodes_test                            579.0
episode_length_test                  17.267703
returns_test                         83.560595
return_std_test                      10.682151
average_reward_test                   4.839141
round_time_test         0 days 00:00:14.307311
round_time_total        0 days 00:16:28.256466
loss_total              94517715384627167232.0
loss_critic            118147142206033297408.0
loss_actor                    -20275990703.104
memory_size                           2278.672 

=== epoch 2/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:50,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:25<00:00,  2.03it/s]
episodes                                   573
episode_length                       17.417103
returns                              84.295383
return_std                           15.185681
average_reward                        4.839671
round_time              0 days 00:16:26.426311
episodes_test                            593.0
episode_length_test                  16.860034
returns_test                         81.499374
return_std_test                       8.383297
average_reward_test                   4.833893
round_time_test         0 days 00:00:13.936634
round_time_total        0 days 00:16:26.427444
loss_total              98250860043283677184.0
loss_critic            122813572756675051520.0
loss_actor                 -20728515733.504002
memory_size                             2284.0 

=== epoch 2/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   582
episode_length                       17.152921
returns                              82.980382
return_std                           13.151622
average_reward                        4.837989
round_time              0 days 00:16:31.298706
episodes_test                            585.0
episode_length_test                   17.08547
returns_test                         82.603011
return_std_test                      10.876195
average_reward_test                    4.83471
round_time_test         0 days 00:00:13.992747
round_time_total        0 days 00:16:31.300167
loss_total              95979126380654985216.0
loss_critic            119973905784492064768.0
loss_actor                 -20242949932.543999
memory_size                             2284.0 

=== epoch 2/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:13,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                   587
episode_length                       16.996593
returns                              82.163181
return_std                             10.5256
average_reward                        4.834071
round_time              0 days 00:16:19.672795
episodes_test                            598.0
episode_length_test                  16.698997
returns_test                         80.564668
return_std_test                      10.270232
average_reward_test                   4.824494
round_time_test         0 days 00:00:14.008647
round_time_total        0 days 00:16:19.674245
loss_total              90282560601986465792.0
loss_critic            112853198836584071168.0
loss_actor                 -18951146022.400002
memory_size                             2284.0 

=== epoch 2/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                   589
episode_length                       16.955857
returns                               81.86414
return_std                           10.367551
average_reward                        4.828011
round_time              0 days 00:16:21.018593
episodes_test                            593.0
episode_length_test                  16.856661
returns_test                         81.102901
return_std_test                       8.259908
average_reward_test                   4.811357
round_time_test         0 days 00:00:13.973146
round_time_total        0 days 00:16:21.019925
loss_total              84834496585405644800.0
loss_critic            106043118861487767552.0
loss_actor                 -17846209878.015999
memory_size                             2284.0 

=== epoch 2/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:35,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                   594
episode_length                       16.818182
returns                              81.132111
return_std                             8.74527
average_reward                        4.824081
round_time              0 days 00:16:21.296818
episodes_test                            594.0
episode_length_test                   16.83165
returns_test                         81.027716
return_std_test                        6.84433
average_reward_test                    4.81403
round_time_test         0 days 00:00:14.050079
round_time_total        0 days 00:16:21.298261
loss_total              84061416954892795904.0
loss_critic            105076769269470658560.0
loss_actor                 -18059861318.144001
memory_size                             2284.0 

=== epoch 2/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:06,  1.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:28<00:00,  2.02it/s]
episodes                                   597
episode_length                       16.726968
returns                              80.633524
return_std                            8.129046
average_reward                        4.820674
round_time              0 days 00:16:29.182107
episodes_test                            603.0
episode_length_test                   16.58209
returns_test                         79.867405
return_std_test                       7.572508
average_reward_test                   4.816496
round_time_test         0 days 00:00:13.775808
round_time_total        0 days 00:16:29.183200
loss_total              86628978616822104064.0
loss_critic            108286221360076423168.0
loss_actor                    -18529216314.368
memory_size                             2284.0 

=== epoch 2/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:11,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                   591
episode_length                       16.878173
returns                              81.364544
return_std                           10.803571
average_reward                        4.820784
round_time              0 days 00:16:30.008168
episodes_test                            568.0
episode_length_test                  17.602113
returns_test                         85.089284
return_std_test                      17.188065
average_reward_test                   4.834045
round_time_test         0 days 00:00:13.431855
round_time_total        0 days 00:16:30.009266
loss_total              85517411162905919488.0
loss_critic            106896761971762692096.0
loss_actor                 -18834678443.007999
memory_size                          2289.2695 

=== epoch 2/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:13,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                   591
episode_length                       16.896785
returns                              81.442712
return_std                           11.013744
average_reward                        4.819971
round_time              0 days 00:16:34.069680
episodes_test                            591.0
episode_length_test                  16.900169
returns_test                         81.500505
return_std_test                      14.887781
average_reward_test                   4.822459
round_time_test         0 days 00:00:14.122445
round_time_total        0 days 00:16:34.070806
loss_total              88421842145132265472.0
loss_critic            110527300565954953216.0
loss_actor                     -19434968924.16
memory_size                          2302.0675 

=== epoch 2/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:58,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                   592
episode_length                       16.888514
returns                              81.463834
return_std                           12.372964
average_reward                        4.823633
round_time              0 days 00:16:31.973436
episodes_test                            588.0
episode_length_test                  16.991497
returns_test                         81.956901
return_std_test                       9.290285
average_reward_test                   4.823434
round_time_test         0 days 00:00:14.097647
round_time_total        0 days 00:16:31.974650
loss_total              91034149193615949824.0
loss_critic            113792684443629780992.0
loss_actor                 -20115703505.919998
memory_size                           2308.765 

=== epoch 2/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:11,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   590
episode_length                       16.922034
returns                              81.650895
return_std                           12.291407
average_reward                        4.825358
round_time              0 days 00:16:24.284931
episodes_test                            587.0
episode_length_test                  17.015332
returns_test                         82.097859
return_std_test                       9.508034
average_reward_test                   4.825002
round_time_test         0 days 00:00:14.067060
round_time_total        0 days 00:16:24.286028
loss_total              99009088488876294144.0
loss_critic            123761358656163676160.0
loss_actor                 -21199912370.175999
memory_size                             2320.0 

=== epoch 2/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:39,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:26<00:00,  2.03it/s]
episodes                                   588
episode_length                       16.984694
returns                              81.960585
return_std                           12.354776
average_reward                        4.825648
round_time              0 days 00:16:26.686884
episodes_test                            597.0
episode_length_test                  16.730318
returns_test                         80.643326
return_std_test                      10.282772
average_reward_test                   4.820178
round_time_test         0 days 00:00:14.102929
round_time_total        0 days 00:16:26.688021
loss_total             109362623709745545216.0
loss_critic            136703277419466981376.0
loss_actor                 -22649061800.959999
memory_size                             2320.0 

=== epoch 2/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:37,  1.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.02it/s]
episodes                                   587
episode_length                       17.013629
returns                              82.129503
return_std                            10.90758
average_reward                        4.827324
round_time              0 days 00:16:28.581080
episodes_test                            579.0
episode_length_test                  17.271157
returns_test                          83.45803
return_std_test                       9.742764
average_reward_test                    4.83222
round_time_test         0 days 00:00:14.070018
round_time_total        0 days 00:16:28.582168
loss_total             124314416017596874752.0
loss_critic            155393017296856514560.0
loss_actor                 -24225049172.992001
memory_size                          2322.6075 

=== epoch 2/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:06,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   587
episode_length                       16.994889
returns                              82.028589
return_std                           10.155806
average_reward                        4.826693
round_time              0 days 00:16:24.531939
episodes_test                            589.0
episode_length_test                  16.977929
returns_test                         81.967617
return_std_test                       6.635244
average_reward_test                   4.827893
round_time_test         0 days 00:00:13.903117
round_time_total        0 days 00:16:24.533212
loss_total             143207338160523280384.0
loss_critic            179009169666001993728.0
loss_actor                    -25970255454.208
memory_size                             2325.0 

=== epoch 2/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:59,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   587
episode_length                       17.003407
returns                              82.107645
return_std                            8.300736
average_reward                        4.829062
round_time              0 days 00:16:23.846272
episodes_test                            594.0
episode_length_test                  16.833333
returns_test                         81.233716
return_std_test                       6.151017
average_reward_test                   4.825774
round_time_test         0 days 00:00:13.560836
round_time_total        0 days 00:16:23.847503
loss_total             158934850952094515200.0
loss_critic            198668560139795103744.0
loss_actor                 -27440456240.127998
memory_size                             2325.0 

=== epoch 2/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:21<00:00,  2.04it/s]
episodes                                   584
episode_length                       17.101027
returns                              82.607996
return_std                            8.653344
average_reward                        4.830536
round_time              0 days 00:16:21.634551
episodes_test                            582.0
episode_length_test                  17.178694
returns_test                         82.881237
return_std_test                       8.678993
average_reward_test                   4.824668
round_time_test         0 days 00:00:13.702198
round_time_total        0 days 00:16:21.635708
loss_total             176739984119371661312.0
loss_critic            220924975866616774656.0
loss_actor                     -28680848276.48
memory_size                             2325.0 

=== epoch 2/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:36,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.02it/s]
episodes                                   579
episode_length                       17.240069
returns                              83.388527
return_std                           11.454309
average_reward                        4.836885
round_time              0 days 00:16:28.420716
episodes_test                            581.0
episode_length_test                  17.194492
returns_test                         83.110676
return_std_test                       7.313406
average_reward_test                   4.833547
round_time_test         0 days 00:00:13.928741
round_time_total        0 days 00:16:28.421829
loss_total             178216982080306872320.0
loss_critic            222771223793874337792.0
loss_actor                 -28262614158.335999
memory_size                          2334.0355 

=== epoch 2/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   580
episode_length                       17.225862
returns                              83.340489
return_std                           13.175578
average_reward                        4.838252
round_time              0 days 00:16:30.863016
episodes_test                            572.0
episode_length_test                  17.480769
returns_test                         84.501641
return_std_test                      10.011755
average_reward_test                   4.833982
round_time_test         0 days 00:00:13.879228
round_time_total        0 days 00:16:30.864103
loss_total             183757049273857146880.0
loss_critic            229696307535123546112.0
loss_actor                    -29158188941.312
memory_size                           2343.743 

=== epoch 2/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:30,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                   576
episode_length                       17.317708
returns                              83.816216
return_std                           13.390166
average_reward                        4.839957
round_time              0 days 00:16:36.439114
episodes_test                            580.0
episode_length_test                  17.231034
returns_test                         83.256868
return_std_test                      11.073492
average_reward_test                   4.831845
round_time_test         0 days 00:00:13.888801
round_time_total        0 days 00:16:36.440375
loss_total             196234581026244755456.0
loss_critic            245293221852873588736.0
loss_actor                    -30491147086.848
memory_size                             2363.0 

=== epoch 2/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:50,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.02it/s]
episodes                                   571
episode_length                       17.467601
returns                              84.505589
return_std                           13.663242
average_reward                        4.837696
round_time              0 days 00:16:33.037391
episodes_test                            565.0
episode_length_test                  17.683186
returns_test                         85.538673
return_std_test                       9.862823
average_reward_test                   4.837297
round_time_test         0 days 00:00:14.548084
round_time_total        0 days 00:16:33.038560
loss_total             214828989379623223296.0
loss_critic            268536232380358590464.0
loss_actor                 -32616224236.543999
memory_size                             2363.0 

=== epoch 2/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:56,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   573
episode_length                       17.415358
returns                              84.222932
return_std                           13.784827
average_reward                         4.83603
round_time              0 days 00:16:29.950715
episodes_test                            592.0
episode_length_test                  16.878378
returns_test                          81.36937
return_std_test                       7.266034
average_reward_test                    4.82093
round_time_test         0 days 00:00:13.942831
round_time_total        0 days 00:16:29.951833
loss_total             252205003583702728704.0
loss_critic            315256249340511059968.0
loss_actor                 -35379681207.295998
memory_size                             2363.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<16:21,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                   117
episode_length                       16.965812
returns                              81.708771
return_std                             7.27966
average_reward                        4.816493
round_time              0 days 00:16:08.884307
episodes_test                            589.0
episode_length_test                  16.972835
returns_test                         81.728944
return_std_test                       7.861081
average_reward_test                   4.815312
round_time_test         0 days 00:00:13.725070
round_time_total        0 days 00:16:08.885454
loss_total             275547120773188386816.0
loss_critic            344433895345781997568.0
loss_actor                 -36763062846.463997
memory_size                             2363.0 

=== epoch 3/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:38,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                   236
episode_length                       16.940678
returns                              81.605997
return_std                           12.440082
average_reward                        4.817209
round_time              0 days 00:16:15.419311
episodes_test                            596.0
episode_length_test                  16.753356
returns_test                         80.672836
return_std_test                        6.94126
average_reward_test                   4.815263
round_time_test         0 days 00:00:13.979976
round_time_total        0 days 00:16:15.420559
loss_total             302091155276016517120.0
loss_critic            377613937473761640448.0
loss_actor                    -38580559126.528
memory_size                            2373.41 

=== epoch 3/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:52,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                   350
episode_length                       17.117143
returns                              82.546028
return_std                           12.094124
average_reward                        4.822464
round_time              0 days 00:16:15.785808
episodes_test                            577.0
episode_length_test                  17.310225
returns_test                         83.429253
return_std_test                      11.050997
average_reward_test                   4.819635
round_time_test         0 days 00:00:14.083786
round_time_total        0 days 00:16:15.787018
loss_total             293740307772238266368.0
loss_critic            367175378571226841088.0
loss_actor                 -37848093542.400002
memory_size                             2378.0 

=== epoch 3/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:34,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                   463
episode_length                       17.274298
returns                              83.270245
return_std                           12.527413
average_reward                        4.820502
round_time              0 days 00:16:17.710957
episodes_test                            576.0
episode_length_test                  17.331597
returns_test                         83.648499
return_std_test                      13.102096
average_reward_test                   4.826478
round_time_test         0 days 00:00:14.010000
round_time_total        0 days 00:16:17.712091
loss_total             303668450102697263104.0
loss_critic            379585555345206542336.0
loss_actor                    -38994425806.848
memory_size                           2378.941 

=== epoch 3/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:14,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:16<00:00,  2.05it/s]
episodes                                   580
episode_length                       17.225862
returns                              83.024758
return_std                           11.832114
average_reward                        4.819791
round_time              0 days 00:16:17.447256
episodes_test                            537.0
episode_length_test                  18.595903
returns_test                         89.349077
return_std_test                      17.069937
average_reward_test                   4.804921
round_time_test         0 days 00:00:13.894389
round_time_total        0 days 00:16:17.448542
loss_total             346061303185667981312.0
loss_critic            432576621080994447360.0
loss_actor                      -41733223936.0
memory_size                             2382.0 

=== epoch 3/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                   580
episode_length                       17.227586
returns                               83.02859
return_std                           11.821531
average_reward                        4.819602
round_time              0 days 00:16:16.148600
episodes_test                            567.0
episode_length_test                   17.62963
returns_test                         84.922793
return_std_test                      10.470954
average_reward_test                   4.817101
round_time_test         0 days 00:00:14.028721
round_time_total        0 days 00:16:16.149981
loss_total             386063104609117339648.0
loss_critic            482578872176409903104.0
loss_actor                 -43979064133.632004
memory_size                             2382.0 

=== epoch 3/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:53,  1.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                   579
episode_length                       17.231434
returns                              83.055676
return_std                           10.038295
average_reward                        4.819987
round_time              0 days 00:16:16.506073
episodes_test                            581.0
episode_length_test                  17.196213
returns_test                         82.791323
return_std_test                       9.613251
average_reward_test                   4.814542
round_time_test         0 days 00:00:13.762989
round_time_total        0 days 00:16:16.507164
loss_total             399088669720059314176.0
loss_critic            498860828655247294464.0
loss_actor                 -44238923366.400002
memory_size                             2382.0 

=== epoch 3/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:03,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                   579
episode_length                       17.229706
returns                              83.070092
return_std                            9.720585
average_reward                        4.821156
round_time              0 days 00:16:15.043584
episodes_test                            586.0
episode_length_test                  17.040956
returns_test                          82.39441
return_std_test                       9.985475
average_reward_test                   4.835041
round_time_test         0 days 00:00:14.070138
round_time_total        0 days 00:16:15.044741
loss_total             425995060567963009024.0
loss_critic            532493817069991362560.0
loss_actor                 -46463795118.080002
memory_size                             2382.0 

=== epoch 3/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                   572
episode_length                       17.423077
returns                              83.983572
return_std                           10.657955
average_reward                        4.820027
round_time              0 days 00:16:12.732298
episodes_test                            560.0
episode_length_test                  17.857143
returns_test                         86.289575
return_std_test                      13.039925
average_reward_test                   4.832216
round_time_test         0 days 00:00:13.890580
round_time_total        0 days 00:16:12.733403
loss_total             470622477898573938688.0
loss_critic            588278087677723869184.0
loss_actor                 -48774297131.008003
memory_size                          2383.5135 

=== epoch 3/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:10,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
episodes                                   569
episode_length                       17.553603
returns                              84.637385
return_std                           11.054449
average_reward                        4.821609
round_time              0 days 00:16:11.385350
episodes_test                            578.0
episode_length_test                  17.301038
returns_test                         83.730455
return_std_test                       9.213291
average_reward_test                    4.83962
round_time_test         0 days 00:00:14.306977
round_time_total        0 days 00:16:11.386754
loss_total             513181019150707392512.0
loss_critic            641476263616169115648.0
loss_actor                 -50812378906.624001
memory_size                             2384.0 

=== epoch 3/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:04,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                   560
episode_length                       17.796429
returns                              85.772382
return_std                           12.697629
average_reward                        4.819044
round_time              0 days 00:16:12.882428
episodes_test                            568.0
episode_length_test                  17.588028
returns_test                         84.912515
return_std_test                      10.774843
average_reward_test                   4.827854
round_time_test         0 days 00:00:13.864353
round_time_total        0 days 00:16:12.883562
loss_total             554238546667039227904.0
loss_critic            692798171208384774144.0
loss_actor                 -53158744084.480003
memory_size                          2387.2065 

=== epoch 3/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                   560
episode_length                       17.810714
returns                              85.803197
return_std                           12.723824
average_reward                        4.817769
round_time              0 days 00:16:18.119586
episodes_test                            569.0
episode_length_test                  17.550088
returns_test                         84.595321
return_std_test                        9.77376
average_reward_test                   4.820203
round_time_test         0 days 00:00:13.731164
round_time_total        0 days 00:16:18.120672
loss_total             591552532014996652032.0
loss_critic            739440651993931186176.0
loss_actor                 -54655356731.391998
memory_size                             2395.0 

=== epoch 3/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:24,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                   561
episode_length                       17.809269
returns                              85.837183
return_std                           12.711516
average_reward                        4.819694
round_time              0 days 00:16:15.169517
episodes_test                            590.0
episode_length_test                  16.940678
returns_test                         81.788471
return_std_test                       8.310677
average_reward_test                   4.827983
round_time_test         0 days 00:00:13.795240
round_time_total        0 days 00:16:15.170636
loss_total             638534853117806510080.0
loss_critic            798168551034881703936.0
loss_actor                 -56522430578.688004
memory_size                             2395.0 

=== epoch 3/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                   570
episode_length                       17.531579
returns                              84.497464
return_std                           11.978322
average_reward                        4.819748
round_time              0 days 00:16:15.444444
episodes_test                            543.0
episode_length_test                  18.388582
returns_test                         88.705728
return_std_test                      17.930338
average_reward_test                   4.824061
round_time_test         0 days 00:00:13.866966
round_time_total        0 days 00:16:15.445717
loss_total             676604140510581751808.0
loss_critic            845755160201084010496.0
loss_actor                 -57677979174.912003
memory_size                          2398.8595 

=== epoch 3/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:59,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                   571
episode_length                       17.486865
returns                              84.347147
return_std                           11.880461
average_reward                        4.823375
round_time              0 days 00:16:13.119995
episodes_test                            560.0
episode_length_test                  17.857143
returns_test                         86.358424
return_std_test                      13.863905
average_reward_test                   4.836072
round_time_test         0 days 00:00:13.970081
round_time_total        0 days 00:16:13.121077
loss_total             706364001896511700992.0
loss_critic            882954985658062864384.0
loss_actor                      -58533241600.0
memory_size                             2401.0 

=== epoch 3/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:34,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                   575
episode_length                       17.361739
returns                              83.856308
return_std                            10.40529
average_reward                        4.829878
round_time              0 days 00:16:13.075581
episodes_test                            564.0
episode_length_test                  17.705674
returns_test                          85.67501
return_std_test                      11.151543
average_reward_test                   4.838823
round_time_test         0 days 00:00:14.011545
round_time_total        0 days 00:16:13.076662
loss_total             759985097321183969280.0
loss_critic            949981354046099685376.0
loss_actor                 -61166237499.391998
memory_size                             2401.0 

=== epoch 3/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                   566
episode_length                       17.662544
returns                               85.35215
return_std                           12.943889
average_reward                        4.832338
round_time              0 days 00:16:18.466050
episodes_test                            565.0
episode_length_test                  17.693805
returns_test                         85.637603
return_std_test                      12.982613
average_reward_test                   4.839999
round_time_test         0 days 00:00:14.174411
round_time_total        0 days 00:16:18.467547
loss_total             786576841084896542720.0
loss_critic            983221034731504926720.0
loss_actor                 -61320442148.863998
memory_size                           2401.966 

=== epoch 3/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:08,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                    560
episode_length                        17.808929
returns                               86.005559
return_std                            13.981119
average_reward                         4.829317
round_time               0 days 00:16:20.628964
episodes_test                             540.0
episode_length_test                   18.512963
returns_test                           89.54989
return_std_test                       18.549754
average_reward_test                    4.837171
round_time_test          0 days 00:00:13.327953
round_time_total         0 days 00:16:20.630269
loss_total              852582535942685982720.0
loss_critic            1065728151280640196608.0
loss_actor                  -64682411366.400002
memory_size                           2404.6515 

=== epoch 3/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:00,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                    550
episode_length                            18.12
returns                               87.610464
return_std                            15.307665
average_reward                         4.835552
round_time               0 days 00:16:20.523612
episodes_test                             543.0
episode_length_test                   18.405157
returns_test                          89.019873
return_std_test                       15.953725
average_reward_test                    4.836694
round_time_test          0 days 00:00:14.264216
round_time_total         0 days 00:16:20.524718
loss_total              901161919419986214912.0
loss_critic            1126452380930730885120.0
loss_actor                  -67247093825.536003
memory_size                              2405.0 

=== epoch 3/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:01,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:16<00:00,  2.05it/s]
episodes                                    548
episode_length                         18.20438
returns                               88.070613
return_std                            15.896706
average_reward                         4.837837
round_time               0 days 00:16:16.904090
episodes_test                             571.0
episode_length_test                   17.509632
returns_test                          84.622159
return_std_test                       10.718758
average_reward_test                    4.832912
round_time_test          0 days 00:00:14.164221
round_time_total         0 days 00:16:16.905200
loss_total              953586392502737829888.0
loss_critic            1191982972451296116736.0
loss_actor                  -69324127873.024002
memory_size                            2408.715 

=== epoch 3/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                    543
episode_length                        18.399632
returns                               88.915516
return_std                            16.825003
average_reward                         4.832525
round_time               0 days 00:16:15.831144
episodes_test                             559.0
episode_length_test                   17.874776
returns_test                          86.121121
return_std_test                       14.849549
average_reward_test                    4.818054
round_time_test          0 days 00:00:13.968515
round_time_total         0 days 00:16:15.832252
loss_total             1029245110489618317312.0
loss_critic            1286556367010195832832.0
loss_actor                  -71781299415.039993
memory_size                              2409.0 

=== epoch 3/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    538
episode_length                        18.585502
returns                               89.876566
return_std                            16.858912
average_reward                         4.835847
round_time               0 days 00:16:18.112352
episodes_test                             468.0
episode_length_test                   21.356838
returns_test                         103.073116
return_std_test                       30.472989
average_reward_test                    4.826274
round_time_test          0 days 00:00:14.083570
round_time_total         0 days 00:16:18.113441
loss_total             1100894785731233579008.0
loss_critic            1376118459017123069952.0
loss_actor                  -74095799861.248001
memory_size                           2410.9115 

=== epoch 3/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:20,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                    540
episode_length                        18.505556
returns                               89.534147
return_std                            16.830706
average_reward                          4.83822
round_time               0 days 00:16:16.176036
episodes_test                             561.0
episode_length_test                   17.796791
returns_test                          85.918937
return_std_test                       16.000304
average_reward_test                    4.827874
round_time_test          0 days 00:00:14.262756
round_time_total         0 days 00:16:16.177558
loss_total             1156824752536801771520.0
loss_critic            1446030917093074927616.0
loss_actor                  -75214795851.776001
memory_size                              2411.0 

=== epoch 3/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:27,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:13<00:00,  2.05it/s]
episodes                                    544
episode_length                        18.354779
returns                                88.70741
return_std                            15.991489
average_reward                         4.832934
round_time               0 days 00:16:13.964198
episodes_test                             499.0
episode_length_test                    20.04008
returns_test                          96.668407
return_std_test                       24.239638
average_reward_test                    4.823754
round_time_test          0 days 00:00:14.483902
round_time_total         0 days 00:16:13.965324
loss_total             1238986836039138279424.0
loss_critic            1548733518256023535616.0
loss_actor                  -78378135545.856003
memory_size                              2411.0 

=== epoch 3/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:49,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                    540
episode_length                        18.487037
returns                               89.277853
return_std                            16.946023
average_reward                          4.82934
round_time               0 days 00:16:23.656533
episodes_test                             508.0
episode_length_test                   19.685039
returns_test                          95.000062
return_std_test                       24.326966
average_reward_test                    4.826003
round_time_test          0 days 00:00:13.789465
round_time_total         0 days 00:16:23.657629
loss_total             1324824383987263995904.0
loss_critic            1656030449795888578560.0
loss_actor                  -80766620336.128006
memory_size                             2412.67 

=== epoch 3/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                    544
episode_length                        18.327206
returns                               88.585589
return_std                            16.815079
average_reward                         4.834115
round_time               0 days 00:16:20.361698
episodes_test                             515.0
episode_length_test                   19.392233
returns_test                          93.645991
return_std_test                       21.634637
average_reward_test                    4.828974
round_time_test          0 days 00:00:13.821032
round_time_total         0 days 00:16:20.363221
loss_total             1434756132345004687360.0
loss_critic            1793445134776871682048.0
loss_actor                  -84045091799.039993
memory_size                            2419.808 

=== epoch 3/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:18<00:00,  2.04it/s]
episodes                                    555
episode_length                        17.971171
returns                               86.786056
return_std                            15.334424
average_reward                         4.829099
round_time               0 days 00:16:19.047342
episodes_test                             571.0
episode_length_test                   17.481611
returns_test                          84.464581
return_std_test                       11.906577
average_reward_test                    4.831357
round_time_test          0 days 00:00:13.969917
round_time_total         0 days 00:16:19.048782
loss_total             1559868629385597157376.0
loss_critic            1949835753324435210240.0
loss_actor                  -87704437112.832001
memory_size                              2420.0 

=== epoch 3/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    541
episode_length                        18.439926
returns                               88.877478
return_std                            17.955069
average_reward                         4.819908
round_time               0 days 00:16:17.658806
episodes_test                             530.0
episode_length_test                   18.815094
returns_test                          90.759185
return_std_test                       19.961619
average_reward_test                    4.823633
round_time_test          0 days 00:00:13.977734
round_time_total         0 days 00:16:17.659907
loss_total             1619345146276297572352.0
loss_critic            2024181399745673822208.0
loss_actor                  -89295102812.160004
memory_size                           2424.4505 

=== epoch 3/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                    525
episode_length                            19.04
returns                               91.695717
return_std                             22.39173
average_reward                         4.815963
round_time               0 days 00:16:19.788501
episodes_test                             461.0
episode_length_test                    21.67679
returns_test                         104.414209
return_std_test                       31.048094
average_reward_test                    4.816883
round_time_test          0 days 00:00:14.024830
round_time_total         0 days 00:16:19.789606
loss_total             1727443616022574399488.0
loss_critic            2159304486391958208512.0
loss_actor                  -90774644092.927994
memory_size                           2476.5755 

=== epoch 3/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                    526
episode_length                        18.977186
returns                               91.375705
return_std                            21.611423
average_reward                          4.81513
round_time               0 days 00:16:15.973378
episodes_test                             566.0
episode_length_test                   17.664311
returns_test                           85.28036
return_std_test                       15.096553
average_reward_test                    4.827851
round_time_test          0 days 00:00:13.907883
round_time_total         0 days 00:16:15.974471
loss_total             1806646615810894987264.0
loss_critic            2258308232907988926464.0
loss_actor                  -91864953159.679993
memory_size                              2494.0 

=== epoch 3/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:09,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                    509
episode_length                        19.618861
returns                               94.410115
return_std                            23.051778
average_reward                         4.812217
round_time               0 days 00:16:15.203350
episodes_test                             530.0
episode_length_test                   18.867925
returns_test                          91.298314
return_std_test                       17.725448
average_reward_test                    4.838811
round_time_test          0 days 00:00:13.947411
round_time_total         0 days 00:16:15.204852
loss_total             1846011067155372048384.0
loss_critic            2307513796895071010816.0
loss_actor                  -93894974750.720001
memory_size                           2496.8115 

=== epoch 3/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:03,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                    494
episode_length                        20.200405
returns                               97.130435
return_std                            24.543497
average_reward                         4.808354
round_time               0 days 00:16:22.873634
episodes_test                             488.0
episode_length_test                   20.479508
returns_test                          98.665601
return_std_test                       24.681183
average_reward_test                    4.817819
round_time_test          0 days 00:00:14.290005
round_time_total         0 days 00:16:22.874717
loss_total             1955381625763458187264.0
loss_critic            2444226993739008049152.0
loss_actor                  -95867034718.207993
memory_size                            2505.343 

=== epoch 3/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:27,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                    489
episode_length                        20.402863
returns                               98.003648
return_std                            24.745941
average_reward                         4.803655
round_time               0 days 00:16:22.873007
episodes_test                             485.0
episode_length_test                   20.587629
returns_test                          99.211956
return_std_test                       25.623059
average_reward_test                     4.81902
round_time_test          0 days 00:00:14.247331
round_time_total         0 days 00:16:22.874098
loss_total             2080059729403814346752.0
loss_critic            2600074618416417734656.0
loss_actor                  -98082959470.591995
memory_size                            2514.109 

=== epoch 3/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:54,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                    488
episode_length                        20.436475
returns                                98.15766
return_std                            24.115606
average_reward                         4.803056
round_time               0 days 00:16:21.499208
episodes_test                             485.0
episode_length_test                   20.585567
returns_test                           98.85619
return_std_test                       23.921905
average_reward_test                    4.802305
round_time_test          0 days 00:00:14.069327
round_time_total         0 days 00:16:21.500468
loss_total             2169659784750091468800.0
loss_critic            2712074683781759369216.0
loss_actor                 -100401446576.128006
memory_size                           2526.4075 

=== epoch 3/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    484
episode_length                         20.63843
returns                               99.155558
return_std                            25.195037
average_reward                         4.804396
round_time               0 days 00:16:17.929072
episodes_test                             486.0
episode_length_test                    20.54321
returns_test                          99.346809
return_std_test                       26.632792
average_reward_test                    4.835796
round_time_test          0 days 00:00:14.047700
round_time_total         0 days 00:16:17.930142
loss_total             2237124036177164763136.0
loss_critic            2796404998602162700288.0
loss_actor                 -101297819525.119995
memory_size                           2539.3025 

=== epoch 3/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                    486
episode_length                        20.539095
returns                               98.746972
return_std                            26.519452
average_reward                         4.808141
round_time               0 days 00:16:22.827248
episodes_test                             503.0
episode_length_test                   19.874751
returns_test                          95.635533
return_std_test                       26.026941
average_reward_test                    4.811931
round_time_test          0 days 00:00:13.993070
round_time_total         0 days 00:16:22.828402
loss_total             2379042705913655328768.0
loss_critic            2973803326888722038784.0
loss_actor                 -103631982264.320007
memory_size                             2564.48 

=== epoch 3/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:51,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                    497
episode_length                        20.062374
returns                               96.670049
return_std                            25.471682
average_reward                         4.818072
round_time               0 days 00:16:22.849548
episodes_test                             482.0
episode_length_test                   20.732365
returns_test                          99.977936
return_std_test                       26.793402
average_reward_test                    4.822334
round_time_test          0 days 00:00:13.895887
round_time_total         0 days 00:16:22.850990
loss_total             2396884262039675469824.0
loss_critic            2996105274192494067712.0
loss_actor                 -103196920827.904007
memory_size                              2571.0 

=== epoch 3/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:51,  1.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                    500
episode_length                           19.966
returns                               96.485919
return_std                             25.74008
average_reward                         4.832388
round_time               0 days 00:16:21.318193
episodes_test                             519.0
episode_length_test                   19.248555
returns_test                          92.772621
return_std_test                       23.495501
average_reward_test                    4.819728
round_time_test          0 days 00:00:14.040400
round_time_total         0 days 00:16:21.319531
loss_total             2410907683923428900864.0
loss_critic            3013634548679659487232.0
loss_actor                 -103498625376.255997
memory_size                           2577.6845 

=== epoch 3/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:07,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.03it/s]
episodes                                    498
episode_length                        20.024096
returns                               96.784778
return_std                            25.056031
average_reward                         4.832653
round_time               0 days 00:16:28.237053
episodes_test                             466.0
episode_length_test                   21.427039
returns_test                          103.00934
return_std_test                       30.264792
average_reward_test                    4.807361
round_time_test          0 days 00:00:14.078034
round_time_total         0 days 00:16:28.238152
loss_total             2537022519020771868672.0
loss_critic            3171278092709668061184.0
loss_actor                 -105565172813.824005
memory_size                           2594.1695 

=== epoch 3/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:50,  1.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    479
episode_length                        20.826722
returns                              100.790842
return_std                            26.906431
average_reward                         4.839677
round_time               0 days 00:16:30.140199
episodes_test                             449.0
episode_length_test                   22.249443
returns_test                         107.620496
return_std_test                       34.595438
average_reward_test                    4.836998
round_time_test          0 days 00:00:14.220865
round_time_total         0 days 00:16:30.141263
loss_total             2585997087542360408064.0
loss_critic            3232496302904256757760.0
loss_actor                    -105960042172.416
memory_size                           2615.1175 

=== epoch 3/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:24,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                    482
episode_length                        20.717842
returns                              100.281338
return_std                            25.322423
average_reward                         4.840286
round_time               0 days 00:16:23.622777
episodes_test                             517.0
episode_length_test                   19.307544
returns_test                           93.53642
return_std_test                       20.938202
average_reward_test                    4.844843
round_time_test          0 days 00:00:13.968992
round_time_total         0 days 00:16:23.623870
loss_total             2545512570211776593920.0
loss_critic            3181890650857818554368.0
loss_actor                 -103351041273.856003
memory_size                            2626.589 

=== epoch 3/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:54,  1.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.03it/s]
episodes                                    469
episode_length                        21.283582
returns                              103.167467
return_std                            25.933053
average_reward                         4.847586
round_time               0 days 00:16:27.927306
episodes_test                             479.0
episode_length_test                   20.868476
returns_test                         101.216964
return_std_test                       25.252719
average_reward_test                    4.850266
round_time_test          0 days 00:00:14.055012
round_time_total         0 days 00:16:27.928420
loss_total             2517894090967165173760.0
loss_critic            3147367554898278219776.0
loss_actor                 -102817755463.679993
memory_size                           2637.6175 

=== epoch 3/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    470
episode_length                        21.238298
returns                              102.882156
return_std                            26.385474
average_reward                         4.844078
round_time               0 days 00:16:30.354865
episodes_test                             456.0
episode_length_test                    21.89693
returns_test                          105.67352
return_std_test                       29.467836
average_reward_test                    4.825869
round_time_test          0 days 00:00:13.801319
round_time_total         0 days 00:16:30.355957
loss_total             2528219275337886859264.0
loss_critic            3160274038053285134336.0
loss_actor                    -103514061778.944
memory_size                            2649.524 

=== epoch 3/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:46,  1.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:28<00:00,  2.02it/s]
episodes                                    478
episode_length                        20.861925
returns                              101.094526
return_std                            26.065516
average_reward                         4.845523
round_time               0 days 00:16:29.128171
episodes_test                             455.0
episode_length_test                   21.949451
returns_test                         105.635645
return_std_test                       30.483334
average_reward_test                    4.812744
round_time_test          0 days 00:00:14.123520
round_time_total         0 days 00:16:29.129270
loss_total             2585645792803229794304.0
loss_critic            3232057179633696112640.0
loss_actor                 -105584166301.695999
memory_size                            2676.328 

=== epoch 3/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                    484
episode_length                        20.595041
returns                               99.756277
return_std                            24.937847
average_reward                         4.843335
round_time               0 days 00:16:33.316094
episodes_test                             470.0
episode_length_test                   21.197872
returns_test                         102.287449
return_std_test                        27.63253
average_reward_test                    4.827042
round_time_test          0 days 00:00:14.418937
round_time_total         0 days 00:16:33.317561
loss_total             2730263908299232509952.0
loss_critic            3412829821919025430528.0
loss_actor                 -107713796874.240005
memory_size                            2681.348 

=== epoch 3/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:59,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                    476
episode_length                        20.976891
returns                              101.389834
return_std                            26.171849
average_reward                         4.833658
round_time               0 days 00:16:33.378294
episodes_test                             454.0
episode_length_test                   22.008811
returns_test                         106.094408
return_std_test                       31.318616
average_reward_test                    4.820549
round_time_test          0 days 00:00:14.088538
round_time_total         0 days 00:16:33.379367
loss_total             2820789134879998083072.0
loss_critic            3525986354458886930432.0
loss_actor                 -109283067338.751999
memory_size                           2687.6365 

=== epoch 3/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:55,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                    474
episode_length                        21.042194
returns                              101.531312
return_std                            27.112477
average_reward                         4.824329
round_time               0 days 00:16:36.301706
episodes_test                             496.0
episode_length_test                   20.135081
returns_test                           97.41278
return_std_test                       20.613615
average_reward_test                    4.837985
round_time_test          0 days 00:00:14.243758
round_time_total         0 days 00:16:36.303188
loss_total             2956171568347698692096.0
loss_critic            3695214390716790210560.0
loss_actor                    -109967772483.584
memory_size                            2700.096 

=== epoch 3/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:51,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                    470
episode_length                        21.234043
returns                              102.365763
return_std                            26.188286
average_reward                         4.820743
round_time               0 days 00:16:35.801899
episodes_test                             469.0
episode_length_test                    21.30064
returns_test                         103.035009
return_std_test                       26.206301
average_reward_test                    4.837158
round_time_test          0 days 00:00:14.218313
round_time_total         0 days 00:16:35.803325
loss_total             3022485299427638509568.0
loss_critic            3778106558894392737792.0
loss_actor                 -111651658063.871994
memory_size                              2725.0 

=== epoch 3/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:53,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                    460
episode_length                        21.695652
returns                              104.691612
return_std                            26.982912
average_reward                         4.825663
round_time               0 days 00:16:34.706532
episodes_test                             480.0
episode_length_test                   20.833333
returns_test                         100.397718
return_std_test                       22.498421
average_reward_test                     4.81909
round_time_test          0 days 00:00:13.828421
round_time_total         0 days 00:16:34.707717
loss_total             3126260331299747659776.0
loss_critic            3907825344424443904000.0
loss_actor                 -113968356249.600006
memory_size                           2734.0795 

=== epoch 3/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:47,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    457
episode_length                        21.840263
returns                              105.437411
return_std                            29.027815
average_reward                         4.827709
round_time               0 days 00:16:36.014393
episodes_test                             438.0
episode_length_test                   22.803653
returns_test                         110.282742
return_std_test                       35.758179
average_reward_test                    4.836113
round_time_test          0 days 00:00:14.070535
round_time_total         0 days 00:16:36.015487
loss_total             3079523953980407808000.0
loss_critic            3849404871790106771456.0
loss_actor                 -112244184547.328003
memory_size                            2773.242 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<17:57,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:16<00:00,  2.05it/s]
episodes                                     94
episode_length                        21.212766
returns                              103.266873
return_std                            28.839993
average_reward                          4.86812
round_time               0 days 00:16:16.884708
episodes_test                             470.0
episode_length_test                   21.259574
returns_test                         102.685859
return_std_test                       29.085846
average_reward_test                    4.830104
round_time_test          0 days 00:00:13.683907
round_time_total         0 days 00:16:16.886288
loss_total             3080070949298830835712.0
loss_critic            3850088617151995838464.0
loss_actor                 -112583179653.119995
memory_size                           2805.5845 

=== epoch 4/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                    185
episode_length                        21.518919
returns                              105.127318
return_std                            29.552188
average_reward                         4.885375
round_time               0 days 00:16:19.710329
episodes_test                             464.0
episode_length_test                   21.519397
returns_test                         104.093296
return_std_test                       26.381177
average_reward_test                    4.836984
round_time_test          0 days 00:00:14.200443
round_time_total         0 days 00:16:19.711402
loss_total             3100387653380756471808.0
loss_critic            3875484503341299400704.0
loss_actor                 -112113103581.184006
memory_size                           2821.8705 

=== epoch 4/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:22,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:18<00:00,  2.04it/s]
episodes                                    261
episode_length                        22.904215
returns                              111.868691
return_std                            37.577949
average_reward                         4.884031
round_time               0 days 00:16:19.390389
episodes_test                             443.0
episode_length_test                   22.530474
returns_test                         109.866076
return_std_test                       25.888023
average_reward_test                    4.875961
round_time_test          0 days 00:00:14.059191
round_time_total         0 days 00:16:19.391540
loss_total             3132793566693441929216.0
loss_critic            3915991885376822181888.0
loss_actor                 -111517922562.048004
memory_size                            2901.501 

=== epoch 4/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:02,  1.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                    338
episode_length                        23.636095
returns                              115.464959
return_std                             39.29323
average_reward                         4.885006
round_time               0 days 00:16:25.217683
episodes_test                             400.0
episode_length_test                       24.96
returns_test                         121.964047
return_std_test                       39.006616
average_reward_test                    4.886159
round_time_test          0 days 00:00:14.301830
round_time_total         0 days 00:16:25.218769
loss_total             3053177399568063004672.0
loss_critic            3816471683647710691328.0
loss_actor                 -107721872846.848007
memory_size                              3008.4 

=== epoch 4/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:57,  1.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                    424
episode_length                        23.554245
returns                              115.022049
return_std                            40.508103
average_reward                         4.883205
round_time               0 days 00:16:33.507157
episodes_test                             388.0
episode_length_test                   25.742268
returns_test                         124.463633
return_std_test                       48.521932
average_reward_test                    4.834943
round_time_test          0 days 00:00:14.093425
round_time_total         0 days 00:16:33.508459
loss_total             3105113472365321781248.0
loss_critic            3881391774204479668224.0
loss_actor                 -108567362580.479996
memory_size                           3143.8995 

=== epoch 4/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:11,  1.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    424
episode_length                        23.542453
returns                              114.754575
return_std                             39.75251
average_reward                         4.874483
round_time               0 days 00:16:29.844831
episodes_test                             440.0
episode_length_test                   22.681818
returns_test                         110.376809
return_std_test                       28.718009
average_reward_test                    4.865998
round_time_test          0 days 00:00:14.402241
round_time_total         0 days 00:16:29.846211
loss_total             3175899033062379356160.0
loss_critic            3969873723756388024320.0
loss_actor                 -110059776290.815994
memory_size                              3175.0 

=== epoch 4/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:02,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.03it/s]
episodes                                    439
episode_length                         22.76082
returns                              110.596278
return_std                            39.582998
average_reward                         4.859217
round_time               0 days 00:16:27.824272
episodes_test                             532.0
episode_length_test                   18.793233
returns_test                           90.71227
return_std_test                        17.34276
average_reward_test                    4.826872
round_time_test          0 days 00:00:13.763511
round_time_total         0 days 00:16:27.825421
loss_total             3181899248440613273600.0
loss_critic            3977373990709787885568.0
loss_actor                 -110992975368.192001
memory_size                           3175.4655 

=== epoch 4/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:51,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                    469
episode_length                        21.272921
returns                              103.180003
return_std                            33.612657
average_reward                         4.850588
round_time               0 days 00:16:30.815956
episodes_test                             533.0
episode_length_test                   18.746717
returns_test                          90.384402
return_std_test                       18.978368
average_reward_test                    4.821363
round_time_test          0 days 00:00:13.629368
round_time_total         0 days 00:16:30.817384
loss_total             3296523581152809713664.0
loss_critic            4120654406969469173760.0
loss_actor                 -111980717166.591995
memory_size                              3189.0 

=== epoch 4/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:21,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    507
episode_length                        19.712032
returns                                95.42119
return_std                            27.562208
average_reward                         4.840698
round_time               0 days 00:16:30.570301
episodes_test                             561.0
episode_length_test                   17.796791
returns_test                               86.0
return_std_test                       15.940633
average_reward_test                    4.832419
round_time_test          0 days 00:00:13.977232
round_time_total         0 days 00:16:30.571560
loss_total             3301711108960066469888.0
loss_critic            4127138814547109347328.0
loss_actor                 -113490962202.623993
memory_size                              3189.0 

=== epoch 4/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:28,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                    530
episode_length                        18.833962
returns                               91.066439
return_std                             19.82159
average_reward                         4.835504
round_time               0 days 00:16:25.397628
episodes_test                             535.0
episode_length_test                   18.672897
returns_test                          90.377283
return_std_test                        17.87949
average_reward_test                    4.839998
round_time_test          0 days 00:00:13.775388
round_time_total         0 days 00:16:25.398728
loss_total             3256705566519623417856.0
loss_critic            4070881892565859172352.0
loss_actor                 -111908652380.160004
memory_size                              3189.0 

=== epoch 4/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:54,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                    542
episode_length                        18.426199
returns                               89.172266
return_std                            17.975352
average_reward                         4.839339
round_time               0 days 00:16:30.870708
episodes_test                             528.0
episode_length_test                    18.92803
returns_test                          91.794045
return_std_test                       18.795409
average_reward_test                     4.84965
round_time_test          0 days 00:00:13.947858
round_time_total         0 days 00:16:30.871913
loss_total             3215262969237280391168.0
loss_critic            4019078642884298145792.0
loss_actor                 -112129169055.744003
memory_size                              3189.0 

=== epoch 4/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:27,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                    535
episode_length                        18.678505
returns                               90.407826
return_std                            19.082226
average_reward                          4.84024
round_time               0 days 00:16:30.193933
episodes_test                             487.0
episode_length_test                   20.529774
returns_test                          99.648647
return_std_test                       23.514325
average_reward_test                    4.853881
round_time_test          0 days 00:00:14.236884
round_time_total         0 days 00:16:30.195225
loss_total             3307285843104329891840.0
loss_critic            4134107235165333356544.0
loss_actor                 -113509282676.735992
memory_size                           3200.2215 

=== epoch 4/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:34,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                    536
episode_length                        18.621269
returns                               90.132413
return_std                             18.81285
average_reward                         4.840212
round_time               0 days 00:16:34.001376
episodes_test                             495.0
episode_length_test                    20.19798
returns_test                          97.923966
return_std_test                        22.20247
average_reward_test                    4.848215
round_time_test          0 days 00:00:13.783772
round_time_total         0 days 00:16:34.002498
loss_total             3427396790771261636608.0
loss_critic            4284245917937003134976.0
loss_actor                 -115825750073.343994
memory_size                           3223.5425 

=== epoch 4/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:42,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                    536
episode_length                        18.652985
returns                                 90.2627
return_std                            18.505332
average_reward                         4.839058
round_time               0 days 00:16:31.786071
episodes_test                             551.0
episode_length_test                   18.125227
returns_test                          87.658838
return_std_test                       16.035959
average_reward_test                    4.836324
round_time_test          0 days 00:00:14.361806
round_time_total         0 days 00:16:31.787543
loss_total             3479050569851471396864.0
loss_critic            4348813140978024579072.0
loss_actor                 -117110990090.240005
memory_size                              3224.0 

=== epoch 4/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:06,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                    539
episode_length                        18.528757
returns                               89.545299
return_std                            18.296288
average_reward                         4.832985
round_time               0 days 00:16:30.719075
episodes_test                             568.0
episode_length_test                   17.584507
returns_test                           85.08343
return_std_test                       12.623316
average_reward_test                    4.838513
round_time_test          0 days 00:00:14.072145
round_time_total         0 days 00:16:30.720176
loss_total             3590378096358909280256.0
loss_critic            4487972543817074081792.0
loss_actor                 -119364747288.576004
memory_size                            3226.147 

=== epoch 4/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:19,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                    548
episode_length                        18.164234
returns                               87.771449
return_std                            17.305106
average_reward                         4.831466
round_time               0 days 00:16:34.609537
episodes_test                             587.0
episode_length_test                   17.011925
returns_test                          82.090023
return_std_test                       12.354747
average_reward_test                    4.825382
round_time_test          0 days 00:00:13.902675
round_time_total         0 days 00:16:34.610661
loss_total             3798148614541849657344.0
loss_critic            4747685686760674689024.0
loss_actor                 -123029886316.544006
memory_size                              3228.0 

=== epoch 4/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:08,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:27<00:00,  2.02it/s]
episodes                                    565
episode_length                        17.681416
returns                               85.499037
return_std                            13.642041
average_reward                         4.835467
round_time               0 days 00:16:28.553211
episodes_test                             558.0
episode_length_test                   17.890681
returns_test                          86.829138
return_std_test                       18.108289
average_reward_test                    4.853407
round_time_test          0 days 00:00:14.060589
round_time_total         0 days 00:16:28.554312
loss_total             3960709115672633278464.0
loss_critic            4950886312892679847936.0
loss_actor                 -124957138735.104004
memory_size                              3228.0 

=== epoch 4/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:28,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                    565
episode_length                        17.663717
returns                               85.401304
return_std                            13.502404
average_reward                           4.8348
round_time               0 days 00:16:30.709441
episodes_test                             554.0
episode_length_test                   18.027076
returns_test                          87.312265
return_std_test                       15.450885
average_reward_test                    4.843344
round_time_test          0 days 00:00:14.076986
round_time_total         0 days 00:16:30.710658
loss_total             4038990293356476104704.0
loss_critic            5048737789940887191552.0
loss_actor                 -127728703385.600006
memory_size                              3228.0 

=== epoch 4/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.02it/s]
episodes                                    564
episode_length                        17.695035
returns                               85.584836
return_std                            16.321376
average_reward                         4.836613
round_time               0 days 00:16:32.810303
episodes_test                             572.0
episode_length_test                   17.473776
returns_test                          84.495944
return_std_test                       15.605289
average_reward_test                     4.83563
round_time_test          0 days 00:00:13.873438
round_time_total         0 days 00:16:32.811553
loss_total             4193356134395814084608.0
loss_critic            5241695079207004012544.0
loss_actor                 -129846053748.735992
memory_size                           3244.4235 

=== epoch 4/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:12,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                    563
episode_length                        17.719361
returns                               85.751627
return_std                            16.434229
average_reward                         4.839405
round_time               0 days 00:16:32.214943
episodes_test                             569.0
episode_length_test                   17.551845
returns_test                          84.650353
return_std_test                       11.727282
average_reward_test                    4.822862
round_time_test          0 days 00:00:13.868103
round_time_total         0 days 00:16:32.216417
loss_total             4522696679126974070784.0
loss_critic            5653370754403494002688.0
loss_actor                 -134957074096.128006
memory_size                              3261.0 

=== epoch 4/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:09,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                    554
episode_length                        18.046931
returns                               87.314792
return_std                            18.073299
average_reward                         4.838186
round_time               0 days 00:16:35.836413
episodes_test                             531.0
episode_length_test                    18.79661
returns_test                          91.042468
return_std_test                       18.310964
average_reward_test                    4.843677
round_time_test          0 days 00:00:14.341213
round_time_total         0 days 00:16:35.837959
loss_total             4783732718432318128128.0
loss_critic            5979665789496609210368.0
loss_actor                 -139320144777.216003
memory_size                           3274.7025 

=== epoch 4/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:46,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                    556
episode_length                        17.964029
returns                               86.866657
return_std                             18.03295
average_reward                          4.83558
round_time               0 days 00:16:34.977731
episodes_test                             576.0
episode_length_test                   17.361111
returns_test                          83.843153
return_std_test                       14.417386
average_reward_test                    4.829366
round_time_test          0 days 00:00:13.844175
round_time_total         0 days 00:16:34.979078
loss_total             5025783126443339808768.0
loss_critic            6282228798982620971008.0
loss_actor                 -143811976409.088013
memory_size                              3275.0 

=== epoch 4/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                    557
episode_length                        17.928187
returns                               86.754447
return_std                            19.311376
average_reward                         4.839187
round_time               0 days 00:16:33.937767
episodes_test                             576.0
episode_length_test                   17.355903
returns_test                          83.989683
return_std_test                       17.115352
average_reward_test                    4.839279
round_time_test          0 days 00:00:13.751683
round_time_total         0 days 00:16:33.938901
loss_total             5266560701513451175936.0
loss_critic            6583200754415014248448.0
loss_actor                 -146809311637.503998
memory_size                            3289.677 

=== epoch 4/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:25,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                    563
episode_length                        17.731794
returns                               85.772008
return_std                            16.953187
average_reward                         4.837112
round_time               0 days 00:16:35.005362
episodes_test                             582.0
episode_length_test                    17.16323
returns_test                          83.123227
return_std_test                       14.542249
average_reward_test                    4.843054
round_time_test          0 days 00:00:14.330537
round_time_total         0 days 00:16:35.006543
loss_total             5492359474233814810624.0
loss_critic            6865449220843234131968.0
loss_actor                 -151149420748.799988
memory_size                              3296.0 

=== epoch 4/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:16,  1.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                    572
episode_length                        17.466783
returns                               84.474922
return_std                            15.927507
average_reward                         4.836416
round_time               0 days 00:16:36.922816
episodes_test                             594.0
episode_length_test                   16.818182
returns_test                          81.238062
return_std_test                       11.461054
average_reward_test                    4.830365
round_time_test          0 days 00:00:13.652974
round_time_total         0 days 00:16:36.923950
loss_total             5683168657941767127040.0
loss_critic            7103960694919044005888.0
loss_actor                 -154064391364.608002
memory_size                              3296.0 

=== epoch 4/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:52,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                    584
episode_length                        17.107877
returns                               82.753565
return_std                            13.856536
average_reward                         4.837174
round_time               0 days 00:16:22.935306
episodes_test                             591.0
episode_length_test                   16.900169
returns_test                          81.779658
return_std_test                       12.997802
average_reward_test                    4.838954
round_time_test          0 days 00:00:14.007636
round_time_total         0 days 00:16:22.936698
loss_total             5716582698231126818816.0
loss_critic            7145728249925081235456.0
loss_actor                 -155106818498.559998
memory_size                              3296.0 

=== epoch 4/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    581
episode_length                        17.165232
returns                               83.043753
return_std                            14.241354
average_reward                         4.838163
round_time               0 days 00:16:17.844752
episodes_test                             586.0
episode_length_test                   17.037543
returns_test                          82.381915
return_std_test                       12.642438
average_reward_test                    4.835451
round_time_test          0 days 00:00:13.894946
round_time_total         0 days 00:16:17.845872
loss_total             5959134488610963718144.0
loss_critic            7448917977942700589056.0
loss_actor                 -157748790091.776001
memory_size                            3296.358 

=== epoch 4/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:20,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                    583
episode_length                         17.12693
returns                                82.86688
return_std                            11.553602
average_reward                         4.838358
round_time               0 days 00:16:15.461923
episodes_test                             543.0
episode_length_test                   18.388582
returns_test                          89.220793
return_std_test                       20.108529
average_reward_test                    4.852051
round_time_test          0 days 00:00:13.765494
round_time_total         0 days 00:16:15.463003
loss_total             6167403891353776553984.0
loss_critic            7709254735311865380864.0
loss_actor                 -160563213819.903992
memory_size                              3297.0 

=== epoch 4/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:35,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    580
episode_length                        17.218966
returns                               83.362278
return_std                            12.442181
average_reward                         4.841226
round_time               0 days 00:16:18.301953
episodes_test                             558.0
episode_length_test                   17.908602
returns_test                          86.906945
return_std_test                       14.340093
average_reward_test                    4.852809
round_time_test          0 days 00:00:13.710045
round_time_total         0 days 00:16:18.303429
loss_total             6338845791424272662528.0
loss_critic            7923557099035433107456.0
loss_actor                 -162712233623.552002
memory_size                            3300.094 

=== epoch 4/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:45,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                    575
episode_length                        17.368696
returns                               84.145214
return_std                            12.861995
average_reward                         4.844679
round_time               0 days 00:16:16.384249
episodes_test                             575.0
episode_length_test                   17.368696
returns_test                          83.972205
return_std_test                       14.194409
average_reward_test                    4.834635
round_time_test          0 days 00:00:13.780904
round_time_total         0 days 00:16:16.385323
loss_total             6466839044782513192960.0
loss_critic            8083548667985033953280.0
loss_actor                 -164192085516.287994
memory_size                              3304.0 

=== epoch 4/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:59,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:14<00:00,  2.05it/s]
episodes                                    574
episode_length                        17.390244
returns                               84.284823
return_std                            12.464189
average_reward                         4.846574
round_time               0 days 00:16:15.462992
episodes_test                             586.0
episode_length_test                   17.042662
returns_test                          82.406129
return_std_test                       15.303736
average_reward_test                    4.835243
round_time_test          0 days 00:00:13.655178
round_time_total         0 days 00:16:15.464270
loss_total             6636866761793456832512.0
loss_critic            8296083304115615039488.0
loss_actor                 -167746589196.287994
memory_size                              3304.0 

=== epoch 4/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:18<00:00,  2.04it/s]
episodes                                    577
episode_length                        17.320624
returns                               83.932957
return_std                            13.200755
average_reward                         4.845847
round_time               0 days 00:16:19.404405
episodes_test                             587.0
episode_length_test                   17.027257
returns_test                          82.426757
return_std_test                       13.690627
average_reward_test                    4.840924
round_time_test          0 days 00:00:13.847637
round_time_total         0 days 00:16:19.405567
loss_total             6911127200318009376768.0
loss_critic            8638908861735900282880.0
loss_actor                  -170510765277.18399
memory_size                           3313.7955 

=== epoch 4/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:01,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    582
episode_length                        17.128866
returns                               82.923154
return_std                            12.657642
average_reward                         4.841223
round_time               0 days 00:16:18.154023
episodes_test                             587.0
episode_length_test                   17.017036
returns_test                          82.477215
return_std_test                       12.229852
average_reward_test                    4.846723
round_time_test          0 days 00:00:13.695094
round_time_total         0 days 00:16:18.155122
loss_total             7222524383716565843968.0
loss_critic            9028155326277028413440.0
loss_actor                 -174765243998.208008
memory_size                              3317.0 

=== epoch 4/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:58,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:18<00:00,  2.04it/s]
episodes                                    586
episode_length                        17.032423
returns                               82.459882
return_std                            11.623696
average_reward                          4.84131
round_time               0 days 00:16:18.638438
episodes_test                             589.0
episode_length_test                   16.957555
returns_test                          82.054757
return_std_test                       14.187848
average_reward_test                     4.83879
round_time_test          0 days 00:00:13.706036
round_time_total         0 days 00:16:18.639654
loss_total             7426210954440813314048.0
loss_critic            9282763545593312706560.0
loss_actor                    -176948271013.888
memory_size                              3317.0 

=== epoch 4/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:23,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                    587
episode_length                        16.993186
returns                                82.22776
return_std                            12.722254
average_reward                          4.83877
round_time               0 days 00:16:18.151759
episodes_test                             595.0
episode_length_test                   16.794958
returns_test                          81.083788
return_std_test                       11.094319
average_reward_test                    4.827891
round_time_test          0 days 00:00:13.644588
round_time_total         0 days 00:16:18.152832
loss_total             7556979798908137897984.0
loss_critic            9446224593542459686912.0
loss_actor                    -178742093914.112
memory_size                           3319.1675 

=== epoch 4/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:28,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                    590
episode_length                        16.927119
returns                               81.905765
return_std                            13.043791
average_reward                         4.838675
round_time               0 days 00:16:04.673537
episodes_test                             603.0
episode_length_test                   16.580431
returns_test                          79.924213
return_std_test                       10.594681
average_reward_test                    4.820411
round_time_test          0 days 00:00:13.458318
round_time_total         0 days 00:16:04.674980
loss_total             7730183816071648116736.0
loss_critic            9662729616333853425664.0
loss_actor                 -180279225737.216003
memory_size                              3323.0 

=== epoch 4/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:47,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:00<00:00,  2.08it/s]
episodes                                    586
episode_length                        16.991468
returns                               82.247188
return_std                            15.459526
average_reward                         4.840529
round_time               0 days 00:16:01.154720
episodes_test                             584.0
episode_length_test                   17.121575
returns_test                          82.789961
return_std_test                       16.284292
average_reward_test                    4.835426
round_time_test          0 days 00:00:13.437616
round_time_total         0 days 00:16:01.156025
loss_total             7979745872898965897216.0
loss_critic            9974682171253559656448.0
loss_actor                 -184805797052.415985
memory_size                            3333.935 

=== epoch 4/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:05,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                     588
episode_length                         16.986395
returns                                82.236359
return_std                             16.424813
average_reward                          4.841249
round_time                0 days 00:16:01.622869
episodes_test                              590.0
episode_length_test                    16.940678
returns_test                           81.897851
return_std_test                        12.161889
average_reward_test                     4.834447
round_time_test           0 days 00:00:13.841701
round_time_total          0 days 00:16:01.623965
loss_total              8331139977059225305088.0
loss_critic            10413924800855749427200.0
loss_actor                  -188338789851.135986
memory_size                             3359.646 

=== epoch 4/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                     588
episode_length                         16.982993
returns                                82.210574
return_std                             16.290858
average_reward                          4.840707
round_time                0 days 00:16:03.677529
episodes_test                              599.0
episode_length_test                    16.669449
returns_test                           80.677654
return_std_test                        11.663394
average_reward_test                     4.839794
round_time_test           0 days 00:00:13.563972
round_time_total          0 days 00:16:03.678862
loss_total              8709689613221875417088.0
loss_critic            10887111829135378350080.0
loss_actor                  -190784877084.671997
memory_size                               3367.0 

=== epoch 4/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                     592
episode_length                            16.875
returns                                81.682886
return_std                             15.011121
average_reward                          4.840452
round_time                0 days 00:16:06.714570
episodes_test                              584.0
episode_length_test                    17.106164
returns_test                           82.797313
return_std_test                        16.378881
average_reward_test                     4.840205
round_time_test           0 days 00:00:14.342615
round_time_total          0 days 00:16:06.715866
loss_total              8935271700594192023552.0
loss_critic            11169089434867520765952.0
loss_actor                  -195262598873.088013
memory_size                               3367.0 

=== epoch 4/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                     590
episode_length                         16.923729
returns                                81.895516
return_std                             15.116595
average_reward                          4.839025
round_time                0 days 00:16:01.675052
episodes_test                              598.0
episode_length_test                    16.714047
returns_test                           80.763783
return_std_test                        11.122936
average_reward_test                     4.832125
round_time_test           0 days 00:00:13.505497
round_time_total          0 days 00:16:01.676174
loss_total              9220535397904630677504.0
loss_critic            11525669061466565640192.0
loss_actor                  -197829219024.895996
memory_size                               3367.0 

=== epoch 4/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:28,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                     592
episode_length                         16.858108
returns                                81.541425
return_std                             11.796123
average_reward                          4.836995
round_time                0 days 00:16:04.561677
episodes_test                              589.0
episode_length_test                    16.976231
returns_test                           82.068084
return_std_test                        11.447815
average_reward_test                       4.8343
round_time_test           0 days 00:00:13.548026
round_time_total          0 days 00:16:04.562758
loss_total              9447290411067411267584.0
loss_critic            11809112802798400962560.0
loss_actor                  -199630509064.191986
memory_size                            3367.8085 

=== epoch 4/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:40,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                     588
episode_length                         16.991497
returns                                82.194885
return_std                             10.912072
average_reward                          4.837384
round_time                0 days 00:16:06.588071
episodes_test                              572.0
episode_length_test                    17.465035
returns_test                            84.54548
return_std_test                        17.353953
average_reward_test                     4.840846
round_time_test           0 days 00:00:13.761724
round_time_total          0 days 00:16:06.589527
loss_total              9502712556165938544640.0
loss_critic            11878390484101188550656.0
loss_actor                  -198947713171.455994
memory_size                               3370.0 

=== epoch 4/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:42,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                     580
episode_length                         17.196552
returns                                83.187434
return_std                             12.489117
average_reward                          4.837304
round_time                0 days 00:16:03.664328
episodes_test                              582.0
episode_length_test                    17.154639
returns_test                           83.040951
return_std_test                        13.007456
average_reward_test                     4.840943
round_time_test           0 days 00:00:13.464677
round_time_total          0 days 00:16:03.665408
loss_total              9732854690298191675392.0
loss_critic            12166068137692758016000.0
loss_actor                  -202467172868.096008
memory_size                              3373.54 

=== epoch 4/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:06,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                     577
episode_length                          17.30156
returns                                83.717834
return_std                             12.652881
average_reward                          4.838647
round_time                0 days 00:16:06.084310
episodes_test                              567.0
episode_length_test                    17.626102
returns_test                           85.376661
return_std_test                        14.820885
average_reward_test                     4.843788
round_time_test           0 days 00:00:13.530240
round_time_total          0 days 00:16:06.085383
loss_total              9809171387761844813824.0
loss_critic            12261464012125968007168.0
loss_actor                  -204087679582.208008
memory_size                               3375.0 

=== epoch 4/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:02<00:00,  2.08it/s]
episodes                                     577
episode_length                         17.294627
returns                                83.629423
return_std                              12.89842
average_reward                          4.835456
round_time                0 days 00:16:03.227540
episodes_test                              597.0
episode_length_test                    16.728643
returns_test                           80.890129
return_std_test                        11.742167
average_reward_test                      4.83539
round_time_test           0 days 00:00:13.543301
round_time_total          0 days 00:16:03.228780
loss_total             10067851664048642850816.0
loss_critic            12584814357765941100544.0
loss_actor                  -207231998312.447998
memory_size                             3378.655 

=== epoch 4/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                     579
episode_length                         17.246978
returns                                83.431171
return_std                             12.104096
average_reward                          4.837426
round_time                0 days 00:16:02.364976
episodes_test                              593.0
episode_length_test                    16.844857
returns_test                           81.441834
return_std_test                         9.481925
average_reward_test                     4.834793
round_time_test           0 days 00:00:13.660710
round_time_total          0 days 00:16:02.366258
loss_total             10468466859551968198656.0
loss_critic            13085583346163754467328.0
loss_actor                  -210868238884.864014
memory_size                               3380.0 

=== epoch 4/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:19,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:04<00:00,  2.07it/s]
episodes                                     580
episode_length                         17.206897
returns                                83.265873
return_std                              13.47403
average_reward                          4.839195
round_time                0 days 00:16:04.587819
episodes_test                              592.0
episode_length_test                    16.873311
returns_test                           81.501402
return_std_test                        10.222054
average_reward_test                     4.830184
round_time_test           0 days 00:00:13.586464
round_time_total          0 days 00:16:04.588906
loss_total             10766030513867317051392.0
loss_critic            13457537896817595777024.0
loss_actor                  -215903222202.368011
memory_size                             3390.473 

=== epoch 4/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:51,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:02<00:00,  2.08it/s]
episodes                                     586
episode_length                         17.059727
returns                                82.538053
return_std                             12.313326
average_reward                          4.838169
round_time                0 days 00:16:03.085113
episodes_test                              598.0
episode_length_test                    16.715719
returns_test                           80.702748
return_std_test                        10.101001
average_reward_test                     4.827993
round_time_test           0 days 00:00:13.660806
round_time_total          0 days 00:16:03.086218
loss_total             11342179781457911742464.0
loss_critic            14177724461321117761536.0
loss_actor                  -221189651652.608002
memory_size                               3392.0 

=== epoch 4/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                     586
episode_length                         17.037543
returns                                82.404508
return_std                             12.515413
average_reward                          4.836596
round_time                0 days 00:16:06.670668
episodes_test                              582.0
episode_length_test                    17.156357
returns_test                           82.918667
return_std_test                        15.301899
average_reward_test                      4.83323
round_time_test           0 days 00:00:13.902239
round_time_total          0 days 00:16:06.672162
loss_total             11874297749275198619648.0
loss_critic            14842871909622621405184.0
loss_actor                   -226506104586.23999
memory_size                               3392.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<16:13,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                     116
episode_length                         17.155172
returns                                82.951962
return_std                             10.958118
average_reward                          4.835314
round_time                0 days 00:15:47.266826
episodes_test                              574.0
episode_length_test                     17.39547
returns_test                           83.996975
return_std_test                        15.020876
average_reward_test                     4.828628
round_time_test           0 days 00:00:13.374673
round_time_total          0 days 00:15:47.267988
loss_total             12179819019530355605504.0
loss_critic            15224773498004517683200.0
loss_actor                  -229193705840.640015
memory_size                               3392.0 

=== epoch 5/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:06,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:51<00:00,  2.10it/s]
episodes                                     233
episode_length                         17.154506
returns                                82.880246
return_std                             10.456885
average_reward                          4.831469
round_time                0 days 00:15:51.848321
episodes_test                              572.0
episode_length_test                    17.458042
returns_test                           84.451472
return_std_test                        16.315438
average_reward_test                     4.837365
round_time_test           0 days 00:00:13.529415
round_time_total          0 days 00:15:51.849401
loss_total             12597807005407498993664.0
loss_critic            15747258498435713597440.0
loss_actor                  -233719902011.391998
memory_size                               3392.0 

=== epoch 5/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:38,  1.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                     352
episode_length                         17.005682
returns                                82.130569
return_std                              9.560072
average_reward                           4.82955
round_time                0 days 00:15:48.033235
episodes_test                              591.0
episode_length_test                     16.91709
returns_test                            81.64841
return_std_test                         7.967093
average_reward_test                     4.826404
round_time_test           0 days 00:00:13.266126
round_time_total          0 days 00:15:48.034373
loss_total             12955642864960702251008.0
loss_critic            16194553289311327879168.0
loss_actor                  -237014761357.312012
memory_size                               3392.0 

=== epoch 5/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:42,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:29<00:00,  2.15it/s]
episodes                                     474
episode_length                         16.873418
returns                                81.466113
return_std                              8.780275
average_reward                          4.828094
round_time                0 days 00:15:29.862827
episodes_test                              588.0
episode_length_test                    16.984694
returns_test                           81.977266
return_std_test                        12.092459
average_reward_test                     4.826504
round_time_test           0 days 00:00:13.681038
round_time_total          0 days 00:15:29.863901
loss_total             13471061136049254170624.0
loss_critic            16838826136757002567680.0
loss_actor                  -241486363877.376007
memory_size                               3392.0 

=== epoch 5/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     596
episode_length                         16.763423
returns                                80.890231
return_std                              8.133377
average_reward                           4.82544
round_time                0 days 00:15:09.100655
episodes_test                              593.0
episode_length_test                    16.863406
returns_test                            81.41344
return_std_test                        12.663578
average_reward_test                     4.827817
round_time_test           0 days 00:00:12.915837
round_time_total          0 days 00:15:09.101760
loss_total             13779237803798527213568.0
loss_critic            17224046963069714169856.0
loss_actor                  -245906837520.384003
memory_size                               3392.0 

=== epoch 5/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:58,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     600
episode_length                         16.641667
returns                                80.265061
return_std                              7.142746
average_reward                          4.823122
round_time                0 days 00:15:06.556500
episodes_test                              614.0
episode_length_test                     16.28013
returns_test                           78.373835
return_std_test                         7.435496
average_reward_test                     4.814104
round_time_test           0 days 00:00:13.283014
round_time_total          0 days 00:15:06.557664
loss_total             14603470590829276430336.0
loss_critic            18254337935810257485824.0
loss_actor                  -252426040639.488007
memory_size                               3392.0 

=== epoch 5/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:54,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:05<00:00,  2.21it/s]
episodes                                     604
episode_length                         16.521523
returns                                79.658812
return_std                              6.342005
average_reward                          4.821461
round_time                0 days 00:15:05.589151
episodes_test                              593.0
episode_length_test                    16.858347
returns_test                           81.396609
return_std_test                        13.971224
average_reward_test                     4.828297
round_time_test           0 days 00:00:13.030078
round_time_total          0 days 00:15:05.590227
loss_total             15319191481080839405568.0
loss_critic            19148989026599293681664.0
loss_actor                  -258862638571.519989
memory_size                               3392.0 

=== epoch 5/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:40,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     606
episode_length                         16.480198
returns                                79.424723
return_std                              6.534657
average_reward                          4.819398
round_time                0 days 00:15:07.951417
episodes_test                              605.0
episode_length_test                    16.507438
returns_test                           79.519869
return_std_test                         7.903533
average_reward_test                     4.817206
round_time_test           0 days 00:00:13.117333
round_time_total          0 days 00:15:07.952512
loss_total             15860841604613931008000.0
loss_critic            19826051691219129466880.0
loss_actor                  -264585563316.223999
memory_size                               3392.0 

=== epoch 5/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:47,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     604
episode_length                         16.528146
returns                                79.661258
return_std                              8.864822
average_reward                          4.819828
round_time                0 days 00:15:08.268032
episodes_test                              605.0
episode_length_test                    16.502479
returns_test                           79.503088
return_std_test                        11.689179
average_reward_test                      4.81783
round_time_test           0 days 00:00:13.219775
round_time_total          0 days 00:15:08.269115
loss_total             16263199133600795590656.0
loss_critic            20328998585775314436096.0
loss_actor                  -268157280935.936005
memory_size                             3398.466 

=== epoch 5/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:02,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:05<00:00,  2.21it/s]
episodes                                     601
episode_length                         16.613977
returns                                80.092119
return_std                              9.413427
average_reward                          4.820743
round_time                0 days 00:15:05.989777
episodes_test                              600.0
episode_length_test                    16.661667
returns_test                           80.318907
return_std_test                        13.783448
average_reward_test                     4.820615
round_time_test           0 days 00:00:13.227698
round_time_total          0 days 00:15:05.990884
loss_total             16502671747472466903040.0
loss_critic            20628339348963149742080.0
loss_actor                   -269939473252.35199
memory_size                               3400.0 

=== epoch 5/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     596
episode_length                         16.753356
returns                                 80.78957
return_std                             11.774529
average_reward                          4.822257
round_time                0 days 00:15:10.146398
episodes_test                              588.0
episode_length_test                    17.003401
returns_test                           82.013215
return_std_test                        11.802839
average_reward_test                     4.823363
round_time_test           0 days 00:00:13.185639
round_time_total          0 days 00:15:10.147665
loss_total             16497378736354738831360.0
loss_critic            20621723089358481784832.0
loss_actor                  -268950920142.847992
memory_size                            3416.2765 

=== epoch 5/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     590
episode_length                         16.915254
returns                                81.643777
return_std                              14.13219
average_reward                          4.826568
round_time                0 days 00:15:09.309362
episodes_test                              597.0
episode_length_test                    16.743719
returns_test                            80.83459
return_std_test                         8.661314
average_reward_test                     4.827795
round_time_test           0 days 00:00:13.098462
round_time_total          0 days 00:15:09.310521
loss_total             16447278121981531127808.0
loss_critic            20559097332791710318592.0
loss_actor                   -270427959697.40799
memory_size                            3421.5905 

=== epoch 5/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:15,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     587
episode_length                         17.015332
returns                                82.136291
return_std                             14.408129
average_reward                          4.827172
round_time                0 days 00:15:07.716474
episodes_test                              591.0
episode_length_test                    16.893401
returns_test                           81.450913
return_std_test                        12.267992
average_reward_test                     4.821446
round_time_test           0 days 00:00:13.062553
round_time_total          0 days 00:15:07.717538
loss_total             16828703121999409446912.0
loss_critic            21035878549388903055360.0
loss_actor                  -272565276237.824005
memory_size                               3428.0 

=== epoch 5/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     581
episode_length                         17.160069
returns                                82.858603
return_std                             14.062195
average_reward                          4.828628
round_time                0 days 00:15:08.039763
episodes_test                              587.0
episode_length_test                    17.028961
returns_test                           82.135772
return_std_test                        11.620695
average_reward_test                     4.823342
round_time_test           0 days 00:00:12.980616
round_time_total          0 days 00:15:08.040830
loss_total             17225355913472260440064.0
loss_critic            21531694542811356463104.0
loss_actor                  -276909502029.823975
memory_size                               3428.0 

=== epoch 5/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:16,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     580
episode_length                         17.215517
returns                                83.171699
return_std                             14.118841
average_reward                          4.831372
round_time                0 days 00:15:06.734857
episodes_test                              586.0
episode_length_test                    17.054608
returns_test                           82.442853
return_std_test                        11.549757
average_reward_test                     4.834094
round_time_test           0 days 00:00:12.980716
round_time_total          0 days 00:15:06.736238
loss_total             17630484942470160318464.0
loss_critic            22038105804922249281536.0
loss_actor                  -277908367818.752014
memory_size                               3428.0 

=== epoch 5/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:05<00:00,  2.21it/s]
episodes                                     580
episode_length                         17.205172
returns                                83.128044
return_std                             12.564795
average_reward                          4.831471
round_time                0 days 00:15:05.878614
episodes_test                              585.0
episode_length_test                     17.08547
returns_test                           82.583517
return_std_test                        13.398613
average_reward_test                      4.83358
round_time_test           0 days 00:00:13.085037
round_time_total          0 days 00:15:05.879901
loss_total             18102718389542214172672.0
loss_critic            22628397598070087352320.0
loss_actor                  -283070670790.656006
memory_size                               3428.0 

=== epoch 5/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:05<00:00,  2.21it/s]
episodes                                     583
episode_length                         17.132075
returns                                82.735155
return_std                             10.321135
average_reward                          4.829233
round_time                0 days 00:15:06.149538
episodes_test                              589.0
episode_length_test                    16.977929
returns_test                           81.995613
return_std_test                        13.193248
average_reward_test                     4.829542
round_time_test           0 days 00:00:13.034981
round_time_total          0 days 00:15:06.150613
loss_total             18806436913756107702272.0
loss_critic            23508045728075076337664.0
loss_actor                  -288873528606.719971
memory_size                               3428.0 

=== epoch 5/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:58,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:05<00:00,  2.21it/s]
episodes                                     586
episode_length                         17.032423
returns                                82.258431
return_std                              9.751222
average_reward                          4.829406
round_time                0 days 00:15:06.375572
episodes_test                              600.0
episode_length_test                    16.656667
returns_test                           80.273519
return_std_test                         9.767868
average_reward_test                     4.819347
round_time_test           0 days 00:00:13.203520
round_time_total          0 days 00:15:06.376645
loss_total             19627078551836087549952.0
loss_critic            24533847759842084651008.0
loss_actor                  -295583632867.328003
memory_size                               3428.0 

=== epoch 5/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     589
episode_length                         16.957555
returns                                81.918651
return_std                               9.08191
average_reward                          4.831026
round_time                0 days 00:15:08.641215
episodes_test                              593.0
episode_length_test                    16.851602
returns_test                           81.271155
return_std_test                         8.320856
average_reward_test                     4.822796
round_time_test           0 days 00:00:13.283758
round_time_total          0 days 00:15:08.642328
loss_total             20413570891670072852480.0
loss_critic            25516963168449751678976.0
loss_actor                  -301050182377.471985
memory_size                               3428.0 

=== epoch 5/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:29,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     588
episode_length                         16.964286
returns                                81.957771
return_std                             10.255547
average_reward                          4.831132
round_time                0 days 00:15:10.767112
episodes_test                              569.0
episode_length_test                    17.567663
returns_test                           85.190306
return_std_test                        17.102273
average_reward_test                     4.849287
round_time_test           0 days 00:00:12.984094
round_time_total          0 days 00:15:10.768445
loss_total             21066952101603015917568.0
loss_critic            26333689643570498109440.0
loss_actor                  -303940612161.536011
memory_size                            3428.1705 

=== epoch 5/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     586
episode_length                         17.049488
returns                                82.458424
return_std                             13.945908
average_reward                          4.836423
round_time                0 days 00:15:10.811305
episodes_test                              599.0
episode_length_test                    16.684474
returns_test                            80.56544
return_std_test                        10.989344
average_reward_test                     4.828794
round_time_test           0 days 00:00:13.342399
round_time_total          0 days 00:15:10.812396
loss_total             21005256434060789547008.0
loss_critic            26256570068853602648064.0
loss_actor                  -306862152679.424011
memory_size                            3436.5745 

=== epoch 5/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:29,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     585
episode_length                         17.061538
returns                                  82.5647
return_std                             14.243679
average_reward                          4.839456
round_time                0 days 00:15:06.936881
episodes_test                              581.0
episode_length_test                     17.20654
returns_test                           83.307829
return_std_test                        16.199439
average_reward_test                     4.841667
round_time_test           0 days 00:00:13.182689
round_time_total          0 days 00:15:06.938008
loss_total             21241068237021184524288.0
loss_critic            26551334812280258822144.0
loss_actor                  -307947498848.255981
memory_size                               3453.0 

=== epoch 5/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     584
episode_length                         17.092466
returns                                82.732473
return_std                             14.396855
average_reward                          4.840182
round_time                0 days 00:15:06.569198
episodes_test                              592.0
episode_length_test                    16.881757
returns_test                           81.597581
return_std_test                          9.60022
average_reward_test                     4.833513
round_time_test           0 days 00:00:13.128637
round_time_total          0 days 00:15:06.570418
loss_total             21821738279279278424064.0
loss_critic            27277172348636589195264.0
loss_actor                  -310970060529.664001
memory_size                               3453.0 

=== epoch 5/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     587
episode_length                         17.010221
returns                                82.318851
return_std                               14.3022
average_reward                          4.839301
round_time                0 days 00:15:08.047255
episodes_test                              597.0
episode_length_test                    16.726968
returns_test                           80.749646
return_std_test                        12.577494
average_reward_test                     4.827583
round_time_test           0 days 00:00:13.095517
round_time_total          0 days 00:15:08.048590
loss_total             22042938663281325244416.0
loss_critic            27553672802321240686592.0
loss_actor                  -312434371665.919983
memory_size                               3453.0 

=== epoch 5/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:28,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     590
episode_length                         16.922034
returns                                81.896228
return_std                             13.546806
average_reward                           4.83954
round_time                0 days 00:15:07.606381
episodes_test                              604.0
episode_length_test                    16.534768
returns_test                           79.819696
return_std_test                        10.776777
average_reward_test                      4.82734
round_time_test           0 days 00:00:13.052787
round_time_total          0 days 00:15:07.607475
loss_total             22507416703222090301440.0
loss_critic            28134270364772831592448.0
loss_actor                  -316170279854.080017
memory_size                               3453.0 

=== epoch 5/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:22,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     595
episode_length                         16.763025
returns                                80.995009
return_std                              9.735965
average_reward                          4.831835
round_time                0 days 00:15:09.540446
episodes_test                              597.0
episode_length_test                    16.726968
returns_test                           80.678079
return_std_test                        11.143756
average_reward_test                     4.823166
round_time_test           0 days 00:00:13.007045
round_time_total          0 days 00:15:09.541772
loss_total             23441761719486039195648.0
loss_critic            29302201626658517024768.0
loss_actor                  -323210976690.176025
memory_size                               3453.0 

=== epoch 5/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:09,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     600
episode_length                         16.633333
returns                                80.274894
return_std                              8.834019
average_reward                          4.826063
round_time                0 days 00:15:08.773500
episodes_test                              595.0
episode_length_test                    16.793277
returns_test                           80.843214
return_std_test                         9.879366
average_reward_test                     4.814041
round_time_test           0 days 00:00:13.312310
round_time_total          0 days 00:15:08.774587
loss_total             24281548783049043345408.0
loss_critic            30351935452453101633536.0
loss_actor                  -331548307243.007996
memory_size                               3453.0 

=== epoch 5/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:01,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     602
episode_length                         16.598007
returns                                80.090716
return_std                              10.24765
average_reward                          4.825333
round_time                0 days 00:15:07.406000
episodes_test                              604.0
episode_length_test                    16.528146
returns_test                           79.611744
return_std_test                        13.310698
average_reward_test                     4.817072
round_time_test           0 days 00:00:12.984168
round_time_total          0 days 00:15:07.407097
loss_total             24951853877492803698688.0
loss_critic            31189816840351784108032.0
loss_actor                     -333669794906.112
memory_size                             3453.374 

=== epoch 5/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:44,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     602
episode_length                         16.591362
returns                                80.024517
return_std                             10.463611
average_reward                          4.823196
round_time                0 days 00:15:08.395036
episodes_test                              604.0
episode_length_test                    16.549669
returns_test                           79.812708
return_std_test                        10.045605
average_reward_test                     4.822657
round_time_test           0 days 00:00:13.140433
round_time_total          0 days 00:15:08.396097
loss_total             26059200231767053172736.0
loss_critic            32573999703537177591808.0
loss_actor                   -337886607605.76001
memory_size                               3461.0 

=== epoch 5/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     600
episode_length                            16.645
returns                                80.233346
return_std                             11.780235
average_reward                          4.820278
round_time                0 days 00:15:08.981299
episodes_test                              594.0
episode_length_test                    16.821549
returns_test                           80.987315
return_std_test                        11.174668
average_reward_test                     4.814529
round_time_test           0 days 00:00:13.157056
round_time_total          0 days 00:15:08.982506
loss_total             27368616660611004104704.0
loss_critic            34210770238888428437504.0
loss_actor                  -338769241784.320007
memory_size                            3466.6875 

=== epoch 5/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:05,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     603
episode_length                         16.568823
returns                                79.865993
return_std                             11.339722
average_reward                          4.820269
round_time                0 days 00:15:08.814272
episodes_test                              604.0
episode_length_test                    16.554636
returns_test                           79.831881
return_std_test                        10.123802
average_reward_test                     4.822337
round_time_test           0 days 00:00:12.996668
round_time_total          0 days 00:15:08.815336
loss_total             27878940791941546115072.0
loss_critic            34848675372511570952192.0
loss_actor                  -343290202767.359985
memory_size                               3473.0 

=== epoch 5/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     604
episode_length                         16.549669
returns                                 79.78295
return_std                             11.295872
average_reward                          4.820805
round_time                0 days 00:15:08.489337
episodes_test                              607.0
episode_length_test                      16.4514
returns_test                           79.370348
return_std_test                        11.661528
average_reward_test                     4.824481
round_time_test           0 days 00:00:13.443746
round_time_total          0 days 00:15:08.490411
loss_total             28880257817072904437760.0
loss_critic            36100321673910492856320.0
loss_actor                  -349813789933.567993
memory_size                               3473.0 

=== epoch 5/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     603
episode_length                         16.560531
returns                                79.855017
return_std                             10.231007
average_reward                          4.821982
round_time                0 days 00:15:09.226319
episodes_test                              615.0
episode_length_test                    16.255285
returns_test                           78.443774
return_std_test                         8.567568
average_reward_test                     4.825771
round_time_test           0 days 00:00:13.000706
round_time_total          0 days 00:15:09.227392
loss_total             29602556329140554825728.0
loss_critic            37003194799639834394624.0
loss_actor                  -354261712306.176025
memory_size                               3473.0 

=== epoch 5/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:08,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     604
episode_length                         16.546358
returns                                79.809534
return_std                             11.915156
average_reward                          4.823407
round_time                0 days 00:15:10.430283
episodes_test                              606.0
episode_length_test                     16.50165
returns_test                           79.704836
return_std_test                         8.285772
average_reward_test                     4.830113
round_time_test           0 days 00:00:12.833986
round_time_total          0 days 00:15:10.431354
loss_total             30350902370045727342592.0
loss_critic            37938627368785694162944.0
loss_actor                  -358409627746.304016
memory_size                             3473.793 

=== epoch 5/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:14,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     606
episode_length                         16.478548
returns                                79.471545
return_std                             10.390152
average_reward                          4.822711
round_time                0 days 00:15:09.416389
episodes_test                              609.0
episode_length_test                    16.395731
returns_test                           78.990237
return_std_test                         9.569684
average_reward_test                     4.817719
round_time_test           0 days 00:00:13.150031
round_time_total          0 days 00:15:09.417465
loss_total             31393502376374017458176.0
loss_critic            39241877315897461309440.0
loss_actor                  -366320975675.392029
memory_size                               3485.0 

=== epoch 5/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:31,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     603
episode_length                         16.560531
returns                                79.925818
return_std                             10.622162
average_reward                          4.826235
round_time                0 days 00:15:09.889699
episodes_test                              588.0
episode_length_test                    16.984694
returns_test                           82.058929
return_std_test                        11.694568
average_reward_test                     4.831318
round_time_test           0 days 00:00:12.982872
round_time_total          0 days 00:15:09.890772
loss_total             32738115500281276923904.0
loss_critic            40922643713040979066880.0
loss_actor                  -373031618084.864014
memory_size                               3485.0 

=== epoch 5/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:53,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                     598
episode_length                         16.685619
returns                                80.606416
return_std                             12.757396
average_reward                          4.830827
round_time                0 days 00:15:14.009100
episodes_test                              605.0
episode_length_test                    16.517355
returns_test                           79.701184
return_std_test                        12.017856
average_reward_test                     4.825331
round_time_test           0 days 00:00:13.128304
round_time_total          0 days 00:15:14.010258
loss_total             33850259330947086286848.0
loss_critic            42312823540216779243520.0
loss_actor                  -377970155585.536011
memory_size                            3491.0045 

=== epoch 5/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:08,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     592
episode_length                         16.858108
returns                                81.514191
return_std                             15.575103
average_reward                          4.835256
round_time                0 days 00:15:11.735955
episodes_test                              597.0
episode_length_test                    16.740369
returns_test                           80.738197
return_std_test                        10.066388
average_reward_test                     4.823006
round_time_test           0 days 00:00:13.219799
round_time_total          0 days 00:15:11.737153
loss_total             34705888097807441592320.0
loss_critic            43382359412660886306816.0
loss_actor                  -380072049934.335999
memory_size                             3510.875 

=== epoch 5/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     590
episode_length                         16.911864
returns                                81.776285
return_std                             14.173918
average_reward                          4.835326
round_time                0 days 00:15:11.868751
episodes_test                              592.0
episode_length_test                    16.863176
returns_test                           81.509213
return_std_test                         8.952339
average_reward_test                     4.833513
round_time_test           0 days 00:00:13.201404
round_time_total          0 days 00:15:11.869833
loss_total             34916410196580267720704.0
loss_critic            43645512049356240322560.0
loss_actor                  -381040093331.455994
memory_size                               3522.0 

=== epoch 5/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     589
episode_length                         16.949066
returns                                 81.99107
return_std                              14.88278
average_reward                          4.837481
round_time                0 days 00:15:13.288285
episodes_test                              610.0
episode_length_test                    16.391803
returns_test                           78.995051
return_std_test                         7.230757
average_reward_test                     4.819191
round_time_test           0 days 00:00:13.143827
round_time_total          0 days 00:15:13.289360
loss_total             35850792628749917487104.0
loss_critic            44813490022014307532800.0
loss_actor                       -388109058048.0
memory_size                               3522.0 

=== epoch 5/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:13,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     589
episode_length                         16.947368
returns                                81.972531
return_std                             15.023673
average_reward                          4.836822
round_time                0 days 00:15:09.573144
episodes_test                              605.0
episode_length_test                     16.51405
returns_test                           79.681777
return_std_test                         9.312788
average_reward_test                      4.82508
round_time_test           0 days 00:00:13.141836
round_time_total          0 days 00:15:09.574304
loss_total             36287721545161806708736.0
loss_critic            45359651195535933571072.0
loss_actor                  -388232472592.383972
memory_size                               3522.0 

=== epoch 5/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:47,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.20it/s]
episodes                                     590
episode_length                         16.908475
returns                                81.752659
return_std                             13.622131
average_reward                          4.835342
round_time                0 days 00:15:11.610809
episodes_test                              588.0
episode_length_test                    17.003401
returns_test                           82.118375
return_std_test                        13.147817
average_reward_test                     4.829544
round_time_test           0 days 00:00:13.280058
round_time_total          0 days 00:15:11.611929
loss_total             36383696180560831447040.0
loss_critic            45479619459385412026368.0
loss_actor                  -388122662486.015991
memory_size                               3522.0 

=== epoch 5/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     594
episode_length                         16.799663
returns                                81.176777
return_std                             10.263237
average_reward                          4.831943
round_time                0 days 00:15:09.296902
episodes_test                              592.0
episode_length_test                    16.880068
returns_test                            81.67586
return_std_test                        11.901666
average_reward_test                     4.838618
round_time_test           0 days 00:00:12.812555
round_time_total          0 days 00:15:09.297975
loss_total             37312706438745875808256.0
loss_critic            46640882304916192755712.0
loss_actor                  -395194600325.119995
memory_size                               3522.0 

=== epoch 5/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:02,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     598
episode_length                         16.692308
returns                                80.618717
return_std                              9.822506
average_reward                          4.829609
round_time                0 days 00:15:12.144230
episodes_test                              602.0
episode_length_test                    16.604651
returns_test                           80.168595
return_std_test                         7.648441
average_reward_test                     4.828125
round_time_test           0 days 00:00:13.035208
round_time_total          0 days 00:15:12.145302
loss_total             38746863130967766204416.0
loss_critic            48433578098558177902592.0
loss_actor                  -404766379802.624023
memory_size                               3522.0 

=== epoch 5/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:19,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     600
episode_length                         16.646667
returns                                80.410183
return_std                              8.698096
average_reward                          4.830317
round_time                0 days 00:15:10.437710
episodes_test                              598.0
episode_length_test                    16.717391
returns_test                           80.944487
return_std_test                        10.517452
average_reward_test                      4.84195
round_time_test           0 days 00:00:13.145372
round_time_total          0 days 00:15:10.439263
loss_total             39979217031065396838400.0
loss_critic            49974020419214811070464.0
loss_actor                  -409568310460.416016
memory_size                               3522.0 

=== epoch 5/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     601
episode_length                         16.612313
returns                                80.201138
return_std                              8.478305
average_reward                          4.827755
round_time                0 days 00:15:12.296614
episodes_test                              595.0
episode_length_test                    16.779832
returns_test                           81.064861
return_std_test                         13.35658
average_reward_test                     4.831346
round_time_test           0 days 00:00:13.201089
round_time_total          0 days 00:15:12.297686
loss_total             40597009428774475792384.0
loss_critic            50746260884403748077568.0
loss_actor                  -412503721328.640015
memory_size                               3522.0 

=== epoch 5/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:01,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     597
episode_length                         16.737018
returns                                80.754948
return_std                             10.688402
average_reward                          4.824916
round_time                0 days 00:15:12.311618
episodes_test                              598.0
episode_length_test                    16.722408
returns_test                           80.707501
return_std_test                        11.619462
average_reward_test                     4.826309
round_time_test           0 days 00:00:12.935752
round_time_total          0 days 00:15:12.312695
loss_total             41128181120785738366976.0
loss_critic            51410225529254165086208.0
loss_actor                  -416082808242.176025
memory_size                             3525.105 

=== epoch 5/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     596
episode_length                         16.765101
returns                                80.846452
return_std                             10.796131
average_reward                          4.822332
round_time                0 days 00:15:12.147984
episodes_test                              581.0
episode_length_test                    17.204819
returns_test                           83.126323
return_std_test                        13.668426
average_reward_test                       4.8316
round_time_test           0 days 00:00:13.165640
round_time_total          0 days 00:15:12.149051
loss_total             42201281418946316075008.0
loss_critic            52751600803157179891712.0
loss_actor                  -420705613316.096008
memory_size                               3534.0 

=== epoch 5/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:26,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     588
episode_length                         17.005102
returns                                 82.00118
return_std                             13.099287
average_reward                          4.822145
round_time                0 days 00:15:13.252020
episodes_test                              577.0
episode_length_test                    17.311958
returns_test                           83.676983
return_std_test                        14.410228
average_reward_test                     4.833466
round_time_test           0 days 00:00:13.078335
round_time_total          0 days 00:15:13.253260
loss_total             43033018800326760726528.0
loss_critic            53791272525097651404800.0
loss_actor                  -427881684844.544006
memory_size                             3537.724 

=== epoch 5/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:49,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                     581
episode_length                         17.168675
returns                                82.734433
return_std                             14.354013
average_reward                          4.818874
round_time                0 days 00:15:12.874896
episodes_test                              582.0
episode_length_test                    17.166667
returns_test                           82.875466
return_std_test                        14.636322
average_reward_test                     4.827705
round_time_test           0 days 00:00:13.338054
round_time_total          0 days 00:15:12.875973
loss_total             43656326988997809143808.0
loss_critic            54570407753618114478080.0
loss_actor                  -429610691772.416016
memory_size                            3557.4275 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<16:02,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     116
episode_length                         17.155172
returns                                83.003633
return_std                             12.337341
average_reward                          4.838321
round_time                0 days 00:15:09.089349
episodes_test                              586.0
episode_length_test                     17.06314
returns_test                            82.44339
return_std_test                        13.186821
average_reward_test                     4.831674
round_time_test           0 days 00:00:13.371273
round_time_total          0 days 00:15:09.090448
loss_total             43996853393706123788288.0
loss_critic            54996065824805708890112.0
loss_actor                  -431121608310.783997
memory_size                               3560.0 

=== epoch 6/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:44,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:06<00:00,  2.21it/s]
episodes                                     232
episode_length                         17.228448
returns                                83.324611
return_std                             13.523047
average_reward                          4.836483
round_time                0 days 00:15:07.503921
episodes_test                              591.0
episode_length_test                    16.895093
returns_test                           81.686924
return_std_test                        10.366349
average_reward_test                     4.834907
round_time_test           0 days 00:00:12.958737
round_time_total          0 days 00:15:07.505045
loss_total             45666279438194598477824.0
loss_critic            57082848231234560065536.0
loss_actor                  -440699546009.599976
memory_size                             3561.793 

=== epoch 6/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:12,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     353
episode_length                          16.96034
returns                                81.928464
return_std                             12.123515
average_reward                           4.83069
round_time                0 days 00:15:12.957530
episodes_test                              597.0
episode_length_test                    16.720268
returns_test                           80.852353
return_std_test                        14.230728
average_reward_test                     4.835646
round_time_test           0 days 00:00:12.948558
round_time_total          0 days 00:15:12.958625
loss_total             47242908100792355389440.0
loss_critic            59053634020638212489216.0
loss_actor                  -445517444775.935974
memory_size                               3564.0 

=== epoch 6/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:55,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     470
episode_length                         16.993617
returns                                82.078639
return_std                             13.053059
average_reward                          4.829956
round_time                0 days 00:15:09.603451
episodes_test                              595.0
episode_length_test                    16.779832
returns_test                           81.076521
return_std_test                        14.789368
average_reward_test                     4.831744
round_time_test           0 days 00:00:13.028348
round_time_total          0 days 00:15:09.604548
loss_total             47116768891017562685440.0
loss_critic            58895960064714717790208.0
loss_actor                  -447145611198.463989
memory_size                             3571.865 

=== epoch 6/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:45,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     588
episode_length                         16.996599
returns                                 82.10044
return_std                             12.459645
average_reward                           4.83042
round_time                0 days 00:15:08.954653
episodes_test                              592.0
episode_length_test                    16.866554
returns_test                           81.427286
return_std_test                        11.082068
average_reward_test                     4.827666
round_time_test           0 days 00:00:13.080911
round_time_total          0 days 00:15:08.955773
loss_total             47872682046649823920128.0
loss_critic            59840851433819750793216.0
loss_actor                  -447798590406.656006
memory_size                               3576.0 

=== epoch 6/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:33,  1.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     586
episode_length                         17.040956
returns                                82.325062
return_std                             14.056157
average_reward                          4.830997
round_time                0 days 00:15:11.020660
episodes_test                              590.0
episode_length_test                    16.930508
returns_test                            81.76508
return_std_test                        11.938153
average_reward_test                     4.829439
round_time_test           0 days 00:00:13.446704
round_time_total          0 days 00:15:11.021738
loss_total             49390351564509888380928.0
loss_critic            61737938260776080900096.0
loss_actor                  -456172508676.096008
memory_size                            3579.8775 

=== epoch 6/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:07,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     583
episode_length                         17.111492
returns                                82.712207
return_std                              14.61724
average_reward                          4.833621
round_time                0 days 00:15:12.388339
episodes_test                              585.0
episode_length_test                    17.090598
returns_test                           82.415993
return_std_test                        12.815997
average_reward_test                     4.822312
round_time_test           0 days 00:00:13.418630
round_time_total          0 days 00:15:12.389438
loss_total             50849052875263899074560.0
loss_critic            63561314977750118825984.0
loss_actor                  -461353292431.359985
memory_size                             3596.507 

=== epoch 6/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:38,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     581
episode_length                         17.187608
returns                                83.095431
return_std                             14.662244
average_reward                          4.834766
round_time                0 days 00:15:11.220127
episodes_test                              584.0
episode_length_test                    17.118151
returns_test                             82.6979
return_std_test                        13.980741
average_reward_test                     4.831038
round_time_test           0 days 00:00:13.239585
round_time_total          0 days 00:15:11.221213
loss_total             52202877878584718917632.0
loss_critic            65253596200375943168000.0
loss_actor                   -466986768007.16803
memory_size                               3599.0 

=== epoch 6/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:45,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     576
episode_length                         17.354167
returns                                84.049076
return_std                             17.092017
average_reward                          4.843131
round_time                0 days 00:15:13.342458
episodes_test                              588.0
episode_length_test                    16.986395
returns_test                           82.212613
return_std_test                        16.211489
average_reward_test                     4.839876
round_time_test           0 days 00:00:13.007126
round_time_total          0 days 00:15:13.343522
loss_total             53931632267278837350400.0
loss_critic            67414539142052024680448.0
loss_actor                  -467817379872.768005
memory_size                             3620.999 

=== epoch 6/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:16,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     573
episode_length                         17.410122
returns                                84.351978
return_std                             17.472321
average_reward                          4.845157
round_time                0 days 00:15:11.180580
episodes_test                              576.0
episode_length_test                    17.359375
returns_test                           84.091381
return_std_test                        14.408761
average_reward_test                     4.844156
round_time_test           0 days 00:00:13.022376
round_time_total          0 days 00:15:11.181673
loss_total             55287178045351775961088.0
loss_critic            69108971431352764203008.0
loss_actor                  -476110995849.216003
memory_size                               3622.0 

=== epoch 6/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:23,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.19it/s]
episodes                                     577
episode_length                         17.308492
returns                                83.846131
return_std                             15.835718
average_reward                          4.844463
round_time                0 days 00:15:12.406235
episodes_test                              575.0
episode_length_test                    17.373913
returns_test                           84.372456
return_std_test                        15.813103
average_reward_test                     4.856215
round_time_test           0 days 00:00:13.040652
round_time_total          0 days 00:15:12.407777
loss_total             55519207917311043305472.0
loss_critic            69399008673067073273856.0
loss_actor                  -482622697242.624023
memory_size                               3622.0 

=== epoch 6/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     581
episode_length                         17.173838
returns                                83.158911
return_std                             15.776717
average_reward                          4.842388
round_time                0 days 00:15:10.989743
episodes_test                              593.0
episode_length_test                    16.849916
returns_test                           81.496771
return_std_test                        11.036501
average_reward_test                     4.836655
round_time_test           0 days 00:00:13.160044
round_time_total          0 days 00:15:10.990830
loss_total             57617638313344827916288.0
loss_critic            72022046686968136335360.0
loss_actor                  -486245874204.671997
memory_size                              3626.08 

=== epoch 6/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:11<00:00,  2.20it/s]
episodes                                     581
episode_length                         17.194492
returns                                83.304166
return_std                             15.619618
average_reward                          4.844771
round_time                0 days 00:15:11.598434
episodes_test                              590.0
episode_length_test                    16.932203
returns_test                           82.020147
return_std_test                        14.424364
average_reward_test                     4.843999
round_time_test           0 days 00:00:13.077526
round_time_total          0 days 00:15:11.599539
loss_total             58167116623305091579904.0
loss_critic            72708894568507485716480.0
loss_actor                  -493501540155.392029
memory_size                               3637.0 

=== epoch 6/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:07,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:07<00:00,  2.20it/s]
episodes                                     587
episode_length                         17.030664
returns                                82.440768
return_std                             11.946782
average_reward                          4.840748
round_time                0 days 00:15:07.729003
episodes_test                              589.0
episode_length_test                     16.96944
returns_test                           82.045351
return_std_test                        10.646653
average_reward_test                     4.834934
round_time_test           0 days 00:00:13.176257
round_time_total          0 days 00:15:07.730088
loss_total             60091579507212015370240.0
loss_critic            75114473128636624928768.0
loss_actor                   -498109773447.16803
memory_size                               3637.0 

=== epoch 6/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     590
episode_length                         16.923729
returns                                81.877393
return_std                             11.311263
average_reward                          4.838329
round_time                0 days 00:15:08.805865
episodes_test                              594.0
episode_length_test                     16.83165
returns_test                           81.381729
return_std_test                        10.516857
average_reward_test                     4.835059
round_time_test           0 days 00:00:12.957673
round_time_total          0 days 00:15:08.806939
loss_total             60437775818876306587648.0
loss_critic            75547218504424714403840.0
loss_actor                  -499916285820.927979
memory_size                               3637.0 

=== epoch 6/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:16,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     584
episode_length                         17.099315
returns                                82.850276
return_std                             12.573918
average_reward                          4.845164
round_time                0 days 00:15:10.346391
episodes_test                              577.0
episode_length_test                    17.308492
returns_test                           83.689883
return_std_test                        16.953058
average_reward_test                     4.835171
round_time_test           0 days 00:00:13.071682
round_time_total          0 days 00:15:10.347835
loss_total             62026964466892047122432.0
loss_critic            77533704274474948362240.0
loss_actor                  -508840636956.671997
memory_size                               3637.0 

=== epoch 6/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:06,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:08<00:00,  2.20it/s]
episodes                                     563
episode_length                         17.737123
returns                                86.057772
return_std                             18.646784
average_reward                          4.851887
round_time                0 days 00:15:09.439318
episodes_test                              543.0
episode_length_test                    18.388582
returns_test                           89.448548
return_std_test                        19.200078
average_reward_test                     4.864257
round_time_test           0 days 00:00:13.148798
round_time_total          0 days 00:15:09.440532
loss_total             61879297508214667476992.0
loss_critic            77349120645371055308800.0
loss_actor                  -507064750161.919983
memory_size                             3644.419 

=== epoch 6/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:00,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                     540
episode_length                         18.483333
returns                                89.904904
return_std                             22.659721
average_reward                          4.864097
round_time                0 days 00:15:13.714206
episodes_test                              461.0
episode_length_test                    21.644252
returns_test                          105.534404
return_std_test                        35.179283
average_reward_test                     4.876092
round_time_test           0 days 00:00:13.246489
round_time_total          0 days 00:15:13.715280
loss_total             62712859493710736392192.0
loss_critic            78391073090367922372608.0
loss_actor                  -509886909874.176025
memory_size                            3692.8675 

=== epoch 6/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                     529
episode_length                         18.865784
returns                                91.872245
return_std                             23.395224
average_reward                          4.869629
round_time                0 days 00:15:13.879101
episodes_test                              502.0
episode_length_test                    19.886454
returns_test                            97.26679
return_std_test                        22.160763
average_reward_test                     4.891256
round_time_test           0 days 00:00:12.864580
round_time_total          0 days 00:15:13.880176
loss_total             64881069798404017618944.0
loss_critic            81101335871592379449344.0
loss_actor                  -516305373986.815979
memory_size                               3704.0 

=== epoch 6/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     516
episode_length                         19.335271
returns                                94.307577
return_std                             25.202328
average_reward                          4.877537
round_time                0 days 00:15:13.250597
episodes_test                              531.0
episode_length_test                    18.815443
returns_test                           91.592292
return_std_test                        18.773955
average_reward_test                      4.86794
round_time_test           0 days 00:00:13.139136
round_time_total          0 days 00:15:13.251676
loss_total             65819660140738003337216.0
loss_critic            82274573886485635792896.0
loss_actor                  -517920780025.856018
memory_size                            3713.4875 

=== epoch 6/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                     515
episode_length                          19.38835
returns                                94.534653
return_std                             25.307424
average_reward                          4.875713
round_time                0 days 00:15:10.338730
episodes_test                              540.0
episode_length_test                    18.512963
returns_test                           90.201699
return_std_test                         19.30848
average_reward_test                      4.87237
round_time_test           0 days 00:00:13.180549
round_time_total          0 days 00:15:10.339983
loss_total             66931657704621012418560.0
loss_critic            83664570761681978785792.0
loss_actor                  -519707048607.744019
memory_size                               3734.0 

=== epoch 6/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:40,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                     527
episode_length                         18.950664
returns                                92.425945
return_std                             22.568694
average_reward                          4.877134
round_time                0 days 00:15:14.826118
episodes_test                              573.0
episode_length_test                     17.43281
returns_test                           84.710308
return_std_test                        13.541598
average_reward_test                     4.859213
round_time_test           0 days 00:00:13.019920
round_time_total          0 days 00:15:14.827673
loss_total             67768471415148836290560.0
loss_critic            84710587926018923692032.0
loss_actor                  -522402323562.495972
memory_size                              3737.74 

=== epoch 6/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                     546
episode_length                         18.278388
returns                                  89.0236
return_std                             19.341542
average_reward                          4.870237
round_time                0 days 00:15:14.019420
episodes_test                              552.0
episode_length_test                    18.108696
returns_test                           88.258847
return_std_test                        16.536309
average_reward_test                     4.873841
round_time_test           0 days 00:00:13.200280
round_time_total          0 days 00:15:14.020516
loss_total             68073988368703747522560.0
loss_critic            85092484047875306684416.0
loss_actor                  -523901635575.807983
memory_size                               3751.0 

=== epoch 6/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                     554
episode_length                         18.027076
returns                                87.691741
return_std                             19.845539
average_reward                          4.864342
round_time                0 days 00:15:10.806024
episodes_test                              597.0
episode_length_test                    16.740369
returns_test                            81.10204
return_std_test                          9.81187
average_reward_test                     4.844716
round_time_test           0 days 00:00:13.304892
round_time_total          0 days 00:15:10.807103
loss_total             67612089970926288896000.0
loss_critic            84515111111733545009152.0
loss_actor                  -528183442735.104004
memory_size                             3755.903 

=== epoch 6/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:46,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                     566
episode_length                         17.644876
returns                                85.785572
return_std                             17.430213
average_reward                          4.861719
round_time                0 days 00:15:14.702879
episodes_test                              601.0
episode_length_test                    16.635607
returns_test                           80.332797
return_std_test                        10.317022
average_reward_test                     4.828982
round_time_test           0 days 00:00:13.384504
round_time_total          0 days 00:15:14.703949
loss_total             68904334884270967357440.0
loss_critic            86130417279028625932288.0
loss_actor                  -532947818676.223999
memory_size                               3774.0 

=== epoch 6/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:50,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                     573
episode_length                         17.431065
returns                                84.693172
return_std                               17.2818
average_reward                          4.858701
round_time                0 days 00:15:13.076843
episodes_test                              601.0
episode_length_test                    16.612313
returns_test                           80.404458
return_std_test                         8.034969
average_reward_test                     4.840212
round_time_test           0 days 00:00:13.071818
round_time_total          0 days 00:15:13.077926
loss_total             70744025839103754895360.0
loss_critic            88430030779759244345344.0
loss_actor                  -538143188451.328003
memory_size                             3782.451 

=== epoch 6/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                     583
episode_length                         17.101201
returns                                82.961836
return_std                             15.608798
average_reward                          4.851937
round_time                0 days 00:15:15.042688
episodes_test                              593.0
episode_length_test                    16.838111
returns_test                           81.551181
return_std_test                         10.56982
average_reward_test                     4.843188
round_time_test           0 days 00:00:13.232425
round_time_total          0 days 00:15:15.044095
loss_total             70563434466421952741376.0
loss_critic            88204291649475386015744.0
loss_actor                  -543065422544.895996
memory_size                             3793.626 

=== epoch 6/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:21,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:20<00:00,  2.17it/s]
episodes                                     588
episode_length                          16.97619
returns                                82.273466
return_std                             15.135778
average_reward                          4.846274
round_time                0 days 00:15:21.071427
episodes_test                              595.0
episode_length_test                    16.801681
returns_test                           81.248958
return_std_test                        11.828721
average_reward_test                     4.835789
round_time_test           0 days 00:00:13.341055
round_time_total          0 days 00:15:21.072514
loss_total             70781151066725397037056.0
loss_critic            88476437453334942580736.0
loss_actor                  -541358621966.335999
memory_size                               3794.0 

=== epoch 6/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                     595
episode_length                         16.769748
returns                                81.194905
return_std                             12.380673
average_reward                          4.841636
round_time                0 days 00:15:16.880557
episodes_test                              598.0
episode_length_test                    16.707358
returns_test                            80.76012
return_std_test                        10.747717
average_reward_test                     4.833803
round_time_test           0 days 00:00:13.044605
round_time_total          0 days 00:15:16.881640
loss_total             71765732760677450252288.0
loss_critic            89707164491680525058048.0
loss_actor                  -548019955236.864014
memory_size                               3794.0 

=== epoch 6/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:46,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                     591
episode_length                         16.890017
returns                                81.730364
return_std                             12.861384
average_reward                          4.838915
round_time                0 days 00:15:14.631204
episodes_test                              585.0
episode_length_test                    17.092308
returns_test                            82.83958
return_std_test                        15.644775
average_reward_test                     4.846608
round_time_test           0 days 00:00:13.210044
round_time_total          0 days 00:15:14.632518
loss_total             74266388305483281727488.0
loss_critic            92832983784765087809536.0
loss_actor                  -555278465056.767944
memory_size                            3794.4735 

=== epoch 6/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                     584
episode_length                         17.075342
returns                                82.664706
return_std                             12.579998
average_reward                          4.841464
round_time                0 days 00:15:16.921167
episodes_test                              552.0
episode_length_test                    18.115942
returns_test                           87.919419
return_std_test                        17.204336
average_reward_test                     4.853152
round_time_test           0 days 00:00:13.225849
round_time_total          0 days 00:15:16.922520
loss_total             73053074777370487947264.0
loss_critic            91316342022116976099328.0
loss_actor                  -554630252838.911987
memory_size                               3796.0 

=== epoch 6/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:17<00:00,  2.18it/s]
episodes                                     572
episode_length                         17.468531
returns                                84.638246
return_std                             16.621808
average_reward                           4.84525
round_time                0 days 00:15:17.884943
episodes_test                              527.0
episode_length_test                    18.967742
returns_test                           91.993346
return_std_test                         28.47645
average_reward_test                     4.850003
round_time_test           0 days 00:00:13.049530
round_time_total          0 days 00:15:17.886120
loss_total             76217037574337338540032.0
loss_critic            95271295385187867688960.0
loss_actor                  -546229893316.607971
memory_size                             3820.503 

=== epoch 6/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                     555
episode_length                         17.998198
returns                                87.281318
return_std                             20.651502
average_reward                          4.849721
round_time                0 days 00:15:17.019702
episodes_test                              538.0
episode_length_test                     18.55948
returns_test                           89.674693
return_std_test                         25.36336
average_reward_test                     4.831767
round_time_test           0 days 00:00:13.051440
round_time_total          0 days 00:15:17.020775
loss_total             77962162240306280398848.0
loss_critic            97452701315602338283520.0
loss_actor                  -536333650739.200012
memory_size                             3845.037 

=== epoch 6/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.18it/s]
episodes                                     542
episode_length                         18.405904
returns                                89.343054
return_std                             23.983413
average_reward                          4.854134
round_time                0 days 00:15:16.340099
episodes_test                              507.0
episode_length_test                    19.692308
returns_test                           95.830315
return_std_test                        33.984054
average_reward_test                     4.866274
round_time_test           0 days 00:00:13.221160
round_time_total          0 days 00:15:16.341189
loss_total             76183271362127263170560.0
loss_critic            95229087631465766715392.0
loss_actor                       -529283088384.0
memory_size                             3896.949 

=== epoch 6/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.19it/s]
episodes                                     545
episode_length                          18.32844
returns                                 88.96309
return_std                             23.445307
average_reward                          4.853803
round_time                0 days 00:15:15.727285
episodes_test                              568.0
episode_length_test                     17.59507
returns_test                           85.006089
return_std_test                        19.269812
average_reward_test                     4.831279
round_time_test           0 days 00:00:13.078091
round_time_total          0 days 00:15:15.728609
loss_total             76630853806355329843200.0
loss_critic            95788565705891141124096.0
loss_actor                   -535268909678.59198
memory_size                               3907.0 

=== epoch 6/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:17<00:00,  2.18it/s]
episodes                                     552
episode_length                         18.101449
returns                                87.734301
return_std                             23.072405
average_reward                          4.846811
round_time                0 days 00:15:18.438360
episodes_test                              577.0
episode_length_test                     17.32409
returns_test                           83.829168
return_std_test                        11.283883
average_reward_test                     4.838892
round_time_test           0 days 00:00:12.985574
round_time_total          0 days 00:15:18.439629
loss_total             77453486053203522879488.0
loss_critic            96816855882439623442432.0
loss_actor                   -541959976337.40802
memory_size                               3907.0 

=== epoch 6/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:47,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.18it/s]
episodes                                     561
episode_length                         17.802139
returns                                86.220075
return_std                             20.403782
average_reward                          4.843159
round_time                0 days 00:15:15.992020
episodes_test                              579.0
episode_length_test                    17.267703
returns_test                           83.599841
return_std_test                        17.240863
average_reward_test                     4.841421
round_time_test           0 days 00:00:12.850443
round_time_total          0 days 00:15:15.993126
loss_total             77422675246058234183680.0
loss_critic            96778342557592634523648.0
loss_actor                  -542222849015.807983
memory_size                              3908.99 

=== epoch 6/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:12,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                     572
episode_length                         17.445804
returns                                84.425174
return_std                             16.747126
average_reward                          4.839226
round_time                0 days 00:15:17.346490
episodes_test                              580.0
episode_length_test                    17.217241
returns_test                           83.175017
return_std_test                        13.059095
average_reward_test                     4.830871
round_time_test           0 days 00:00:13.042840
round_time_total          0 days 00:15:17.347573
loss_total             74080886495998166171648.0
loss_critic            92601106603973476876288.0
loss_actor                   -539429256593.40802
memory_size                            3916.2255 

=== epoch 6/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:49,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:17<00:00,  2.18it/s]
episodes                                     575
episode_length                         17.373913
returns                                 84.03689
return_std                             14.630862
average_reward                          4.836969
round_time                0 days 00:15:18.172228
episodes_test                              572.0
episode_length_test                     17.45979
returns_test                           84.751055
return_std_test                        13.949986
average_reward_test                     4.854135
round_time_test           0 days 00:00:13.134637
round_time_total          0 days 00:15:18.173514
loss_total             76298921066873160204288.0
loss_critic            95373649760146342019072.0
loss_actor                  -545477682855.935974
memory_size                             3923.407 

=== epoch 6/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:00,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                      573
episode_length                          17.408377
returns                                 84.211999
return_std                              15.896909
average_reward                           4.837875
round_time                 0 days 00:15:16.855050
episodes_test                               565.0
episode_length_test                     17.681416
returns_test                            85.676664
return_std_test                         15.251733
average_reward_test                      4.845548
round_time_test            0 days 00:00:13.126651
round_time_total           0 days 00:15:16.856101
loss_total              84352585220765905846272.0
loss_critic            105440729779968095027200.0
loss_actor                   -553968750903.296021
memory_size                             3940.1475 

=== epoch 6/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:02,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:18<00:00,  2.18it/s]
episodes                                      574
episode_length                          17.372822
returns                                 84.052667
return_std                              16.095002
average_reward                           4.838028
round_time                 0 days 00:15:19.437976
episodes_test                               595.0
episode_length_test                     16.796639
returns_test                            81.323283
return_std_test                         11.325578
average_reward_test                       4.84166
round_time_test            0 days 00:00:13.258534
round_time_total           0 days 00:15:19.439254
loss_total              84054606102910998675456.0
loss_critic            105068255909108128940032.0
loss_actor                   -557958165626.880005
memory_size                             3951.2625 

=== epoch 6/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:50,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:19<00:00,  2.18it/s]
episodes                                      578
episode_length                          17.254325
returns                                  83.44139
return_std                              15.170847
average_reward                           4.836247
round_time                 0 days 00:15:19.941146
episodes_test                               597.0
episode_length_test                     16.740369
returns_test                             80.93688
return_std_test                           9.27246
average_reward_test                       4.83486
round_time_test            0 days 00:00:12.913127
round_time_total           0 days 00:15:19.942506
loss_total              86198707757219338780672.0
loss_critic            107748382806138228310016.0
loss_actor                   -563708190900.223999
memory_size                                3952.0 

=== epoch 6/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:31,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:20<00:00,  2.17it/s]
episodes                                      584
episode_length                          17.104452
returns                                 82.692196
return_std                              14.412035
average_reward                           4.834476
round_time                 0 days 00:15:21.322282
episodes_test                               600.0
episode_length_test                     16.653333
returns_test                            80.360087
return_std_test                          9.337078
average_reward_test                      4.825472
round_time_test            0 days 00:00:13.129469
round_time_total           0 days 00:15:21.323518
loss_total              85384646336717684473856.0
loss_critic            106730806188700103344128.0
loss_actor                   -567400321908.735962
memory_size                                3952.0 

=== epoch 6/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:19<00:00,  2.17it/s]
episodes                                      588
episode_length                          16.965986
returns                                 82.038716
return_std                               12.72557
average_reward                           4.835325
round_time                 0 days 00:15:20.349099
episodes_test                               593.0
episode_length_test                     16.851602
returns_test                            81.481567
return_std_test                          8.394887
average_reward_test                       4.83526
round_time_test            0 days 00:00:13.226385
round_time_total           0 days 00:15:20.350170
loss_total              85778733690380452102144.0
loss_critic            107223415176146242764800.0
loss_actor                   -572031876710.400024
memory_size                             3955.9675 

=== epoch 6/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:22<00:00,  2.17it/s]
episodes                                      588
episode_length                          16.988095
returns                                 82.162658
return_std                              12.734016
average_reward                           4.836447
round_time                 0 days 00:15:23.106817
episodes_test                               582.0
episode_length_test                      17.17354
returns_test                             83.17944
return_std_test                         10.296584
average_reward_test                      4.843494
round_time_test            0 days 00:00:13.182001
round_time_total           0 days 00:15:23.107895
loss_total              87162930709505381498880.0
loss_critic            108953661534213429526528.0
loss_actor                   -575635463651.328003
memory_size                             3963.1145 

=== epoch 6/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:24,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:21<00:00,  2.17it/s]
episodes                                      584
episode_length                          17.099315
returns                                 82.737896
return_std                              13.528863
average_reward                           4.838603
round_time                 0 days 00:15:22.425524
episodes_test                               573.0
episode_length_test                     17.448517
returns_test                            84.585151
return_std_test                         11.796003
average_reward_test                      4.847712
round_time_test            0 days 00:00:13.010298
round_time_total           0 days 00:15:22.426590
loss_total              91380136877681167826944.0
loss_critic            114225169150420522631168.0
loss_actor                    -581006642642.94397
memory_size                               3974.41 

=== epoch 6/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:42,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:21<00:00,  2.17it/s]
episodes                                      583
episode_length                          17.132075
returns                                  82.84907
return_std                              14.197192
average_reward                           4.836031
round_time                 0 days 00:15:22.427960
episodes_test                               590.0
episode_length_test                     16.937288
returns_test                            81.713673
return_std_test                         13.100841
average_reward_test                      4.824532
round_time_test            0 days 00:00:13.209216
round_time_total           0 days 00:15:22.429041
loss_total              92271128320895015190528.0
loss_critic            115338908288930537275392.0
loss_actor                   -578726319243.264038
memory_size                             3978.6715 

=== epoch 6/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:16,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:22<00:00,  2.17it/s]
episodes                                      579
episode_length                          17.229706
returns                                 83.339443
return_std                              14.400825
average_reward                           4.836888
round_time                 0 days 00:15:23.487776
episodes_test                               583.0
episode_length_test                     17.152659
returns_test                            82.803126
return_std_test                         13.679018
average_reward_test                      4.827422
round_time_test            0 days 00:00:13.120739
round_time_total           0 days 00:15:23.489281
loss_total              91834724608079220965376.0
loss_critic            114793403780766980112384.0
loss_actor                   -574007978409.984009
memory_size                                3982.0 

=== epoch 6/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:21<00:00,  2.17it/s]
episodes                                      576
episode_length                          17.355903
returns                                 83.956666
return_std                              17.551272
average_reward                            4.83737
round_time                 0 days 00:15:21.974294
episodes_test                               591.0
episode_length_test                     16.895093
returns_test                            81.626548
return_std_test                          8.185355
average_reward_test                      4.831313
round_time_test            0 days 00:00:13.067909
round_time_total           0 days 00:15:21.975377
loss_total              90253146250800198057984.0
loss_critic            112816430840360663515136.0
loss_actor                   -574238207459.328003
memory_size                              4002.545 

=== epoch 6/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:21<00:00,  2.17it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      574
episode_length                          17.398955
returns                                 84.135815
return_std                              18.270329
average_reward                           4.835588
round_time                 0 days 00:15:22.312335
episodes_test                               571.0
episode_length_test                      17.49387
returns_test                            84.713807
return_std_test                         12.012033
average_reward_test                       4.84248
round_time_test            0 days 00:00:12.919772
round_time_total           0 days 00:15:22.313408
loss_total              91650059484088271634432.0
loss_critic            114562572329053461676032.0
loss_actor                   -573565379919.871948
memory_size                              4040.763 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<17:16,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                      115
episode_length                          17.391304
returns                                 84.249845
return_std                              10.354084
average_reward                           4.844366
round_time                 0 days 00:15:14.401527
episodes_test                               559.0
episode_length_test                     17.858676
returns_test                            86.468999
return_std_test                         15.163712
average_reward_test                      4.841805
round_time_test            0 days 00:00:12.994899
round_time_total           0 days 00:15:14.402618
loss_total              92964338701088531152896.0
loss_critic            116205421434183310376960.0
loss_actor                   -581619747766.271973
memory_size                                4047.0 

=== epoch 7/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:55,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:12<00:00,  2.19it/s]
episodes                                      234
episode_length                          17.059829
returns                                 82.428826
return_std                               8.682787
average_reward                           4.831829
round_time                 0 days 00:15:13.518874
episodes_test                               587.0
episode_length_test                     17.017036
returns_test                            82.004223
return_std_test                         14.027097
average_reward_test                      4.818957
round_time_test            0 days 00:00:12.888045
round_time_total           0 days 00:15:13.519958
loss_total              92823496051430092963840.0
loss_critic            116029367796162258534400.0
loss_actor                   -581139370213.375977
memory_size                                4047.0 

=== epoch 7/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:52,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.18it/s]
episodes                                      355
episode_length                          16.884507
returns                                 81.507328
return_std                               7.904533
average_reward                           4.827404
round_time                 0 days 00:15:16.065912
episodes_test                               590.0
episode_length_test                     16.949153
returns_test                             81.67581
return_std_test                         13.631526
average_reward_test                      4.818873
round_time_test            0 days 00:00:13.215610
round_time_total           0 days 00:15:16.067217
loss_total              93062810404588747751424.0
loss_critic            116328510975175443873792.0
loss_actor                   -585316914249.728027
memory_size                                4047.0 

=== epoch 7/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      471
episode_length                          16.974522
returns                                 81.975782
return_std                              12.290041
average_reward                            4.82936
round_time                 0 days 00:15:14.032481
episodes_test                               587.0
episode_length_test                     17.027257
returns_test                            82.306633
return_std_test                         12.063056
average_reward_test                      4.833856
round_time_test            0 days 00:00:13.064304
round_time_total           0 days 00:15:14.033566
loss_total              94230074095086902181888.0
loss_critic            117787590443057064968192.0
loss_actor                   -594241427292.160034
memory_size                             4050.6205 

=== epoch 7/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.19it/s]
episodes                                      589
episode_length                          16.964346
returns                                 81.950285
return_std                              11.740191
average_reward                           4.830747
round_time                 0 days 00:15:15.733642
episodes_test                               596.0
episode_length_test                     16.756711
returns_test                            81.095466
return_std_test                         10.125836
average_reward_test                      4.839774
round_time_test            0 days 00:00:13.031411
round_time_total           0 days 00:15:15.734729
loss_total             100025534693800757166080.0
loss_critic            125031916055215470542848.0
loss_actor                   -606279327023.104004
memory_size                                4073.0 

=== epoch 7/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:42,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      593
episode_length                          16.853288
returns                                 81.339266
return_std                              11.189538
average_reward                           4.826359
round_time                 0 days 00:15:14.437823
episodes_test                               595.0
episode_length_test                     16.784874
returns_test                            80.976202
return_std_test                         10.208462
average_reward_test                      4.824394
round_time_test            0 days 00:00:13.103613
round_time_total           0 days 00:15:14.439009
loss_total             101090588983590154928128.0
loss_critic            126363233891556555489280.0
loss_actor                   -614487924899.839966
memory_size                                4073.0 

=== epoch 7/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      593
episode_length                          16.838111
returns                                 81.295271
return_std                              11.230904
average_reward                           4.828029
round_time                 0 days 00:15:14.182930
episodes_test                               597.0
episode_length_test                     16.750419
returns_test                            80.873498
return_std_test                          9.701692
average_reward_test                      4.828148
round_time_test            0 days 00:00:12.782642
round_time_total           0 days 00:15:14.184001
loss_total             101119325932646651920384.0
loss_critic            126399155239443809435648.0
loss_actor                   -619146608410.624023
memory_size                                4073.0 

=== epoch 7/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:08,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                      592
episode_length                          16.871622
returns                                 81.535814
return_std                              11.462487
average_reward                           4.832656
round_time                 0 days 00:15:17.207982
episodes_test                               586.0
episode_length_test                     17.037543
returns_test                            82.207202
return_std_test                         14.017038
average_reward_test                       4.82522
round_time_test            0 days 00:00:13.103043
round_time_total           0 days 00:15:17.209057
loss_total             102843818849866077962240.0
loss_critic            128554771306592126107648.0
loss_actor                    -625705927147.52002
memory_size                                4073.0 

=== epoch 7/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.18it/s]
episodes                                      596
episode_length                          16.758389
returns                                 80.974722
return_std                               7.918916
average_reward                           4.831793
round_time                 0 days 00:15:15.998823
episodes_test                               599.0
episode_length_test                     16.684474
returns_test                            80.565322
return_std_test                          9.757045
average_reward_test                      4.828787
round_time_test            0 days 00:00:13.265357
round_time_total           0 days 00:15:15.999915
loss_total             105534969581502860361728.0
loss_critic            131918709726767619244032.0
loss_actor                   -636089918259.199951
memory_size                                4073.0 

=== epoch 7/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:17<00:00,  2.18it/s]
episodes                                      594
episode_length                          16.792929
returns                                 81.133098
return_std                               7.924883
average_reward                           4.831777
round_time                 0 days 00:15:17.623896
episodes_test                               596.0
episode_length_test                     16.748322
returns_test                              80.8277
return_std_test                          9.299735
average_reward_test                      4.826309
round_time_test            0 days 00:00:12.983475
round_time_total           0 days 00:15:17.625153
loss_total             109446059528017036705792.0
loss_critic            136807571869428149649408.0
loss_actor                   -645844322615.296021
memory_size                                4073.0 

=== epoch 7/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:02,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      594
episode_length                          16.794613
returns                                 81.203873
return_std                               8.296311
average_reward                           4.834983
round_time                 0 days 00:15:13.836152
episodes_test                               590.0
episode_length_test                     16.945763
returns_test                            81.977314
return_std_test                         10.148107
average_reward_test                      4.837647
round_time_test            0 days 00:00:13.098900
round_time_total           0 days 00:15:13.837242
loss_total             109906341008553369665536.0
loss_critic            137382923970611295289344.0
loss_actor                   -653402229407.744019
memory_size                                4073.0 

=== epoch 7/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:14<00:00,  2.19it/s]
episodes                                      592
episode_length                          16.873311
returns                                 81.623768
return_std                               9.175287
average_reward                           4.837375
round_time                 0 days 00:15:15.364982
episodes_test                               584.0
episode_length_test                     17.109589
returns_test                            82.844262
return_std_test                         13.331766
average_reward_test                      4.841961
round_time_test            0 days 00:00:12.915227
round_time_total           0 days 00:15:15.366044
loss_total             108594565617305555828736.0
loss_critic            135743204725359078539264.0
loss_actor                        -641064263680.0
memory_size                              4073.552 

=== epoch 7/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:56,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      590
episode_length                          16.905085
returns                                 81.775372
return_std                               9.305216
average_reward                           4.837219
round_time                 0 days 00:15:14.159887
episodes_test                               581.0
episode_length_test                     17.203098
returns_test                            83.433491
return_std_test                         13.970447
average_reward_test                      4.849951
round_time_test            0 days 00:00:12.988059
round_time_total           0 days 00:15:14.160988
loss_total             108760749148368871096320.0
loss_critic            135950933987754684448768.0
loss_actor                   -635093214347.264038
memory_size                                4074.0 

=== epoch 7/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:22,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      586
episode_length                          17.013652
returns                                 82.312845
return_std                               9.734734
average_reward                            4.83793
round_time                 0 days 00:15:14.062732
episodes_test                               583.0
episode_length_test                     17.138937
returns_test                            82.999504
return_std_test                         11.008323
average_reward_test                       4.84273
round_time_test            0 days 00:00:12.740855
round_time_total           0 days 00:15:14.063819
loss_total             107419873977812379500544.0
loss_critic            134274840005981729980416.0
loss_actor                   -630508839895.040039
memory_size                                4074.0 

=== epoch 7/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:35,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:16<00:00,  2.18it/s]
episodes                                      583
episode_length                          17.113208
returns                                 82.802697
return_std                              12.200706
average_reward                           4.838869
round_time                 0 days 00:15:16.533070
episodes_test                               573.0
episode_length_test                      17.43281
returns_test                             84.74453
return_std_test                         10.707545
average_reward_test                      4.861198
round_time_test            0 days 00:00:12.998074
round_time_total           0 days 00:15:16.534159
loss_total             106901596859398019350528.0
loss_critic            133626993817381154848768.0
loss_actor                   -640039149846.527954
memory_size                              4086.982 

=== epoch 7/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:45,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                      580
episode_length                          17.227586
returns                                 83.403339
return_std                              12.309658
average_reward                           4.841288
round_time                 0 days 00:15:14.041800
episodes_test                               581.0
episode_length_test                     17.211704
returns_test                            83.469667
return_std_test                          11.69503
average_reward_test                      4.849588
round_time_test            0 days 00:00:12.996099
round_time_total           0 days 00:15:14.042878
loss_total             110011882850058200154112.0
loss_critic            137514850990454413983744.0
loss_actor                    -651377975787.52002
memory_size                                4098.0 

=== epoch 7/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:10<00:00,  2.20it/s]
episodes                                      579
episode_length                          17.240069
returns                                 83.495802
return_std                              11.975164
average_reward                           4.843308
round_time                 0 days 00:15:11.242138
episodes_test                               583.0
episode_length_test                     17.142367
returns_test                            82.894181
return_std_test                         10.034066
average_reward_test                      4.835645
round_time_test            0 days 00:00:12.915398
round_time_total           0 days 00:15:11.243231
loss_total             110947371609490716622848.0
loss_critic            138684212016306249007104.0
loss_actor                   -662011412414.463989
memory_size                                4098.0 

=== epoch 7/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:48,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:59<00:00,  2.22it/s]
episodes                                      579
episode_length                          17.236615
returns                                 83.473035
return_std                              11.879177
average_reward                           4.842738
round_time                 0 days 00:14:59.857946
episodes_test                               574.0
episode_length_test                     17.414634
returns_test                            84.407172
return_std_test                         17.655076
average_reward_test                      4.846936
round_time_test            0 days 00:00:13.043722
round_time_total           0 days 00:14:59.859031
loss_total             111418336947897869795328.0
loss_critic            139272918591924844101632.0
loss_actor                   -662896865935.359985
memory_size                                4098.0 

=== epoch 7/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:02<00:00,  2.22it/s]
episodes                                      582
episode_length                           17.16323
returns                                 83.041997
return_std                                11.9733
average_reward                           4.838349
round_time                 0 days 00:15:03.147902
episodes_test                               587.0
episode_length_test                     17.010221
returns_test                            82.370644
return_std_test                         13.278243
average_reward_test                      4.842345
round_time_test            0 days 00:00:13.083197
round_time_total           0 days 00:15:03.149021
loss_total             116948939992338521915392.0
loss_critic            146186172444763454701568.0
loss_actor                   -679035659583.488037
memory_size                               4098.36 

=== epoch 7/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:50,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:00<00:00,  2.22it/s]
episodes                                      585
episode_length                          17.078632
returns                                 82.662484
return_std                                9.18966
average_reward                           4.840179
round_time                 0 days 00:15:00.673644
episodes_test                               574.0
episode_length_test                     17.400697
returns_test                            84.340541
return_std_test                         11.075867
average_reward_test                      4.846988
round_time_test            0 days 00:00:12.786528
round_time_total           0 days 00:15:00.675004
loss_total             120249467036206282833920.0
loss_critic            150311831281686309502976.0
loss_actor                   -686965639249.920044
memory_size                                4107.0 

=== epoch 7/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:51<00:00,  2.24it/s]
episodes                                      584
episode_length                          17.090753
returns                                 82.709509
return_std                              10.132128
average_reward                           4.839316
round_time                 0 days 00:14:52.406154
episodes_test                               568.0
episode_length_test                     17.596831
returns_test                            85.546785
return_std_test                          14.98186
average_reward_test                      4.861524
round_time_test            0 days 00:00:12.868373
round_time_total           0 days 00:14:52.407232
loss_total             122613895285554009866240.0
loss_critic            153267366498232434688000.0
loss_actor                    -698618798571.52002
memory_size                              4111.959 

=== epoch 7/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:49<00:00,  2.25it/s]
episodes                                      578
episode_length                          17.247405
returns                                 83.531669
return_std                              11.397826
average_reward                           4.843176
round_time                 0 days 00:14:50.064315
episodes_test                               547.0
episode_length_test                     18.272395
returns_test                            88.905812
return_std_test                         14.199718
average_reward_test                      4.865603
round_time_test            0 days 00:00:12.809512
round_time_total           0 days 00:14:50.065408
loss_total             119476824028023651041280.0
loss_critic            149346027444333875036160.0
loss_actor                   -699727539535.871948
memory_size                             4116.0695 

=== epoch 7/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:03,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:57<00:00,  2.23it/s]
episodes                                      572
episode_length                          17.465035
returns                                 84.656908
return_std                              13.889122
average_reward                           4.847226
round_time                 0 days 00:14:57.711386
episodes_test                               543.0
episode_length_test                     18.406998
returns_test                            89.908127
return_std_test                         21.437338
average_reward_test                      4.884469
round_time_test            0 days 00:00:12.870466
round_time_total           0 days 00:14:57.712492
loss_total             123049039836949988245504.0
loss_critic            153811297164959397445632.0
loss_actor                   -703171235250.176025
memory_size                             4121.1005 

=== epoch 7/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:28,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:51<00:00,  2.24it/s]
episodes                                      568
episode_length                          17.591549
returns                                 85.400776
return_std                              14.384219
average_reward                            4.85487
round_time                 0 days 00:14:51.990353
episodes_test                               561.0
episode_length_test                     17.814617
returns_test                            86.599549
return_std_test                            15.618
average_reward_test                      4.861155
round_time_test            0 days 00:00:12.832325
round_time_total           0 days 00:14:51.991414
loss_total             125545852448934784401408.0
loss_critic            156932312864638195728384.0
loss_actor                   -709132249464.832031
memory_size                              4139.286 

=== epoch 7/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:06,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:55<00:00,  2.23it/s]
episodes                                      564
episode_length                          17.687943
returns                                 85.887767
return_std                              15.328206
average_reward                           4.855556
round_time                 0 days 00:14:56.044672
episodes_test                               569.0
episode_length_test                     17.564148
returns_test                             85.16912
return_std_test                           13.9309
average_reward_test                      4.849052
round_time_test            0 days 00:00:13.134729
round_time_total           0 days 00:14:56.045917
loss_total             126945379069295241199616.0
loss_critic            158681721124889112346624.0
loss_actor                   -715544790269.952026
memory_size                             4144.4665 

=== epoch 7/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:16,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:52<00:00,  2.24it/s]
episodes                                      564
episode_length                          17.698582
returns                                 85.921946
return_std                               14.83254
average_reward                           4.854672
round_time                 0 days 00:14:52.549478
episodes_test                               572.0
episode_length_test                     17.473776
returns_test                            84.865342
return_std_test                         15.358146
average_reward_test                      4.856762
round_time_test            0 days 00:00:13.035568
round_time_total           0 days 00:14:52.550545
loss_total             129408916076576678871040.0
loss_critic            161761142184143686205440.0
loss_actor                    -725271237656.57605
memory_size                                4150.0 

=== epoch 7/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:06,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:57<00:00,  2.23it/s]
episodes                                      567
episode_length                          17.613757
returns                                  85.39855
return_std                              14.337611
average_reward                           4.848636
round_time                 0 days 00:14:58.237889
episodes_test                               568.0
episode_length_test                     17.600352
returns_test                            85.390129
return_std_test                         15.086265
average_reward_test                      4.851622
round_time_test            0 days 00:00:12.947384
round_time_total           0 days 00:14:58.239068
loss_total             130214894058164761657344.0
loss_critic            162768614793422036795392.0
loss_actor                   -727622994886.656006
memory_size                                4150.0 

=== epoch 7/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:36,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:52<00:00,  1.97it/s]
episodes                                      572
episode_length                          17.444056
returns                                 84.450936
return_std                               12.16106
average_reward                            4.84136
round_time                 0 days 00:16:53.404293
episodes_test                               553.0
episode_length_test                     18.068716
returns_test                             87.59109
return_std_test                          16.58826
average_reward_test                      4.847671
round_time_test            0 days 00:00:13.110425
round_time_total           0 days 00:16:53.405563
loss_total             129764781394838647996416.0
loss_critic            162205974007611528314880.0
loss_actor                   -733869127106.560059
memory_size                                4150.0 

=== epoch 7/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:53,  1.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:58<00:00,  2.23it/s]
episodes                                      574
episode_length                          17.405923
returns                                 84.210386
return_std                              11.635464
average_reward                           4.838002
round_time                 0 days 00:14:58.826660
episodes_test                               584.0
episode_length_test                     17.109589
returns_test                            82.575462
return_std_test                          9.787297
average_reward_test                      4.826273
round_time_test            0 days 00:00:13.270463
round_time_total           0 days 00:14:58.827726
loss_total             132530710471149053018112.0
loss_critic            165663385369888025477120.0
loss_actor                   -740904338489.343994
memory_size                                4150.0 

=== epoch 7/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:02,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                      576
episode_length                          17.345486
returns                                 83.931639
return_std                              13.614407
average_reward                           4.838829
round_time                 0 days 00:16:51.944960
episodes_test                               579.0
episode_length_test                     17.250432
returns_test                            83.167665
return_std_test                          13.61672
average_reward_test                      4.821174
round_time_test            0 days 00:00:13.298651
round_time_total           0 days 00:16:51.946528
loss_total             135078086444057616187392.0
loss_critic            168847605199226890354688.0
loss_actor                   -746076862644.223999
memory_size                             4159.2155 

=== epoch 7/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:22,  1.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:49<00:00,  1.68it/s]
episodes                                      576
episode_length                           17.34375
returns                                 83.865201
return_std                              13.773912
average_reward                           4.835818
round_time                 0 days 00:19:50.074866
episodes_test                               572.0
episode_length_test                     17.480769
returns_test                             84.60493
return_std_test                         15.078467
average_reward_test                      4.839892
round_time_test            0 days 00:00:15.295898
round_time_total           0 days 00:19:50.075927
loss_total             137329257611529868541952.0
loss_critic            171661569142241678589952.0
loss_actor                   -755651186393.088013
memory_size                                4172.0 

=== epoch 7/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:20,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:56<00:00,  2.23it/s]
episodes                                      577
episode_length                          17.292894
returns                                 83.628918
return_std                              13.963225
average_reward                           4.835415
round_time                 0 days 00:14:56.729411
episodes_test                               578.0
episode_length_test                     17.276817
returns_test                            83.573325
return_std_test                         12.146258
average_reward_test                      4.837273
round_time_test            0 days 00:00:12.901667
round_time_total           0 days 00:14:56.730581
loss_total             141258695698055702773760.0
loss_critic            176573366676652524830720.0
loss_actor                   -762803156418.560059
memory_size                              4180.685 

=== epoch 7/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:51<00:00,  2.24it/s]
episodes                                      576
episode_length                          17.329861
returns                                 83.858711
return_std                              14.073696
average_reward                           4.838884
round_time                 0 days 00:14:52.111676
episodes_test                               574.0
episode_length_test                     17.397213
returns_test                            84.142234
return_std_test                         10.771155
average_reward_test                      4.836471
round_time_test            0 days 00:00:13.007642
round_time_total           0 days 00:14:52.112772
loss_total             144333062707470248443904.0
loss_critic            180416325352852316749824.0
loss_actor                   -772064350044.160034
memory_size                                4181.0 

=== epoch 7/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:56<00:00,  2.23it/s]
episodes                                      576
episode_length                          17.331597
returns                                 83.927258
return_std                               14.76514
average_reward                           4.842547
round_time                 0 days 00:14:57.323926
episodes_test                               590.0
episode_length_test                     16.945763
returns_test                            81.851009
return_std_test                          8.571089
average_reward_test                      4.830192
round_time_test            0 days 00:00:13.143741
round_time_total           0 days 00:14:57.325117
loss_total             155349636014107871150080.0
loss_critic            194187041727755312955392.0
loss_actor                   -786230812770.303955
memory_size                              4187.975 

=== epoch 7/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:23,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                      577
episode_length                          17.294627
returns                                 83.745394
return_std                              12.113948
average_reward                           4.842153
round_time                 0 days 00:14:54.060905
episodes_test                               573.0
episode_length_test                     17.439791
returns_test                            84.475004
return_std_test                         13.644402
average_reward_test                      4.843823
round_time_test            0 days 00:00:12.804628
round_time_total           0 days 00:14:54.061980
loss_total             161530335349105722654720.0
loss_critic            201912915732121243353088.0
loss_actor                   -800652760317.952026
memory_size                                4190.0 

=== epoch 7/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:17,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:55<00:00,  2.23it/s]
episodes                                      576
episode_length                          17.324653
returns                                 83.959701
return_std                              12.574295
average_reward                           4.846192
round_time                 0 days 00:14:55.663435
episodes_test                               578.0
episode_length_test                     17.287197
returns_test                            83.780943
return_std_test                         13.032232
average_reward_test                      4.846422
round_time_test            0 days 00:00:12.801178
round_time_total           0 days 00:14:55.664614
loss_total             165160388302980205510656.0
loss_critic            206450481805118941429760.0
loss_actor                   -812365956677.631958
memory_size                             4190.0735 

=== epoch 7/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:34<00:00,  1.41it/s]
episodes                                      577
episode_length                          17.298094
returns                                 83.862033
return_std                              11.798512
average_reward                           4.848173
round_time                 0 days 00:23:34.511340
episodes_test                               553.0
episode_length_test                     18.075949
returns_test                            87.823476
return_std_test                         20.092514
average_reward_test                      4.858612
round_time_test            0 days 00:00:12.830684
round_time_total           0 days 00:23:34.512595
loss_total             168349451035394153906176.0
loss_critic            210436810170533863751680.0
loss_actor                   -823144849047.552002
memory_size                                4196.0 

=== epoch 7/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:12,  1.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:20<00:00,  1.32it/s]
episodes                                      572
episode_length                          17.456294
returns                                 84.672097
return_std                              12.680727
average_reward                           4.850529
round_time                 0 days 00:25:20.674825
episodes_test                               564.0
episode_length_test                     17.725177
returns_test                             86.00167
return_std_test                         15.733774
average_reward_test                       4.85197
round_time_test            0 days 00:00:14.197410
round_time_total           0 days 00:25:20.676417
loss_total             176672097469987505045504.0
loss_critic            220840118018431900450816.0
loss_actor                   -831251587301.375977
memory_size                                4196.0 

=== epoch 7/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<26:56,  1.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:20<00:00,  1.32it/s]
episodes                                      570
episode_length                           17.52807
returns                                 85.020748
return_std                              12.020476
average_reward                           4.850458
round_time                 0 days 00:25:20.807604
episodes_test                               551.0
episode_length_test                     18.141561
returns_test                            88.044166
return_std_test                         17.734838
average_reward_test                      4.853204
round_time_test            0 days 00:00:14.250102
round_time_total           0 days 00:25:20.808923
loss_total             177273723524108938903552.0
loss_critic            221592150917661207822336.0
loss_actor                   -839204667621.375977
memory_size                                4196.0 

=== epoch 7/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:05,  1.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:20<00:00,  1.32it/s]
episodes                                      565
episode_length                          17.681416
returns                                 85.764417
return_std                              13.367797
average_reward                           4.850553
round_time                 0 days 00:25:21.203012
episodes_test                               541.0
episode_length_test                     18.460259
returns_test                            89.468896
return_std_test                         16.934032
average_reward_test                      4.846596
round_time_test            0 days 00:00:14.464804
round_time_total           0 days 00:25:21.204549
loss_total             181920459259248111517696.0
loss_critic            227400570070360051941376.0
loss_actor                    -850560541720.57605
memory_size                             4198.5425 

=== epoch 7/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:36,  1.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:19<00:00,  1.37it/s]
episodes                                      563
episode_length                          17.737123
returns                                 86.049996
return_std                              12.884679
average_reward                           4.851587
round_time                 0 days 00:24:20.255173
episodes_test                               559.0
episode_length_test                      17.86941
returns_test                            86.614695
return_std_test                         15.160799
average_reward_test                      4.847062
round_time_test            0 days 00:00:13.882262
round_time_total           0 days 00:24:20.256416
loss_total             186372371884343207919616.0
loss_critic            232965460703110143934464.0
loss_actor                   -860171355979.776001
memory_size                                4202.0 

=== epoch 7/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:55,  1.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:23<00:00,  1.37it/s]
episodes                                      557
episode_length                          17.924596
returns                                 86.987731
return_std                              14.491404
average_reward                           4.852898
round_time                 0 days 00:24:23.738737
episodes_test                               550.0
episode_length_test                     18.181818
returns_test                            88.261523
return_std_test                         15.243182
average_reward_test                      4.854384
round_time_test            0 days 00:00:13.819422
round_time_total           0 days 00:24:23.740136
loss_total             189601443904544181321728.0
loss_critic            237001800700213889859584.0
loss_actor                        -877016887296.0
memory_size                             4207.3325 

=== epoch 7/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:11,  1.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:21<00:00,  1.37it/s]
episodes                                      559
episode_length                          17.862254
returns                                 86.674492
return_std                              14.402001
average_reward                           4.852813
round_time                 0 days 00:24:21.925190
episodes_test                               534.0
episode_length_test                     18.709738
returns_test                            91.253929
return_std_test                         18.489562
average_reward_test                      4.877321
round_time_test            0 days 00:00:13.639913
round_time_total           0 days 00:24:21.926252
loss_total             191187720374170274496512.0
loss_critic            238984646048555712118784.0
loss_actor                   -883757808254.975952
memory_size                                4211.0 

=== epoch 7/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:17,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:42<00:00,  2.27it/s]
episodes                                      555
episode_length                          17.969369
returns                                 87.180697
return_std                              14.579728
average_reward                           4.851898
round_time                 0 days 00:14:43.133652
episodes_test                               569.0
episode_length_test                     17.564148
returns_test                             85.07139
return_std_test                         12.198781
average_reward_test                      4.843488
round_time_test            0 days 00:00:12.248982
round_time_total           0 days 00:14:43.134840
loss_total             193890312649279195316224.0
loss_critic            242362886606362851147776.0
loss_actor                   -890026651254.784058
memory_size                                4211.0 

=== epoch 7/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      560
episode_length                          17.830357
returns                                  86.46214
return_std                              13.510425
average_reward                           4.849419
round_time                 0 days 00:14:11.100821
episodes_test                               568.0
episode_length_test                     17.591549
returns_test                            85.342681
return_std_test                         13.000286
average_reward_test                      4.851375
round_time_test            0 days 00:00:12.430380
round_time_total           0 days 00:14:11.101915
loss_total             197881024557362622496768.0
loss_critic            247351276012959652904960.0
loss_actor                   -901697800503.296021
memory_size                                4211.0 

=== epoch 7/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      564
episode_length                          17.698582
returns                                 85.774226
return_std                              13.655215
average_reward                           4.846302
round_time                 0 days 00:14:10.668369
episodes_test                               571.0
episode_length_test                     17.507881
returns_test                            84.710634
return_std_test                         14.738554
average_reward_test                      4.838453
round_time_test            0 days 00:00:12.477067
round_time_total           0 days 00:14:10.669522
loss_total             201070169284563192250368.0
loss_critic            251337707173036049825792.0
loss_actor                   -902702327529.472046
memory_size                                4211.0 

=== epoch 7/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:24,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      566
episode_length                           17.59894
returns                                 85.262214
return_std                              12.167038
average_reward                           4.844986
round_time                 0 days 00:14:10.423600
episodes_test                               575.0
episode_length_test                     17.368696
returns_test                            84.160847
return_std_test                         12.002104
average_reward_test                      4.845517
round_time_test            0 days 00:00:12.357105
round_time_total           0 days 00:14:10.424778
loss_total             205430688070087867367424.0
loss_critic            256788355480427406819328.0
loss_actor                   -914136259166.208008
memory_size                                4211.0 

=== epoch 7/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:37,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      562
episode_length                          17.740214
returns                                 85.951043
return_std                              12.758406
average_reward                           4.844915
round_time                 0 days 00:14:10.316889
episodes_test                               563.0
episode_length_test                     17.753108
returns_test                             86.06902
return_std_test                         13.174677
average_reward_test                      4.848146
round_time_test            0 days 00:00:12.376387
round_time_total           0 days 00:14:10.318027
loss_total             205110258483320703680512.0
loss_critic            256387818403518780276736.0
loss_actor                   -925858378907.647949
memory_size                                4211.0 

=== epoch 7/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:00<15:06,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      557
episode_length                          17.913824
returns                                 86.716041
return_std                              15.463824
average_reward                           4.841063
round_time                 0 days 00:14:10.954254
episodes_test                               533.0
episode_length_test                     18.756098
returns_test                            91.020523
return_std_test                         17.216389
average_reward_test                      4.852862
round_time_test            0 days 00:00:12.272346
round_time_total           0 days 00:14:10.955329
loss_total             208597984524948566179840.0
loss_critic            260747475884621904740352.0
loss_actor                   -931997418618.880005
memory_size                              4231.146 

=== epoch 7/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      537
episode_length                          18.571695
returns                                 89.908713
return_std                              19.518261
average_reward                           4.841089
round_time                 0 days 00:14:11.211033
episodes_test                               505.0
episode_length_test                     19.782178
returns_test                            96.000163
return_std_test                         26.098912
average_reward_test                      4.852834
round_time_test            0 days 00:00:12.554213
round_time_total           0 days 00:14:11.212080
loss_total             211611977309082873757696.0
loss_critic            264514967070829487063040.0
loss_actor                   -932136517894.144043
memory_size                              4245.745 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:24<00:00,  2.31it/s]
episodes                                      104
episode_length                          19.230769
returns                                 93.705218
return_std                              19.594353
average_reward                           4.872671
round_time                 0 days 00:14:24.453908
episodes_test                               513.0
episode_length_test                     19.473684
returns_test                            94.509099
return_std_test                         21.319152
average_reward_test                      4.853159
round_time_test            0 days 00:00:12.492802
round_time_total           0 days 00:14:24.454984
loss_total             214500377219616821215232.0
loss_critic            268125466791237788893184.0
loss_actor                   -939180548194.303955
memory_size                             4259.4175 

=== epoch 8/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:27<00:00,  2.30it/s]
episodes                                      205
episode_length                          19.439024
returns                                 94.756323
return_std                              22.118392
average_reward                           4.874492
round_time                 0 days 00:14:28.469672
episodes_test                               502.0
episode_length_test                     19.906375
returns_test                            96.731801
return_std_test                         23.806228
average_reward_test                      4.859335
round_time_test            0 days 00:00:12.251544
round_time_total           0 days 00:14:28.470738
loss_total             215831824998317921665024.0
loss_critic            269789776622700581617664.0
loss_actor                   -930373920456.703979
memory_size                               4270.04 

=== epoch 8/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:45,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:17<00:00,  2.33it/s]
episodes                                      302
episode_length                          19.864238
returns                                 97.075772
return_std                              27.604364
average_reward                           4.886965
round_time                 0 days 00:14:17.692437
episodes_test                               492.0
episode_length_test                     20.313008
returns_test                            99.279726
return_std_test                          29.82171
average_reward_test                      4.887497
round_time_test            0 days 00:00:12.395942
round_time_total           0 days 00:14:17.693712
loss_total             216413852064652780896256.0
loss_critic            270517310201165797916672.0
loss_actor                   -937721598246.911987
memory_size                             4293.0665 

=== epoch 8/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                      391
episode_length                          20.445013
returns                                 99.960092
return_std                              29.308604
average_reward                           4.889202
round_time                 0 days 00:14:13.355416
episodes_test                               487.0
episode_length_test                     20.525667
returns_test                           100.187337
return_std_test                         25.296598
average_reward_test                      4.881079
round_time_test            0 days 00:00:12.371160
round_time_total           0 days 00:14:13.356493
loss_total             217182158126660477517824.0
loss_critic            271477692981337359646720.0
loss_actor                   -943524784472.063965
memory_size                             4345.3485 

=== epoch 8/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:31,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      494
episode_length                          20.200405
returns                                 98.720878
return_std                              27.703682
average_reward                           4.887504
round_time                 0 days 00:14:09.793866
episodes_test                               488.0
episode_length_test                     20.444672
returns_test                            99.579224
return_std_test                         24.326685
average_reward_test                       4.87066
round_time_test            0 days 00:00:12.256868
round_time_total           0 days 00:14:09.794927
loss_total             212181225186295101259776.0
loss_critic            265226526714682796933120.0
loss_actor                   -947055961341.952026
memory_size                             4360.8185 

=== epoch 8/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      496
episode_length                          20.143145
returns                                 98.594105
return_std                              27.508894
average_reward                           4.894604
round_time                 0 days 00:14:08.513218
episodes_test                               497.0
episode_length_test                     20.118712
returns_test                            98.728438
return_std_test                         24.192591
average_reward_test                      4.907294
round_time_test            0 days 00:00:12.269682
round_time_total           0 days 00:14:08.514386
loss_total             218740461450971452538880.0
loss_critic            273425572368661518221312.0
loss_actor                   -957575101382.656006
memory_size                                4362.0 

=== epoch 8/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:36,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      497
episode_length                          20.072435
returns                                 98.243299
return_std                              26.477414
average_reward                           4.894385
round_time                 0 days 00:14:08.184793
episodes_test                               530.0
episode_length_test                     18.866038
returns_test                            91.912769
return_std_test                         18.460609
average_reward_test                      4.871869
round_time_test            0 days 00:00:12.333007
round_time_total           0 days 00:14:08.185857
loss_total             228406854744898503966720.0
loss_critic            285508563585249941913600.0
loss_actor                   -979714699984.895996
memory_size                                4362.0 

=== epoch 8/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      508
episode_length                          19.627953
returns                                 95.914576
return_std                              22.168537
average_reward                           4.886598
round_time                 0 days 00:14:05.446952
episodes_test                               514.0
episode_length_test                     19.433852
returns_test                            95.396436
return_std_test                         20.453888
average_reward_test                      4.908705
round_time_test            0 days 00:00:12.499274
round_time_total           0 days 00:14:05.448035
loss_total             235672603500850951749632.0
loss_critic            294590749259974521651200.0
loss_actor                   -995581721903.104004
memory_size                                4362.0 

=== epoch 8/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:13,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      528
episode_length                          18.890152
returns                                 92.232575
return_std                              17.133165
average_reward                           4.883035
round_time                 0 days 00:14:06.126951
episodes_test                               546.0
episode_length_test                     18.304029
returns_test                            89.151471
return_std_test                         14.367722
average_reward_test                      4.870601
round_time_test            0 days 00:00:12.614189
round_time_total           0 days 00:14:06.128235
loss_total             241586702361498205814784.0
loss_critic            301983372496887726735360.0
loss_actor                  -1005493658451.968018
memory_size                                4362.0 

=== epoch 8/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                      533
episode_length                          18.733583
returns                                 91.368148
return_std                              17.609003
average_reward                           4.877717
round_time                 0 days 00:14:03.259559
episodes_test                               552.0
episode_length_test                     18.112319
returns_test                             87.87046
return_std_test                         16.690052
average_reward_test                      4.851439
round_time_test            0 days 00:00:12.262838
round_time_total           0 days 00:14:03.260662
loss_total             243289342995922302271488.0
loss_critic            304111673759418061684736.0
loss_actor                  -1011043214622.719971
memory_size                             4378.9005 

=== epoch 8/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:05,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      540
episode_length                          18.455556
returns                                 89.786645
return_std                              16.632876
average_reward                           4.865596
round_time                 0 days 00:14:06.220590
episodes_test                               548.0
episode_length_test                     18.228102
returns_test                            88.400924
return_std_test                         16.326194
average_reward_test                      4.849694
round_time_test            0 days 00:00:12.292632
round_time_total           0 days 00:14:06.221644
loss_total             242314379951568697950208.0
loss_critic            302892969499112503246848.0
loss_actor                  -1019401887514.624023
memory_size                                4381.0 

=== epoch 8/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:04,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      553
episode_length                            18.0434
returns                                 87.597618
return_std                               15.13346
average_reward                           4.854668
round_time                 0 days 00:14:06.828207
episodes_test                               571.0
episode_length_test                     17.502627
returns_test                            84.644199
return_std_test                         11.733285
average_reward_test                      4.836131
round_time_test            0 days 00:00:12.220841
round_time_total           0 days 00:14:06.829259
loss_total             242096344084332261933056.0
loss_critic            302620425164966531694592.0
loss_actor                  -1015058539118.592041
memory_size                                4381.0 

=== epoch 8/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      556
episode_length                          17.953237
returns                                 87.046232
return_std                              14.529357
average_reward                           4.848385
round_time                 0 days 00:14:06.464447
episodes_test                               546.0
episode_length_test                     18.296703
returns_test                            88.619472
return_std_test                         16.626052
average_reward_test                      4.843424
round_time_test            0 days 00:00:12.366255
round_time_total           0 days 00:14:06.465732
loss_total             246512086927222060875776.0
loss_critic            308140103359416703647744.0
loss_actor                  -1022368856375.296021
memory_size                                4381.0 

=== epoch 8/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      561
episode_length                          17.800357
returns                                 86.162909
return_std                              13.853333
average_reward                           4.840454
round_time                 0 days 00:14:05.962438
episodes_test                               566.0
episode_length_test                      17.64311
returns_test                            85.367982
return_std_test                          12.69895
average_reward_test                      4.838563
round_time_test            0 days 00:00:12.375352
round_time_total           0 days 00:14:05.963533
loss_total             250973034917083970797568.0
loss_critic            313716288578679522459648.0
loss_actor                   -1030881745895.42395
memory_size                                4381.0 

=== epoch 8/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      566
episode_length                          17.644876
returns                                 85.359289
return_std                              12.010043
average_reward                           4.837559
round_time                 0 days 00:14:05.346402
episodes_test                               558.0
episode_length_test                     17.919355
returns_test                            86.638167
return_std_test                         13.604993
average_reward_test                      4.834903
round_time_test            0 days 00:00:12.148639
round_time_total           0 days 00:14:05.347454
loss_total             253836320058439492435968.0
loss_critic            317295394602301754703872.0
loss_actor                  -1041552817029.119995
memory_size                              4392.457 

=== epoch 8/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:59,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      565
episode_length                          17.688496
returns                                 85.565327
return_std                               12.33224
average_reward                           4.837416
round_time                 0 days 00:14:06.532588
episodes_test                               545.0
episode_length_test                     18.343119
returns_test                            88.912239
return_std_test                         15.758493
average_reward_test                      4.847198
round_time_test            0 days 00:00:12.371743
round_time_total           0 days 00:14:06.533654
loss_total             255583968001207679582208.0
loss_critic            319479954930456393154560.0
loss_actor                  -1047535485124.608032
memory_size                                4393.0 

=== epoch 8/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      557
episode_length                          17.935368
returns                                 86.871684
return_std                              13.446396
average_reward                           4.843725
round_time                 0 days 00:14:05.813330
episodes_test                               530.0
episode_length_test                     18.837736
returns_test                            91.608591
return_std_test                         18.500171
average_reward_test                      4.863235
round_time_test            0 days 00:00:12.342309
round_time_total           0 days 00:14:05.814401
loss_total             260272519381915585740800.0
loss_critic            325340643590013674586112.0
loss_actor                   -1056920408555.52002
memory_size                             4396.7815 

=== epoch 8/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      551
episode_length                          18.130672
returns                                 87.974901
return_std                              15.692168
average_reward                           4.852172
round_time                 0 days 00:14:05.915431
episodes_test                               532.0
episode_length_test                     18.772556
returns_test                            91.344266
return_std_test                         18.609122
average_reward_test                      4.865842
round_time_test            0 days 00:00:12.333422
round_time_total           0 days 00:14:05.916489
loss_total             264395818866310262030336.0
loss_critic            330494767881330712641536.0
loss_actor                   -1064816296689.66394
memory_size                              4410.461 

=== epoch 8/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      548
episode_length                          18.222628
returns                                 88.483981
return_std                              16.015572
average_reward                           4.855546
round_time                 0 days 00:14:06.051585
episodes_test                               527.0
episode_length_test                     18.969639
returns_test                            92.175499
return_std_test                          21.54991
average_reward_test                      4.859125
round_time_test            0 days 00:00:12.482400
round_time_total           0 days 00:14:06.052644
loss_total             265443575847744444563456.0
loss_critic            331804464525832330674176.0
loss_actor                  -1069674054746.112061
memory_size                                4416.0 

=== epoch 8/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:25,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      541
episode_length                          18.434381
returns                                 89.609652
return_std                              16.231954
average_reward                           4.860958
round_time                 0 days 00:14:08.425453
episodes_test                               539.0
episode_length_test                     18.528757
returns_test                            90.069131
return_std_test                         16.040706
average_reward_test                      4.860999
round_time_test            0 days 00:00:12.408118
round_time_total           0 days 00:14:08.426514
loss_total             270189027039210056450048.0
loss_critic            337736277899297047969792.0
loss_actor                  -1081647343108.095947
memory_size                                4416.0 

=== epoch 8/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:56,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      531
episode_length                          18.792844
returns                                 91.537469
return_std                              17.796409
average_reward                           4.870628
round_time                 0 days 00:14:08.584339
episodes_test                               516.0
episode_length_test                     19.372093
returns_test                            94.518663
return_std_test                         21.455927
average_reward_test                      4.879126
round_time_test            0 days 00:00:12.264528
round_time_total           0 days 00:14:08.585503
loss_total             275847202109143367286784.0
loss_critic            344808997011433275260928.0
loss_actor                  -1084116699152.384033
memory_size                             4421.2605 

=== epoch 8/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:28,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      520
episode_length                          19.169231
returns                                 93.531623
return_std                              20.619739
average_reward                           4.879491
round_time                 0 days 00:14:07.302657
episodes_test                               491.0
episode_length_test                     20.342159
returns_test                            99.667788
return_std_test                         26.361782
average_reward_test                      4.899454
round_time_test            0 days 00:00:12.571174
round_time_total           0 days 00:14:07.303716
loss_total             276281087296457521758208.0
loss_critic            345351353386363690942464.0
loss_actor                  -1095216685383.680054
memory_size                              4430.678 

=== epoch 8/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      521
episode_length                          19.122841
returns                                 93.332683
return_std                              19.436824
average_reward                           4.881108
round_time                 0 days 00:14:06.598136
episodes_test                               513.0
episode_length_test                     19.489279
returns_test                             95.43879
return_std_test                         22.171483
average_reward_test                      4.896988
round_time_test            0 days 00:00:12.450217
round_time_total           0 days 00:14:06.599186
loss_total             280551055509916772990976.0
loss_critic            350688813818695024377856.0
loss_actor                  -1101433438994.431885
memory_size                                4454.0 

=== epoch 8/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:45,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      520
episode_length                          19.221154
returns                                 93.904556
return_std                              20.112685
average_reward                           4.885504
round_time                 0 days 00:14:06.468222
episodes_test                               538.0
episode_length_test                     18.583643
returns_test                            90.682413
return_std_test                         16.533325
average_reward_test                      4.879693
round_time_test            0 days 00:00:12.290562
round_time_total           0 days 00:14:06.469397
loss_total             285368078627236514627584.0
loss_critic            356710092293132236357632.0
loss_actor                  -1104626385256.447998
memory_size                             4458.0065 

=== epoch 8/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:56,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                      522
episode_length                          19.145594
returns                                 93.606833
return_std                              19.926662
average_reward                           4.889139
round_time                 0 days 00:14:15.036600
episodes_test                               538.0
episode_length_test                     18.579926
returns_test                            90.830453
return_std_test                          16.40722
average_reward_test                      4.888646
round_time_test            0 days 00:00:12.456043
round_time_total           0 days 00:14:15.037662
loss_total             291738263804568605294592.0
loss_critic            364672823387620930224128.0
loss_actor                  -1118040201986.048096
memory_size                                4460.0 

=== epoch 8/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:00,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      525
episode_length                          18.986667
returns                                 92.687844
return_std                              20.459016
average_reward                            4.88205
round_time                 0 days 00:14:09.412275
episodes_test                               528.0
episode_length_test                     18.926136
returns_test                            92.381297
return_std_test                         19.361278
average_reward_test                      4.881153
round_time_test            0 days 00:00:12.456628
round_time_total           0 days 00:14:09.413341
loss_total             294091332802468405837824.0
loss_critic            367614159666520866684928.0
loss_actor                  -1125810490015.743896
memory_size                              4465.728 

=== epoch 8/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:08,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:15<00:00,  2.34it/s]
episodes                                      534
episode_length                          18.696629
returns                                 91.183169
return_std                              18.674446
average_reward                           4.876974
round_time                 0 days 00:14:15.671971
episodes_test                               526.0
episode_length_test                     19.003802
returns_test                            92.902018
return_std_test                         19.298298
average_reward_test                      4.888593
round_time_test            0 days 00:00:12.385074
round_time_total           0 days 00:14:15.673035
loss_total             299161139458568347451392.0
loss_critic            373951417995652960878592.0
loss_actor                  -1134543344762.879883
memory_size                             4488.8755 

=== epoch 8/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                      537
episode_length                          18.608939
returns                                 90.638598
return_std                              18.506543
average_reward                           4.870847
round_time                 0 days 00:14:15.211219
episodes_test                               531.0
episode_length_test                     18.809793
returns_test                            91.820486
return_std_test                         18.694094
average_reward_test                      4.881459
round_time_test            0 days 00:00:12.436505
round_time_total           0 days 00:14:15.212388
loss_total             303795129247905529987072.0
loss_critic            379743904890050853208064.0
loss_actor                  -1140670721327.104004
memory_size                                4489.0 

=== epoch 8/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:31,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      530
episode_length                          18.807547
returns                                 91.597106
return_std                              20.021373
average_reward                            4.87061
round_time                 0 days 00:14:08.978664
episodes_test                               533.0
episode_length_test                     18.756098
returns_test                            91.368293
return_std_test                         18.698219
average_reward_test                      4.871403
round_time_test            0 days 00:00:12.296988
round_time_total           0 days 00:14:08.979732
loss_total             305864159284258360262656.0
loss_critic            382330192326279630422016.0
loss_actor                  -1142275803742.208008
memory_size                              4498.118 

=== epoch 8/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:44,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      531
episode_length                          18.789077
returns                                 91.366474
return_std                              20.344264
average_reward                           4.862878
round_time                 0 days 00:14:06.787155
episodes_test                               533.0
episode_length_test                     18.737336
returns_test                            91.164169
return_std_test                         22.737761
average_reward_test                      4.865303
round_time_test            0 days 00:00:12.245790
round_time_total           0 days 00:14:06.788210
loss_total             309325216017699282354176.0
loss_critic            386656513329775050227712.0
loss_actor                  -1150377182298.112061
memory_size                              4513.055 

=== epoch 8/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:06,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      533
episode_length                           18.73546
returns                                 91.119349
return_std                              19.000056
average_reward                           4.863411
round_time                 0 days 00:14:06.452481
episodes_test                               539.0
episode_length_test                     18.536178
returns_test                            89.992482
return_std_test                         18.634717
average_reward_test                       4.85494
round_time_test            0 days 00:00:12.242675
round_time_total           0 days 00:14:06.453533
loss_total             305424075076704686571520.0
loss_critic            381780087116377167495168.0
loss_actor                  -1146373196054.528076
memory_size                                4523.0 

=== epoch 8/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:58,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      533
episode_length                          18.748593
returns                                 91.171346
return_std                              18.266826
average_reward                           4.862736
round_time                 0 days 00:14:08.540465
episodes_test                               529.0
episode_length_test                     18.903592
returns_test                             92.09923
return_std_test                         16.766005
average_reward_test                      4.872049
round_time_test            0 days 00:00:12.299626
round_time_total           0 days 00:14:08.541681
loss_total             312028547581205005991936.0
loss_critic            390035677593880104009728.0
loss_actor                  -1151203240542.208008
memory_size                             4523.9155 

=== epoch 8/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      531
episode_length                          18.789077
returns                                 91.349937
return_std                              19.404484
average_reward                           4.861708
round_time                 0 days 00:14:09.318324
episodes_test                               535.0
episode_length_test                     18.659813
returns_test                            91.058512
return_std_test                         17.281363
average_reward_test                      4.879883
round_time_test            0 days 00:00:12.331679
round_time_total           0 days 00:14:09.319379
loss_total             312271089745281400438784.0
loss_critic            390338855552303052619776.0
loss_actor                  -1158738203049.983887
memory_size                               4539.13 

=== epoch 8/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      535
episode_length                          18.663551
returns                                 90.714942
return_std                              18.302867
average_reward                           4.860468
round_time                 0 days 00:14:07.387379
episodes_test                               536.0
episode_length_test                     18.654851
returns_test                            90.865079
return_std_test                         17.676565
average_reward_test                      4.870857
round_time_test            0 days 00:00:12.309997
round_time_total           0 days 00:14:07.388429
loss_total             319532466992183506894848.0
loss_critic            399415576802434224226304.0
loss_actor                  -1176646220775.424072
memory_size                              4545.337 

=== epoch 8/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:37<00:00,  2.13it/s]
episodes                                      529
episode_length                          18.873346
returns                                 91.947402
return_std                              19.759681
average_reward                           4.872004
round_time                 0 days 00:15:38.158811
episodes_test                               546.0
episode_length_test                     18.300366
returns_test                            89.160667
return_std_test                         17.761569
average_reward_test                      4.872036
round_time_test            0 days 00:00:12.160503
round_time_total           0 days 00:15:38.160065
loss_total             326386913762074091847680.0
loss_critic            407983634903383473651712.0
loss_actor                   -1187360788676.60791
memory_size                              4557.452 

=== epoch 8/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:58,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:25<00:00,  1.91it/s]
episodes                                      526
episode_length                          18.965779
returns                                 92.440891
return_std                              20.101816
average_reward                           4.874428
round_time                 0 days 00:17:25.915977
episodes_test                               508.0
episode_length_test                     19.683071
returns_test                            96.003955
return_std_test                         24.419002
average_reward_test                      4.877492
round_time_test            0 days 00:00:13.307278
round_time_total           0 days 00:17:25.917115
loss_total             328329537097352263761920.0
loss_critic            410411914402369942585344.0
loss_actor                  -1190336695926.783936
memory_size                                4577.0 

=== epoch 8/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:33,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:10<00:00,  1.94it/s]
episodes                                      525
episode_length                          18.982857
returns                                   92.5596
return_std                              20.042229
average_reward                           4.875999
round_time                 0 days 00:17:10.784221
episodes_test                               511.0
episode_length_test                     19.567515
returns_test                            95.775107
return_std_test                         22.637416
average_reward_test                      4.894597
round_time_test            0 days 00:00:13.014293
round_time_total           0 days 00:17:10.785371
loss_total             334792507087028532281344.0
loss_critic            418490626725083894251520.0
loss_actor                  -1199236926898.176025
memory_size                                4577.0 

=== epoch 8/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:17,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:58<00:00,  1.96it/s]
episodes                                      521
episode_length                          19.134357
returns                                 93.366745
return_std                              20.063706
average_reward                           4.879674
round_time                 0 days 00:16:59.547193
episodes_test                               529.0
episode_length_test                     18.897921
returns_test                            92.164149
return_std_test                         16.844539
average_reward_test                      4.876954
round_time_test            0 days 00:00:13.035168
round_time_total           0 days 00:16:59.548249
loss_total             334797589615380837433344.0
loss_critic            418496979403639044964352.0
loss_actor                  -1201811741081.600098
memory_size                             4577.3375 

=== epoch 8/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:30,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      522
episode_length                          19.139847
returns                                 93.422205
return_std                              19.446533
average_reward                           4.880954
round_time                 0 days 00:14:09.016213
episodes_test                               533.0
episode_length_test                      18.73546
returns_test                            91.142131
return_std_test                         16.420776
average_reward_test                       4.86462
round_time_test            0 days 00:00:12.165665
round_time_total           0 days 00:14:09.017284
loss_total             325085920580557529415680.0
loss_critic            406357393841944906432512.0
loss_actor                  -1190061006061.568115
memory_size                                4592.0 

=== epoch 8/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:14,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      521
episode_length                          19.138196
returns                                 93.304458
return_std                              19.475241
average_reward                           4.875241
round_time                 0 days 00:14:08.839063
episodes_test                               526.0
episode_length_test                     18.980989
returns_test                            92.486336
return_std_test                         20.025023
average_reward_test                      4.872459
round_time_test            0 days 00:00:12.363940
round_time_total           0 days 00:14:08.840227
loss_total             327711926252692066795520.0
loss_critic            409639900544803480272896.0
loss_actor                  -1195787242569.728027
memory_size                             4606.1755 

=== epoch 8/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:49,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      527
episode_length                          18.944972
returns                                  92.35689
return_std                              18.872516
average_reward                           4.875115
round_time                 0 days 00:14:11.566907
episodes_test                               538.0
episode_length_test                     18.566914
returns_test                            90.493055
return_std_test                         17.660818
average_reward_test                       4.87384
round_time_test            0 days 00:00:12.274049
round_time_total           0 days 00:14:11.567962
loss_total             336189265328935433928704.0
loss_critic            420236573951006775181312.0
loss_actor                  -1202220589809.664062
memory_size                                4613.0 

=== epoch 8/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      528
episode_length                          18.901515
returns                                 92.214961
return_std                              19.550123
average_reward                            4.87881
round_time                 0 days 00:14:09.005378
episodes_test                               523.0
episode_length_test                     19.108987
returns_test                             93.23274
return_std_test                         19.947568
average_reward_test                      4.879002
round_time_test            0 days 00:00:12.158911
round_time_total           0 days 00:14:09.006449
loss_total             329170208070029285523456.0
loss_critic            411462752805215965020160.0
loss_actor                  -1195168543834.112061
memory_size                             4613.0455 

=== epoch 8/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      535
episode_length                          18.674766
returns                                 91.063822
return_std                              18.385675
average_reward                           4.876255
round_time                 0 days 00:14:09.591644
episodes_test                               546.0
episode_length_test                     18.305861
returns_test                            89.093325
return_std_test                         16.577707
average_reward_test                      4.866946
round_time_test            0 days 00:00:12.137635
round_time_total           0 days 00:14:09.592696
loss_total             328366246018979784032256.0
loss_critic            410457800047749331156992.0
loss_actor                  -1193088100859.904053
memory_size                                4614.0 

=== epoch 8/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:49,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      533
episode_length                          18.679174
returns                                 91.109986
return_std                              18.207567
average_reward                           4.877637
round_time                 0 days 00:14:10.619783
episodes_test                               548.0
episode_length_test                     18.220803
returns_test                            88.721062
return_std_test                         19.744161
average_reward_test                       4.86915
round_time_test            0 days 00:00:12.371070
round_time_total           0 days 00:14:10.620863
loss_total             333034428334645442510848.0
loss_critic            416293028246324374405120.0
loss_actor                  -1206411569070.080078
memory_size                                4614.0 

=== epoch 8/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:08,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      534
episode_length                          18.691011
returns                                 91.149859
return_std                              17.783261
average_reward                           4.876866
round_time                 0 days 00:14:09.019004
episodes_test                               504.0
episode_length_test                      19.80754
returns_test                            96.728458
return_std_test                         19.684676
average_reward_test                      4.883698
round_time_test            0 days 00:00:12.433024
round_time_total           0 days 00:14:09.020201
loss_total             338847456984613322227712.0
loss_critic            423559313317942060384256.0
loss_actor                  -1216537994231.808105
memory_size                             4626.3695 

=== epoch 8/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      524
episode_length                          19.055344
returns                                 92.943955
return_std                              19.030801
average_reward                           4.877656
round_time                 0 days 00:14:12.174059
episodes_test                               520.0
episode_length_test                     19.228846
returns_test                            93.784749
return_std_test                         18.682146
average_reward_test                      4.877296
round_time_test            0 days 00:00:12.046868
round_time_total           0 days 00:14:12.175112
loss_total             345566753760608969031680.0
loss_critic            431958434326217259745280.0
loss_actor                  -1222510904934.399902
memory_size                                4627.0 

=== epoch 8/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:31,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                      521
episode_length                          19.159309
returns                                 93.413387
return_std                              19.414224
average_reward                           4.875528
round_time                 0 days 00:14:12.594652
episodes_test                               494.0
episode_length_test                     20.230769
returns_test                            98.979315
return_std_test                         20.624873
average_reward_test                      4.892507
round_time_test            0 days 00:00:12.311093
round_time_total           0 days 00:14:12.595703
loss_total             350117875880154261094400.0
loss_critic            437647336726825011249152.0
loss_actor                   -1233476897210.36792
memory_size                              4633.412 

=== epoch 8/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:43,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      522
episode_length                          19.124521
returns                                 93.186276
return_std                              19.686594
average_reward                           4.872451
round_time                 0 days 00:14:11.532005
episodes_test                               535.0
episode_length_test                      18.66729
returns_test                            90.945436
return_std_test                         20.082173
average_reward_test                      4.871824
round_time_test            0 days 00:00:12.204898
round_time_total           0 days 00:14:11.533075
loss_total             347674352151292687679488.0
loss_critic            434592932778442737319936.0
loss_actor                  -1234592379961.343994
memory_size                                4634.0 

=== epoch 8/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      526
episode_length                          18.990494
returns                                  92.48807
return_std                               20.18491
average_reward                           4.870216
round_time                 0 days 00:14:11.413080
episodes_test                               552.0
episode_length_test                     18.088768
returns_test                            87.877708
return_std_test                         17.592492
average_reward_test                      4.858168
round_time_test            0 days 00:00:12.147596
round_time_total           0 days 00:14:11.414122
loss_total             354360138822404215406592.0
loss_critic            442950165671475769507840.0
loss_actor                  -1244173740048.384033
memory_size                              4635.179 

=== epoch 8/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      523
episode_length                          19.055449
returns                                 92.824833
return_std                              20.079869
average_reward                           4.871234
round_time                 0 days 00:14:10.680215
episodes_test                               524.0
episode_length_test                      19.04771
returns_test                            92.937326
return_std_test                         20.154117
average_reward_test                      4.878821
round_time_test            0 days 00:00:12.111974
round_time_total           0 days 00:14:10.681426
loss_total             364362591729332000915456.0
loss_critic            455453231597969817468928.0
loss_actor                   -1256248658165.76001
memory_size                              4637.893 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:07,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                      106
episode_length                          18.820755
returns                                 91.875693
return_std                              17.802486
average_reward                           4.881718
round_time                 0 days 00:14:03.303610
episodes_test                               534.0
episode_length_test                     18.705993
returns_test                            91.054433
return_std_test                         19.834826
average_reward_test                      4.867652
round_time_test            0 days 00:00:12.250930
round_time_total           0 days 00:14:03.304691
loss_total             368262452751816026226688.0
loss_critic            460328058078736873947136.0
loss_actor                  -1263203027976.191895
memory_size                                4638.0 

=== epoch 9/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:22,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      213
episode_length                          18.713615
returns                                 91.276819
return_std                              16.548175
average_reward                           4.877323
round_time                 0 days 00:14:05.113702
episodes_test                               509.0
episode_length_test                     19.614931
returns_test                            95.506674
return_std_test                         25.644736
average_reward_test                      4.869126
round_time_test            0 days 00:00:12.446169
round_time_total           0 days 00:14:05.114906
loss_total             370466702407714004271104.0
loss_critic            463083369545126998179840.0
loss_actor                  -1266005984739.327881
memory_size                                4638.0 

=== epoch 9/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:09,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      317
episode_length                          18.917981
returns                                 92.284618
return_std                              16.994243
average_reward                           4.878148
round_time                 0 days 00:14:04.633100
episodes_test                               520.0
episode_length_test                     19.230769
returns_test                            93.765513
return_std_test                          19.55371
average_reward_test                      4.875807
round_time_test            0 days 00:00:12.192794
round_time_total           0 days 00:14:04.634157
loss_total             373463474233017603457024.0
loss_critic            466829334497893260722176.0
loss_actor                  -1280150716022.783936
memory_size                                4638.0 

=== epoch 9/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                      422
episode_length                          18.921801
returns                                 92.219426
return_std                              17.643601
average_reward                           4.873722
round_time                 0 days 00:14:04.012155
episodes_test                               520.0
episode_length_test                     19.211538
returns_test                            93.691297
return_std_test                         20.351155
average_reward_test                      4.876762
round_time_test            0 days 00:00:12.492818
round_time_total           0 days 00:14:04.013217
loss_total             375007925412616344698880.0
loss_critic            468759898377816087986176.0
loss_actor                  -1283011934355.456055
memory_size                              4650.015 

=== epoch 9/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                      529
episode_length                           18.89603
returns                                 92.057589
return_std                              17.600574
average_reward                           4.871816
round_time                 0 days 00:14:04.161991
episodes_test                               524.0
episode_length_test                     19.078244
returns_test                            92.988698
return_std_test                         18.730786
average_reward_test                      4.874086
round_time_test            0 days 00:00:12.266153
round_time_total           0 days 00:14:04.163059
loss_total             384444766423490966323200.0
loss_critic            480555949450006445948928.0
loss_actor                  -1299498700177.407959
memory_size                                4653.0 

=== epoch 9/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:31,  2.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      530
episode_length                          18.818868
returns                                 91.526671
return_std                              18.507904
average_reward                            4.86371
round_time                 0 days 00:14:05.589000
episodes_test                               546.0
episode_length_test                     18.294872
returns_test                            89.087466
return_std_test                         14.981218
average_reward_test                      4.869501
round_time_test            0 days 00:00:12.226710
round_time_total           0 days 00:14:05.590071
loss_total             394246086031030148399104.0
loss_critic            492807599017977253462016.0
loss_actor                  -1309427058606.080078
memory_size                             4672.5875 

=== epoch 9/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                      537
episode_length                           18.60149
returns                                 90.389568
return_std                              18.117941
average_reward                           4.859226
round_time                 0 days 00:14:03.800016
episodes_test                               557.0
episode_length_test                     17.928187
returns_test                            87.171762
return_std_test                         15.333532
average_reward_test                      4.862194
round_time_test            0 days 00:00:12.377467
round_time_total           0 days 00:14:03.801442
loss_total             402268898792463673589760.0
loss_critic            502836114895459732946944.0
loss_actor                  -1321831384350.719971
memory_size                                4678.0 

=== epoch 9/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      549
episode_length                           18.15847
returns                                 88.066441
return_std                              16.888587
average_reward                           4.849981
round_time                 0 days 00:14:06.270453
episodes_test                               580.0
episode_length_test                     17.241379
returns_test                            83.652722
return_std_test                         10.175227
average_reward_test                      4.851858
round_time_test            0 days 00:00:12.157677
round_time_total           0 days 00:14:06.271514
loss_total             402571032681385016950784.0
loss_critic            503213781923345018650624.0
loss_actor                   -1324387824107.52002
memory_size                                4678.0 

=== epoch 9/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      554
episode_length                          18.034296
returns                                 87.502725
return_std                              18.344869
average_reward                           4.852081
round_time                 0 days 00:14:05.796590
episodes_test                               571.0
episode_length_test                     17.502627
returns_test                            84.836666
return_std_test                         12.342609
average_reward_test                        4.8471
round_time_test            0 days 00:00:12.124686
round_time_total           0 days 00:14:05.797651
loss_total             408574269456491776835584.0
loss_critic            510717827637775077212160.0
loss_actor                  -1337755794145.280029
memory_size                               4700.13 

=== epoch 9/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:47,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      556
episode_length                          17.940647
returns                                 87.041792
return_std                              18.248503
average_reward                           4.852085
round_time                 0 days 00:14:06.166839
episodes_test                               549.0
episode_length_test                     18.209472
returns_test                            88.723857
return_std_test                         17.399457
average_reward_test                      4.872408
round_time_test            0 days 00:00:12.373000
round_time_total           0 days 00:14:06.167906
loss_total             406648049433517542604800.0
loss_critic            508310052820726426107904.0
loss_actor                  -1343249112956.927979
memory_size                                4715.0 

=== epoch 9/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:34,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      561
episode_length                          17.800357
returns                                 86.446987
return_std                              16.746244
average_reward                           4.856577
round_time                 0 days 00:14:06.560557
episodes_test                               561.0
episode_length_test                     17.819964
returns_test                            86.840383
return_std_test                         16.920351
average_reward_test                      4.873224
round_time_test            0 days 00:00:12.444293
round_time_total           0 days 00:14:06.561631
loss_total             410037170471851591204864.0
loss_critic            512546454152421018959872.0
loss_actor                   -1350063339274.23999
memory_size                                4715.0 

=== epoch 9/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      560
episode_length                          17.821429
returns                                 86.557118
return_std                              16.846728
average_reward                           4.856786
round_time                 0 days 00:14:05.259691
episodes_test                               548.0
episode_length_test                     18.233577
returns_test                            88.812494
return_std_test                         17.857279
average_reward_test                      4.870817
round_time_test            0 days 00:00:12.258054
round_time_total           0 days 00:14:05.260754
loss_total             421101503274846204723200.0
loss_critic            526376870149408855949312.0
loss_actor                  -1362559862177.791992
memory_size                                4715.0 

=== epoch 9/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:54,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      558
episode_length                          17.908602
returns                                 87.092061
return_std                              17.171953
average_reward                            4.86317
round_time                 0 days 00:14:05.318411
episodes_test                               563.0
episode_length_test                     17.753108
returns_test                             86.11053
return_std_test                         13.199255
average_reward_test                      4.850469
round_time_test            0 days 00:00:12.239922
round_time_total           0 days 00:14:05.319604
loss_total             418667357801316282793984.0
loss_critic            523334188683547079868416.0
loss_actor                  -1370078718459.904053
memory_size                                4715.0 

=== epoch 9/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      562
episode_length                          17.715302
returns                                 86.126236
return_std                              14.576835
average_reward                           4.861668
round_time                 0 days 00:14:06.380017
episodes_test                               569.0
episode_length_test                      17.54833
returns_test                            85.229488
return_std_test                         14.888776
average_reward_test                      4.856939
round_time_test            0 days 00:00:12.324891
round_time_total           0 days 00:14:06.381084
loss_total             429000280957790403428352.0
loss_critic            536250342421974097068032.0
loss_actor                  -1379666435833.855957
memory_size                                4715.0 

=== epoch 9/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:18,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      564
episode_length                          17.689716
returns                                 86.000319
return_std                               15.21474
average_reward                           4.861444
round_time                 0 days 00:14:10.226722
episodes_test                               563.0
episode_length_test                     17.749556
returns_test                            86.401341
return_std_test                         15.754335
average_reward_test                      4.867792
round_time_test            0 days 00:00:12.273165
round_time_total           0 days 00:14:10.227816
loss_total             422213802599044526112768.0
loss_critic            527767244475793571577856.0
loss_actor                  -1374249632071.679932
memory_size                              4727.961 

=== epoch 9/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                      562
episode_length                          17.756228
returns                                 86.334011
return_std                              14.797327
average_reward                           4.862026
round_time                 0 days 00:14:12.608328
episodes_test                               568.0
episode_length_test                     17.591549
returns_test                            85.526854
return_std_test                         13.762208
average_reward_test                      4.861794
round_time_test            0 days 00:00:12.150311
round_time_total           0 days 00:14:12.609376
loss_total             421409652270509028540416.0
loss_critic            526762056324181651357696.0
loss_actor                  -1381445711364.095947
memory_size                                4728.0 

=== epoch 9/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:08,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      559
episode_length                          17.856887
returns                                   86.9011
return_std                              15.381754
average_reward                           4.866426
round_time                 0 days 00:14:10.934454
episodes_test                               538.0
episode_length_test                      18.54461
returns_test                            90.514101
return_std_test                         17.647348
average_reward_test                      4.880721
round_time_test            0 days 00:00:12.391827
round_time_total           0 days 00:14:10.935578
loss_total             424485620192497511694336.0
loss_critic            530607015990228248690688.0
loss_actor                  -1386300876587.008057
memory_size                                4728.0 

=== epoch 9/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:17,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      560
episode_length                          17.817857
returns                                 86.580305
return_std                              15.794097
average_reward                           4.859135
round_time                 0 days 00:14:11.299336
episodes_test                               548.0
episode_length_test                     18.229927
returns_test                            88.454932
return_std_test                         19.973608
average_reward_test                      4.852155
round_time_test            0 days 00:00:12.386006
round_time_total           0 days 00:14:11.300398
loss_total             427597285266428396568576.0
loss_critic            534496597231310859141120.0
loss_actor                  -1388376961908.736084
memory_size                             4739.4905 

=== epoch 9/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:58,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      559
episode_length                          17.840787
returns                                 86.719342
return_std                              14.860357
average_reward                           4.860983
round_time                 0 days 00:14:06.609255
episodes_test                               576.0
episode_length_test                     17.357639
returns_test                            83.845209
return_std_test                         13.054083
average_reward_test                      4.830468
round_time_test            0 days 00:00:12.376231
round_time_total           0 days 00:14:06.610332
loss_total             434901499023603846348800.0
loss_critic            543626864749787509424128.0
loss_actor                  -1399141666521.087891
memory_size                             4742.6285 

=== epoch 9/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      569
episode_length                           17.56239
returns                                 85.218953
return_std                              13.778767
average_reward                           4.852364
round_time                 0 days 00:14:04.899350
episodes_test                               578.0
episode_length_test                     17.292388
returns_test                            83.618005
return_std_test                         11.361779
average_reward_test                      4.835588
round_time_test            0 days 00:00:12.363874
round_time_total           0 days 00:14:04.900430
loss_total             444326003260968837054464.0
loss_critic            555407494839328214876160.0
loss_actor                  -1408792609095.679932
memory_size                                4748.0 

=== epoch 9/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      572
episode_length                           17.45979
returns                                 84.587749
return_std                              15.128681
average_reward                           4.844671
round_time                 0 days 00:14:07.420231
episodes_test                               585.0
episode_length_test                     17.088889
returns_test                            82.321127
return_std_test                         13.062667
average_reward_test                      4.817254
round_time_test            0 days 00:00:12.316719
round_time_total           0 days 00:14:07.421285
loss_total             446722229461464232493056.0
loss_critic            558402777594451017596928.0
loss_actor                  -1409581944733.696045
memory_size                             4754.2515 

=== epoch 9/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      578
episode_length                          17.276817
returns                                 83.595393
return_std                              13.854077
average_reward                           4.838765
round_time                 0 days 00:14:07.839334
episodes_test                               573.0
episode_length_test                     17.443281
returns_test                            84.126112
return_std_test                         16.955181
average_reward_test                      4.822883
round_time_test            0 days 00:00:12.259248
round_time_total           0 days 00:14:07.840392
loss_total             447184510056098099625984.0
loss_critic            558980627815325893656576.0
loss_actor                  -1413125235933.184082
memory_size                                4764.0 

=== epoch 9/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:23,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      582
episode_length                           17.16323
returns                                 83.032888
return_std                              12.611749
average_reward                           4.838191
round_time                 0 days 00:14:09.151052
episodes_test                               587.0
episode_length_test                     17.027257
returns_test                            82.030291
return_std_test                         11.136879
average_reward_test                      4.817641
round_time_test            0 days 00:00:12.298208
round_time_total           0 days 00:14:09.152115
loss_total             455803866354559808962560.0
loss_critic            569754823409079322411008.0
loss_actor                  -1425810031378.431885
memory_size                                4764.0 

=== epoch 9/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:27,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      587
episode_length                          17.025554
returns                                 82.206913
return_std                              10.735705
average_reward                           4.828413
round_time                 0 days 00:14:09.594876
episodes_test                               592.0
episode_length_test                     16.878378
returns_test                            81.312074
return_std_test                          9.676428
average_reward_test                      4.817532
round_time_test            0 days 00:00:12.447601
round_time_total           0 days 00:14:09.595920
loss_total             455332031391330776121344.0
loss_critic            569165029560927903547392.0
loss_actor                  -1430062290763.775879
memory_size                                4764.0 

=== epoch 9/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      583
episode_length                          17.114923
returns                                 82.651884
return_std                              11.507455
average_reward                           4.829171
round_time                 0 days 00:14:08.290587
episodes_test                               583.0
episode_length_test                     17.132075
returns_test                            82.614022
return_std_test                         12.420112
average_reward_test                      4.822197
round_time_test            0 days 00:00:12.536058
round_time_total           0 days 00:14:08.291644
loss_total             458944793513026295693312.0
loss_critic            573680981868521937960960.0
loss_actor                  -1427876240883.711914
memory_size                              4764.503 

=== epoch 9/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:15,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      584
episode_length                          17.094178
returns                                 82.507085
return_std                              10.193156
average_reward                           4.826987
round_time                 0 days 00:14:10.515470
episodes_test                               580.0
episode_length_test                      17.22931
returns_test                            82.962847
return_std_test                         10.753154
average_reward_test                      4.815243
round_time_test            0 days 00:00:12.521031
round_time_total           0 days 00:14:10.516544
loss_total             457664262657093117411328.0
loss_critic            572080318199526206734336.0
loss_actor                  -1426619406483.456055
memory_size                             4766.1435 

=== epoch 9/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:15<00:00,  2.34it/s]
episodes                                      584
episode_length                          17.083904
returns                                 82.422455
return_std                              10.457551
average_reward                           4.824497
round_time                 0 days 00:14:15.781795
episodes_test                               581.0
episode_length_test                     17.194492
returns_test                            83.121548
return_std_test                          9.506981
average_reward_test                      4.834169
round_time_test            0 days 00:00:12.216156
round_time_total           0 days 00:14:15.782864
loss_total             452171830925136528146432.0
loss_critic            565214779095278669529088.0
loss_actor                   -1412911478538.23999
memory_size                                4767.0 

=== epoch 9/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      582
episode_length                          17.140893
returns                                 82.721571
return_std                              10.674739
average_reward                           4.825879
round_time                 0 days 00:14:08.697586
episodes_test                               583.0
episode_length_test                     17.144082
returns_test                            82.702246
return_std_test                         14.099941
average_reward_test                      4.824002
round_time_test            0 days 00:00:12.398734
round_time_total           0 days 00:14:08.698774
loss_total             448568819123655918223360.0
loss_critic            560711014167787467702272.0
loss_actor                  -1420414446272.511963
memory_size                                4767.0 

=== epoch 9/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:22,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      581
episode_length                          17.173838
returns                                 82.859829
return_std                              11.052281
average_reward                            4.82485
round_time                 0 days 00:14:11.252706
episodes_test                               586.0
episode_length_test                     17.042662
returns_test                            82.491966
return_std_test                          10.57024
average_reward_test                      4.840426
round_time_test            0 days 00:00:12.257855
round_time_total           0 days 00:14:11.253775
loss_total             457358827638652128985088.0
loss_critic            571698525324943122497536.0
loss_actor                  -1432140228001.791992
memory_size                                4767.0 

=== epoch 9/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:53,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:16<00:00,  2.33it/s]
episodes                                      577
episode_length                          17.306759
returns                                 83.481642
return_std                              12.170312
average_reward                           4.823747
round_time                 0 days 00:14:17.167072
episodes_test                               566.0
episode_length_test                     17.646643
returns_test                            85.455305
return_std_test                         15.831995
average_reward_test                      4.842531
round_time_test            0 days 00:00:12.429368
round_time_total           0 days 00:14:17.168278
loss_total             461949865265277538664448.0
loss_critic            577437322286167329079296.0
loss_actor                  -1437758140514.303955
memory_size                             4769.7585 

=== epoch 9/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:43,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                      579
episode_length                          17.243523
returns                                 83.186417
return_std                              11.412952
average_reward                           4.824173
round_time                 0 days 00:14:15.257922
episodes_test                               587.0
episode_length_test                     17.010221
returns_test                            82.144545
return_std_test                          9.654723
average_reward_test                      4.829046
round_time_test            0 days 00:00:12.292747
round_time_total           0 days 00:14:15.258976
loss_total             467527052517443012395008.0
loss_critic            584408806752194511503360.0
loss_actor                  -1437942635036.672119
memory_size                                4776.0 

=== epoch 9/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:56,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:55<00:00,  1.97it/s]
episodes                                      577
episode_length                          17.310225
returns                                 83.408921
return_std                              14.602951
average_reward                            4.81848
round_time                 0 days 00:16:56.398403
episodes_test                               580.0
episode_length_test                     17.237931
returns_test                            83.130729
return_std_test                         11.880859
average_reward_test                      4.822563
round_time_test            0 days 00:00:12.272292
round_time_total           0 days 00:16:56.399532
loss_total             474733502278022715146240.0
loss_critic            593416867995904209059840.0
loss_actor                  -1446267144044.543945
memory_size                              4808.976 

=== epoch 9/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:32,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:26<00:00,  1.91it/s]
episodes                                      580
episode_length                          17.234483
returns                                 82.986263
return_std                              14.390515
average_reward                           4.815096
round_time                 0 days 00:17:27.599737
episodes_test                               597.0
episode_length_test                     16.737018
returns_test                            80.518723
return_std_test                           8.63038
average_reward_test                      4.810853
round_time_test            0 days 00:00:12.903899
round_time_total           0 days 00:17:27.601261
loss_total             480259200344022765797376.0
loss_critic            600323990690994306678784.0
loss_actor                  -1460568707366.912109
memory_size                                4817.0 

=== epoch 9/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:57,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:14<00:00,  1.93it/s]
episodes                                      581
episode_length                          17.170396
returns                                 82.685167
return_std                              13.952332
average_reward                           4.815546
round_time                 0 days 00:17:14.697776
episodes_test                               579.0
episode_length_test                      17.26943
returns_test                            83.087276
return_std_test                         13.289113
average_reward_test                      4.811246
round_time_test            0 days 00:00:13.058440
round_time_total           0 days 00:17:14.698834
loss_total             484490711703270776111104.0
loss_critic            605613379876543426199552.0
loss_actor                  -1465810677989.375977
memory_size                                4817.0 

=== epoch 9/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:40,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      586
episode_length                          17.027304
returns                                   82.0275
return_std                              12.708364
average_reward                           4.817338
round_time                 0 days 00:14:09.871516
episodes_test                               599.0
episode_length_test                     16.676127
returns_test                            80.329524
return_std_test                          5.908416
average_reward_test                      4.817045
round_time_test            0 days 00:00:12.240186
round_time_total           0 days 00:14:09.872734
loss_total             483524088775987901235200.0
loss_critic            604405100827878497452032.0
loss_actor                  -1474886617202.687988
memory_size                                4817.0 

=== epoch 9/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      587
episode_length                          16.994889
returns                                 81.886679
return_std                              12.111587
average_reward                           4.818249
round_time                 0 days 00:14:08.010060
episodes_test                               592.0
episode_length_test                     16.883446
returns_test                            81.434326
return_std_test                          11.89087
average_reward_test                      4.823363
round_time_test            0 days 00:00:12.240631
round_time_total           0 days 00:14:08.011123
loss_total             486051812951900676947968.0
loss_critic            607564756124330715250688.0
loss_actor                  -1474467278225.407959
memory_size                                4817.0 

=== epoch 9/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      591
episode_length                          16.888325
returns                                 81.446802
return_std                               7.567743
average_reward                           4.822614
round_time                 0 days 00:14:07.725643
episodes_test                               588.0
episode_length_test                     17.001701
returns_test                            82.174695
return_std_test                         11.271852
average_reward_test                       4.83334
round_time_test            0 days 00:00:12.442034
round_time_total           0 days 00:14:07.726706
loss_total             487717772195201766391808.0
loss_critic            609647205874263220289536.0
loss_actor                   -1480287013306.36792
memory_size                                4817.0 

=== epoch 9/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:52<00:00,  1.97it/s]
episodes                                      589
episode_length                          16.976231
returns                                  81.93322
return_std                               7.929782
average_reward                           4.826358
round_time                 0 days 00:16:53.170366
episodes_test                               585.0
episode_length_test                     17.068376
returns_test                            82.211459
return_std_test                          8.988683
average_reward_test                      4.816577
round_time_test            0 days 00:00:12.304469
round_time_total           0 days 00:16:53.171608
loss_total             493231694564148190904320.0
loss_critic            616539608202690512814080.0
loss_actor                  -1484317888479.231934
memory_size                                4817.0 

=== epoch 9/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:53,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:25<00:00,  1.91it/s]
episodes                                      576
episode_length                          17.333333
returns                                 83.632293
return_std                              13.345321
average_reward                           4.825076
round_time                 0 days 00:17:26.531928
episodes_test                               557.0
episode_length_test                     17.949731
returns_test                            86.615901
return_std_test                         15.356275
average_reward_test                       4.82549
round_time_test            0 days 00:00:12.789531
round_time_total           0 days 00:17:26.533377
loss_total             487935326033162895622144.0
loss_critic            609919147811426686468096.0
loss_actor                  -1478442392879.104004
memory_size                              4830.063 

=== epoch 9/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:54<00:00,  1.97it/s]
episodes                                      563
episode_length                          17.730018
returns                                 85.414059
return_std                              17.399967
average_reward                           4.817405
round_time                 0 days 00:16:54.960905
episodes_test                               514.0
episode_length_test                     19.435798
returns_test                            93.144721
return_std_test                         27.028776
average_reward_test                      4.792451
round_time_test            0 days 00:00:13.410858
round_time_total           0 days 00:16:54.961960
loss_total             498527390563593921495040.0
loss_critic            623159227843961434406912.0
loss_actor                  -1479595830673.407959
memory_size                              4864.144 

=== epoch 9/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:59<00:00,  1.67it/s]
episodes                                      550
episode_length                          18.145455
returns                                 87.396932
return_std                              20.047876
average_reward                           4.816474
round_time                 0 days 00:20:00.451569
episodes_test                               526.0
episode_length_test                     19.003802
returns_test                            91.390179
return_std_test                         23.878812
average_reward_test                      4.809094
round_time_test            0 days 00:00:12.450647
round_time_total           0 days 00:20:00.452948
loss_total             492585875963653868486656.0
loss_critic            615732334276532689174528.0
loss_actor                   -1472691656458.23999
memory_size                             4897.4825 

=== epoch 9/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:43,  1.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:15<00:00,  1.18it/s]
episodes                                      547
episode_length                          18.272395
returns                                 87.955153
return_std                              20.616012
average_reward                           4.813538
round_time                 0 days 00:28:16.409065
episodes_test                               565.0
episode_length_test                     17.692035
returns_test                            85.262572
return_std_test                         12.665386
average_reward_test                      4.819289
round_time_test            0 days 00:00:14.137087
round_time_total           0 days 00:28:16.410523
loss_total             490252961519842476687360.0
loss_critic            612816192106725740904448.0
loss_actor                  -1472809327329.280029
memory_size                             4912.4285 

=== epoch 9/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:34,  1.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [31:25<00:00,  1.06it/s]
episodes                                      542
episode_length                          18.392989
returns                                 88.395622
return_std                              21.047782
average_reward                           4.805901
round_time                 0 days 00:31:26.492246
episodes_test                               570.0
episode_length_test                     17.542105
returns_test                            84.603378
return_std_test                         13.770789
average_reward_test                      4.822884
round_time_test            0 days 00:00:15.940133
round_time_total           0 days 00:31:26.493688
loss_total             491702635526147092250624.0
loss_critic            614628284204778919034880.0
loss_actor                  -1471891522060.288086
memory_size                                4920.0 

=== epoch 9/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<30:09,  1.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:04<00:00,  1.15it/s]
episodes                                      547
episode_length                          18.268739
returns                                 87.836323
return_std                              19.454318
average_reward                           4.807967
round_time                 0 days 00:29:04.922351
episodes_test                               560.0
episode_length_test                     17.833929
returns_test                            86.075908
return_std_test                         14.393279
average_reward_test                      4.826497
round_time_test            0 days 00:00:15.287687
round_time_total           0 days 00:29:04.923706
loss_total             489581703670320198582272.0
loss_critic            611977119893902067761152.0
loss_actor                  -1473632681197.568115
memory_size                              4923.456 

=== epoch 9/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:14,  1.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:00<00:00,  1.33it/s]
episodes                                      548
episode_length                          18.228102
returns                                 87.809707
return_std                              18.050325
average_reward                           4.817276
round_time                 0 days 00:25:00.719724
episodes_test                               508.0
episode_length_test                     19.677165
returns_test                            94.856343
return_std_test                         26.341205
average_reward_test                      4.820657
round_time_test            0 days 00:00:14.400743
round_time_total           0 days 00:25:00.720881
loss_total             484063527093079377969152.0
loss_critic            605079399237653268267008.0
loss_actor                  -1456543596150.783936
memory_size                             4939.2495 

=== epoch 9/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<25:27,  1.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [29:55<00:00,  1.11it/s]
episodes                                      545
episode_length                          18.324771
returns                                 88.279844
return_std                              18.374326
average_reward                           4.817527
round_time                 0 days 00:29:56.106028
episodes_test                               522.0
episode_length_test                     19.149425
returns_test                            92.399203
return_std_test                         23.318695
average_reward_test                      4.825206
round_time_test            0 days 00:00:14.380447
round_time_total           0 days 00:29:56.107381
loss_total             487325817020117126479872.0
loss_critic            609157261666716609413120.0
loss_actor                  -1454087811235.840088
memory_size                              4958.785 

=== epoch 9/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<29:54,  1.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [21:36<00:00,  1.54it/s]
episodes                                      538
episode_length                          18.526022
returns                                  89.42008
return_std                              19.156599
average_reward                           4.826649
round_time                 0 days 00:21:37.343895
episodes_test                               522.0
episode_length_test                     19.126437
returns_test                            92.474841
return_std_test                         21.439565
average_reward_test                      4.835077
round_time_test            0 days 00:00:15.610169
round_time_total           0 days 00:21:37.345077
loss_total             486165959386616131747840.0
loss_critic            607707439023609855606784.0
loss_actor                  -1452456611938.303955
memory_size                              4971.778 

=== epoch 9/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:16,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                      530
episode_length                          18.826415
returns                                 90.976266
return_std                              20.934312
average_reward                           4.832485
round_time                 0 days 00:15:09.815279
episodes_test                               530.0
episode_length_test                     18.839623
returns_test                            91.103689
return_std_test                         20.640731
average_reward_test                      4.835832
round_time_test            0 days 00:00:12.909337
round_time_total           0 days 00:15:09.816340
loss_total             487233639955031882989568.0
loss_critic            609042040078654809047040.0
loss_actor                   -1447908397744.12793
memory_size                              4992.376 

=== epoch 9/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:28,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:16<00:00,  2.33it/s]
episodes                                      525
episode_length                           18.99619
returns                                 91.816842
return_std                              24.389709
average_reward                           4.833336
round_time                 0 days 00:14:17.167123
episodes_test                               510.0
episode_length_test                     19.584314
returns_test                            94.502524
return_std_test                         27.986753
average_reward_test                      4.825485
round_time_test            0 days 00:00:12.475911
round_time_total           0 days 00:14:17.168185
loss_total             473997756313579382898688.0
loss_critic            592497185186817477967872.0
loss_actor                  -1439386549157.887939
memory_size                             5046.5385 

=== epoch 9/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:24,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:16<00:00,  2.34it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                      532
episode_length                          18.774436
returns                                 90.664774
return_std                               23.55657
average_reward                           4.829165
round_time                 0 days 00:14:16.918462
episodes_test                               552.0
episode_length_test                     18.115942
returns_test                            87.456435
return_std_test                         15.874877
average_reward_test                      4.827595
round_time_test            0 days 00:00:12.519213
round_time_total           0 days 00:14:16.919510
loss_total             473900892802721924513792.0
loss_critic            592376105847785233842176.0
loss_actor                  -1426459918270.464111
memory_size                              5057.716 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:28,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      112
episode_length                          17.803571
returns                                 86.036055
return_std                              22.601127
average_reward                           4.832676
round_time                 0 days 00:14:09.185475
episodes_test                               559.0
episode_length_test                     17.860465
returns_test                             85.92157
return_std_test                         16.443811
average_reward_test                      4.810664
round_time_test            0 days 00:00:12.698665
round_time_total           0 days 00:14:09.186788
loss_total             454081387044021561458688.0
loss_critic            567601725088309891301376.0
loss_actor                  -1387806242963.456055
memory_size                              5071.717 

=== epoch 10/10 ==== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                      217
episode_length                          18.253456
returns                                 88.134533
return_std                              22.120927
average_reward                           4.825511
round_time                 0 days 00:14:05.741809
episodes_test                               527.0
episode_length_test                     18.956357
returns_test                            91.268544
return_std_test                         20.160505
average_reward_test                      4.814696
round_time_test            0 days 00:00:12.750559
round_time_total           0 days 00:14:05.742855
loss_total             453072266078999019520000.0
loss_critic            566340323578038693396480.0
loss_actor                  -1386969824231.424072
memory_size                             5103.5685 

=== epoch 10/10 ==== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:55,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      323
episode_length                          18.551084
returns                                 89.635531
return_std                              22.539884
average_reward                            4.83185
round_time                 0 days 00:14:06.264182
episodes_test                               491.0
episode_length_test                     20.323829
returns_test                             97.94896
return_std_test                         30.672989
average_reward_test                      4.819744
round_time_test            0 days 00:00:12.591696
round_time_total           0 days 00:14:06.265251
loss_total             467248624820355420127232.0
loss_critic            584060771176071877885952.0
loss_actor                  -1392429091520.511963
memory_size                             5120.9785 

=== epoch 10/10 ==== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      429
episode_length                          18.641026
returns                                 89.944918
return_std                              22.833251
average_reward                           4.825146
round_time                 0 days 00:14:04.627939
episodes_test                               523.0
episode_length_test                      19.10325
returns_test                            91.926066
return_std_test                         23.831636
average_reward_test                      4.812074
round_time_test            0 days 00:00:12.563747
round_time_total           0 days 00:14:04.629033
loss_total             469240364264550941327360.0
loss_critic            586550445519596953796608.0
loss_actor                  -1390307325837.312012
memory_size                             5137.5295 

=== epoch 10/10 ==== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:02,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      537
episode_length                          18.608939
returns                                 89.766884
return_std                              22.711621
average_reward                           4.823869
round_time                 0 days 00:14:04.628803
episodes_test                               532.0
episode_length_test                     18.785714
returns_test                            90.349754
return_std_test                         26.353459
average_reward_test                      4.809542
round_time_test            0 days 00:00:12.626550
round_time_total           0 days 00:14:04.629871
loss_total             472637382499939645390848.0
loss_critic            590796717985069948469248.0
loss_actor                  -1389262774140.927979
memory_size                             5166.3315 

=== epoch 10/10 ==== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                      533
episode_length                          18.733583
returns                                 90.306004
return_std                              22.330022
average_reward                            4.82075
round_time                 0 days 00:14:06.178667
episodes_test                               541.0
episode_length_test                     18.475046
returns_test                            88.635329
return_std_test                         18.752993
average_reward_test                      4.797619
round_time_test            0 days 00:00:12.444839
round_time_total           0 days 00:14:06.179723
loss_total             467085639432747322703872.0
loss_critic            583857039668993528430592.0
loss_actor                  -1382882750758.912109
memory_size                             5203.9585 

=== epoch 10/10 ==== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      543
episode_length                          18.390424
returns                                 88.595837
return_std                              20.378636
average_reward                            4.81734
round_time                 0 days 00:14:04.708970
episodes_test                               559.0
episode_length_test                     17.874776
returns_test                            85.854982
return_std_test                         15.988585
average_reward_test                       4.80317
round_time_test            0 days 00:00:12.399139
round_time_total           0 days 00:14:04.710035
loss_total             462275294572710650183680.0
loss_critic            577844108778595227271168.0
loss_actor                  -1377716607320.063965
memory_size                                5214.0 

=== epoch 10/10 ==== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                      553
episode_length                          18.041591
returns                                 86.827245
return_std                              18.567458
average_reward                           4.812625
round_time                 0 days 00:14:04.574973
episodes_test                               588.0
episode_length_test                     16.993197
returns_test                            81.807719
return_std_test                          8.968822
average_reward_test                      4.814159
round_time_test            0 days 00:00:12.244507
round_time_total           0 days 00:14:04.576165
loss_total             467192612317803497652224.0
loss_critic            583990756117587391676416.0
loss_actor                  -1373619318226.944092
memory_size                                5214.0 

=== epoch 10/10 ==== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:26,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:57<00:00,  2.39it/s]
episodes                                      565
episode_length                          17.656637
returns                                 85.019325
return_std                              15.480249
average_reward                           4.815365
round_time                 0 days 00:13:58.113672
episodes_test                               596.0
episode_length_test                     16.753356
returns_test                            80.690731
return_std_test                          6.864654
average_reward_test                      4.816299
round_time_test            0 days 00:00:12.140662
round_time_total           0 days 00:13:58.114749
loss_total             458477942746555342651392.0
loss_critic            573097419029678124957696.0
loss_actor                  -1366994882920.447998
memory_size                                5214.0 

=== epoch 10/10 ==== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:33,  2.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                      570
episode_length                          17.522807
returns                                 84.416792
return_std                              12.965213
average_reward                           4.817849
round_time                 0 days 00:13:59.710538
episodes_test                               578.0
episode_length_test                     17.283737
returns_test                            83.501493
return_std_test                          11.43185
average_reward_test                      4.831219
round_time_test            0 days 00:00:12.520798
round_time_total           0 days 00:13:59.711606
loss_total             452369400508125571710976.0
loss_critic            565461740907381768847360.0
loss_actor                  -1360883916144.639893
memory_size                                5214.0 

=== epoch 10/10 ==== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:58<00:00,  2.38it/s]
episodes                                      575
episode_length                              17.36
returns                                 83.666475
return_std                              10.612136
average_reward                           4.819458
round_time                 0 days 00:13:59.313066
episodes_test                               565.0
episode_length_test                     17.684956
returns_test                             85.03905
return_std_test                         12.290455
average_reward_test                      4.808593
round_time_test            0 days 00:00:12.399427
round_time_total           0 days 00:13:59.314201
loss_total             451595106973463674355712.0
loss_critic            564493874311061835874304.0
loss_actor                  -1356257033781.248047
memory_size                              5216.855 

=== epoch 10/10 ==== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:40,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:45<00:00,  1.99it/s]
episodes                                      577
episode_length                          17.320624
returns                                 83.508168
return_std                                10.1539
average_reward                           4.821291
round_time                 0 days 00:16:46.450189
episodes_test                               572.0
episode_length_test                     17.465035
returns_test                            84.203294
return_std_test                         11.546126
average_reward_test                       4.82125
round_time_test            0 days 00:00:12.279216
round_time_total           0 days 00:16:46.451298
loss_total             459286581812839453818880.0
loss_critic            574108217711662706196480.0
loss_actor                  -1360163455041.535889
memory_size                                5218.0 

=== epoch 10/10 ==== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:11<00:00,  1.94it/s]
episodes                                      572
episode_length                          17.452797
returns                                 84.210928
return_std                              12.634195
average_reward                           4.825158
round_time                 0 days 00:17:12.272306
episodes_test                               579.0
episode_length_test                      17.26943
returns_test                            83.209801
return_std_test                          8.689866
average_reward_test                      4.818342
round_time_test            0 days 00:00:12.726309
round_time_total           0 days 00:17:12.273462
loss_total             455753001781633979777024.0
loss_critic            569691242310116030021632.0
loss_actor                   -1353496729354.23999
memory_size                             5220.4885 

=== epoch 10/10 ==== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:06,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                      571
episode_length                          17.481611
returns                                 84.410495
return_std                               12.84235
average_reward                           4.828484
round_time                 0 days 00:16:24.909798
episodes_test                               567.0
episode_length_test                      17.62963
returns_test                            85.254994
return_std_test                         11.725325
average_reward_test                      4.835926
round_time_test            0 days 00:00:12.617940
round_time_total           0 days 00:16:24.910859
loss_total             446851388978803903561728.0
loss_critic            558564227299622202114048.0
loss_actor                  -1346314941825.023926
memory_size                                5239.0 

=== epoch 10/10 ==== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      575
episode_length                          17.377391
returns                                 83.829192
return_std                               12.29034
average_reward                           4.824045
round_time                 0 days 00:14:11.165823
episodes_test                               561.0
episode_length_test                     17.821747
returns_test                              85.7401
return_std_test                         12.720366
average_reward_test                      4.810995
round_time_test            0 days 00:00:12.371588
round_time_total           0 days 00:14:11.166881
loss_total             451025955824564311162880.0
loss_critic            563782435498786520301568.0
loss_actor                  -1345627615789.055908
memory_size                                5239.0 

=== epoch 10/10 ==== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:06,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      577
episode_length                          17.303293
returns                                 83.447358
return_std                              12.097089
average_reward                           4.822577
round_time                 0 days 00:14:09.467627
episodes_test                               585.0
episode_length_test                     17.066667
returns_test                            82.239511
return_std_test                          9.469743
average_reward_test                      4.818676
round_time_test            0 days 00:00:12.285487
round_time_total           0 days 00:14:09.468696
loss_total             450138246771392032800768.0
loss_critic            562672799558371832233984.0
loss_actor                  -1350924413304.832031
memory_size                             5245.5485 

=== epoch 10/10 ==== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:58,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      574
episode_length                          17.397213
returns                                 83.955385
return_std                              13.904723
average_reward                            4.82577
round_time                 0 days 00:14:10.182663
episodes_test                               574.0
episode_length_test                     17.405923
returns_test                            83.919613
return_std_test                         14.466965
average_reward_test                      4.821342
round_time_test            0 days 00:00:12.431482
round_time_total           0 days 00:14:10.183728
loss_total             445151156369560979898368.0
loss_critic            556438935923327210881024.0
loss_actor                   -1348960332021.76001
memory_size                             5251.7115 

=== epoch 10/10 ==== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:42,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      577
episode_length                           17.30156
returns                                 83.443172
return_std                              12.240689
average_reward                           4.822798
round_time                 0 days 00:14:11.673175
episodes_test                               573.0
episode_length_test                       17.4363
returns_test                            83.822761
return_std_test                         14.055491
average_reward_test                      4.807394
round_time_test            0 days 00:00:12.327746
round_time_total           0 days 00:14:11.674239
loss_total             442951124262656428474368.0
loss_critic            553688896586833677254656.0
loss_actor                  -1338033870012.416016
memory_size                              5268.775 

=== epoch 10/10 ==== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                      580
episode_length                               17.2
returns                                 82.900714
return_std                              12.232487
average_reward                           4.819711
round_time                 0 days 00:14:15.309634
episodes_test                               593.0
episode_length_test                     16.863406
returns_test                            81.465256
return_std_test                          8.834571
average_reward_test                       4.83089
round_time_test            0 days 00:00:12.439994
round_time_total           0 days 00:14:15.310714
loss_total             444281763564159626117120.0
loss_critic            555352194355877419941888.0
loss_actor                  -1338632755871.743896
memory_size                                5270.0 

=== epoch 10/10 ==== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:55,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                      584
episode_length                           17.10274
returns                                 82.523545
return_std                              12.092262
average_reward                           4.825131
round_time                 0 days 00:14:14.650000
episodes_test                               597.0
episode_length_test                     16.742044
returns_test                            80.716724
return_std_test                         10.427187
average_reward_test                      4.821234
round_time_test            0 days 00:00:12.303853
round_time_total           0 days 00:14:14.651060
loss_total             440626233666128385671168.0
loss_critic            550782782951612187934720.0
loss_actor                  -1338752923533.312012
memory_size                                5270.0 

=== epoch 10/10 ==== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      587
episode_length                          16.994889
returns                                 82.012812
return_std                              11.583685
average_reward                           4.825644
round_time                 0 days 00:14:09.567569
episodes_test                               602.0
episode_length_test                     16.607973
returns_test                            80.092374
return_std_test                          9.032431
average_reward_test                      4.822549
round_time_test            0 days 00:00:12.308059
round_time_total           0 days 00:14:09.568606
loss_total             442602614457493046165504.0
loss_critic            553253258242760183382016.0
loss_actor                  -1337761029554.176025
memory_size                                5270.0 

=== epoch 10/10 ==== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:17<00:00,  2.18it/s]
episodes                                      594
episode_length                          16.819865
returns                                 81.174402
return_std                               9.393117
average_reward                           4.826177
round_time                 0 days 00:15:18.420322
episodes_test                               604.0
episode_length_test                     16.529801
returns_test                            79.669218
return_std_test                          8.435711
average_reward_test                      4.819967
round_time_test            0 days 00:00:12.016839
round_time_total           0 days 00:15:18.421451
loss_total             437921839266621361750016.0
loss_critic            547402290044552236498944.0
loss_actor                  -1333815315267.583984
memory_size                                5270.0 

=== epoch 10/10 ==== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:23,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:13<00:00,  1.94it/s]
episodes                                      599
episode_length                          16.664441
returns                                 80.450701
return_std                               8.001532
average_reward                           4.827438
round_time                 0 days 00:17:13.983760
episodes_test                               594.0
episode_length_test                     16.818182
returns_test                            81.237252
return_std_test                         11.833392
average_reward_test                      4.830328
round_time_test            0 days 00:00:12.625696
round_time_total           0 days 00:17:13.984900
loss_total             441909541441591773757440.0
loss_critic            552386917634912720781312.0
loss_actor                  -1331683707781.120117
memory_size                                5270.0 

=== epoch 10/10 ==== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:49,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:02<00:00,  1.96it/s]
episodes                                      601
episode_length                          16.620632
returns                                 80.218611
return_std                               7.668494
average_reward                           4.826441
round_time                 0 days 00:17:02.935709
episodes_test                               602.0
episode_length_test                     16.593023
returns_test                            80.082295
return_std_test                         10.154434
average_reward_test                      4.826275
round_time_test            0 days 00:00:12.823218
round_time_total           0 days 00:17:02.937029
loss_total             448969125071328024461312.0
loss_critic            561211396674435239903232.0
loss_actor                  -1333299193315.327881
memory_size                                5270.0 

=== epoch 10/10 ==== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:13<00:00,  2.34it/s]
episodes                                      601
episode_length                          16.607321
returns                                 80.128961
return_std                               7.532716
average_reward                           4.824885
round_time                 0 days 00:14:14.252364
episodes_test                               603.0
episode_length_test                     16.580431
returns_test                            80.047832
return_std_test                         12.586242
average_reward_test                      4.827863
round_time_test            0 days 00:00:12.632981
round_time_total           0 days 00:14:14.253458
loss_total             444280964580549750423552.0
loss_critic            555351195817768044724224.0
loss_actor                  -1340241605951.488037
memory_size                                5270.0 

=== epoch 10/10 ==== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:56,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                      598
episode_length                          16.687291
returns                                 80.532926
return_std                                9.40328
average_reward                           4.825955
round_time                 0 days 00:16:29.553365
episodes_test                               596.0
episode_length_test                     16.766779
returns_test                            80.833622
return_std_test                         11.374909
average_reward_test                      4.821098
round_time_test            0 days 00:00:12.220465
round_time_total           0 days 00:16:29.554514
loss_total             447532622950019125739520.0
loss_critic            559415768790863768977408.0
loss_actor                  -1340232951529.471924
memory_size                              5273.213 

=== epoch 10/10 ==== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:37,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:13<00:00,  1.94it/s]
episodes                                      595
episode_length                          16.771429
returns                                 80.941005
return_std                               10.18002
average_reward                           4.826054
round_time                 0 days 00:17:13.932555
episodes_test                               583.0
episode_length_test                      17.13036
returns_test                            82.635143
return_std_test                         11.419589
average_reward_test                      4.823898
round_time_test            0 days 00:00:12.811049
round_time_total           0 days 00:17:13.933633
loss_total             446132199975973057200128.0
loss_critic            557665240449356742524928.0
loss_actor                  -1334733465780.224121
memory_size                                5282.0 

=== epoch 10/10 ==== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:20,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                      590
episode_length                          16.915254
returns                                 81.527997
return_std                              11.182617
average_reward                           4.819701
round_time                 0 days 00:16:33.560637
episodes_test                               572.0
episode_length_test                     17.472028
returns_test                            84.566706
return_std_test                         17.036533
average_reward_test                       4.84014
round_time_test            0 days 00:00:12.868489
round_time_total           0 days 00:16:33.561699
loss_total             445835926882455635623936.0
loss_critic            557294898913574922485760.0
loss_actor                  -1328046749777.919922
memory_size                              5284.089 

=== epoch 10/10 ==== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:42,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      586
episode_length                           17.02901
returns                                 82.119464
return_std                              11.713501
average_reward                           4.822184
round_time                 0 days 00:14:08.812678
episodes_test                               580.0
episode_length_test                     17.239655
returns_test                            83.218243
return_std_test                         14.218418
average_reward_test                      4.827149
round_time_test            0 days 00:00:12.266451
round_time_total           0 days 00:14:08.813870
loss_total             439969271340701635837952.0
loss_critic            549961579598972440281088.0
loss_actor                  -1315996427091.968018
memory_size                                5286.0 

=== epoch 10/10 ==== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      586
episode_length                          17.042662
returns                                 82.175452
return_std                               11.89275
average_reward                           4.821723
round_time                 0 days 00:14:08.164018
episodes_test                               590.0
episode_length_test                     16.910169
returns_test                            81.524404
return_std_test                          9.920233
average_reward_test                       4.82145
round_time_test            0 days 00:00:12.162507
round_time_total           0 days 00:14:08.165226
loss_total             433951039749570789638144.0
loss_critic            542438790533397216231424.0
loss_actor                  -1291050902749.184082
memory_size                                5286.0 

=== epoch 10/10 ==== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:20,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      585
episode_length                          17.083761
returns                                 82.393349
return_std                              11.311696
average_reward                           4.822941
round_time                 0 days 00:14:08.945045
episodes_test                               584.0
episode_length_test                     17.109589
returns_test                            82.672684
return_std_test                         10.816832
average_reward_test                      4.831952
round_time_test            0 days 00:00:12.368977
round_time_total           0 days 00:14:08.946244
loss_total             431571683240645993955328.0
loss_critic            539464595036852874903552.0
loss_actor                  -1297140186742.783936
memory_size                             5286.0205 

=== epoch 10/10 ==== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:18,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      576
episode_length                           17.34375
returns                                 83.745788
return_std                              13.958928
average_reward                           4.828584
round_time                 0 days 00:14:08.409210
episodes_test                               568.0
episode_length_test                     17.588028
returns_test                            85.293524
return_std_test                         12.582897
average_reward_test                      4.849496
round_time_test            0 days 00:00:12.351288
round_time_total           0 days 00:14:08.410266
loss_total             434485319517548272680960.0
loss_critic            543106639842548696219648.0
loss_actor                  -1300128433438.719971
memory_size                             5297.0585 

=== epoch 10/10 ==== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      570
episode_length                          17.522807
returns                                 84.821734
return_std                              14.475451
average_reward                           4.840562
round_time                 0 days 00:14:09.782979
episodes_test                               541.0
episode_length_test                     18.469501
returns_test                            89.941712
return_std_test                         20.915365
average_reward_test                      4.869737
round_time_test            0 days 00:00:12.357969
round_time_total           0 days 00:14:09.784190
loss_total             425625979608505365037056.0
loss_critic            532032465575490023849984.0
loss_actor                  -1293283743039.488037
memory_size                                5310.0 

=== epoch 10/10 ==== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:11,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      558
episode_length                          17.912186
returns                                 86.912825
return_std                              15.852568
average_reward                           4.852113
round_time                 0 days 00:14:11.695129
episodes_test                               513.0
episode_length_test                     19.452242
returns_test                              94.8578
return_std_test                         19.144518
average_reward_test                      4.876379
round_time_test            0 days 00:00:12.474442
round_time_total           0 days 00:14:11.696181
loss_total             429353091061086756536320.0
loss_critic            536691354389065444098048.0
loss_actor                  -1301464974032.895996
memory_size                                5310.0 

=== epoch 10/10 ==== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      543
episode_length                          18.388582
returns                                 89.400101
return_std                              17.929594
average_reward                           4.861597
round_time                 0 days 00:14:07.169384
episodes_test                               536.0
episode_length_test                     18.647388
returns_test                            91.049798
return_std_test                         15.817291
average_reward_test                       4.88271
round_time_test            0 days 00:00:12.402673
round_time_total           0 days 00:14:07.170441
loss_total             428349321246119817117696.0
loss_critic            535436642201421609959424.0
loss_actor                  -1298580370620.416016
memory_size                              5314.293 

=== epoch 10/10 ==== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      535
episode_length                          18.654206
returns                                 90.770589
return_std                              18.045017
average_reward                           4.865942
round_time                 0 days 00:14:08.075140
episodes_test                               545.0
episode_length_test                     18.344954
returns_test                            89.215712
return_std_test                         14.720361
average_reward_test                      4.863246
round_time_test            0 days 00:00:12.370168
round_time_total           0 days 00:14:08.076203
loss_total             437662630302641175920640.0
loss_critic            547078278659433039921152.0
loss_actor                  -1297568488554.496094
memory_size                                5322.0 

=== epoch 10/10 ==== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:10,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      535
episode_length                          18.657944
returns                                 90.838918
return_std                              16.533272
average_reward                           4.868546
round_time                 0 days 00:14:10.000422
episodes_test                               543.0
episode_length_test                      18.39779
returns_test                            89.768296
return_std_test                         18.882637
average_reward_test                      4.879229
round_time_test            0 days 00:00:12.359668
round_time_total           0 days 00:14:10.001535
loss_total             440079333235735100653568.0
loss_critic            550099157217714020286464.0
loss_actor                  -1306416636166.144043
memory_size                                5322.0 

=== epoch 10/10 ==== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                      536
episode_length                          18.626866
returns                                 90.655383
return_std                              18.673369
average_reward                           4.866782
round_time                 0 days 00:14:06.709402
episodes_test                               571.0
episode_length_test                     17.499124
returns_test                            84.864361
return_std_test                         17.884604
average_reward_test                      4.849651
round_time_test            0 days 00:00:12.273995
round_time_total           0 days 00:14:06.710471
loss_total             444127716731940755210240.0
loss_critic            555159637081115257208832.0
loss_actor                  -1318238047436.800049
memory_size                             5334.9645 

=== epoch 10/10 ==== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:05,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                      540
episode_length                           18.47037
returns                                 89.819626
return_std                               18.42629
average_reward                            4.86272
round_time                 0 days 00:14:09.784234
episodes_test                               575.0
episode_length_test                     17.391304
returns_test                            84.218223
return_std_test                         13.841075
average_reward_test                      4.842548
round_time_test            0 days 00:00:12.286538
round_time_total           0 days 00:14:09.785283
loss_total             445971730308123999600640.0
loss_critic            557464653157379786932224.0
loss_actor                  -1316427578408.959961
memory_size                                5347.0 

=== epoch 10/10 ==== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      545
episode_length                          18.293578
returns                                 88.932932
return_std                              17.704771
average_reward                           4.861894
round_time                 0 days 00:14:08.443234
episodes_test                               550.0
episode_length_test                     18.176364
returns_test                            88.512423
return_std_test                         15.340267
average_reward_test                      4.869655
round_time_test            0 days 00:00:12.316956
round_time_total           0 days 00:14:08.444294
loss_total             443533777167209912598528.0
loss_critic            554417212260410166607872.0
loss_actor                  -1309271588732.927979
memory_size                             5352.3795 

=== epoch 10/10 ==== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:40,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      544
episode_length                          18.351103
returns                                 89.338315
return_std                              18.224348
average_reward                           4.868065
round_time                 0 days 00:14:09.455417
episodes_test                               538.0
episode_length_test                     18.552045
returns_test                            90.484509
return_std_test                          18.67695
average_reward_test                      4.877377
round_time_test            0 days 00:00:12.297983
round_time_total           0 days 00:14:09.456467
loss_total             441861605784683578130432.0
loss_critic            552326998151597645627392.0
loss_actor                  -1302562940321.791992
memory_size                                5354.0 

=== epoch 10/10 ==== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:02,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      547
episode_length                          18.250457
returns                                 88.810825
return_std                               18.37678
average_reward                           4.866096
round_time                 0 days 00:14:08.349905
episodes_test                               546.0
episode_length_test                     18.309524
returns_test                            88.998394
return_std_test                         16.982661
average_reward_test                      4.860781
round_time_test            0 days 00:00:12.139954
round_time_total           0 days 00:14:08.350950
loss_total             440441814738166475653120.0
loss_critic            550552258845803422941184.0
loss_actor                   -1311929266667.52002
memory_size                                5354.0 

=== epoch 10/10 ==== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                      543
episode_length                          18.377532
returns                                 89.465116
return_std                               18.24434
average_reward                           4.868301
round_time                 0 days 00:14:09.032301
episodes_test                               543.0
episode_length_test                     18.416206
returns_test                            89.345558
return_std_test                         21.974288
average_reward_test                      4.851464
round_time_test            0 days 00:00:12.440917
round_time_total           0 days 00:14:09.033351
loss_total             444234634087613532733440.0
loss_critic            555293282877238128148480.0
loss_actor                  -1311349298823.167969
memory_size                             5357.2785 

=== epoch 10/10 ==== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:37,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                      541
episode_length                          18.476895
returns                                 89.907877
return_std                              18.320939
average_reward                           4.865912
round_time                 0 days 00:14:07.991745
episodes_test                               537.0
episode_length_test                     18.612663
returns_test                            90.568438
return_std_test                           18.3408
average_reward_test                       4.86597
round_time_test            0 days 00:00:12.316012
round_time_total           0 days 00:14:07.992778
loss_total             453922552903413163622400.0
loss_critic            567403181043455084199936.0
loss_actor                  -1312290058174.464111
memory_size                                5368.0 

=== epoch 10/10 ==== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      532
episode_length                          18.781955
returns                                 91.343745
return_std                              19.712942
average_reward                           4.863354
round_time                 0 days 00:14:11.994292
episodes_test                               512.0
episode_length_test                     19.519531
returns_test                            94.976634
return_std_test                         23.369723
average_reward_test                      4.865742
round_time_test            0 days 00:00:12.324220
round_time_total           0 days 00:14:11.995335
loss_total             449124577332875426267136.0
loss_critic            561405712343643052834816.0
loss_actor                  -1309798297370.624023
memory_size                             5369.1145 

=== epoch 10/10 ==== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:37,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                      522
episode_length                          19.105364
returns                                 92.876537
return_std                              22.725354
average_reward                           4.861842
round_time                 0 days 00:14:11.574759
episodes_test                               507.0
episode_length_test                     19.714004
returns_test                            96.008812
return_std_test                         23.486696
average_reward_test                      4.870096
round_time_test            0 days 00:00:12.176198
round_time_total           0 days 00:14:11.575819
loss_total             452481862632609192869888.0
loss_critic            565602318175676730966016.0
loss_actor                  -1304974928248.832031
memory_size                             5380.1085 

=== epoch 10/10 ==== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:10,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      509
episode_length                          19.614931
returns                                 95.300766
return_std                              24.389151
average_reward                           4.858634
round_time                 0 days 00:14:10.917351
episodes_test                               518.0
episode_length_test                     19.283784
returns_test                            93.837139
return_std_test                         21.124343
average_reward_test                      4.866067
round_time_test            0 days 00:00:12.335740
round_time_total           0 days 00:14:10.918501
loss_total             457366048881431411687424.0
loss_critic            571707551835633076928512.0
loss_actor                  -1306088802811.904053
memory_size                             5404.9165 

=== epoch 10/10 ==== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:21,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                      506
episode_length                           19.73913
returns                                 95.936961
return_std                              23.438163
average_reward                           4.860173
round_time                 0 days 00:14:10.063466
episodes_test                               523.0
episode_length_test                     19.107075
returns_test                             92.89875
return_std_test                         19.520765
average_reward_test                      4.862011
round_time_test            0 days 00:00:12.274129
round_time_total           0 days 00:14:10.064513
loss_total             462597107183398124781568.0
loss_critic            578246374377573236867072.0
loss_actor                  -1314394434043.904053
memory_size                                5406.0 

=== epoch 10/10 ==== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:37,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                      508
episode_length                          19.661417
returns                                 95.552918
return_std                              23.532631
average_reward                           4.859848
round_time                 0 days 00:14:11.424468
episodes_test                               523.0
episode_length_test                     19.097514
returns_test                            92.704671
return_std_test                         20.801119
average_reward_test                      4.854237
round_time_test            0 days 00:00:12.212111
round_time_total           0 days 00:14:11.425519
loss_total             464840662806684726460416.0
loss_critic            581050818602688518815744.0
loss_actor                    -1316050536497.1521
memory_size                                5406.0 

=== epoch 10/10 ==== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Humanoid-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<HumanoidEnv<Humanoid-v4>>>>>>>>>>
episodes                                      515
episode_length                          19.378641
returns                                 94.160378
return_std                               22.41723
average_reward                           4.859104
round_time                 0 days 00:14:13.402158
episodes_test                               559.0
episode_length_test                     17.865832
returns_test                            86.653475
return_std_test                         16.399165
average_reward_test                      4.850114
round_time_test            0 days 00:00:12.272933
round_time_total           0 days 00:14:13.403207
loss_total             463714077118712426528768.0
loss_critic            579642586999378105335808.0
loss_actor                  -1317066388340.736084
memory_size                                5406.0 


