/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Walker2d-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 4
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>

<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:01<00:00, 1026.74it/s]
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  118
episode_length                       16.79661
returns                             -1.467717
return_std                           6.101378
average_reward                      -0.085089
round_time             0 days 00:00:02.026327
episodes_test                           478.0
episode_length_test                 20.916318
returns_test                         2.723482
return_std_test                      9.421594
average_reward_test                  0.130359
round_time_test        0 days 00:00:09.665559
round_time_total       0 days 00:00:11.502629 

=== epoch 1/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 87%|████████▋ | 1742/2000 [00:01<00:00, 1069.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1071.07it/s]
episodes                                  234
episode_length                       17.07265
returns                             -1.392781
return_std                           5.837673
average_reward                      -0.080758
round_time             0 days 00:00:02.369957
episodes_test                           473.0
episode_length_test                 21.135307
returns_test                         3.283082
return_std_test                      9.501433
average_reward_test                  0.155237
round_time_test        0 days 00:00:09.849332
round_time_total       0 days 00:00:11.593408 

=== epoch 1/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 86%|████████▌ | 1719/2000 [00:01<00:00, 1038.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1048.46it/s]
episodes                                  352
episode_length                      17.017045
returns                             -1.531278
return_std                            5.84199
average_reward                      -0.089427
round_time             0 days 00:00:02.412958
episodes_test                           453.0
episode_length_test                 22.075055
returns_test                         4.076168
return_std_test                     10.189236
average_reward_test                   0.18465
round_time_test        0 days 00:00:09.685329
round_time_total       0 days 00:00:11.444246 

=== epoch 1/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 89%|████████▉ | 1778/2000 [00:01<00:00, 1063.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1042.80it/s]
episodes                                  472
episode_length                      16.923729
returns                             -1.564193
return_std                           5.761629
average_reward                      -0.093239
round_time             0 days 00:00:02.428930
episodes_test                           477.0
episode_length_test                 20.893082
returns_test                         3.140007
return_std_test                     10.002438
average_reward_test                  0.152105
round_time_test        0 days 00:00:09.677155
round_time_total       0 days 00:00:11.517204 

=== epoch 1/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 78%|███████▊  | 1560/2000 [00:01<00:00, 852.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 988.76it/s] 
episodes                                  597
episode_length                      16.747069
returns                             -1.711799
return_std                           5.602994
average_reward                      -0.102038
round_time             0 days 00:00:02.572024
episodes_test                           474.0
episode_length_test                 21.056962
returns_test                         2.550979
return_std_test                      9.064656
average_reward_test                  0.122479
round_time_test        0 days 00:00:09.874453
round_time_total       0 days 00:00:11.666771 

=== epoch 1/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 4/2000 [00:01<13:57,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
starting training
episodes                                  586
episode_length                      17.061433
returns                             -1.145105
return_std                           6.877151
average_reward                      -0.067027
round_time             0 days 00:12:32.422462
episodes_test                           470.0
episode_length_test                 21.251064
returns_test                         3.046382
return_std_test                      9.812926
average_reward_test                  0.142853
round_time_test        0 days 00:00:09.869491
round_time_total       0 days 00:12:32.423607
loss_total                         157.748954
loss_critic                        247.573983
loss_actor                        -201.551173
memory_size                           362.028 

=== epoch 1/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:37,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:38<00:00,  2.64it/s]
episodes                                  631
episode_length                       15.81775
returns                             -1.062077
return_std                            6.45283
average_reward                      -0.067166
round_time             0 days 00:12:39.207519
episodes_test                           780.0
episode_length_test                 12.802564
returns_test                         0.302751
return_std_test                      5.658769
average_reward_test                  0.024413
round_time_test        0 days 00:00:10.327088
round_time_total       0 days 00:12:39.208772
loss_total                       46951.714089
loss_critic                      59917.289646
loss_actor                       -4910.591572
memory_size                          468.4455 

=== epoch 1/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:01,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:24<00:00,  2.16it/s]
episodes                                  712
episode_length                      14.023876
returns                             -1.034487
return_std                            5.76839
average_reward                      -0.074564
round_time             0 days 00:15:25.022168
episodes_test                           792.0
episode_length_test                 12.618687
returns_test                        -1.041908
return_std_test                      4.221527
average_reward_test                 -0.082777
round_time_test        0 days 00:00:10.254504
round_time_total       0 days 00:15:25.023297
loss_total                  25728392645.36689
loss_critic                32160648464.223171
loss_actor                     -632674.541019
memory_size                             486.0 

=== epoch 1/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<16:43,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:23<00:00,  2.17it/s]
episodes                                  771
episode_length                      12.959792
returns                              -1.09447
return_std                           5.272797
average_reward                      -0.084554
round_time             0 days 00:15:23.632952
episodes_test                           990.0
episode_length_test                 10.094949
returns_test                         -1.20114
return_std_test                      2.567591
average_reward_test                 -0.118369
round_time_test        0 days 00:00:10.365427
round_time_total       0 days 00:15:23.634093
loss_total              13905344127295.488281
loss_critic             17381682432245.759766
loss_actor                   -10267732.835625
memory_size                          486.0065 

=== epoch 1/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:39<00:00,  2.13it/s]
episodes                                  736
episode_length                      13.506793
returns                             -0.426355
return_std                           7.664725
average_reward                      -0.028484
round_time             0 days 00:15:39.944269
episodes_test                           474.0
episode_length_test                  21.07173
returns_test                        -3.918081
return_std_test                      6.111479
average_reward_test                 -0.185556
round_time_test        0 days 00:00:09.825639
round_time_total       0 days 00:15:39.945381
loss_total              160711110670942.21875
loss_critic              200888889479528.4375
loss_actor                      -32126561.656
memory_size                           799.347 

=== epoch 1/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:06,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  764
episode_length                      13.015707
returns                             -0.089075
return_std                           8.657812
average_reward                      -0.005256
round_time             0 days 00:15:44.106857
episodes_test                           417.0
episode_length_test                 23.966427
returns_test                         9.226014
return_std_test                     20.345874
average_reward_test                  0.384796
round_time_test        0 days 00:00:09.958666
round_time_total       0 days 00:15:44.107994
loss_total                 344873735611744.25
loss_critic                 431092166916309.0
loss_actor                      -50026176.186
memory_size                          1229.757 

=== epoch 1/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                  722
episode_length                      13.847645
returns                              1.525944
return_std                          26.524178
average_reward                       0.110127
round_time             0 days 00:15:43.350348
episodes_test                           869.0
episode_length_test                 11.501726
returns_test                         0.532124
return_std_test                     11.610919
average_reward_test                  0.045893
round_time_test        0 days 00:00:10.623890
round_time_total       0 days 00:15:43.351449
loss_total               396401458083266.5625
loss_critic              495501815853350.9375
loss_actor                      -60262263.014
memory_size                         1815.9455 

=== epoch 1/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:04,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                  703
episode_length                      14.213371
returns                              1.397709
return_std                          26.919969
average_reward                       0.098025
round_time             0 days 00:15:45.535531
episodes_test                           938.0
episode_length_test                 10.653518
returns_test                        -1.452517
return_std_test                      3.296575
average_reward_test                 -0.135831
round_time_test        0 days 00:00:10.460298
round_time_total       0 days 00:15:45.537055
loss_total                 632157298495586.25
loss_critic                790196612017160.25
loss_actor                      -80799454.894
memory_size                          2051.101 

=== epoch 1/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:56,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  679
episode_length                       14.69514
returns                              1.323095
return_std                          27.422197
average_reward                       0.090682
round_time             0 days 00:15:43.820636
episodes_test                           788.0
episode_length_test                 12.678934
returns_test                        -2.163891
return_std_test                      3.487371
average_reward_test                 -0.170237
round_time_test        0 days 00:00:10.545278
round_time_total       0 days 00:15:43.821938
loss_total                 1304822806797615.0
loss_critic               1631028480017694.75
loss_actor                     -120099952.464
memory_size                         2205.1365 

=== epoch 1/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  762
episode_length                       13.11811
returns                              0.444359
return_std                          25.297657
average_reward                       0.034228
round_time             0 days 00:15:43.674427
episodes_test                           839.0
episode_length_test                   11.9118
returns_test                        -2.096463
return_std_test                      3.124849
average_reward_test                 -0.175604
round_time_test        0 days 00:00:10.492562
round_time_total       0 days 00:15:43.675551
loss_total                2229213037049413.75
loss_critic                2786516247313907.5
loss_actor                     -154842521.024
memory_size                          2303.626 

=== epoch 1/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                  812
episode_length                      12.305419
returns                             -0.611482
return_std                          23.913323
average_reward                       -0.04965
round_time             0 days 00:15:46.097468
episodes_test                           866.0
episode_length_test                  11.54388
returns_test                         -2.02148
return_std_test                       3.24936
average_reward_test                 -0.174883
round_time_test        0 days 00:00:10.371197
round_time_total       0 days 00:15:46.098754
loss_total                 3334499676073230.5
loss_critic                4168124525130547.0
loss_actor                     -189215491.296
memory_size                           2309.66 

=== epoch 1/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:41<00:00,  2.12it/s]
episodes                                  886
episode_length                      11.283296
returns                             -2.212345
return_std                            3.09338
average_reward                      -0.195708
round_time             0 days 00:15:42.293269
episodes_test                           958.0
episode_length_test                 10.430063
returns_test                        -2.138269
return_std_test                      2.869519
average_reward_test                 -0.204911
round_time_test        0 days 00:00:10.725577
round_time_total       0 days 00:15:42.294374
loss_total                 4902766333415916.0
loss_critic                6128457807684436.0
loss_actor                     -230200166.384
memory_size                            2310.0 

=== epoch 1/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:41,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  905
episode_length                      11.038674
returns                             -2.058992
return_std                           2.940868
average_reward                      -0.186702
round_time             0 days 00:15:44.071675
episodes_test                           955.0
episode_length_test                 10.463874
returns_test                        -2.121403
return_std_test                      2.822359
average_reward_test                 -0.202012
round_time_test        0 days 00:00:10.611252
round_time_total       0 days 00:15:44.072783
loss_total                 7481567209274212.0
loss_critic                9351958845833872.0
loss_actor                     -286406107.448
memory_size                            2310.0 

=== epoch 1/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:43,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                  943
episode_length                      10.562036
returns                              -1.81817
return_std                            2.85928
average_reward                      -0.173089
round_time             0 days 00:15:44.720249
episodes_test                           992.0
episode_length_test                 10.073589
returns_test                        -1.578799
return_std_test                      2.666066
average_reward_test                 -0.156627
round_time_test        0 days 00:00:10.575764
round_time_total       0 days 00:15:44.721358
loss_total                11415401654351036.0
loss_critic               14269251816146338.0
loss_actor                      -354217935.28
memory_size                         2312.0525 

=== epoch 1/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:33,  2.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                  968
episode_length                      10.319215
returns                             -1.770797
return_std                           2.853002
average_reward                      -0.171721
round_time             0 days 00:15:46.631560
episodes_test                           989.0
episode_length_test                 10.111223
returns_test                        -1.585522
return_std_test                      2.643209
average_reward_test                 -0.156808
round_time_test        0 days 00:00:10.683813
round_time_total       0 days 00:15:46.632678
loss_total                17286181204613136.0
loss_critic               21607726137540084.0
loss_actor                      -434192929.52
memory_size                            2328.0 

=== epoch 1/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                  976
episode_length                      10.229508
returns                              -1.58602
return_std                           2.907329
average_reward                       -0.15486
round_time             0 days 00:15:46.896476
episodes_test                           980.0
episode_length_test                 10.194898
returns_test                        -1.632315
return_std_test                      2.775246
average_reward_test                 -0.159866
round_time_test        0 days 00:00:10.695663
round_time_total       0 days 00:15:46.897669
loss_total                25389722276154312.0
loss_critic               31737152291947412.0
loss_actor                     -528667764.976
memory_size                            2328.0 

=== epoch 1/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:03,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                  939
episode_length                      10.609159
returns                             -1.272663
return_std                           3.006984
average_reward                      -0.119185
round_time             0 days 00:15:46.481168
episodes_test                           949.0
episode_length_test                 10.537408
returns_test                        -1.244565
return_std_test                      3.009615
average_reward_test                 -0.118109
round_time_test        0 days 00:00:10.448036
round_time_total       0 days 00:15:46.482285
loss_total                36556166375067880.0
loss_critic               45695207152656840.0
loss_actor                     -644550773.088
memory_size                         2370.8375 

=== epoch 1/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:49<00:00,  2.11it/s]
episodes                                  814
episode_length                      12.269042
returns                             -0.360244
return_std                          12.198763
average_reward                      -0.029883
round_time             0 days 00:15:49.686813
episodes_test                           287.0
episode_length_test                 34.770035
returns_test                         5.929807
return_std_test                     42.061039
average_reward_test                  0.171356
round_time_test        0 days 00:00:09.943185
round_time_total       0 days 00:15:49.687927
loss_total                43447384026105512.0
loss_critic               54309229129212360.0
loss_actor                     -741827293.088
memory_size                          3096.404 

=== epoch 1/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:29,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:55<00:00,  2.09it/s]
episodes                                  809
episode_length                       12.33869
returns                              -0.28759
return_std                          12.214995
average_reward                      -0.023908
round_time             0 days 00:15:55.740634
episodes_test                           915.0
episode_length_test                 10.923497
returns_test                        -0.725802
return_std_test                      2.921267
average_reward_test                 -0.066004
round_time_test        0 days 00:00:10.688530
round_time_total       0 days 00:15:55.741799
loss_total                39301247206546208.0
loss_critic               49126558098186568.0
loss_actor                     -701862715.072
memory_size                          3469.692 

=== epoch 1/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:53<00:00,  2.10it/s]
episodes                                  799
episode_length                      12.499374
returns                             -0.145806
return_std                          12.284239
average_reward                      -0.011574
round_time             0 days 00:15:54.359628
episodes_test                           895.0
episode_length_test                 11.168715
returns_test                        -0.732123
return_std_test                      2.880739
average_reward_test                 -0.065268
round_time_test        0 days 00:00:10.467971
round_time_total       0 days 00:15:54.360771
loss_total                33894062709725464.0
loss_critic               42367577669091984.0
loss_actor                     -630233247.232
memory_size                            3481.0 

=== epoch 1/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:31,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:52<00:00,  2.10it/s]
episodes                                  803
episode_length                      12.448319
returns                             -0.170512
return_std                          12.246403
average_reward                      -0.013453
round_time             0 days 00:15:53.462844
episodes_test                           920.0
episode_length_test                 10.868478
returns_test                         -0.70223
return_std_test                      3.436753
average_reward_test                 -0.064524
round_time_test        0 days 00:00:10.426526
round_time_total       0 days 00:15:53.464011
loss_total                28573779995064796.0
loss_critic               35717224395891016.0
loss_actor                     -569064072.448
memory_size                            3481.0 

=== epoch 1/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:54<00:00,  2.10it/s]
episodes                                  834
episode_length                      11.872902
returns                             -0.182985
return_std                          11.967858
average_reward                      -0.018361
round_time             0 days 00:15:54.947088
episodes_test                           951.0
episode_length_test                 10.512093
returns_test                        -1.141762
return_std_test                      2.830411
average_reward_test                 -0.108251
round_time_test        0 days 00:00:10.472520
round_time_total       0 days 00:15:54.948222
loss_total                29768809173821160.0
loss_critic               37211010824910408.0
loss_actor                     -628309842.496
memory_size                            3481.0 

=== epoch 1/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:48,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:53<00:00,  2.10it/s]
episodes                                  948
episode_length                      10.535865
returns                             -0.981193
return_std                           2.926611
average_reward                       -0.09338
round_time             0 days 00:15:54.102821
episodes_test                           928.0
episode_length_test                 10.769397
returns_test                        -0.792158
return_std_test                      3.082322
average_reward_test                 -0.073321
round_time_test        0 days 00:00:10.567402
round_time_total       0 days 00:15:54.104038
loss_total                37755318349839992.0
loss_critic               47194147125282736.0
loss_actor                     -748352205.728
memory_size                            3481.0 

=== epoch 1/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:54,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:54<00:00,  2.10it/s]
episodes                                  941
episode_length                      10.616366
returns                             -0.891141
return_std                           3.064054
average_reward                      -0.084102
round_time             0 days 00:15:54.714990
episodes_test                           928.0
episode_length_test                 10.762931
returns_test                        -0.975691
return_std_test                      3.270592
average_reward_test                 -0.090179
round_time_test        0 days 00:00:10.434184
round_time_total       0 days 00:15:54.716280
loss_total                47881767092257552.0
loss_critic               59852207811041688.0
loss_actor                     -857558247.808
memory_size                            3481.0 

=== epoch 1/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:38,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:53<00:00,  2.10it/s]
episodes                                  917
episode_length                      10.896401
returns                             -0.846035
return_std                           3.271787
average_reward                      -0.077731
round_time             0 days 00:15:53.731853
episodes_test                           902.0
episode_length_test                 11.080931
returns_test                        -0.786822
return_std_test                      3.229153
average_reward_test                 -0.071086
round_time_test        0 days 00:00:10.619488
round_time_total       0 days 00:15:53.732981
loss_total                56581151251226424.0
loss_critic               70726437845872936.0
loss_actor                     -945568337.344
memory_size                          3493.168 

=== epoch 1/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:29,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:56<00:00,  2.09it/s]
episodes                                  825
episode_length                       12.09697
returns                              0.062545
return_std                          19.804731
average_reward                       0.004971
round_time             0 days 00:15:56.751626
episodes_test                           710.0
episode_length_test                 14.077465
returns_test                         0.049396
return_std_test                      4.949113
average_reward_test                  0.003709
round_time_test        0 days 00:00:10.324846
round_time_total       0 days 00:15:56.752752
loss_total                60789065363174192.0
loss_critic               75986330430375728.0
loss_actor                    -1002296547.776
memory_size                         3785.5145 

=== epoch 1/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:03,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:53<00:00,  2.10it/s]
episodes                                  815
episode_length                      12.250307
returns                              0.099835
return_std                          19.937735
average_reward                       0.007945
round_time             0 days 00:15:54.230614
episodes_test                           757.0
episode_length_test                 13.206077
returns_test                         0.277067
return_std_test                      3.875432
average_reward_test                  0.021196
round_time_test        0 days 00:00:10.296383
round_time_total       0 days 00:15:54.231716
loss_total                55979752088072616.0
loss_critic               69974688934075040.0
loss_actor                     -961860622.144
memory_size                            4131.0 

=== epoch 1/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:55<00:00,  2.09it/s]
episodes                                  804
episode_length                      12.426617
returns                              0.080946
return_std                          20.076632
average_reward                       0.006936
round_time             0 days 00:15:55.983143
episodes_test                           897.0
episode_length_test                 11.140468
returns_test                        -0.738101
return_std_test                      3.189091
average_reward_test                 -0.066214
round_time_test        0 days 00:00:10.413478
round_time_total       0 days 00:15:55.984276
loss_total                50462789431367568.0
loss_critic               63078485695737624.0
loss_actor                     -896508991.808
memory_size                          4141.203 

=== epoch 1/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:19,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:52<00:00,  2.10it/s]
episodes                                  803
episode_length                      12.445828
returns                              0.050192
return_std                          20.084485
average_reward                       0.004376
round_time             0 days 00:15:53.457449
episodes_test                           823.0
episode_length_test                 12.149453
returns_test                        -0.232814
return_std_test                      3.891202
average_reward_test                 -0.019052
round_time_test        0 days 00:00:10.225444
round_time_total       0 days 00:15:53.458666
loss_total                46067955033962448.0
loss_critic               57584942785417448.0
loss_actor                     -835804386.208
memory_size                            4150.0 

=== epoch 1/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:58,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:54<00:00,  2.10it/s]
episodes                                  825
episode_length                      12.113939
returns                             -0.159177
return_std                          19.800961
average_reward                      -0.013512
round_time             0 days 00:15:55.201124
episodes_test                           880.0
episode_length_test                   11.3625
returns_test                        -0.633841
return_std_test                      3.206263
average_reward_test                 -0.055686
round_time_test        0 days 00:00:10.423745
round_time_total       0 days 00:15:55.202345
loss_total                42522950160261056.0
loss_critic               53153686777579440.0
loss_actor                      -762778469.44
memory_size                            4150.0 

=== epoch 1/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:22,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:55<00:00,  2.09it/s]
episodes                                  874
episode_length                      11.434783
returns                             -1.013336
return_std                           3.549937
average_reward                      -0.088429
round_time             0 days 00:15:56.499048
episodes_test                           925.0
episode_length_test                 10.806486
returns_test                        -1.454271
return_std_test                      3.066161
average_reward_test                 -0.134845
round_time_test        0 days 00:00:10.775980
round_time_total       0 days 00:15:56.500175
loss_total                37849661517712064.0
loss_critic               47312076088880920.0
loss_actor                     -708135434.432
memory_size                         4204.5575 

=== epoch 1/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:53,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:56<00:00,  2.09it/s]
episodes                                  854
episode_length                      11.686183
returns                             -1.308806
return_std                           3.847642
average_reward                      -0.112142
round_time             0 days 00:15:57.007444
episodes_test                           916.0
episode_length_test                 10.909389
returns_test                        -1.215921
return_std_test                      3.242753
average_reward_test                 -0.110764
round_time_test        0 days 00:00:10.572395
round_time_total       0 days 00:15:57.008563
loss_total                30117177264902768.0
loss_critic               37646470965203304.0
loss_actor                      -613558405.44
memory_size                          4421.035 

=== epoch 1/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:39,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:57<00:00,  2.09it/s]
episodes                                  800
episode_length                         12.485
returns                             -1.601986
return_std                           4.309064
average_reward                      -0.128391
round_time             0 days 00:15:57.771448
episodes_test                           471.0
episode_length_test                 21.161359
returns_test                        -4.232892
return_std_test                     10.042003
average_reward_test                 -0.199161
round_time_test        0 days 00:00:09.881536
round_time_total       0 days 00:15:57.772700
loss_total                28310431418604848.0
loss_critic               35388038632701952.0
loss_actor                     -585439730.848
memory_size                         4852.0475 

=== epoch 1/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:55,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:58<00:00,  2.09it/s]
episodes                                  801
episode_length                       12.45568
returns                             -1.826519
return_std                           4.185961
average_reward                      -0.145904
round_time             0 days 00:15:58.684816
episodes_test                           940.0
episode_length_test                  10.63617
returns_test                        -2.008493
return_std_test                      2.759269
average_reward_test                 -0.188615
round_time_test        0 days 00:00:10.464807
round_time_total       0 days 00:15:58.685965
loss_total                28036761690524616.0
loss_critic               35045951528637564.0
loss_actor                      -585862409.76
memory_size                            5015.0 

=== epoch 1/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:59,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:57<00:00,  2.09it/s]
episodes                                  720
episode_length                      13.868056
returns                             -2.232973
return_std                           4.573087
average_reward                      -0.160237
round_time             0 days 00:15:58.303242
episodes_test                           648.0
episode_length_test                 15.419753
returns_test                        -3.178251
return_std_test                      4.391616
average_reward_test                 -0.205276
round_time_test        0 days 00:00:10.219738
round_time_total       0 days 00:15:58.304351
loss_total                30454066362517028.0
loss_critic               38067582317155584.0
loss_actor                     -664664205.312
memory_size                         5067.7545 

=== epoch 1/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:59<00:00,  2.08it/s]
episodes                                  667
episode_length                      14.941529
returns                              -2.67925
return_std                           4.554603
average_reward                      -0.179013
round_time             0 days 00:16:00.015037
episodes_test                           476.0
episode_length_test                 21.006303
returns_test                        -3.469505
return_std_test                      4.374248
average_reward_test                 -0.165066
round_time_test        0 days 00:00:09.860824
round_time_total       0 days 00:16:00.016140
loss_total                35210828726866544.0
loss_critic               44013535122268616.0
loss_actor                     -712742532.096
memory_size                          5216.737 

=== epoch 1/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:42,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:01<00:00,  2.08it/s]
episodes                                  600
episode_length                         16.545
returns                             -2.756263
return_std                           4.567998
average_reward                      -0.165828
round_time             0 days 00:16:02.405107
episodes_test                           462.0
episode_length_test                 21.571429
returns_test                        -3.423115
return_std_test                      4.386477
average_reward_test                 -0.158282
round_time_test        0 days 00:00:09.872084
round_time_total       0 days 00:16:02.406619
loss_total                38366918450090080.0
loss_critic               47958647201203224.0
loss_actor                       -709229212.8
memory_size                         5415.3175 

=== epoch 1/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:06,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                  570
episode_length                      17.529825
returns                             -2.398897
return_std                           4.158845
average_reward                      -0.137055
round_time             0 days 00:16:06.784380
episodes_test                           456.0
episode_length_test                 21.888158
returns_test                        -1.768082
return_std_test                      4.290909
average_reward_test                 -0.080625
round_time_test        0 days 00:00:09.847896
round_time_total       0 days 00:16:06.785495
loss_total                35848103621652644.0
loss_critic               44810128777594008.0
loss_actor                     -650111979.808
memory_size                         5656.7455 

=== epoch 1/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:07<00:00,  2.07it/s]
episodes                                  555
episode_length                      17.969369
returns                              -2.50379
return_std                           4.284795
average_reward                      -0.141174
round_time             0 days 00:16:07.880282
episodes_test                           556.0
episode_length_test                 17.985612
returns_test                        -0.944303
return_std_test                      3.945615
average_reward_test                 -0.052503
round_time_test        0 days 00:00:09.877707
round_time_total       0 days 00:16:07.881582
loss_total                27573241389646872.0
loss_critic               34466551113885680.0
loss_actor                     -528649086.704
memory_size                         5793.4505 

=== epoch 1/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:09,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                  661
episode_length                      15.119516
returns                             -2.365452
return_std                           3.679399
average_reward                      -0.156434
round_time             0 days 00:16:09.414757
episodes_test                          1025.0
episode_length_test                  9.749268
returns_test                        -2.732578
return_std_test                      2.740111
average_reward_test                 -0.279982
round_time_test        0 days 00:00:10.463819
round_time_total       0 days 00:16:09.415979
loss_total                19479990261956740.0
loss_critic               24349987404056100.0
loss_actor                     -425996428.752
memory_size                            5799.0 

=== epoch 1/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:07,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                  766
episode_length                      13.031332
returns                             -2.354598
return_std                           3.285109
average_reward                      -0.180058
round_time             0 days 00:16:09.800073
episodes_test                          1046.0
episode_length_test                  9.560229
returns_test                        -2.742888
return_std_test                      2.653291
average_reward_test                 -0.286906
round_time_test        0 days 00:00:10.814709
round_time_total       0 days 00:16:09.801191
loss_total                15572929406857904.0
loss_critic               19466161417793568.0
loss_actor                     -368858294.288
memory_size                            5799.0 

=== epoch 1/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:07<00:00,  2.07it/s]
episodes                                  885
episode_length                      11.283616
returns                             -2.407092
return_std                           2.911534
average_reward                      -0.213075
round_time             0 days 00:16:08.390852
episodes_test                          1049.0
episode_length_test                  9.526215
returns_test                        -3.171487
return_std_test                      2.759044
average_reward_test                 -0.332276
round_time_test        0 days 00:00:10.857755
round_time_total       0 days 00:16:08.392032
loss_total                14280116442459276.0
loss_critic               17850145242494272.0
loss_actor                     -372187708.848
memory_size                            5799.0 

=== epoch 1/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:15,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:08<00:00,  2.06it/s]
episodes                                  996
episode_length                      10.026104
returns                             -2.662324
return_std                           2.798201
average_reward                      -0.265167
round_time             0 days 00:16:09.177014
episodes_test                          1032.0
episode_length_test                  9.688953
returns_test                        -2.791116
return_std_test                       2.85874
average_reward_test                 -0.287965
round_time_test        0 days 00:00:10.428394
round_time_total       0 days 00:16:09.178180
loss_total                14927975727277016.0
loss_critic               18659969332242548.0
loss_actor                     -397294499.248
memory_size                            5799.0 

=== epoch 1/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:50,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:11<00:00,  2.06it/s]
episodes                                 1034
episode_length                       9.668279
returns                             -2.728651
return_std                           2.649953
average_reward                      -0.281935
round_time             0 days 00:16:12.036810
episodes_test                          1031.0
episode_length_test                  9.695441
returns_test                        -2.626033
return_std_test                       2.91636
average_reward_test                 -0.270619
round_time_test        0 days 00:00:10.766216
round_time_total       0 days 00:16:12.037944
loss_total                15719323956492108.0
loss_critic               19649154616815256.0
loss_actor                     -424491216.384
memory_size                            5799.0 

=== epoch 1/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1028
episode_length                       9.714981
returns                             -2.724602
return_std                           2.701119
average_reward                      -0.280513
round_time             0 days 00:16:10.624528
episodes_test                          1032.0
episode_length_test                  9.684109
returns_test                        -2.655976
return_std_test                      2.888883
average_reward_test                 -0.274424
round_time_test        0 days 00:00:11.043785
round_time_total       0 days 00:16:10.625654
loss_total                15451832456054834.0
loss_critic               19314790221370884.0
loss_actor                     -431316326.368
memory_size                            5799.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:58,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                  205
episode_length                       9.746341
returns                             -2.393395
return_std                           2.987835
average_reward                      -0.244489
round_time             0 days 00:15:43.011325
episodes_test                          1013.0
episode_length_test                  9.868707
returns_test                        -2.657769
return_std_test                        2.8852
average_reward_test                 -0.268964
round_time_test        0 days 00:00:10.873755
round_time_total       0 days 00:15:43.012450
loss_total                16022718316111986.0
loss_critic               20028397545502800.0
loss_actor                     -449930301.872
memory_size                            5799.0 

=== epoch 2/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:58,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  407
episode_length                       9.815725
returns                              -2.50993
return_std                           2.898648
average_reward                      -0.254929
round_time             0 days 00:15:43.616972
episodes_test                          1012.0
episode_length_test                  9.880435
returns_test                        -2.695057
return_std_test                      3.015116
average_reward_test                 -0.272656
round_time_test        0 days 00:00:10.668731
round_time_total       0 days 00:15:43.618087
loss_total                16737769180631138.0
loss_critic               20922211123568052.0
loss_actor                     -465216426.384
memory_size                            5799.0 

=== epoch 2/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:21,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  612
episode_length                       9.800654
returns                              -2.55205
return_std                           2.854125
average_reward                      -0.260094
round_time             0 days 00:15:44.490844
episodes_test                          1010.0
episode_length_test                  9.894059
returns_test                         -2.46364
return_std_test                      2.927035
average_reward_test                 -0.248933
round_time_test        0 days 00:00:10.615976
round_time_total       0 days 00:15:44.491946
loss_total                17591041556159136.0
loss_critic               21988801555095092.0
loss_actor                     -480138372.784
memory_size                            5799.0 

=== epoch 2/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:39,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                  823
episode_length                        9.71932
returns                             -2.603353
return_std                           2.814622
average_reward                      -0.267707
round_time             0 days 00:15:43.216411
episodes_test                          1027.0
episode_length_test                  9.733204
returns_test                        -2.787179
return_std_test                      2.788069
average_reward_test                 -0.285908
round_time_test        0 days 00:00:10.793629
round_time_total       0 days 00:15:43.217521
loss_total                16237500173201178.0
loss_critic               20296874869380876.0
loss_actor                      -463221048.56
memory_size                            5799.0 

=== epoch 2/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:10,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1031
episode_length                       9.695441
returns                             -2.563424
return_std                           2.821659
average_reward                      -0.263884
round_time             0 days 00:15:44.112082
episodes_test                          1036.0
episode_length_test                  9.645753
returns_test                        -2.638785
return_std_test                      2.784051
average_reward_test                   -0.2731
round_time_test        0 days 00:00:10.776414
round_time_total       0 days 00:15:44.113484
loss_total                16895321785120064.0
loss_critic               21119151871595904.0
loss_actor                     -463660866.976
memory_size                            5799.0 

=== epoch 2/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:48,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1034
episode_length                       9.656673
returns                             -2.627625
return_std                           2.776168
average_reward                      -0.271985
round_time             0 days 00:15:45.056886
episodes_test                          1040.0
episode_length_test                  9.607692
returns_test                        -2.551243
return_std_test                      2.786941
average_reward_test                 -0.265066
round_time_test        0 days 00:00:10.630327
round_time_total       0 days 00:15:45.058187
loss_total                18014104575878890.0
loss_critic               22517630317732300.0
loss_actor                     -481014654.512
memory_size                            5799.0 

=== epoch 2/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1037
episode_length                       9.632594
returns                             -2.542047
return_std                           2.764462
average_reward                      -0.264526
round_time             0 days 00:15:46.891082
episodes_test                          1040.0
episode_length_test                    9.6125
returns_test                        -2.553392
return_std_test                      2.692886
average_reward_test                 -0.265279
round_time_test        0 days 00:00:10.532143
round_time_total       0 days 00:15:46.892185
loss_total                19500888498137728.0
loss_critic               24376110201933136.0
loss_actor                     -499608495.776
memory_size                            5799.0 

=== epoch 2/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:48,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1037
episode_length                       9.629701
returns                             -2.576455
return_std                           2.765247
average_reward                      -0.267775
round_time             0 days 00:15:43.482841
episodes_test                          1025.0
episode_length_test                  9.755122
returns_test                         -2.47162
return_std_test                      2.845422
average_reward_test                 -0.253255
round_time_test        0 days 00:00:10.652000
round_time_total       0 days 00:15:43.484025
loss_total                21324016201850944.0
loss_critic               26655019794295684.0
loss_actor                     -527118724.496
memory_size                            5799.0 

=== epoch 2/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1030
episode_length                       9.694175
returns                             -2.503442
return_std                           2.784335
average_reward                      -0.257787
round_time             0 days 00:15:45.395583
episodes_test                          1009.0
episode_length_test                  9.900892
returns_test                        -2.602798
return_std_test                      2.926597
average_reward_test                 -0.262494
round_time_test        0 days 00:00:10.710110
round_time_total       0 days 00:15:45.396683
loss_total                23465494042462128.0
loss_critic               29331867052915296.0
loss_actor                     -554445389.504
memory_size                            5799.0 

=== epoch 2/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:08,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1026
episode_length                       9.732943
returns                             -2.537222
return_std                           2.829855
average_reward                       -0.26071
round_time             0 days 00:15:46.197607
episodes_test                          1012.0
episode_length_test                  9.873518
returns_test                        -2.530375
return_std_test                      2.949152
average_reward_test                 -0.255834
round_time_test        0 days 00:00:10.585169
round_time_total       0 days 00:15:46.198728
loss_total                25667472947673040.0
loss_critic               32084340625842896.0
loss_actor                     -582903368.528
memory_size                            5799.0 

=== epoch 2/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:39,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1020
episode_length                       9.789216
returns                             -2.575131
return_std                           2.834777
average_reward                      -0.262699
round_time             0 days 00:15:44.866593
episodes_test                          1014.0
episode_length_test                  9.857988
returns_test                         -2.52689
return_std_test                      2.954886
average_reward_test                 -0.256003
round_time_test        0 days 00:00:10.984380
round_time_total       0 days 00:15:44.867908
loss_total                26246243790549416.0
loss_critic               32807804199705248.0
loss_actor                     -589579711.456
memory_size                            5799.0 

=== epoch 2/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1023
episode_length                       9.763441
returns                             -2.728277
return_std                           2.865601
average_reward                       -0.27933
round_time             0 days 00:15:46.568769
episodes_test                          1044.0
episode_length_test                  9.577586
returns_test                        -2.909588
return_std_test                      2.707576
average_reward_test                 -0.303686
round_time_test        0 days 00:00:10.593133
round_time_total       0 days 00:15:46.569878
loss_total                24260346504087076.0
loss_critic               30325432624309340.0
loss_actor                     -560940671.824
memory_size                            5799.0 

=== epoch 2/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:06,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1031
episode_length                       9.696411
returns                             -2.722166
return_std                           2.828802
average_reward                      -0.280657
round_time             0 days 00:15:43.937378
episodes_test                          1042.0
episode_length_test                  9.586372
returns_test                        -2.889844
return_std_test                      2.675887
average_reward_test                 -0.301416
round_time_test        0 days 00:00:10.834463
round_time_total       0 days 00:15:43.938477
loss_total                24238486013098328.0
loss_critic               30298107018827792.0
loss_actor                     -551154020.352
memory_size                            5799.0 

=== epoch 2/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:08,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1030
episode_length                       9.694175
returns                             -2.819488
return_std                           2.803461
average_reward                       -0.29115
round_time             0 days 00:15:45.511703
episodes_test                          1029.0
episode_length_test                   9.71137
returns_test                        -2.782864
return_std_test                       2.86936
average_reward_test                  -0.28594
round_time_test        0 days 00:00:10.574038
round_time_total       0 days 00:15:45.512922
loss_total                26955118877704456.0
loss_critic               33693898019927228.0
loss_actor                       -589966788.8
memory_size                            5799.0 

=== epoch 2/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:26,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1032
episode_length                       9.686047
returns                              -2.86375
return_std                           2.730973
average_reward                      -0.295935
round_time             0 days 00:15:45.513798
episodes_test                          1041.0
episode_length_test                  9.605187
returns_test                        -3.011401
return_std_test                      2.809935
average_reward_test                 -0.313402
round_time_test        0 days 00:00:10.558117
round_time_total       0 days 00:15:45.515020
loss_total                29169721497641548.0
loss_critic               36462151227001536.0
loss_actor                     -628161365.248
memory_size                            5799.0 

=== epoch 2/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:35,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1034
episode_length                       9.661509
returns                             -2.861016
return_std                           2.743062
average_reward                      -0.296309
round_time             0 days 00:15:46.365872
episodes_test                          1030.0
episode_length_test                  9.701942
returns_test                        -2.773711
return_std_test                      2.891989
average_reward_test                 -0.285503
round_time_test        0 days 00:00:10.695727
round_time_total       0 days 00:15:46.366968
loss_total                31358703021676560.0
loss_critic               39198378115200976.0
loss_actor                     -648659724.256
memory_size                            5799.0 

=== epoch 2/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:02,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1033
episode_length                       9.677638
returns                             -2.811321
return_std                           2.727955
average_reward                      -0.290249
round_time             0 days 00:15:43.421524
episodes_test                          1025.0
episode_length_test                  9.749268
returns_test                        -2.614638
return_std_test                      2.801482
average_reward_test                 -0.267843
round_time_test        0 days 00:00:10.541903
round_time_total       0 days 00:15:43.422669
loss_total                32417747556710220.0
loss_critic               40522183753122968.0
loss_actor                     -664625184.672
memory_size                            5799.0 

=== epoch 2/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:16,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1025
episode_length                       9.742439
returns                             -2.791363
return_std                           2.744154
average_reward                      -0.286628
round_time             0 days 00:15:45.811067
episodes_test                          1035.0
episode_length_test                  9.652174
returns_test                        -2.842923
return_std_test                      2.770662
average_reward_test                 -0.294313
round_time_test        0 days 00:00:10.703539
round_time_total       0 days 00:15:45.812190
loss_total                34162218272907328.0
loss_critic               42702772140920272.0
loss_actor                     -670400549.888
memory_size                            5799.0 

=== epoch 2/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:12,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                 1027
episode_length                       9.735151
returns                             -2.784873
return_std                           2.753282
average_reward                      -0.286178
round_time             0 days 00:16:19.769892
episodes_test                          1014.0
episode_length_test                  9.852071
returns_test                        -2.794457
return_std_test                      2.822503
average_reward_test                 -0.283364
round_time_test        0 days 00:00:10.688182
round_time_total       0 days 00:16:19.771041
loss_total                35021058339766272.0
loss_critic               43776322189798672.0
loss_actor                     -684327079.936
memory_size                            5799.0 

=== epoch 2/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:28,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1025
episode_length                       9.747317
returns                             -2.758739
return_std                           2.788104
average_reward                      -0.282834
round_time             0 days 00:15:46.573262
episodes_test                          1019.0
episode_length_test                  9.813543
returns_test                        -2.851168
return_std_test                      2.832887
average_reward_test                 -0.290534
round_time_test        0 days 00:00:10.532330
round_time_total       0 days 00:15:46.574410
loss_total                35049629497574816.0
loss_critic               43812036090821344.0
loss_actor                      -684672715.04
memory_size                            5799.0 

=== epoch 2/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:12,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1031
episode_length                       9.697381
returns                             -2.699735
return_std                           2.738949
average_reward                      -0.278686
round_time             0 days 00:15:43.706743
episodes_test                          1040.0
episode_length_test                  9.611538
returns_test                         -2.41047
return_std_test                      2.619582
average_reward_test                 -0.250605
round_time_test        0 days 00:00:10.700942
round_time_total       0 days 00:15:43.707843
loss_total                28167015916671860.0
loss_critic               35208769267767968.0
loss_actor                     -564365527.888
memory_size                            5799.0 

=== epoch 2/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:08,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1038
episode_length                       9.618497
returns                             -2.753869
return_std                           2.715871
average_reward                      -0.286551
round_time             0 days 00:15:44.798365
episodes_test                          1051.0
episode_length_test                  9.506185
returns_test                        -2.790524
return_std_test                       2.59813
average_reward_test                 -0.293743
round_time_test        0 days 00:00:10.960141
round_time_total       0 days 00:15:44.799489
loss_total                20902881895177520.0
loss_critic               26128601945783404.0
loss_actor                     -493829184.112
memory_size                            5799.0 

=== epoch 2/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1043
episode_length                       9.575264
returns                             -2.857855
return_std                             2.7487
average_reward                      -0.298768
round_time             0 days 00:15:45.236100
episodes_test                          1047.0
episode_length_test                  9.548233
returns_test                        -3.185951
return_std_test                      2.619931
average_reward_test                 -0.333354
round_time_test        0 days 00:00:10.687482
round_time_total       0 days 00:15:45.237205
loss_total                19726969583106524.0
loss_critic               24658711545225676.0
loss_actor                       -492692980.8
memory_size                            5799.0 

=== epoch 2/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:41<00:00,  2.12it/s]
episodes                                 1049
episode_length                       9.531935
returns                               -2.8837
return_std                           2.709501
average_reward                      -0.302394
round_time             0 days 00:15:42.430229
episodes_test                          1036.0
episode_length_test                  9.643822
returns_test                         -3.12508
return_std_test                      2.768837
average_reward_test                 -0.324362
round_time_test        0 days 00:00:10.736630
round_time_total       0 days 00:15:42.431478
loss_total                19768308695943872.0
loss_critic               24710385456405020.0
loss_actor                     -512098233.632
memory_size                            5799.0 

=== epoch 2/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:35,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1054
episode_length                        9.48482
returns                             -2.996935
return_std                           2.634389
average_reward                      -0.316032
round_time             0 days 00:15:45.322339
episodes_test                          1042.0
episode_length_test                  9.592131
returns_test                        -3.052298
return_std_test                      2.725501
average_reward_test                 -0.318326
round_time_test        0 days 00:00:10.647877
round_time_total       0 days 00:15:45.323591
loss_total                20182353099312072.0
loss_critic               25227940929275428.0
loss_actor                     -528129783.088
memory_size                            5799.0 

=== epoch 2/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:11,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1055
episode_length                       9.475829
returns                              -3.10813
return_std                            2.60012
average_reward                      -0.327876
round_time             0 days 00:15:47.289749
episodes_test                          1046.0
episode_length_test                  9.554493
returns_test                        -3.249057
return_std_test                      2.608175
average_reward_test                 -0.340071
round_time_test        0 days 00:00:10.657980
round_time_total       0 days 00:15:47.290849
loss_total                20480921189820136.0
loss_critic               25601151043282928.0
loss_actor                     -545629900.096
memory_size                            5799.0 

=== epoch 2/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1049
episode_length                       9.527169
returns                             -3.131211
return_std                           2.581431
average_reward                      -0.328453
round_time             0 days 00:15:45.242100
episodes_test                          1046.0
episode_length_test                  9.553537
returns_test                        -3.167545
return_std_test                      2.530575
average_reward_test                 -0.330901
round_time_test        0 days 00:00:10.730310
round_time_total       0 days 00:15:45.243212
loss_total                19671159472660676.0
loss_critic               24588948940924124.0
loss_actor                     -534848475.728
memory_size                            5799.0 

=== epoch 2/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:20,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1048
episode_length                       9.541031
returns                             -3.144802
return_std                           2.521705
average_reward                      -0.329667
round_time             0 days 00:15:44.903502
episodes_test                          1049.0
episode_length_test                  9.529075
returns_test                        -3.218989
return_std_test                      2.542494
average_reward_test                 -0.337778
round_time_test        0 days 00:00:10.729478
round_time_total       0 days 00:15:44.904774
loss_total                18955974463025316.0
loss_critic               23694967659821008.0
loss_actor                     -508739417.088
memory_size                            5799.0 

=== epoch 2/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:34,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1050
episode_length                       9.514286
returns                             -3.182238
return_std                           2.546171
average_reward                      -0.334362
round_time             0 days 00:15:46.566698
episodes_test                          1048.0
episode_length_test                  9.533397
returns_test                        -3.060308
return_std_test                      2.586615
average_reward_test                  -0.32051
round_time_test        0 days 00:00:10.780134
round_time_total       0 days 00:15:46.567784
loss_total                18268380956538700.0
loss_critic               22835475792080668.0
loss_actor                     -514379556.688
memory_size                            5799.0 

=== epoch 2/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:21,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1049
episode_length                       9.523356
returns                             -3.224311
return_std                           2.589642
average_reward                      -0.338299
round_time             0 days 00:15:47.274033
episodes_test                          1026.0
episode_length_test                  9.746589
returns_test                        -3.088803
return_std_test                      2.638021
average_reward_test                 -0.316911
round_time_test        0 days 00:00:10.568656
round_time_total       0 days 00:15:47.275256
loss_total                19215427467237916.0
loss_critic               24019283903309152.0
loss_actor                     -531163438.144
memory_size                            5799.0 

=== epoch 2/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:51,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1036
episode_length                       9.646718
returns                             -3.162328
return_std                           2.646316
average_reward                      -0.328627
round_time             0 days 00:15:45.296492
episodes_test                          1019.0
episode_length_test                  9.805692
returns_test                        -3.194124
return_std_test                      2.774019
average_reward_test                 -0.325542
round_time_test        0 days 00:00:10.588666
round_time_total       0 days 00:15:45.297924
loss_total                20210859731142576.0
loss_critic               25263574228525908.0
loss_actor                     -545563829.456
memory_size                            5799.0 

=== epoch 2/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1028
episode_length                       9.715953
returns                             -3.144109
return_std                           2.668044
average_reward                      -0.324074
round_time             0 days 00:15:45.658594
episodes_test                          1008.0
episode_length_test                  9.915675
returns_test                        -3.086473
return_std_test                      2.774221
average_reward_test                 -0.311021
round_time_test        0 days 00:00:10.891897
round_time_total       0 days 00:15:45.659699
loss_total                21628169830084380.0
loss_critic               27035211838546508.0
loss_actor                     -558210919.056
memory_size                            5799.0 

=== epoch 2/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1023
episode_length                       9.771261
returns                             -3.089376
return_std                            2.71882
average_reward                      -0.315699
round_time             0 days 00:15:48.422474
episodes_test                          1005.0
episode_length_test                  9.948259
returns_test                        -3.005991
return_std_test                      2.851294
average_reward_test                 -0.301927
round_time_test        0 days 00:00:10.560263
round_time_total       0 days 00:15:48.423635
loss_total                21442714710711468.0
loss_critic               26803392947853196.0
loss_actor                     -555183061.408
memory_size                            5799.0 

=== epoch 2/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1008
episode_length                       9.919643
returns                             -3.090045
return_std                           2.791067
average_reward                      -0.311386
round_time             0 days 00:15:45.558615
episodes_test                           993.0
episode_length_test                 10.066465
returns_test                        -3.071426
return_std_test                      2.815731
average_reward_test                 -0.305037
round_time_test        0 days 00:00:10.609590
round_time_total       0 days 00:15:45.559741
loss_total                20848191079247248.0
loss_critic               26060238404261512.0
loss_actor                     -550102861.456
memory_size                            5799.0 

=== epoch 2/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1000
episode_length                          9.984
returns                             -3.057716
return_std                           2.800593
average_reward                      -0.306212
round_time             0 days 00:15:44.095461
episodes_test                          1005.0
episode_length_test                  9.949254
returns_test                        -3.043366
return_std_test                      2.798083
average_reward_test                 -0.305789
round_time_test        0 days 00:00:10.662697
round_time_total       0 days 00:15:44.096922
loss_total                20671939886750628.0
loss_critic               25839924410117520.0
loss_actor                     -550073146.448
memory_size                            5799.0 

=== epoch 2/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:54,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1005
episode_length                       9.946269
returns                             -3.097436
return_std                           2.724989
average_reward                       -0.31088
round_time             0 days 00:15:43.534468
episodes_test                          1010.0
episode_length_test                   9.89802
returns_test                        -3.244024
return_std_test                      2.775245
average_reward_test                 -0.327511
round_time_test        0 days 00:00:10.486162
round_time_total       0 days 00:15:43.535608
loss_total                21176180662100556.0
loss_critic               26470225370547224.0
loss_actor                      -560231158.64
memory_size                            5799.0 

=== epoch 2/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1003
episode_length                       9.969093
returns                             -3.080124
return_std                           2.798563
average_reward                      -0.309099
round_time             0 days 00:15:44.231523
episodes_test                          1020.0
episode_length_test                  9.801961
returns_test                        -3.209709
return_std_test                      2.849097
average_reward_test                 -0.327256
round_time_test        0 days 00:00:10.797162
round_time_total       0 days 00:15:44.232953
loss_total                21979699300429464.0
loss_critic               27474623679464212.0
loss_actor                     -575256603.344
memory_size                            5799.0 

=== epoch 2/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:58,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:41<00:00,  2.12it/s]
episodes                                 1006
episode_length                       9.931412
returns                             -3.178099
return_std                           2.785883
average_reward                      -0.319724
round_time             0 days 00:15:42.013381
episodes_test                          1009.0
episode_length_test                  9.910803
returns_test                        -3.163733
return_std_test                      2.749712
average_reward_test                 -0.319221
round_time_test        0 days 00:00:10.592178
round_time_total       0 days 00:15:42.014478
loss_total                23120260704396904.0
loss_critic               28900325421471500.0
loss_actor                     -585312433.488
memory_size                            5799.0 

=== epoch 2/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1013
episode_length                       9.860809
returns                              -3.24334
return_std                           2.737073
average_reward                      -0.328876
round_time             0 days 00:15:44.330533
episodes_test                          1029.0
episode_length_test                  9.718173
returns_test                        -3.365519
return_std_test                      2.753199
average_reward_test                 -0.346312
round_time_test        0 days 00:00:10.696578
round_time_total       0 days 00:15:44.331791
loss_total                24376959509838432.0
loss_critic               30471198854185224.0
loss_actor                     -600956241.344
memory_size                            5799.0 

=== epoch 2/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:23,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1017
episode_length                       9.828909
returns                             -3.175577
return_std                           2.737941
average_reward                       -0.32327
round_time             0 days 00:15:45.263555
episodes_test                          1025.0
episode_length_test                  9.747317
returns_test                        -3.276311
return_std_test                      2.865325
average_reward_test                 -0.335935
round_time_test        0 days 00:00:10.495574
round_time_total       0 days 00:15:45.264676
loss_total                23040748952756420.0
loss_critic               28800935676958736.0
loss_actor                     -583861283.168
memory_size                            5799.0 

=== epoch 2/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1023
episode_length                       9.767351
returns                             -3.167101
return_std                           2.755092
average_reward                      -0.324431
round_time             0 days 00:15:45.459056
episodes_test                          1018.0
episode_length_test                  9.818271
returns_test                        -2.984434
return_std_test                      2.631985
average_reward_test                 -0.303703
round_time_test        0 days 00:00:10.659350
round_time_total       0 days 00:15:45.460204
loss_total                19897766582541092.0
loss_critic               24872207813242256.0
loss_actor                     -535454944.048
memory_size                            5799.0 

=== epoch 2/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:00,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1033
episode_length                        9.67667
returns                             -3.164889
return_std                           2.635897
average_reward                      -0.326582
round_time             0 days 00:15:44.493551
episodes_test                          1030.0
episode_length_test                       9.7
returns_test                        -2.658525
return_std_test                      2.630799
average_reward_test                 -0.273868
round_time_test        0 days 00:00:10.567373
round_time_total       0 days 00:15:44.494662
loss_total                17328051511489462.0
loss_critic               21660064025430460.0
loss_actor                     -488825810.784
memory_size                            5799.0 

=== epoch 2/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:18,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1035
episode_length                       9.654106
returns                             -3.084243
return_std                           2.584145
average_reward                      -0.319754
round_time             0 days 00:15:45.580289
episodes_test                          1044.0
episode_length_test                  9.571839
returns_test                        -3.024077
return_std_test                      2.470614
average_reward_test                 -0.315569
round_time_test        0 days 00:00:10.765968
round_time_total       0 days 00:15:45.581385
loss_total                15010391313151426.0
loss_critic               18762988828779084.0
loss_actor                     -467759388.592
memory_size                            5799.0 

=== epoch 2/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1038
episode_length                       9.624277
returns                             -2.972935
return_std                           2.534406
average_reward                      -0.309029
round_time             0 days 00:15:46.990904
episodes_test                          1033.0
episode_length_test                   9.67667
returns_test                        -2.933979
return_std_test                      2.570862
average_reward_test                 -0.302823
round_time_test        0 days 00:00:10.525251
round_time_total       0 days 00:15:46.992002
loss_total                14304133633961624.0
loss_critic               17880166729556952.0
loss_actor                     -461819551.664
memory_size                            5799.0 

=== epoch 2/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:49,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1044
episode_length                       9.562261
returns                             -2.996724
return_std                           2.491512
average_reward                      -0.312731
round_time             0 days 00:15:44.148098
episodes_test                          1046.0
episode_length_test                  9.556405
returns_test                        -2.815388
return_std_test                      2.539491
average_reward_test                 -0.294278
round_time_test        0 days 00:00:10.683609
round_time_total       0 days 00:15:44.149188
loss_total                14334042719314248.0
loss_critic               17917553091649994.0
loss_actor                     -470224755.664
memory_size                            5799.0 

=== epoch 2/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1043
episode_length                       9.573346
returns                             -2.942042
return_std                           2.470817
average_reward                      -0.307074
round_time             0 days 00:15:45.208074
episodes_test                          1042.0
episode_length_test                   9.59405
returns_test                        -3.030224
return_std_test                      2.449495
average_reward_test                 -0.315442
round_time_test        0 days 00:00:10.726599
round_time_total       0 days 00:15:45.209292
loss_total                15228539987990414.0
loss_critic               19035674649108152.0
loss_actor                     -490169973.872
memory_size                            5799.0 

=== epoch 2/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1042
episode_length                       9.586372
returns                             -3.045186
return_std                           2.568989
average_reward                      -0.317889
round_time             0 days 00:15:45.861685
episodes_test                          1042.0
episode_length_test                   9.59405
returns_test                        -3.151176
return_std_test                      2.541353
average_reward_test                 -0.328255
round_time_test        0 days 00:00:10.609245
round_time_total       0 days 00:15:45.862789
loss_total                16363302870025502.0
loss_critic               20454128232223996.0
loss_actor                     -498825801.552
memory_size                            5799.0 

=== epoch 2/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:10,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1042
episode_length                       9.592131
returns                             -3.022879
return_std                           2.599708
average_reward                      -0.314741
round_time             0 days 00:15:43.946539
episodes_test                          1037.0
episode_length_test                  9.639344
returns_test                        -3.072398
return_std_test                      2.604676
average_reward_test                 -0.318245
round_time_test        0 days 00:00:10.731930
round_time_total       0 days 00:15:43.947618
loss_total                16234413796012064.0
loss_critic               20293016896216760.0
loss_actor                     -504458940.912
memory_size                            5799.0 

=== epoch 2/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:15,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1039
episode_length                       9.611165
returns                             -3.041398
return_std                           2.620996
average_reward                      -0.316304
round_time             0 days 00:15:45.579243
episodes_test                          1029.0
episode_length_test                  9.712342
returns_test                        -2.946882
return_std_test                      2.674746
average_reward_test                 -0.303192
round_time_test        0 days 00:00:10.725366
round_time_total       0 days 00:15:45.580340
loss_total                16561963633001628.0
loss_critic               20702454167623436.0
loss_actor                     -513151981.408
memory_size                            5799.0 

=== epoch 2/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:55,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1036
episode_length                       9.643822
returns                             -3.069876
return_std                           2.702514
average_reward                       -0.31873
round_time             0 days 00:15:42.851098
episodes_test                          1030.0
episode_length_test                  9.707767
returns_test                        -3.114742
return_std_test                      2.768898
average_reward_test                 -0.320736
round_time_test        0 days 00:00:10.553552
round_time_total       0 days 00:15:42.852268
loss_total                17343205998382284.0
loss_critic               21679007134113596.0
loss_actor                     -520601309.552
memory_size                            5799.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                  203
episode_length                       9.832512
returns                             -3.077076
return_std                           2.695514
average_reward                      -0.312054
round_time             0 days 00:15:45.063690
episodes_test                          1030.0
episode_length_test                  9.707767
returns_test                        -3.179389
return_std_test                      2.760159
average_reward_test                 -0.327412
round_time_test        0 days 00:00:10.590036
round_time_total       0 days 00:15:45.064890
loss_total                18071547947228596.0
loss_critic               22589434529100856.0
loss_actor                     -535977620.928
memory_size                            5799.0 

=== epoch 3/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:27,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                  409
episode_length                       9.765281
returns                              -3.20235
return_std                           2.715544
average_reward                      -0.327739
round_time             0 days 00:15:42.755739
episodes_test                          1023.0
episode_length_test                  9.766373
returns_test                        -2.875449
return_std_test                      2.764808
average_reward_test                 -0.293849
round_time_test        0 days 00:00:10.530797
round_time_total       0 days 00:15:42.756856
loss_total                19283375824874504.0
loss_critic               24104219352267488.0
loss_actor                      -555410742.48
memory_size                            5799.0 

=== epoch 3/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                  614
episode_length                       9.763844
returns                             -3.222544
return_std                           2.639237
average_reward                      -0.329373
round_time             0 days 00:15:44.957754
episodes_test                          1029.0
episode_length_test                  9.717201
returns_test                        -3.058502
return_std_test                      2.796321
average_reward_test                 -0.314635
round_time_test        0 days 00:00:10.662589
round_time_total       0 days 00:15:44.959293
loss_total                20663177375406620.0
loss_critic               25828971292445900.0
loss_actor                     -570162745.808
memory_size                            5799.0 

=== epoch 3/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                  817
episode_length                       9.779682
returns                             -3.200464
return_std                           2.747864
average_reward                      -0.327115
round_time             0 days 00:15:44.602021
episodes_test                          1028.0
episode_length_test                  9.726654
returns_test                        -3.161615
return_std_test                      2.678757
average_reward_test                 -0.324922
round_time_test        0 days 00:00:10.717393
round_time_total       0 days 00:15:44.603308
loss_total                21980513242902952.0
loss_critic               27475641102455800.0
loss_actor                     -588744507.888
memory_size                            5799.0 

=== epoch 3/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1024
episode_length                       9.758789
returns                             -3.164651
return_std                           2.790077
average_reward                      -0.323588
round_time             0 days 00:15:46.081875
episodes_test                          1018.0
episode_length_test                  9.823183
returns_test                        -3.083765
return_std_test                      2.737388
average_reward_test                 -0.313927
round_time_test        0 days 00:00:10.573027
round_time_total       0 days 00:15:46.082999
loss_total                23011609949671260.0
loss_critic               28764511946959488.0
loss_actor                     -606797823.264
memory_size                            5799.0 

=== epoch 3/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:43,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1027
episode_length                       9.729309
returns                             -3.144372
return_std                           2.786964
average_reward                      -0.323346
round_time             0 days 00:15:46.471575
episodes_test                          1021.0
episode_length_test                  9.785504
returns_test                        -2.900037
return_std_test                      2.660674
average_reward_test                 -0.296484
round_time_test        0 days 00:00:10.690705
round_time_total       0 days 00:15:46.472764
loss_total                24546481756000224.0
loss_critic               30683101671081376.0
loss_actor                     -620835289.888
memory_size                            5799.0 

=== epoch 3/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:52,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1027
episode_length                       9.731256
returns                             -3.115678
return_std                           2.792148
average_reward                      -0.320316
round_time             0 days 00:15:43.254675
episodes_test                          1015.0
episode_length_test                  9.846305
returns_test                        -3.085955
return_std_test                       2.82955
average_reward_test                 -0.312505
round_time_test        0 days 00:00:10.481467
round_time_total       0 days 00:15:43.255978
loss_total                24946791628660540.0
loss_critic               31183489011302792.0
loss_actor                     -633697019.232
memory_size                            5799.0 

=== epoch 3/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:44,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1024
episode_length                       9.754883
returns                             -3.038898
return_std                           2.858068
average_reward                      -0.311565
round_time             0 days 00:15:46.494791
episodes_test                          1009.0
episode_length_test                  9.902874
returns_test                        -2.806841
return_std_test                      2.748094
average_reward_test                 -0.283578
round_time_test        0 days 00:00:10.485218
round_time_total       0 days 00:15:46.496240
loss_total                25408843178766760.0
loss_critic               31761053392497016.0
loss_actor                     -638328483.712
memory_size                            5799.0 

=== epoch 3/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1026
episode_length                       9.741715
returns                             -2.975086
return_std                           2.806891
average_reward                      -0.304856
round_time             0 days 00:15:45.650293
episodes_test                          1017.0
episode_length_test                  9.828909
returns_test                        -2.667282
return_std_test                      2.673409
average_reward_test                 -0.271076
round_time_test        0 days 00:00:10.509585
round_time_total       0 days 00:15:45.651408
loss_total                26028792593971676.0
loss_critic               32535990221498484.0
loss_actor                      -646139207.04
memory_size                            5799.0 

=== epoch 3/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:45,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1021
episode_length                       9.780607
returns                             -2.998064
return_std                           2.814145
average_reward                       -0.30628
round_time             0 days 00:15:43.820194
episodes_test                          1008.0
episode_length_test                   9.91369
returns_test                        -2.731607
return_std_test                      2.828226
average_reward_test                 -0.275357
round_time_test        0 days 00:00:10.446480
round_time_total       0 days 00:15:43.821467
loss_total                27663391583885068.0
loss_critic               34579238898022480.0
loss_actor                     -660165734.176
memory_size                            5799.0 

=== epoch 3/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:14,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1018
episode_length                       9.811395
returns                             -3.037526
return_std                           2.902111
average_reward                      -0.309263
round_time             0 days 00:15:44.983516
episodes_test                          1007.0
episode_length_test                  9.925521
returns_test                        -2.949492
return_std_test                      2.981977
average_reward_test                 -0.297209
round_time_test        0 days 00:00:10.814307
round_time_total       0 days 00:15:44.984773
loss_total                29394753894806980.0
loss_critic               36743441727887504.0
loss_actor                     -681643568.768
memory_size                            5799.0 

=== epoch 3/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1016
episode_length                       9.830709
returns                             -2.974187
return_std                           2.943658
average_reward                      -0.302155
round_time             0 days 00:15:45.897591
episodes_test                          1019.0
episode_length_test                  9.806673
returns_test                        -3.184862
return_std_test                      3.051171
average_reward_test                 -0.324693
round_time_test        0 days 00:00:10.667406
round_time_total       0 days 00:15:45.898758
loss_total                30623481376767212.0
loss_critic               38279351072016304.0
loss_actor                     -702687743.008
memory_size                            5799.0 

=== epoch 3/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:57,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1021
episode_length                       9.782566
returns                              -3.04802
return_std                           2.913591
average_reward                       -0.31163
round_time             0 days 00:15:44.632096
episodes_test                          1018.0
episode_length_test                  9.823183
returns_test                        -2.785258
return_std_test                      2.803556
average_reward_test                 -0.283539
round_time_test        0 days 00:00:10.746799
round_time_total       0 days 00:15:44.633199
loss_total                32606672848865460.0
loss_critic               40758340383483624.0
loss_actor                      -727975471.52
memory_size                            5799.0 

=== epoch 3/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:04,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1020
episode_length                       9.795098
returns                             -3.094897
return_std                           2.884026
average_reward                      -0.315801
round_time             0 days 00:15:46.219787
episodes_test                          1024.0
episode_length_test                  9.764648
returns_test                        -3.127232
return_std_test                      2.952257
average_reward_test                 -0.320127
round_time_test        0 days 00:00:10.590687
round_time_total       0 days 00:15:46.220986
loss_total                33344727237575836.0
loss_critic               41680908376216696.0
loss_actor                     -734014122.304
memory_size                            5799.0 

=== epoch 3/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:55,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1026
episode_length                       9.737817
returns                             -3.075809
return_std                           2.808878
average_reward                      -0.315074
round_time             0 days 00:15:45.434361
episodes_test                          1030.0
episode_length_test                  9.706796
returns_test                        -3.022551
return_std_test                      2.812155
average_reward_test                 -0.311092
round_time_test        0 days 00:00:10.586705
round_time_total       0 days 00:15:45.435544
loss_total                34813474092085148.0
loss_critic               43516841875902304.0
loss_actor                      -750348064.16
memory_size                            5799.0 

=== epoch 3/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1028
episode_length                        9.72179
returns                             -2.999708
return_std                           2.777095
average_reward                      -0.308647
round_time             0 days 00:15:47.563218
episodes_test                          1017.0
episode_length_test                  9.824975
returns_test                        -2.970162
return_std_test                      2.788256
average_reward_test                 -0.302165
round_time_test        0 days 00:00:10.551391
round_time_total       0 days 00:15:47.564322
loss_total                35845940385675216.0
loss_critic               44807424705376024.0
loss_actor                     -764948965.088
memory_size                            5799.0 

=== epoch 3/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:55,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1028
episode_length                       9.718872
returns                             -3.107757
return_std                            2.74238
average_reward                      -0.319775
round_time             0 days 00:15:46.415997
episodes_test                          1025.0
episode_length_test                  9.754146
returns_test                        -2.976308
return_std_test                      2.808518
average_reward_test                  -0.30492
round_time_test        0 days 00:00:10.635973
round_time_total       0 days 00:15:46.417249
loss_total                36470318320452632.0
loss_critic               45587897132504840.0
loss_actor                     -771523365.024
memory_size                            5799.0 

=== epoch 3/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1025
episode_length                       9.754146
returns                             -3.018596
return_std                            2.77404
average_reward                       -0.30956
round_time             0 days 00:15:47.615974
episodes_test                          1023.0
episode_length_test                  9.769306
returns_test                        -3.119412
return_std_test                      2.893158
average_reward_test                 -0.319584
round_time_test        0 days 00:00:10.552763
round_time_total       0 days 00:15:47.617241
loss_total                39210362628198104.0
loss_critic               49012952435783632.0
loss_actor                     -790812147.424
memory_size                            5799.0 

=== epoch 3/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:06,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1025
episode_length                       9.747317
returns                             -3.001506
return_std                           2.778357
average_reward                       -0.30791
round_time             0 days 00:15:43.112854
episodes_test                          1009.0
episode_length_test                  9.908821
returns_test                        -2.799308
return_std_test                      2.877635
average_reward_test                 -0.282226
round_time_test        0 days 00:00:10.480001
round_time_total       0 days 00:15:43.114099
loss_total                40804287519488216.0
loss_critic               51005358532850616.0
loss_actor                     -812853367.744
memory_size                            5799.0 

=== epoch 3/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:27,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1017
episode_length                       9.820059
returns                             -2.933681
return_std                            2.83121
average_reward                      -0.299083
round_time             0 days 00:15:47.023870
episodes_test                          1020.0
episode_length_test                  9.801961
returns_test                        -2.954948
return_std_test                      2.934309
average_reward_test                 -0.301302
round_time_test        0 days 00:00:10.500233
round_time_total       0 days 00:15:47.025019
loss_total                42968086825633056.0
loss_critic               53710107574867592.0
loss_actor                     -835465705.408
memory_size                            5799.0 

=== epoch 3/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:03,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1014
episode_length                       9.857002
returns                             -2.956299
return_std                           2.874062
average_reward                      -0.299413
round_time             0 days 00:15:44.257998
episodes_test                          1015.0
episode_length_test                  9.842365
returns_test                        -3.074078
return_std_test                      2.973253
average_reward_test                 -0.311682
round_time_test        0 days 00:00:10.628490
round_time_total       0 days 00:15:44.259182
loss_total                44291390335170376.0
loss_critic               55364236979640208.0
loss_actor                     -852430636.064
memory_size                            5799.0 

=== epoch 3/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1016
episode_length                       9.841535
returns                             -2.832909
return_std                           2.877693
average_reward                      -0.287732
round_time             0 days 00:15:45.914211
episodes_test                          1016.0
episode_length_test                   9.84252
returns_test                        -3.167849
return_std_test                      2.939735
average_reward_test                 -0.321854
round_time_test        0 days 00:00:10.721471
round_time_total       0 days 00:15:45.915409
loss_total                45578829860347840.0
loss_critic               56973536352758920.0
loss_actor                     -856934464.192
memory_size                            5799.0 

=== epoch 3/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1017
episode_length                       9.824975
returns                             -2.878862
return_std                           2.872671
average_reward                      -0.293081
round_time             0 days 00:15:45.354068
episodes_test                          1011.0
episode_length_test                  9.881306
returns_test                        -2.883535
return_std_test                      2.910099
average_reward_test                 -0.291791
round_time_test        0 days 00:00:11.054341
round_time_total       0 days 00:15:45.355197
loss_total                45173779088701128.0
loss_critic               56467222935847832.0
loss_actor                      -843708626.88
memory_size                            5799.0 

=== epoch 3/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:48,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1016
episode_length                       9.836614
returns                             -2.817038
return_std                           2.868163
average_reward                      -0.285628
round_time             0 days 00:15:44.990489
episodes_test                          1021.0
episode_length_test                  9.788443
returns_test                        -2.947794
return_std_test                      2.841208
average_reward_test                  -0.30058
round_time_test        0 days 00:00:10.405787
round_time_total       0 days 00:15:44.991604
loss_total                45915511518289464.0
loss_critic               57394388388007640.0
loss_actor                     -865438898.688
memory_size                            5799.0 

=== epoch 3/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:04,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1020
episode_length                        9.80098
returns                             -2.890684
return_std                           2.885859
average_reward                      -0.294933
round_time             0 days 00:15:45.434033
episodes_test                          1010.0
episode_length_test                   9.90099
returns_test                        -2.862759
return_std_test                      3.021029
average_reward_test                 -0.289139
round_time_test        0 days 00:00:10.579003
round_time_total       0 days 00:15:45.435116
loss_total                49236975945131232.0
loss_critic               61546218853511464.0
loss_actor                      -899618836.48
memory_size                            5799.0 

=== epoch 3/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:38,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1019
episode_length                       9.804711
returns                             -2.878263
return_std                           2.847997
average_reward                      -0.293581
round_time             0 days 00:15:46.208149
episodes_test                          1024.0
episode_length_test                  9.765625
returns_test                         -2.89351
return_std_test                      2.842642
average_reward_test                 -0.296295
round_time_test        0 days 00:00:10.694388
round_time_total       0 days 00:15:46.209252
loss_total                52292552328185120.0
loss_critic               65365689296358472.0
loss_actor                     -930712439.456
memory_size                            5799.0 

=== epoch 3/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:58,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1013
episode_length                       9.855874
returns                             -2.880508
return_std                           2.861089
average_reward                      -0.291791
round_time             0 days 00:15:44.936978
episodes_test                          1015.0
episode_length_test                  9.849261
returns_test                        -2.799576
return_std_test                      2.836145
average_reward_test                 -0.284051
round_time_test        0 days 00:00:10.605380
round_time_total       0 days 00:15:44.938109
loss_total                56101423760260728.0
loss_critic               70126778421499400.0
loss_actor                     -955178496.576
memory_size                            5799.0 

=== epoch 3/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:09,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1011
episode_length                       9.889219
returns                             -2.831473
return_std                           2.880425
average_reward                      -0.286377
round_time             0 days 00:15:45.309256
episodes_test                          1018.0
episode_length_test                  9.818271
returns_test                        -2.988678
return_std_test                      2.848301
average_reward_test                 -0.303955
round_time_test        0 days 00:00:11.029594
round_time_total       0 days 00:15:45.310386
loss_total                58432127385582176.0
loss_critic               73040157999053664.0
loss_actor                     -976862168.768
memory_size                            5799.0 

=== epoch 3/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:57,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1011
episode_length                       9.880317
returns                             -2.837955
return_std                           2.891156
average_reward                      -0.287087
round_time             0 days 00:15:46.961243
episodes_test                          1011.0
episode_length_test                  9.883284
returns_test                        -2.729224
return_std_test                      2.823521
average_reward_test                 -0.276179
round_time_test        0 days 00:00:10.759699
round_time_total       0 days 00:15:46.962615
loss_total                59892647760664464.0
loss_critic               74865808384691536.0
loss_actor                     -978787050.272
memory_size                            5799.0 

=== epoch 3/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:37,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1005
episode_length                       9.940299
returns                             -2.741937
return_std                           2.839536
average_reward                      -0.275812
round_time             0 days 00:15:47.734748
episodes_test                          1012.0
episode_length_test                   9.87747
returns_test                        -2.809091
return_std_test                      2.785151
average_reward_test                 -0.284184
round_time_test        0 days 00:00:10.659845
round_time_total       0 days 00:15:47.735850
loss_total                59666329097029552.0
loss_critic               74582910095547440.0
loss_actor                    -1001149702.976
memory_size                            5799.0 

=== epoch 3/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1007
episode_length                       9.914598
returns                             -2.738551
return_std                            2.81515
average_reward                      -0.276682
round_time             0 days 00:15:43.598676
episodes_test                          1016.0
episode_length_test                  9.837598
returns_test                        -2.776479
return_std_test                      2.871976
average_reward_test                 -0.281639
round_time_test        0 days 00:00:10.728990
round_time_total       0 days 00:15:43.599818
loss_total                63729529894360584.0
loss_critic               79661910971750816.0
loss_actor                    -1028729256.992
memory_size                            5799.0 

=== epoch 3/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:38,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1013
episode_length                       9.866732
returns                             -2.749934
return_std                           2.797032
average_reward                      -0.278654
round_time             0 days 00:15:46.014204
episodes_test                          1015.0
episode_length_test                   9.84532
returns_test                        -3.044388
return_std_test                      2.994378
average_reward_test                  -0.30863
round_time_test        0 days 00:00:10.534397
round_time_total       0 days 00:15:46.015343
loss_total                67316488709494280.0
loss_critic               84145609387655824.0
loss_actor                    -1055469198.784
memory_size                            5799.0 

=== epoch 3/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:05,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1017
episode_length                       9.826942
returns                             -2.858808
return_std                           2.795212
average_reward                      -0.290537
round_time             0 days 00:15:45.396348
episodes_test                          1019.0
episode_length_test                  9.813543
returns_test                        -2.874271
return_std_test                      2.798158
average_reward_test                 -0.292888
round_time_test        0 days 00:00:10.662769
round_time_total       0 days 00:15:45.397581
loss_total                70844076152928072.0
loss_critic               88555093633966016.0
loss_actor                    -1076457981.056
memory_size                            5799.0 

=== epoch 3/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1016
episode_length                       9.827756
returns                             -2.874539
return_std                           2.793328
average_reward                      -0.292774
round_time             0 days 00:15:44.758351
episodes_test                          1009.0
episode_length_test                  9.909812
returns_test                        -2.810756
return_std_test                      2.862216
average_reward_test                 -0.283518
round_time_test        0 days 00:00:10.563934
round_time_total       0 days 00:15:44.759459
loss_total                70362430017393128.0
loss_critic               87953036034608992.0
loss_actor                    -1055949794.656
memory_size                            5799.0 

=== epoch 3/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:12,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1025
episode_length                       9.755122
returns                             -3.008863
return_std                           2.792789
average_reward                      -0.308308
round_time             0 days 00:15:45.482190
episodes_test                          1013.0
episode_length_test                  9.862784
returns_test                        -2.802919
return_std_test                      2.820311
average_reward_test                 -0.284006
round_time_test        0 days 00:00:10.398707
round_time_total       0 days 00:15:45.483319
loss_total                68675431735784112.0
loss_critic               85844288157633216.0
loss_actor                    -1038537346.912
memory_size                            5799.0 

=== epoch 3/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:28,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1023
episode_length                       9.771261
returns                             -2.999309
return_std                           2.777373
average_reward                      -0.306512
round_time             0 days 00:15:46.656075
episodes_test                          1019.0
episode_length_test                  9.808636
returns_test                        -2.811803
return_std_test                      2.828361
average_reward_test                 -0.286722
round_time_test        0 days 00:00:10.516294
round_time_total       0 days 00:15:46.657184
loss_total                65829635191988024.0
loss_critic               82287042518690304.0
loss_actor                    -1026737016.288
memory_size                            5799.0 

=== epoch 3/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:10,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1023
episode_length                       9.761486
returns                             -3.050639
return_std                           2.800186
average_reward                      -0.312284
round_time             0 days 00:15:44.888131
episodes_test                          1012.0
episode_length_test                  9.881423
returns_test                        -2.759275
return_std_test                      2.938864
average_reward_test                 -0.279239
round_time_test        0 days 00:00:10.802098
round_time_total       0 days 00:15:44.889249
loss_total                67575654748822440.0
loss_critic               84469566939768816.0
loss_actor                    -1056173373.568
memory_size                            5799.0 

=== epoch 3/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:21,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1019
episode_length                       9.801766
returns                              -2.98783
return_std                           2.819881
average_reward                      -0.305313
round_time             0 days 00:15:48.078024
episodes_test                          1011.0
episode_length_test                   9.88724
returns_test                        -2.799455
return_std_test                        2.8964
average_reward_test                 -0.282731
round_time_test        0 days 00:00:10.548950
round_time_total       0 days 00:15:48.079131
loss_total                71013417685924968.0
loss_critic               88766770597322560.0
loss_actor                     -1086034656.64
memory_size                            5799.0 

=== epoch 3/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:22,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1017
episode_length                       9.830875
returns                              -3.04274
return_std                           2.884233
average_reward                      -0.309506
round_time             0 days 00:15:46.157218
episodes_test                          1016.0
episode_length_test                  9.837598
returns_test                        -2.627232
return_std_test                      2.828422
average_reward_test                 -0.266717
round_time_test        0 days 00:00:10.471644
round_time_total       0 days 00:15:46.158381
loss_total                74102977267027152.0
loss_critic               92628719957333504.0
loss_actor                    -1116707238.912
memory_size                            5799.0 

=== epoch 3/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:35,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1014
episode_length                       9.845168
returns                             -2.980669
return_std                           2.896948
average_reward                      -0.302376
round_time             0 days 00:15:45.644353
episodes_test                          1013.0
episode_length_test                  9.870681
returns_test                        -2.900199
return_std_test                      2.939976
average_reward_test                  -0.29372
round_time_test        0 days 00:00:10.472611
round_time_total       0 days 00:15:45.645450
loss_total                80065566251614736.0
loss_critic              100081956117603680.0
loss_actor                    -1152995738.944
memory_size                            5799.0 

=== epoch 3/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1014
episode_length                        9.84714
returns                             -3.038182
return_std                           2.948675
average_reward                      -0.308692
round_time             0 days 00:15:44.043252
episodes_test                          1009.0
episode_length_test                  9.903865
returns_test                        -3.033061
return_std_test                      2.962911
average_reward_test                  -0.30639
round_time_test        0 days 00:00:10.624278
round_time_total       0 days 00:15:44.044354
loss_total                84536882050782720.0
loss_critic              105671100742546496.0
loss_actor                    -1188687939.584
memory_size                            5799.0 

=== epoch 3/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:51,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1010
episode_length                        9.89604
returns                             -2.925704
return_std                           2.976944
average_reward                      -0.295836
round_time             0 days 00:15:46.155433
episodes_test                          1013.0
episode_length_test                  9.868707
returns_test                        -3.097927
return_std_test                      3.048801
average_reward_test                 -0.313814
round_time_test        0 days 00:00:10.536593
round_time_total       0 days 00:15:46.156689
loss_total                89781739863358304.0
loss_critic              112227172928943296.0
loss_actor                    -1226943773.312
memory_size                            5799.0 

=== epoch 3/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:47,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1012
episode_length                       9.873518
returns                             -2.935696
return_std                           2.966644
average_reward                      -0.297419
round_time             0 days 00:15:43.777829
episodes_test                          1009.0
episode_length_test                  9.899901
returns_test                        -2.744962
return_std_test                      2.955629
average_reward_test                 -0.276799
round_time_test        0 days 00:00:10.700727
round_time_total       0 days 00:15:43.778923
loss_total                96701556968254336.0
loss_critic              120876944193562352.0
loss_actor                    -1263227564.608
memory_size                            5799.0 

=== epoch 3/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:44,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1011
episode_length                       9.872404
returns                             -2.885447
return_std                           2.947477
average_reward                       -0.29155
round_time             0 days 00:15:43.516308
episodes_test                          1007.0
episode_length_test                    9.9285
returns_test                        -2.831855
return_std_test                      2.913581
average_reward_test                 -0.284996
round_time_test        0 days 00:00:10.610879
round_time_total       0 days 00:15:43.517390
loss_total                99148919691037120.0
loss_critic              123936147553017392.0
loss_actor                    -1284901327.104
memory_size                            5799.0 

=== epoch 3/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:41<00:00,  2.12it/s]
episodes                                 1007
episode_length                        9.91857
returns                             -2.891917
return_std                           2.950179
average_reward                      -0.291137
round_time             0 days 00:15:41.971799
episodes_test                          1000.0
episode_length_test                     9.994
returns_test                        -2.745155
return_std_test                      3.005367
average_reward_test                 -0.274968
round_time_test        0 days 00:00:10.666582
round_time_total       0 days 00:15:41.973080
loss_total               105125304887968528.0
loss_critic              131406628838057184.0
loss_actor                    -1319239545.152
memory_size                            5799.0 

=== epoch 3/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:40,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1008
episode_length                       9.912698
returns                             -2.881248
return_std                           2.903008
average_reward                      -0.291043
round_time             0 days 00:15:47.311506
episodes_test                          1006.0
episode_length_test                  9.935388
returns_test                        -2.747807
return_std_test                      2.959278
average_reward_test                 -0.276067
round_time_test        0 days 00:00:10.488815
round_time_total       0 days 00:15:47.312607
loss_total               108840916092742272.0
loss_critic              136051142712088336.0
loss_actor                    -1352339175.424
memory_size                            5799.0 

=== epoch 3/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:35,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1010
episode_length                        9.89505
returns                             -2.914108
return_std                           2.834814
average_reward                      -0.294092
round_time             0 days 00:15:47.079518
episodes_test                          1015.0
episode_length_test                  9.842365
returns_test                        -2.631541
return_std_test                      2.935643
average_reward_test                  -0.26723
round_time_test        0 days 00:00:10.570264
round_time_total       0 days 00:15:47.080634
loss_total               102605049797546080.0
loss_critic              128256310088174672.0
loss_actor                     -1282770296.32
memory_size                            5799.0 

=== epoch 3/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:52,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1008
episode_length                       9.914683
returns                             -2.888122
return_std                           2.915408
average_reward                      -0.291657
round_time             0 days 00:15:44.525982
episodes_test                          1003.0
episode_length_test                  9.961117
returns_test                        -2.930017
return_std_test                      2.980408
average_reward_test                 -0.293814
round_time_test        0 days 00:00:10.563925
round_time_total       0 days 00:15:44.527073
loss_total               100882876300718512.0
loss_critic              126103593285591232.0
loss_actor                    -1245521594.496
memory_size                            5799.0 

=== epoch 3/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:21,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1012
episode_length                        9.87253
returns                             -2.956635
return_std                           2.938253
average_reward                      -0.299627
round_time             0 days 00:15:44.727300
episodes_test                          1021.0
episode_length_test                   9.79334
returns_test                        -2.900673
return_std_test                      2.876533
average_reward_test                 -0.296079
round_time_test        0 days 00:00:10.618789
round_time_total       0 days 00:15:44.728397
loss_total               107351222924322272.0
loss_critic              134189026494497424.0
loss_actor                    -1304363320.256
memory_size                            5799.0 

=== epoch 3/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:20,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1017
episode_length                       9.829892
returns                             -2.980455
return_std                           2.922569
average_reward                      -0.303293
round_time             0 days 00:15:46.342017
episodes_test                          1021.0
episode_length_test                   9.79236
returns_test                        -3.245649
return_std_test                      3.010127
average_reward_test                 -0.331204
round_time_test        0 days 00:00:10.762809
round_time_total       0 days 00:15:46.343128
loss_total               115015659725875312.0
loss_critic              143769572186395776.0
loss_actor                     -1374073408.64
memory_size                            5799.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:25,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                  207
episode_length                       9.657005
returns                             -2.886914
return_std                           2.845888
average_reward                      -0.298245
round_time             0 days 00:15:43.544932
episodes_test                          1018.0
episode_length_test                  9.819253
returns_test                        -2.983752
return_std_test                      2.939646
average_reward_test                 -0.303573
round_time_test        0 days 00:00:10.684998
round_time_total       0 days 00:15:43.546205
loss_total               122813248285347680.0
loss_critic              153516557643875872.0
loss_actor                    -1431756115.584
memory_size                            5799.0 

=== epoch 4/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:04,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                  409
episode_length                       9.772616
returns                             -2.949418
return_std                           2.956795
average_reward                      -0.301167
round_time             0 days 00:15:45.853731
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.916079
return_std_test                      3.063706
average_reward_test                 -0.295342
round_time_test        0 days 00:00:10.469655
round_time_total       0 days 00:15:45.854854
loss_total               122428106005136016.0
loss_critic              153035129840855936.0
loss_actor                     -1459310289.92
memory_size                            5799.0 

=== epoch 4/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                  608
episode_length                       9.855263
returns                             -2.796506
return_std                           2.968688
average_reward                      -0.282715
round_time             0 days 00:15:46.873887
episodes_test                          1013.0
episode_length_test                  9.865745
returns_test                        -2.827726
return_std_test                      2.840046
average_reward_test                 -0.286565
round_time_test        0 days 00:00:10.623264
round_time_total       0 days 00:15:46.874978
loss_total               123678464498250816.0
loss_critic              154598078016573664.0
loss_actor                    -1427041049.344
memory_size                            5799.0 

=== epoch 4/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:40,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                  812
episode_length                       9.843596
returns                             -2.806406
return_std                           2.950125
average_reward                      -0.284925
round_time             0 days 00:15:46.801044
episodes_test                          1008.0
episode_length_test                  9.918651
returns_test                         -2.81932
return_std_test                      2.777961
average_reward_test                 -0.283962
round_time_test        0 days 00:00:10.625272
round_time_total       0 days 00:15:46.802188
loss_total               121729458633393040.0
loss_critic              152161820712345024.0
loss_actor                    -1430687902.976
memory_size                            5799.0 

=== epoch 4/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:12,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1019
episode_length                       9.806673
returns                             -2.860044
return_std                           2.917529
average_reward                      -0.291219
round_time             0 days 00:15:46.091733
episodes_test                          1014.0
episode_length_test                  9.861933
returns_test                         -3.03335
return_std_test                        2.9672
average_reward_test                 -0.307582
round_time_test        0 days 00:00:10.468414
round_time_total       0 days 00:15:46.092847
loss_total               131488991817427328.0
loss_critic              164361236905698784.0
loss_actor                    -1490540545.472
memory_size                            5799.0 

=== epoch 4/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:17,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:48<00:00,  2.11it/s]
episodes                                 1015
episode_length                       9.842365
returns                             -2.848515
return_std                           2.942915
average_reward                      -0.288995
round_time             0 days 00:15:48.874913
episodes_test                          1020.0
episode_length_test                  9.803922
returns_test                        -2.908859
return_std_test                      2.879603
average_reward_test                 -0.296704
round_time_test        0 days 00:00:10.714060
round_time_total       0 days 00:15:48.876135
loss_total               128997582201421824.0
loss_critic              161246974954948256.0
loss_actor                    -1483715088.064
memory_size                            5799.0 

=== epoch 4/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:26,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1015
episode_length                       9.841379
returns                             -2.791134
return_std                           2.901659
average_reward                      -0.283509
round_time             0 days 00:15:45.384111
episodes_test                          1014.0
episode_length_test                  9.858974
returns_test                        -2.895596
return_std_test                      2.855584
average_reward_test                 -0.293306
round_time_test        0 days 00:00:10.598030
round_time_total       0 days 00:15:45.385327
loss_total               130138050905767936.0
loss_critic              162672560684520192.0
loss_actor                    -1514555582.784
memory_size                            5799.0 

=== epoch 4/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1020
episode_length                       9.793137
returns                             -2.838932
return_std                           2.856719
average_reward                       -0.28971
round_time             0 days 00:15:45.738003
episodes_test                          1013.0
episode_length_test                  9.868707
returns_test                        -2.877635
return_std_test                      2.752575
average_reward_test                 -0.291305
round_time_test        0 days 00:00:10.719526
round_time_total       0 days 00:15:45.739137
loss_total               138785133422075440.0
loss_critic              173481414005998208.0
loss_actor                      -1517454516.8
memory_size                            5799.0 

=== epoch 4/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1018
episode_length                       9.811395
returns                             -2.811307
return_std                           2.839895
average_reward                      -0.286406
round_time             0 days 00:15:47.965603
episodes_test                          1022.0
episode_length_test                  9.783757
returns_test                        -2.943499
return_std_test                      2.761089
average_reward_test                  -0.30074
round_time_test        0 days 00:00:10.945900
round_time_total       0 days 00:15:47.966722
loss_total               142693173746882048.0
loss_critic              178366464002642400.0
loss_actor                    -1562767854.912
memory_size                            5799.0 

=== epoch 4/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:57,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1014
episode_length                       9.858974
returns                             -2.778797
return_std                           2.899356
average_reward                      -0.282032
round_time             0 days 00:15:43.650059
episodes_test                          1017.0
episode_length_test                  9.829892
returns_test                        -2.930813
return_std_test                      2.911405
average_reward_test                 -0.297752
round_time_test        0 days 00:00:10.625689
round_time_total       0 days 00:15:43.651189
loss_total               152594760188945248.0
loss_critic              190743447052537024.0
loss_actor                     -1612212209.28
memory_size                            5799.0 

=== epoch 4/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                 1008
episode_length                       9.914683
returns                             -2.778986
return_std                           2.902388
average_reward                      -0.279737
round_time             0 days 00:15:48.360378
episodes_test                          1009.0
episode_length_test                  9.905847
returns_test                        -2.851028
return_std_test                      2.897959
average_reward_test                 -0.287793
round_time_test        0 days 00:00:10.650958
round_time_total       0 days 00:15:48.361479
loss_total               166052829018829344.0
loss_critic              207566032574496128.0
loss_actor                    -1682471810.816
memory_size                            5799.0 

=== epoch 4/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:13,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1007
episode_length                       9.917577
returns                             -2.816797
return_std                            2.92222
average_reward                      -0.284259
round_time             0 days 00:15:44.214574
episodes_test                          1008.0
episode_length_test                  9.915675
returns_test                        -2.933141
return_std_test                      2.924702
average_reward_test                 -0.295875
round_time_test        0 days 00:00:10.490193
round_time_total       0 days 00:15:44.215683
loss_total               175620606009179904.0
loss_critic              219525753609497088.0
loss_actor                    -1747450604.928
memory_size                            5799.0 

=== epoch 4/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:13,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1009
episode_length                       9.900892
returns                             -2.947548
return_std                           2.963512
average_reward                      -0.297867
round_time             0 days 00:15:44.200130
episodes_test                          1012.0
episode_length_test                  9.873518
returns_test                        -2.830552
return_std_test                      2.825184
average_reward_test                 -0.286183
round_time_test        0 days 00:00:10.574871
round_time_total       0 days 00:15:44.201242
loss_total               179444622208368896.0
loss_critic              224305773992701056.0
loss_actor                    -1772989753.792
memory_size                            5799.0 

=== epoch 4/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:15,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1009
episode_length                        9.89891
returns                             -2.973799
return_std                           3.012349
average_reward                      -0.299776
round_time             0 days 00:15:45.336619
episodes_test                          1009.0
episode_length_test                  9.906838
returns_test                        -2.872977
return_std_test                      2.782327
average_reward_test                 -0.289773
round_time_test        0 days 00:00:10.646139
round_time_total       0 days 00:15:45.337739
loss_total               188550653734020960.0
loss_critic              235688313178306912.0
loss_actor                    -1820339672.384
memory_size                            5799.0 

=== epoch 4/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:53,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1005
episode_length                       9.940299
returns                             -2.903757
return_std                           3.006519
average_reward                      -0.291849
round_time             0 days 00:15:46.412293
episodes_test                          1013.0
episode_length_test                   9.86772
returns_test                         -2.83025
return_std_test                      2.833413
average_reward_test                 -0.286453
round_time_test        0 days 00:00:10.663632
round_time_total       0 days 00:15:46.413432
loss_total               202652324002694944.0
loss_critic              253315400638071296.0
loss_actor                    -1866781157.632
memory_size                            5799.0 

=== epoch 4/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1005
episode_length                       9.940299
returns                             -2.845503
return_std                           2.977964
average_reward                      -0.286204
round_time             0 days 00:15:47.140886
episodes_test                          1013.0
episode_length_test                  9.864758
returns_test                         -2.87581
return_std_test                      2.940618
average_reward_test                 -0.290749
round_time_test        0 days 00:00:10.707204
round_time_total       0 days 00:15:47.142162
loss_total               214607407673459616.0
loss_critic              268259254953796704.0
loss_actor                    -1904313487.936
memory_size                            5799.0 

=== epoch 4/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:35,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1006
episode_length                       9.931412
returns                             -2.846209
return_std                           2.923128
average_reward                      -0.286752
round_time             0 days 00:15:45.775149
episodes_test                          1013.0
episode_length_test                  9.865745
returns_test                        -2.860596
return_std_test                      2.800656
average_reward_test                 -0.289148
round_time_test        0 days 00:00:10.810780
round_time_total       0 days 00:15:45.776301
loss_total               225504191379352320.0
loss_critic              281880234433691264.0
loss_actor                    -1955900750.656
memory_size                            5799.0 

=== epoch 4/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:40,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1007
episode_length                       9.926514
returns                             -2.807145
return_std                           2.862022
average_reward                      -0.282717
round_time             0 days 00:15:45.251942
episodes_test                          1015.0
episode_length_test                  9.851232
returns_test                         -2.82663
return_std_test                      2.825833
average_reward_test                 -0.286822
round_time_test        0 days 00:00:10.649789
round_time_total       0 days 00:15:45.253056
loss_total               227088689818691968.0
loss_critic              283860857318046432.0
loss_actor                    -1997398355.136
memory_size                            5799.0 

=== epoch 4/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.12it/s]
episodes                                 1008
episode_length                       9.916667
returns                             -2.830668
return_std                           2.813297
average_reward                       -0.28548
round_time             0 days 00:15:45.942340
episodes_test                          1018.0
episode_length_test                    9.8222
returns_test                        -2.903201
return_std_test                      2.937605
average_reward_test                 -0.295462
round_time_test        0 days 00:00:10.453777
round_time_total       0 days 00:15:45.943447
loss_total               237457657816688896.0
loss_critic              296822067125490304.0
loss_actor                    -2032139587.456
memory_size                            5799.0 

=== epoch 4/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:21,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:45<00:00,  2.11it/s]
episodes                                 1014
episode_length                       9.857002
returns                             -2.941023
return_std                           2.746186
average_reward                      -0.298104
round_time             0 days 00:15:46.143690
episodes_test                          1014.0
episode_length_test                  9.854043
returns_test                        -2.979777
return_std_test                      2.977858
average_reward_test                 -0.302188
round_time_test        0 days 00:00:10.664392
round_time_total       0 days 00:15:46.144863
loss_total               247605575921113760.0
loss_critic              309506964489733376.0
loss_actor                    -2082198780.096
memory_size                            5799.0 

=== epoch 4/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:06,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1019
episode_length                       9.806673
returns                             -3.032297
return_std                           2.716052
average_reward                      -0.309403
round_time             0 days 00:15:45.179192
episodes_test                          1014.0
episode_length_test                  9.858974
returns_test                        -2.890786
return_std_test                      2.876793
average_reward_test                 -0.293132
round_time_test        0 days 00:00:10.750236
round_time_total       0 days 00:15:45.180375
loss_total               259845702943382688.0
loss_critic              324807122940078336.0
loss_actor                    -2140030620.032
memory_size                            5799.0 

=== epoch 4/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1017
episode_length                       9.824975
returns                             -3.023963
return_std                           2.752513
average_reward                      -0.307788
round_time             0 days 00:15:47.054903
episodes_test                          1011.0
episode_length_test                  9.888229
returns_test                        -2.824552
return_std_test                      2.882838
average_reward_test                  -0.28554
round_time_test        0 days 00:00:10.726481
round_time_total       0 days 00:15:47.056000
loss_total               278574470809018944.0
loss_critic              348218082631463488.0
loss_actor                    -2195551334.784
memory_size                            5799.0 

=== epoch 4/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:46,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1013
episode_length                       9.856861
returns                             -2.948118
return_std                           2.784863
average_reward                      -0.298839
round_time             0 days 00:15:43.328072
episodes_test                          1011.0
episode_length_test                  9.883284
returns_test                        -2.937857
return_std_test                      2.859367
average_reward_test                 -0.297666
round_time_test        0 days 00:00:10.574739
round_time_total       0 days 00:15:43.329172
loss_total               282547608657124640.0
loss_critic              353184504741342720.0
loss_actor                    -2238874203.648
memory_size                            5799.0 

=== epoch 4/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:11,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                 1011
episode_length                       9.877349
returns                             -2.856312
return_std                             2.7898
average_reward                      -0.289341
round_time             0 days 00:15:47.257178
episodes_test                          1013.0
episode_length_test                  9.861797
returns_test                        -2.855057
return_std_test                      2.904613
average_reward_test                  -0.28961
round_time_test        0 days 00:00:10.518500
round_time_total       0 days 00:15:47.258406
loss_total               296647207481325376.0
loss_critic              370809003196968576.0
loss_actor                    -2275505652.864
memory_size                            5799.0 

=== epoch 4/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:20,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1009
episode_length                       9.900892
returns                             -2.793038
return_std                           2.839274
average_reward                      -0.282368
round_time             0 days 00:15:45.441126
episodes_test                          1010.0
episode_length_test                   9.89703
returns_test                        -2.685097
return_std_test                      2.824649
average_reward_test                 -0.271094
round_time_test        0 days 00:00:10.640436
round_time_total       0 days 00:15:45.442247
loss_total               302630657235349376.0
loss_critic              378288314917589120.0
loss_actor                     -2297720955.84
memory_size                            5799.0 

=== epoch 4/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1007
episode_length                       9.917577
returns                             -2.722835
return_std                           2.913893
average_reward                      -0.274204
round_time             0 days 00:15:44.152406
episodes_test                          1008.0
episode_length_test                  9.914683
returns_test                        -2.762158
return_std_test                      2.936789
average_reward_test                 -0.278101
round_time_test        0 days 00:00:10.525411
round_time_total       0 days 00:15:44.153677
loss_total               313885454451370624.0
loss_critic              392356811188507520.0
loss_actor                    -2354407195.904
memory_size                            5799.0 

=== epoch 4/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:07,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:43<00:00,  2.12it/s]
episodes                                 1008
episode_length                       9.909722
returns                             -2.688464
return_std                           2.870471
average_reward                      -0.271002
round_time             0 days 00:15:44.357783
episodes_test                          1004.0
episode_length_test                  9.951195
returns_test                        -2.776989
return_std_test                      2.943763
average_reward_test                 -0.279117
round_time_test        0 days 00:00:10.542034
round_time_total       0 days 00:15:44.358919
loss_total               325069180367496256.0
loss_critic              406336468500449600.0
loss_actor                    -2402394570.752
memory_size                            5799.0 

=== epoch 4/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:39,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                 1008
episode_length                       9.915675
returns                             -2.756626
return_std                           2.954551
average_reward                      -0.278172
round_time             0 days 00:15:45.198538
episodes_test                          1011.0
episode_length_test                  9.886251
returns_test                        -2.801182
return_std_test                      2.856751
average_reward_test                  -0.28303
round_time_test        0 days 00:00:10.499878
round_time_total       0 days 00:15:45.199692
loss_total               343788475427210688.0
loss_critic              429735587194632960.0
loss_actor                    -2446021718.144
memory_size                            5799.0 

=== epoch 4/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:46,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1011
episode_length                       9.884273
returns                             -2.851727
return_std                           2.929085
average_reward                      -0.288561
round_time             0 days 00:15:42.915104
episodes_test                          1008.0
episode_length_test                  9.916667
returns_test                        -2.828534
return_std_test                      2.933835
average_reward_test                 -0.285108
round_time_test        0 days 00:00:10.645674
round_time_total       0 days 00:15:42.916331
loss_total               363150576187460288.0
loss_critic              453938212231727552.0
loss_actor                    -2537478733.824
memory_size                            5799.0 

=== epoch 4/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:44,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:42<00:00,  2.12it/s]
episodes                                 1014
episode_length                        9.85503
returns                             -2.845261
return_std                           2.936997
average_reward                       -0.28898
round_time             0 days 00:15:42.995609
episodes_test                          1003.0
episode_length_test                  9.962114
returns_test                        -2.688921
return_std_test                      2.925944
average_reward_test                 -0.270042
round_time_test        0 days 00:00:10.509338
round_time_total       0 days 00:15:42.996811
loss_total               383307728696891136.0
loss_critic              479134652785838016.0
loss_actor                    -2602962736.128
memory_size                            5799.0 

=== epoch 4/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:59,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:23<00:00,  2.17it/s]
episodes                                 1014
episode_length                       9.846154
returns                             -2.903032
return_std                           2.889061
average_reward                      -0.294599
round_time             0 days 00:15:24.187011
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.885561
return_std_test                      2.816004
average_reward_test                 -0.292207
round_time_test        0 days 00:00:10.516187
round_time_total       0 days 00:15:24.188103
loss_total               402988112054470080.0
loss_critic              503735131773968832.0
loss_actor                     -2652523084.16
memory_size                            5799.0 

=== epoch 4/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                 1013
episode_length                       9.863771
returns                             -2.948997
return_std                           2.950588
average_reward                      -0.298995
round_time             0 days 00:14:53.649983
episodes_test                          1007.0
episode_length_test                  9.923535
returns_test                        -2.830669
return_std_test                      2.886968
average_reward_test                 -0.285041
round_time_test        0 days 00:00:10.253834
round_time_total       0 days 00:14:53.651076
loss_total               417436463223833344.0
loss_critic              521795569996401728.0
loss_actor                    -2702039423.744
memory_size                            5799.0 

=== epoch 4/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:11,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:55<00:00,  2.23it/s]
episodes                                 1011
episode_length                        9.87636
returns                             -2.883056
return_std                           2.873776
average_reward                      -0.291769
round_time             0 days 00:14:55.781659
episodes_test                          1012.0
episode_length_test                  9.878458
returns_test                        -2.816133
return_std_test                      2.897254
average_reward_test                 -0.284884
round_time_test        0 days 00:00:10.290725
round_time_total       0 days 00:14:55.782782
loss_total               441456675680549888.0
loss_critic              551820835984982912.0
loss_actor                    -2782992752.768
memory_size                            5799.0 

=== epoch 4/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:54<00:00,  2.23it/s]
episodes                                 1012
episode_length                        9.87253
returns                             -2.842136
return_std                           2.882107
average_reward                      -0.287712
round_time             0 days 00:14:55.381138
episodes_test                          1007.0
episode_length_test                  9.927507
returns_test                        -2.717015
return_std_test                      2.867865
average_reward_test                 -0.273434
round_time_test        0 days 00:00:10.013231
round_time_total       0 days 00:14:55.382230
loss_total               460608618109696192.0
loss_critic              575760763103366592.0
loss_actor                    -2843400065.536
memory_size                            5799.0 

=== epoch 4/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:26,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                 1011
episode_length                       9.878338
returns                             -2.805328
return_std                           2.831428
average_reward                      -0.283663
round_time             0 days 00:14:54.460739
episodes_test                          1019.0
episode_length_test                  9.810599
returns_test                        -2.853788
return_std_test                      2.812685
average_reward_test                 -0.290605
round_time_test        0 days 00:00:10.175596
round_time_total       0 days 00:14:54.461837
loss_total               465646304457991936.0
loss_critic              582057870167931648.0
loss_actor                    -2858196405.888
memory_size                            5799.0 

=== epoch 4/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                 1015
episode_length                       9.844335
returns                             -2.786639
return_std                           2.801798
average_reward                      -0.282614
round_time             0 days 00:14:53.870332
episodes_test                          1019.0
episode_length_test                  9.804711
returns_test                        -2.905514
return_std_test                      2.820231
average_reward_test                  -0.29591
round_time_test        0 days 00:00:10.182241
round_time_total       0 days 00:14:53.871421
loss_total               478851583796439936.0
loss_critic              598564469478430592.0
loss_actor                    -2915678032.768
memory_size                            5799.0 

=== epoch 4/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:51<00:00,  2.24it/s]
episodes                                 1023
episode_length                       9.763441
returns                             -2.824562
return_std                           2.772853
average_reward                      -0.289179
round_time             0 days 00:14:51.883227
episodes_test                          1014.0
episode_length_test                  9.857988
returns_test                        -2.939355
return_std_test                      2.886807
average_reward_test                 -0.297665
round_time_test        0 days 00:00:10.196051
round_time_total       0 days 00:14:51.884357
loss_total               513982800979174592.0
loss_critic              642478490418904320.0
loss_actor                    -2981387382.784
memory_size                            5799.0 

=== epoch 4/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:03,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                 1028
episode_length                       9.723735
returns                             -2.844123
return_std                           2.748572
average_reward                      -0.292592
round_time             0 days 00:14:53.764994
episodes_test                          1013.0
episode_length_test                   9.86772
returns_test                        -2.951048
return_std_test                      2.893707
average_reward_test                 -0.298855
round_time_test        0 days 00:00:10.209127
round_time_total       0 days 00:14:53.766095
loss_total               518601287326695936.0
loss_critic              648251597514713600.0
loss_actor                    -3036095913.088
memory_size                            5799.0 

=== epoch 4/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                 1026
episode_length                       9.733918
returns                             -2.829419
return_std                           2.738792
average_reward                      -0.290106
round_time             0 days 00:14:54.522618
episodes_test                          1025.0
episode_length_test                  9.752195
returns_test                        -2.880879
return_std_test                      2.749037
average_reward_test                 -0.295027
round_time_test        0 days 00:00:10.150820
round_time_total       0 days 00:14:54.523699
loss_total               527859358447284800.0
loss_critic              659824186125539328.0
loss_actor                    -3078298694.784
memory_size                            5799.0 

=== epoch 4/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:00,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:52<00:00,  2.24it/s]
episodes                                 1025
episode_length                       9.746341
returns                             -2.849301
return_std                           2.772386
average_reward                      -0.292188
round_time             0 days 00:14:53.186203
episodes_test                          1017.0
episode_length_test                  9.826942
returns_test                        -2.866627
return_std_test                      2.862031
average_reward_test                 -0.291546
round_time_test        0 days 00:00:10.286574
round_time_total       0 days 00:14:53.187321
loss_total               543510924864021056.0
loss_critic              679388644731649024.0
loss_actor                    -3129348993.536
memory_size                            5799.0 

=== epoch 4/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:38,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:52<00:00,  2.24it/s]
episodes                                 1024
episode_length                       9.751953
returns                             -2.871202
return_std                           2.770991
average_reward                       -0.29496
round_time             0 days 00:14:53.040732
episodes_test                          1016.0
episode_length_test                  9.838583
returns_test                        -2.902489
return_std_test                      2.734201
average_reward_test                 -0.294604
round_time_test        0 days 00:00:10.359604
round_time_total       0 days 00:14:53.041866
loss_total               576485725888982272.0
loss_critic              720607144854283136.0
loss_actor                      -3198415433.6
memory_size                            5799.0 

=== epoch 4/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:42,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                 1015
episode_length                        9.84335
returns                             -2.785224
return_std                           2.776801
average_reward                       -0.28294
round_time             0 days 00:14:11.771581
episodes_test                          1003.0
episode_length_test                   9.96012
returns_test                        -2.807302
return_std_test                      2.965749
average_reward_test                 -0.281283
round_time_test        0 days 00:00:10.069868
round_time_total       0 days 00:14:11.772678
loss_total               593035639738503040.0
loss_critic              741294537503338880.0
loss_actor                    -3237617276.928
memory_size                            5799.0 

=== epoch 4/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:51,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1013
episode_length                       9.871668
returns                             -2.756952
return_std                           2.853272
average_reward                      -0.279279
round_time             0 days 00:14:03.016072
episodes_test                          1005.0
episode_length_test                  9.948259
returns_test                        -2.793174
return_std_test                      2.832286
average_reward_test                 -0.280582
round_time_test        0 days 00:00:10.103271
round_time_total       0 days 00:14:03.017195
loss_total               609529771287222656.0
loss_critic              761912201229495168.0
loss_actor                    -3293493391.872
memory_size                            5799.0 

=== epoch 4/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1013
episode_length                       9.861797
returns                             -2.856797
return_std                           2.892102
average_reward                      -0.289866
round_time             0 days 00:14:03.928603
episodes_test                          1013.0
episode_length_test                  9.866732
returns_test                        -2.810723
return_std_test                      2.767399
average_reward_test                 -0.284741
round_time_test        0 days 00:00:09.972857
round_time_total       0 days 00:14:03.929687
loss_total               630802947911530368.0
loss_critic              788503671127264000.0
loss_actor                     -3360455221.76
memory_size                            5799.0 

=== epoch 4/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1009
episode_length                       9.896928
returns                             -2.878587
return_std                           2.901612
average_reward                      -0.290804
round_time             0 days 00:14:02.175764
episodes_test                          1018.0
episode_length_test                  9.821218
returns_test                        -2.837068
return_std_test                        2.7695
average_reward_test                 -0.288746
round_time_test        0 days 00:00:10.216876
round_time_total       0 days 00:14:02.176836
loss_total               645028555722518912.0
loss_critic              806285680776109312.0
loss_actor                    -3378671547.136
memory_size                            5799.0 

=== epoch 4/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:43,  2.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1008
episode_length                       9.910714
returns                             -2.857732
return_std                           2.939259
average_reward                      -0.289023
round_time             0 days 00:14:02.309699
episodes_test                          1017.0
episode_length_test                  9.823009
returns_test                        -2.713365
return_std_test                      2.733839
average_reward_test                 -0.275855
round_time_test        0 days 00:00:10.016225
round_time_total       0 days 00:14:02.310782
loss_total               654949923480424960.0
loss_critic              818687390334979200.0
loss_actor                    -3429876974.464
memory_size                            5799.0 

=== epoch 4/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.877349
returns                             -2.901228
return_std                           2.968861
average_reward                      -0.294085
round_time             0 days 00:14:02.565274
episodes_test                          1011.0
episode_length_test                   9.88724
returns_test                        -2.638061
return_std_test                      2.794697
average_reward_test                  -0.26651
round_time_test        0 days 00:00:10.098072
round_time_total       0 days 00:14:02.566344
loss_total               669686380955072256.0
loss_critic              837107962316800896.0
loss_actor                     -3446749920.64
memory_size                            5799.0 

=== epoch 4/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1008
episode_length                       9.910714
returns                             -2.824502
return_std                           2.898435
average_reward                      -0.284633
round_time             0 days 00:14:03.943725
episodes_test                          1021.0
episode_length_test                   9.79334
returns_test                        -2.796519
return_std_test                      2.764507
average_reward_test                 -0.285466
round_time_test        0 days 00:00:09.946555
round_time_total       0 days 00:14:03.944798
loss_total               689619746036085888.0
loss_critic              862024667519164288.0
loss_actor                    -3512133476.864
memory_size                            5799.0 

=== epoch 4/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1008
episode_length                       9.912698
returns                             -2.742399
return_std                           2.852048
average_reward                      -0.276552
round_time             0 days 00:14:02.504507
episodes_test                          1014.0
episode_length_test                  9.852071
returns_test                        -2.768673
return_std_test                      2.780005
average_reward_test                 -0.281064
round_time_test        0 days 00:00:10.116414
round_time_total       0 days 00:14:02.505568
loss_total               715246050316161920.0
loss_critic              894057547104755072.0
loss_actor                    -3586305603.328
memory_size                            5799.0 

=== epoch 4/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:15,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1012
episode_length                       9.876482
returns                             -2.679603
return_std                           2.794077
average_reward                      -0.270885
round_time             0 days 00:14:02.500527
episodes_test                          1027.0
episode_length_test                  9.733204
returns_test                        -2.973839
return_std_test                      2.698467
average_reward_test                 -0.305272
round_time_test        0 days 00:00:10.084642
round_time_total       0 days 00:14:02.501597
loss_total               758104957384403200.0
loss_critic              947631180756446848.0
loss_actor                    -3688089521.152
memory_size                            5799.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:02<14:06,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  199
episode_length                      10.050251
returns                             -3.161438
return_std                           3.234905
average_reward                      -0.314563
round_time             0 days 00:14:01.168318
episodes_test                          1019.0
episode_length_test                  9.809617
returns_test                        -2.749242
return_std_test                      2.812368
average_reward_test                 -0.279985
round_time_test        0 days 00:00:10.229998
round_time_total       0 days 00:14:01.169418
loss_total               781761119960615040.0
loss_critic              977201381937676032.0
loss_actor                    -3752062701.184
memory_size                         5800.6065 

=== epoch 5/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                  400
episode_length                         9.9925
returns                             -2.914835
return_std                            3.01811
average_reward                       -0.29116
round_time             0 days 00:14:03.134355
episodes_test                          1015.0
episode_length_test                  9.846305
returns_test                        -2.708375
return_std_test                      2.696393
average_reward_test                 -0.274516
round_time_test        0 days 00:00:10.259851
round_time_total       0 days 00:14:03.135436
loss_total               817402700285997952.0
loss_critic             1021753357619282560.0
loss_actor                    -3816259669.504
memory_size                            5820.0 

=== epoch 5/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                  602
episode_length                       9.963455
returns                             -2.887716
return_std                           2.995408
average_reward                      -0.289382
round_time             0 days 00:14:02.529204
episodes_test                          1001.0
episode_length_test                  9.989011
returns_test                        -2.659095
return_std_test                      2.885058
average_reward_test                 -0.266087
round_time_test        0 days 00:00:10.183821
round_time_total       0 days 00:14:02.530270
loss_total               851582013760862208.0
loss_critic             1064477498908812032.0
loss_actor                    -3875768661.888
memory_size                            5820.0 

=== epoch 5/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:43,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  805
episode_length                       9.932919
returns                             -2.895653
return_std                           2.965253
average_reward                      -0.290992
round_time             0 days 00:14:01.475233
episodes_test                          1003.0
episode_length_test                  9.964108
returns_test                        -2.805849
return_std_test                      2.909893
average_reward_test                 -0.280985
round_time_test        0 days 00:00:10.041721
round_time_total       0 days 00:14:01.476324
loss_total               903798147192480384.0
loss_critic             1129747665006844928.0
loss_actor                    -3991456171.776
memory_size                            5820.0 

=== epoch 5/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1010
episode_length                        9.89901
returns                             -2.951871
return_std                            2.90913
average_reward                      -0.297929
round_time             0 days 00:14:03.637633
episodes_test                          1013.0
episode_length_test                  9.862784
returns_test                        -2.888262
return_std_test                      2.873752
average_reward_test                 -0.292635
round_time_test        0 days 00:00:10.125192
round_time_total       0 days 00:14:03.638696
loss_total               940511210403813888.0
loss_critic             1175638992197798400.0
loss_actor                    -4102627375.616
memory_size                            5820.0 

=== epoch 5/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.888229
returns                             -2.929527
return_std                           2.832086
average_reward                      -0.295892
round_time             0 days 00:14:02.404942
episodes_test                          1010.0
episode_length_test                   9.89901
returns_test                        -2.745459
return_std_test                      3.014494
average_reward_test                 -0.277123
round_time_test        0 days 00:00:10.087226
round_time_total       0 days 00:14:02.406009
loss_total               982259241493958016.0
loss_critic             1227824031036856064.0
loss_actor                    -4184305753.856
memory_size                            5820.0 

=== epoch 5/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:42,  2.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1012
episode_length                       9.870553
returns                             -2.996734
return_std                           2.815274
average_reward                      -0.303913
round_time             0 days 00:14:03.256309
episodes_test                          1012.0
episode_length_test                   9.87253
returns_test                        -3.080494
return_std_test                      2.956469
average_reward_test                 -0.312158
round_time_test        0 days 00:00:10.135639
round_time_total       0 days 00:14:03.257395
loss_total              1025947678352475136.0
loss_critic             1282434574505104896.0
loss_actor                    -4257349684.608
memory_size                            5820.0 

=== epoch 5/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1010
episode_length                       9.885149
returns                             -3.022529
return_std                           2.810541
average_reward                      -0.305893
round_time             0 days 00:14:02.917013
episodes_test                          1005.0
episode_length_test                  9.943284
returns_test                         -2.92174
return_std_test                      2.962993
average_reward_test                 -0.293519
round_time_test        0 days 00:00:10.149957
round_time_total       0 days 00:14:02.918102
loss_total              1054849420324222592.0
loss_critic             1318561753101512960.0
loss_actor                     -4312433012.48
memory_size                            5820.0 

=== epoch 5/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:27,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1010
episode_length                       9.891089
returns                             -3.056742
return_std                           2.831741
average_reward                       -0.30906
round_time             0 days 00:14:03.745806
episodes_test                          1015.0
episode_length_test                  9.848276
returns_test                        -2.917735
return_std_test                      2.954299
average_reward_test                 -0.296067
round_time_test        0 days 00:00:10.112950
round_time_total       0 days 00:14:03.746893
loss_total              1086446439627818496.0
loss_critic             1358058025893125632.0
loss_actor                    -4358026368.256
memory_size                            5820.0 

=== epoch 5/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1008
episode_length                       9.905754
returns                             -3.071423
return_std                           2.862431
average_reward                      -0.309871
round_time             0 days 00:14:02.638032
episodes_test                          1011.0
episode_length_test                  9.889219
returns_test                        -2.760457
return_std_test                      2.852277
average_reward_test                 -0.278943
round_time_test        0 days 00:00:09.943985
round_time_total       0 days 00:14:02.639105
loss_total              1111980345941448192.0
loss_critic             1389975407932611840.0
loss_actor                     -4407568578.56
memory_size                            5820.0 

=== epoch 5/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.875371
returns                             -3.015318
return_std                           2.877262
average_reward                      -0.305609
round_time             0 days 00:14:02.509137
episodes_test                          1005.0
episode_length_test                  9.949254
returns_test                        -2.875494
return_std_test                       3.02514
average_reward_test                 -0.288898
round_time_test        0 days 00:00:10.040442
round_time_total       0 days 00:14:02.510227
loss_total              1143548283600576128.0
loss_critic             1429435330354414080.0
loss_actor                    -4493169337.984
memory_size                            5820.0 

=== epoch 5/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1009
episode_length                       9.900892
returns                             -2.956052
return_std                           2.887377
average_reward                      -0.298997
round_time             0 days 00:14:03.022565
episodes_test                          1012.0
episode_length_test                  9.879447
returns_test                        -2.839574
return_std_test                      2.867684
average_reward_test                 -0.287232
round_time_test        0 days 00:00:09.944302
round_time_total       0 days 00:14:03.023641
loss_total              1179127282902381824.0
loss_critic             1473909078609792768.0
loss_actor                    -4580275802.112
memory_size                            5820.0 

=== epoch 5/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.878458
returns                             -2.918282
return_std                           2.854517
average_reward                      -0.295648
round_time             0 days 00:14:02.328060
episodes_test                          1017.0
episode_length_test                  9.832842
returns_test                         -2.83908
return_std_test                      2.835587
average_reward_test                 -0.288734
round_time_test        0 days 00:00:09.963653
round_time_total       0 days 00:14:02.329126
loss_total              1222417702744987136.0
loss_critic             1528022100891903488.0
loss_actor                    -4643701207.552
memory_size                            5820.0 

=== epoch 5/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:47,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1004
episode_length                       9.948207
returns                             -2.871123
return_std                           2.828501
average_reward                      -0.288832
round_time             0 days 00:14:02.842126
episodes_test                           998.0
episode_length_test                 10.012024
returns_test                         -2.85946
return_std_test                      2.956687
average_reward_test                 -0.285724
round_time_test        0 days 00:00:09.939388
round_time_total       0 days 00:14:02.843196
loss_total              1249353812088581120.0
loss_critic             1561692239027389952.0
loss_actor                     -4750196020.48
memory_size                            5820.0 

=== epoch 5/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  999
episode_length                       9.996997
returns                             -2.755215
return_std                           2.850005
average_reward                      -0.275222
round_time             0 days 00:14:01.960604
episodes_test                          1010.0
episode_length_test                  9.893069
returns_test                        -2.917492
return_std_test                      2.932076
average_reward_test                 -0.294714
round_time_test        0 days 00:00:10.059814
round_time_total       0 days 00:14:01.961709
loss_total              1298046553465244672.0
loss_critic             1622558164524153600.0
loss_actor                    -4834687585.536
memory_size                            5820.0 

=== epoch 5/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:08,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                  998
episode_length                      10.017034
returns                             -2.696527
return_std                           2.821798
average_reward                      -0.269235
round_time             0 days 00:14:03.441270
episodes_test                          1014.0
episode_length_test                  9.856016
returns_test                        -2.690059
return_std_test                      2.735429
average_reward_test                 -0.272519
round_time_test        0 days 00:00:10.115203
round_time_total       0 days 00:14:03.442358
loss_total              1362836906596020224.0
loss_critic             1703546104445122048.0
loss_actor                    -4953247717.376
memory_size                            5820.0 

=== epoch 5/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1003
episode_length                       9.959123
returns                             -2.743286
return_std                           2.821316
average_reward                      -0.275707
round_time             0 days 00:14:01.324725
episodes_test                          1004.0
episode_length_test                  9.950199
returns_test                        -2.762875
return_std_test                      2.810446
average_reward_test                 -0.277376
round_time_test        0 days 00:00:10.008842
round_time_total       0 days 00:14:01.325817
loss_total              1431887587578977792.0
loss_critic             1789859454718188800.0
loss_actor                    -5069128199.936
memory_size                            5820.0 

=== epoch 5/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:51,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1006
episode_length                       9.931412
returns                             -2.789594
return_std                           2.818832
average_reward                      -0.280653
round_time             0 days 00:14:02.877036
episodes_test                          1011.0
episode_length_test                  9.886251
returns_test                        -2.888961
return_std_test                      2.905401
average_reward_test                 -0.291927
round_time_test        0 days 00:00:10.170996
round_time_total       0 days 00:14:02.878116
loss_total              1487959127068462080.0
loss_critic             1859948877054967040.0
loss_actor                     -5141501863.68
memory_size                            5820.0 

=== epoch 5/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:37,  2.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1015
episode_length                        9.84335
returns                             -2.824957
return_std                           2.798004
average_reward                      -0.287308
round_time             0 days 00:14:02.787783
episodes_test                          1017.0
episode_length_test                  9.826942
returns_test                        -2.938945
return_std_test                      2.840844
average_reward_test                 -0.298706
round_time_test        0 days 00:00:10.033462
round_time_total       0 days 00:14:02.788870
loss_total              1485656514244022016.0
loss_critic             1857070611885558016.0
loss_actor                    -5167188784.128
memory_size                            5820.0 

=== epoch 5/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:51,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1018
episode_length                        9.81336
returns                             -2.909733
return_std                           2.774953
average_reward                      -0.296696
round_time             0 days 00:14:02.257859
episodes_test                          1007.0
episode_length_test                  9.922542
returns_test                        -2.809356
return_std_test                      2.939251
average_reward_test                 -0.283165
round_time_test        0 days 00:00:10.113442
round_time_total       0 days 00:14:02.258975
loss_total              1529277395278472704.0
loss_critic             1911596710721900032.0
loss_actor                    -5254542753.792
memory_size                            5820.0 

=== epoch 5/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1018
episode_length                       9.814342
returns                             -3.027054
return_std                            2.80388
average_reward                      -0.308315
round_time             0 days 00:14:03.206307
episodes_test                          1015.0
episode_length_test                   9.84335
returns_test                        -2.999181
return_std_test                      2.996386
average_reward_test                 -0.304647
round_time_test        0 days 00:00:09.958088
round_time_total       0 days 00:14:03.207407
loss_total              1597383869371177216.0
loss_critic             1996729801490944768.0
loss_actor                    -5370884810.496
memory_size                            5820.0 

=== epoch 5/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1013
episode_length                       9.863771
returns                             -2.951778
return_std                           2.813377
average_reward                      -0.298959
round_time             0 days 00:14:02.183110
episodes_test                          1009.0
episode_length_test                  9.901883
returns_test                        -2.879729
return_std_test                      2.822829
average_reward_test                 -0.290159
round_time_test        0 days 00:00:10.151995
round_time_total       0 days 00:14:02.184180
loss_total              1661620075917228288.0
loss_critic             2077025058002766336.0
loss_actor                    -5515311827.456
memory_size                            5820.0 

=== epoch 5/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1008
episode_length                       9.909722
returns                             -2.944219
return_std                           2.862101
average_reward                      -0.297314
round_time             0 days 00:14:01.257462
episodes_test                          1016.0
episode_length_test                   9.83563
returns_test                        -2.817052
return_std_test                       2.76795
average_reward_test                 -0.285684
round_time_test        0 days 00:00:10.069310
round_time_total       0 days 00:14:01.258532
loss_total              1712111953788239104.0
loss_critic             2140139906638609920.0
loss_actor                    -5593586128.128
memory_size                            5820.0 

=== epoch 5/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1005
episode_length                       9.941294
returns                             -2.846338
return_std                           2.882774
average_reward                      -0.286113
round_time             0 days 00:14:01.088072
episodes_test                          1021.0
episode_length_test                  9.786484
returns_test                        -2.892426
return_std_test                      2.846782
average_reward_test                 -0.295101
round_time_test        0 days 00:00:10.050408
round_time_total       0 days 00:14:01.089143
loss_total              1772572908871657472.0
loss_critic             2215716097744103680.0
loss_actor                    -5715813126.656
memory_size                            5820.0 

=== epoch 5/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1006
episode_length                        9.93837
returns                             -2.763423
return_std                           2.884107
average_reward                      -0.278093
round_time             0 days 00:14:01.755767
episodes_test                          1012.0
episode_length_test                  9.874506
returns_test                        -2.753126
return_std_test                      2.976437
average_reward_test                  -0.27819
round_time_test        0 days 00:00:10.100147
round_time_total       0 days 00:14:01.756833
loss_total              1828986763551055104.0
loss_critic             2286233414581522176.0
loss_actor                    -5774602289.408
memory_size                            5820.0 

=== epoch 5/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1009
episode_length                       9.891972
returns                             -2.731192
return_std                           2.822579
average_reward                      -0.275407
round_time             0 days 00:14:02.891848
episodes_test                          1013.0
episode_length_test                  9.862784
returns_test                        -2.664688
return_std_test                      2.817498
average_reward_test                 -0.270206
round_time_test        0 days 00:00:10.001775
round_time_total       0 days 00:14:02.892904
loss_total              1864767088207910144.0
loss_critic             2330958819659561984.0
loss_actor                    -5806527837.696
memory_size                            5820.0 

=== epoch 5/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1009
episode_length                       9.896928
returns                              -2.69218
return_std                           2.855406
average_reward                      -0.271601
round_time             0 days 00:14:02.140412
episodes_test                          1026.0
episode_length_test                  9.745614
returns_test                        -2.683575
return_std_test                       2.64509
average_reward_test                 -0.275246
round_time_test        0 days 00:00:10.091150
round_time_total       0 days 00:14:02.141492
loss_total              1878953235925262336.0
loss_critic             2348691504357791744.0
loss_actor                     -5785595823.36
memory_size                            5820.0 

=== epoch 5/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1014
episode_length                       9.858974
returns                             -2.616988
return_std                           2.778989
average_reward                      -0.265354
round_time             0 days 00:14:02.800266
episodes_test                          1010.0
episode_length_test                       9.9
returns_test                        -2.457139
return_std_test                      2.812635
average_reward_test                 -0.248092
round_time_test        0 days 00:00:10.085785
round_time_total       0 days 00:14:02.801352
loss_total              1820800575985326080.0
loss_critic             2276000680528087808.0
loss_actor                    -5644845734.912
memory_size                            5820.0 

=== epoch 5/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:11,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.844335
returns                             -2.598793
return_std                           2.762764
average_reward                      -0.264017
round_time             0 days 00:14:01.942967
episodes_test                          1010.0
episode_length_test                  9.893069
returns_test                        -2.528443
return_std_test                      2.788006
average_reward_test                  -0.25586
round_time_test        0 days 00:00:09.983449
round_time_total       0 days 00:14:01.944038
loss_total              1825432024522671872.0
loss_critic             2281789991912734720.0
loss_actor                    -5674046041.344
memory_size                            5820.0 

=== epoch 5/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1016
episode_length                       9.837598
returns                             -2.586754
return_std                           2.714516
average_reward                      -0.263047
round_time             0 days 00:14:02.938588
episodes_test                          1017.0
episode_length_test                  9.832842
returns_test                        -2.427635
return_std_test                       2.70488
average_reward_test                 -0.246891
round_time_test        0 days 00:00:10.050413
round_time_total       0 days 00:14:02.939667
loss_total              1842170993271965184.0
loss_critic             2302713701522206720.0
loss_actor                    -5725915365.376
memory_size                            5820.0 

=== epoch 5/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1013
episode_length                       9.863771
returns                             -2.440281
return_std                           2.713873
average_reward                      -0.247664
round_time             0 days 00:14:02.881872
episodes_test                          1020.0
episode_length_test                   9.80098
returns_test                        -2.449394
return_std_test                      2.693212
average_reward_test                 -0.249792
round_time_test        0 days 00:00:10.120855
round_time_total       0 days 00:14:02.882945
loss_total              1844619377552719360.0
loss_critic             2305774183531006720.0
loss_actor                    -5732334043.648
memory_size                            5820.0 

=== epoch 5/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.853057
returns                             -2.442318
return_std                           2.687814
average_reward                      -0.247659
round_time             0 days 00:14:02.275004
episodes_test                          1026.0
episode_length_test                  9.738791
returns_test                        -2.445816
return_std_test                       2.78597
average_reward_test                 -0.251193
round_time_test        0 days 00:00:10.210879
round_time_total       0 days 00:14:02.276074
loss_total              1813641064256800512.0
loss_critic             2267051290828776448.0
loss_actor                     -5719030676.48
memory_size                            5820.0 

=== epoch 5/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                        9.87636
returns                             -2.405489
return_std                           2.712021
average_reward                      -0.244161
round_time             0 days 00:14:02.354231
episodes_test                          1024.0
episode_length_test                  9.763672
returns_test                        -2.432228
return_std_test                      2.649155
average_reward_test                 -0.248829
round_time_test        0 days 00:00:10.090289
round_time_total       0 days 00:14:02.355314
loss_total              1861403951553251840.0
loss_critic             2326754898626490880.0
loss_actor                    -5818388921.088
memory_size                            5820.0 

=== epoch 5/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1013
episode_length                       9.868707
returns                             -2.391745
return_std                           2.693382
average_reward                      -0.241986
round_time             0 days 00:14:01.462084
episodes_test                          1021.0
episode_length_test                  9.785504
returns_test                          -2.7906
return_std_test                      2.841697
average_reward_test                  -0.28491
round_time_test        0 days 00:00:10.079509
round_time_total       0 days 00:14:01.463183
loss_total              1938353081233254144.0
loss_critic             2422941310846752768.0
loss_actor                      -6004522662.4
memory_size                            5820.0 

=== epoch 5/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.826942
returns                             -2.475948
return_std                           2.681682
average_reward                      -0.251952
round_time             0 days 00:14:01.599594
episodes_test                          1008.0
episode_length_test                  9.919643
returns_test                        -2.632972
return_std_test                      2.803949
average_reward_test                 -0.265319
round_time_test        0 days 00:00:10.029700
round_time_total       0 days 00:14:01.600695
loss_total              2048521863526931712.0
loss_critic             2560652282842629120.0
loss_actor                      -6175627206.4
memory_size                            5820.0 

=== epoch 5/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.823992
returns                             -2.570636
return_std                           2.730586
average_reward                      -0.261631
round_time             0 days 00:14:00.936671
episodes_test                          1016.0
episode_length_test                   9.83563
returns_test                        -2.730133
return_std_test                      2.799286
average_reward_test                 -0.276913
round_time_test        0 days 00:00:09.965461
round_time_total       0 days 00:14:00.937748
loss_total              2143778520141991424.0
loss_critic             2679723104504806912.0
loss_actor                    -6307729089.792
memory_size                            5820.0 

=== epoch 5/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.850099
returns                             -2.570962
return_std                           2.709719
average_reward                      -0.261317
round_time             0 days 00:14:01.390516
episodes_test                          1012.0
episode_length_test                   9.87253
returns_test                        -2.784389
return_std_test                      2.793004
average_reward_test                 -0.281733
round_time_test        0 days 00:00:10.143860
round_time_total       0 days 00:14:01.391598
loss_total              2211571757717133568.0
loss_critic             2764464651263281152.0
loss_actor                      -6394534860.8
memory_size                            5820.0 

=== epoch 5/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1012
episode_length                        9.87747
returns                              -2.62332
return_std                           2.742275
average_reward                      -0.265284
round_time             0 days 00:14:00.686507
episodes_test                          1011.0
episode_length_test                  9.885262
returns_test                         -2.88111
return_std_test                      2.880645
average_reward_test                 -0.290765
round_time_test        0 days 00:00:09.917808
round_time_total       0 days 00:14:00.687578
loss_total              2312860418891214336.0
loss_critic             2891075470807395328.0
loss_actor                    -6498329500.416
memory_size                            5820.0 

=== epoch 5/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:41,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1004
episode_length                       9.943227
returns                              -2.63603
return_std                            2.84606
average_reward                      -0.264971
round_time             0 days 00:14:01.457645
episodes_test                          1001.0
episode_length_test                   9.98002
returns_test                        -2.729493
return_std_test                      2.919754
average_reward_test                 -0.273121
round_time_test        0 days 00:00:09.876639
round_time_total       0 days 00:14:01.458723
loss_total              2366637117402660352.0
loss_critic             2958296346386243584.0
loss_actor                    -6635338333.184
memory_size                            5820.0 

=== epoch 5/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:47,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  999
episode_length                      10.001001
returns                              -2.53677
return_std                           2.916426
average_reward                      -0.253936
round_time             0 days 00:14:01.961799
episodes_test                          1011.0
episode_length_test                  9.890208
returns_test                        -2.755699
return_std_test                      2.753742
average_reward_test                 -0.278514
round_time_test        0 days 00:00:10.087247
round_time_total       0 days 00:14:01.962865
loss_total              2444066953669405184.0
loss_critic             3055083641384667648.0
loss_actor                    -6752392895.488
memory_size                            5820.0 

=== epoch 5/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                  996
episode_length                      10.026104
returns                              -2.52485
return_std                           2.938088
average_reward                      -0.252237
round_time             0 days 00:14:00.695656
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.812762
return_std_test                      2.864302
average_reward_test                 -0.285142
round_time_test        0 days 00:00:09.969972
round_time_total       0 days 00:14:00.696733
loss_total              2549844978097379840.0
loss_critic             3187306168475072000.0
loss_actor                    -6808192112.128
memory_size                            5820.0 

=== epoch 5/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:26,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1002
episode_length                       9.978044
returns                             -2.653761
return_std                            2.96599
average_reward                      -0.266175
round_time             0 days 00:14:02.483310
episodes_test                          1017.0
episode_length_test                  9.825959
returns_test                        -2.941775
return_std_test                      2.802237
average_reward_test                 -0.298931
round_time_test        0 days 00:00:09.972318
round_time_total       0 days 00:14:02.484366
loss_total              2551104949926672384.0
loss_critic             3188881132707637248.0
loss_actor                    -6852485951.744
memory_size                            5820.0 

=== epoch 5/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1004
episode_length                       9.949203
returns                             -2.700368
return_std                           2.979707
average_reward                      -0.271077
round_time             0 days 00:14:01.807323
episodes_test                          1023.0
episode_length_test                  9.765396
returns_test                        -2.778349
return_std_test                      2.811297
average_reward_test                 -0.284483
round_time_test        0 days 00:00:10.195785
round_time_total       0 days 00:14:01.808440
loss_total              2585564925907873792.0
loss_critic             3231956098350516736.0
loss_actor                    -6960042166.272
memory_size                            5820.0 

=== epoch 5/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.879447
returns                             -2.774025
return_std                           2.947073
average_reward                      -0.280828
round_time             0 days 00:14:01.999137
episodes_test                          1014.0
episode_length_test                  9.860947
returns_test                        -2.615341
return_std_test                      2.825061
average_reward_test                 -0.265102
round_time_test        0 days 00:00:09.989233
round_time_total       0 days 00:14:02.000220
loss_total              2705369485540788736.0
loss_critic             3381711798565970432.0
loss_actor                    -7107932207.872
memory_size                            5820.0 

=== epoch 5/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:55,  2.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1013
episode_length                       9.863771
returns                             -2.859131
return_std                           2.936779
average_reward                        -0.2901
round_time             0 days 00:14:01.733639
episodes_test                          1014.0
episode_length_test                   9.85503
returns_test                        -3.030587
return_std_test                      2.829348
average_reward_test                 -0.307113
round_time_test        0 days 00:00:10.114590
round_time_total       0 days 00:14:01.734722
loss_total              2784045699515578880.0
loss_critic             3480057064947831296.0
loss_actor                    -7234315085.056
memory_size                            5820.0 

=== epoch 5/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:00,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1016
episode_length                        9.83563
returns                             -2.914171
return_std                           2.900792
average_reward                      -0.296222
round_time             0 days 00:14:01.098885
episodes_test                          1010.0
episode_length_test                   9.89703
returns_test                        -2.719456
return_std_test                      2.771299
average_reward_test                 -0.274425
round_time_test        0 days 00:00:09.940006
round_time_total       0 days 00:14:01.099948
loss_total              2900390128044799488.0
loss_critic             3625487593836193280.0
loss_actor                    -7375233221.632
memory_size                            5820.0 

=== epoch 5/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.874506
returns                             -2.822833
return_std                           2.875779
average_reward                      -0.285581
round_time             0 days 00:14:02.137688
episodes_test                          1015.0
episode_length_test                   9.84532
returns_test                        -2.748269
return_std_test                      2.857308
average_reward_test                 -0.279143
round_time_test        0 days 00:00:10.149386
round_time_total       0 days 00:14:02.138773
loss_total              2938418213287576576.0
loss_critic             3673022706303834624.0
loss_actor                    -7467358872.064
memory_size                            5820.0 

=== epoch 5/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:53,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.846305
returns                             -2.886286
return_std                            2.86659
average_reward                       -0.29328
round_time             0 days 00:14:02.485320
episodes_test                          1024.0
episode_length_test                  9.765625
returns_test                        -2.764982
return_std_test                      2.710187
average_reward_test                 -0.283134
round_time_test        0 days 00:00:10.002648
round_time_total       0 days 00:14:02.486424
loss_total              3012556842194438144.0
loss_critic             3765695989224776192.0
loss_actor                      -7482309734.4
memory_size                            5820.0 

=== epoch 5/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1020
episode_length                       9.803922
returns                             -2.950006
return_std                           2.857484
average_reward                      -0.300901
round_time             0 days 00:14:02.621336
episodes_test                          1015.0
episode_length_test                  9.847291
returns_test                        -2.768074
return_std_test                      2.811267
average_reward_test                 -0.281253
round_time_test        0 days 00:00:10.351942
round_time_total       0 days 00:14:02.622387
loss_total              3064298346967793152.0
loss_critic             3830372865926567424.0
loss_actor                     -7609178702.08
memory_size                            5820.0 

=== epoch 5/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:07,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1019
episode_length                       9.797841
returns                             -2.917416
return_std                           2.832734
average_reward                      -0.297251
round_time             0 days 00:14:01.312223
episodes_test                          1010.0
episode_length_test                   9.90099
returns_test                        -2.766558
return_std_test                      2.877163
average_reward_test                 -0.279422
round_time_test        0 days 00:00:10.203750
round_time_total       0 days 00:14:01.313278
loss_total              3168801094089738240.0
loss_critic             3961001304944304640.0
loss_actor                    -7745961223.424
memory_size                            5820.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:01,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  204
episode_length                       9.779412
returns                             -2.924899
return_std                             2.9682
average_reward                      -0.298947
round_time             0 days 00:14:01.677201
episodes_test                          1008.0
episode_length_test                  9.918651
returns_test                        -2.659194
return_std_test                      2.742267
average_reward_test                 -0.267904
round_time_test        0 days 00:00:09.997374
round_time_total       0 days 00:14:01.678305
loss_total              3254041462679296512.0
loss_critic             4067551753367581696.0
loss_actor                    -7891299127.552
memory_size                            5820.0 

=== epoch 6/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                  412
episode_length                       9.706311
returns                             -2.948806
return_std                            2.91626
average_reward                      -0.303551
round_time             0 days 00:14:00.661852
episodes_test                          1019.0
episode_length_test                  9.812561
returns_test                        -2.836244
return_std_test                      2.760045
average_reward_test                  -0.28895
round_time_test        0 days 00:00:10.129385
round_time_total       0 days 00:14:00.662943
loss_total              3448986021108968960.0
loss_critic             4311232453148428800.0
loss_actor                    -8058993276.416
memory_size                            5820.0 

=== epoch 6/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                  614
episode_length                       9.762215
returns                             -2.894126
return_std                           2.839896
average_reward                      -0.295772
round_time             0 days 00:14:02.507112
episodes_test                          1012.0
episode_length_test                  9.875494
returns_test                        -2.619285
return_std_test                      2.918133
average_reward_test                 -0.264635
round_time_test        0 days 00:00:10.121139
round_time_total       0 days 00:14:02.508245
loss_total              3564510605132309504.0
loss_critic             4455638178547630080.0
loss_actor                    -8194071147.776
memory_size                            5820.0 

=== epoch 6/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:59,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                  816
episode_length                       9.791667
returns                             -2.832249
return_std                           2.768546
average_reward                      -0.288749
round_time             0 days 00:14:01.358280
episodes_test                          1009.0
episode_length_test                  9.902874
returns_test                        -2.655198
return_std_test                      2.859671
average_reward_test                 -0.267721
round_time_test        0 days 00:00:09.978541
round_time_total       0 days 00:14:01.359346
loss_total              3665570163370613760.0
loss_critic             4581962633655544320.0
loss_actor                    -8316175609.856
memory_size                            5820.0 

=== epoch 6/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1020
episode_length                       9.795098
returns                             -2.865631
return_std                           2.762776
average_reward                      -0.292974
round_time             0 days 00:14:01.781817
episodes_test                          1018.0
episode_length_test                   9.81336
returns_test                        -2.792308
return_std_test                      2.866353
average_reward_test                 -0.283924
round_time_test        0 days 00:00:10.079854
round_time_total       0 days 00:14:01.782904
loss_total              3739939487443528704.0
loss_critic             4674924283635677184.0
loss_actor                    -8424653252.352
memory_size                            5820.0 

=== epoch 6/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.825959
returns                             -2.746813
return_std                           2.712034
average_reward                      -0.279649
round_time             0 days 00:14:01.423272
episodes_test                          1019.0
episode_length_test                  9.810599
returns_test                        -2.766358
return_std_test                      2.605557
average_reward_test                 -0.281692
round_time_test        0 days 00:00:10.237998
round_time_total       0 days 00:14:01.424350
loss_total              3885277511463708160.0
loss_critic             4856596803164001280.0
loss_actor                    -8563712274.176
memory_size                            5820.0 

=== epoch 6/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1013
episode_length                       9.859822
returns                             -2.643022
return_std                           2.647433
average_reward                      -0.268902
round_time             0 days 00:14:01.762328
episodes_test                          1021.0
episode_length_test                  9.790402
returns_test                        -2.531594
return_std_test                        2.7874
average_reward_test                 -0.258254
round_time_test        0 days 00:00:10.042666
round_time_total       0 days 00:14:01.763414
loss_total              3979086009319643136.0
loss_critic             4973857426368683008.0
loss_actor                    -8604951726.336
memory_size                            5820.0 

=== epoch 6/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:25,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.840394
returns                             -2.686626
return_std                           2.692552
average_reward                      -0.273131
round_time             0 days 00:14:01.486234
episodes_test                          1025.0
episode_length_test                  9.755122
returns_test                         -2.84527
return_std_test                      2.801869
average_reward_test                 -0.291557
round_time_test        0 days 00:00:10.122798
round_time_total       0 days 00:14:01.487308
loss_total              4070831333976116736.0
loss_critic             5088539083580845056.0
loss_actor                    -8744512674.816
memory_size                            5820.0 

=== epoch 6/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.825959
returns                             -2.747501
return_std                           2.694125
average_reward                      -0.279325
round_time             0 days 00:14:01.619304
episodes_test                          1012.0
episode_length_test                  9.881423
returns_test                        -2.888894
return_std_test                      2.828165
average_reward_test                 -0.292356
round_time_test        0 days 00:00:10.016946
round_time_total       0 days 00:14:01.620379
loss_total              4148074784708213248.0
loss_critic             5185093391653025792.0
loss_actor                    -8901012481.024
memory_size                            5820.0 

=== epoch 6/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.853057
returns                             -2.702418
return_std                           2.733442
average_reward                      -0.273879
round_time             0 days 00:14:01.059328
episodes_test                          1005.0
episode_length_test                  9.944279
returns_test                        -2.700649
return_std_test                      2.954248
average_reward_test                 -0.271447
round_time_test        0 days 00:00:10.024816
round_time_total       0 days 00:14:01.060414
loss_total              4369011394841340416.0
loss_critic             5461264145600651264.0
loss_actor                 -9097492147.200001
memory_size                            5820.0 

=== epoch 6/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:39,  2.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.821042
returns                               -2.8456
return_std                           2.797976
average_reward                      -0.289445
round_time             0 days 00:14:02.176675
episodes_test                          1014.0
episode_length_test                  9.860947
returns_test                        -2.758169
return_std_test                      2.862706
average_reward_test                 -0.279579
round_time_test        0 days 00:00:09.937151
round_time_total       0 days 00:14:02.177761
loss_total              4469600418418567168.0
loss_critic             5587000424178831360.0
loss_actor                    -9298366470.656
memory_size                            5820.0 

=== epoch 6/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1018
episode_length                       9.817289
returns                             -2.978995
return_std                           2.829148
average_reward                      -0.303914
round_time             0 days 00:14:01.272347
episodes_test                          1019.0
episode_length_test                  9.810599
returns_test                        -2.895066
return_std_test                      2.868069
average_reward_test                 -0.294808
round_time_test        0 days 00:00:10.088058
round_time_total       0 days 00:14:01.273433
loss_total              4637893635364062208.0
loss_critic             5797366937896047616.0
loss_actor                 -9474896797.695999
memory_size                            5820.0 

=== epoch 6/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.827925
returns                             -2.961934
return_std                           2.857724
average_reward                      -0.301205
round_time             0 days 00:14:02.433636
episodes_test                          1016.0
episode_length_test                  9.838583
returns_test                        -2.805509
return_std_test                      2.962055
average_reward_test                 -0.284758
round_time_test        0 days 00:00:10.171913
round_time_total       0 days 00:14:02.434760
loss_total              4817847741707072512.0
loss_critic             6022309566108936192.0
loss_actor                 -9583376099.327999
memory_size                            5820.0 

=== epoch 6/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.874506
returns                             -2.871025
return_std                           2.934454
average_reward                      -0.290874
round_time             0 days 00:14:00.112179
episodes_test                          1013.0
episode_length_test                  9.863771
returns_test                        -2.813433
return_std_test                      2.819405
average_reward_test                 -0.284854
round_time_test        0 days 00:00:09.955506
round_time_total       0 days 00:14:00.113258
loss_total              4966497257530480640.0
loss_critic             6208121467468085248.0
loss_actor                     -9746659338.24
memory_size                            5820.0 

=== epoch 6/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                 1013
episode_length                        9.86772
returns                             -2.804948
return_std                           2.913936
average_reward                      -0.284279
round_time             0 days 00:14:00.448940
episodes_test                          1010.0
episode_length_test                   9.89802
returns_test                        -2.632412
return_std_test                      2.774688
average_reward_test                 -0.265608
round_time_test        0 days 00:00:09.865028
round_time_total       0 days 00:14:00.450016
loss_total              5151683632602734592.0
loss_critic             6439604428105016320.0
loss_actor                 -9879708566.016001
memory_size                            5820.0 

=== epoch 6/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.869565
returns                             -2.718278
return_std                           2.838091
average_reward                      -0.276069
round_time             0 days 00:14:01.153552
episodes_test                          1016.0
episode_length_test                  9.838583
returns_test                        -2.754596
return_std_test                      2.851744
average_reward_test                 -0.279656
round_time_test        0 days 00:00:10.039591
round_time_total       0 days 00:14:01.154633
loss_total              5270439712773629952.0
loss_critic             6588049529598535680.0
loss_actor                -10023644947.455999
memory_size                            5820.0 

=== epoch 6/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.877349
returns                             -2.620833
return_std                           2.850964
average_reward                      -0.265613
round_time             0 days 00:14:02.279941
episodes_test                          1023.0
episode_length_test                  9.775171
returns_test                        -2.763244
return_std_test                      2.890404
average_reward_test                  -0.28268
round_time_test        0 days 00:00:10.091701
round_time_total       0 days 00:14:02.281019
loss_total              5396152049795352576.0
loss_critic             6745189945558519808.0
loss_actor                    -10145698247.68
memory_size                            5820.0 

=== epoch 6/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1010
episode_length                       9.889109
returns                             -2.672447
return_std                           2.821352
average_reward                      -0.269621
round_time             0 days 00:14:02.673071
episodes_test                          1016.0
episode_length_test                  9.838583
returns_test                        -2.779429
return_std_test                      2.830676
average_reward_test                 -0.282016
round_time_test        0 days 00:00:10.040834
round_time_total       0 days 00:14:02.674147
loss_total              5450203736336704512.0
loss_critic             6812754556475398144.0
loss_actor                -10207273441.280001
memory_size                            5820.0 

=== epoch 6/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:15,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.869565
returns                              -2.73834
return_std                           2.800058
average_reward                       -0.27771
round_time             0 days 00:14:02.096717
episodes_test                          1014.0
episode_length_test                  9.856016
returns_test                        -2.698029
return_std_test                      2.865869
average_reward_test                 -0.273458
round_time_test        0 days 00:00:10.032326
round_time_total       0 days 00:14:02.097791
loss_total              5556499879024830464.0
loss_critic             6945624725257778176.0
loss_actor                   -10289783763.968
memory_size                            5820.0 

=== epoch 6/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.870553
returns                             -2.807616
return_std                           2.782824
average_reward                      -0.284241
round_time             0 days 00:14:02.336867
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.904343
return_std_test                       2.84595
average_reward_test                   -0.2943
round_time_test        0 days 00:00:10.145912
round_time_total       0 days 00:14:02.337938
loss_total              5730304443015810048.0
loss_critic             7162880433553628160.0
loss_actor                   -10382653169.664
memory_size                            5820.0 

=== epoch 6/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.882295
returns                             -2.809949
return_std                           2.817584
average_reward                      -0.284566
round_time             0 days 00:14:01.822855
episodes_test                          1028.0
episode_length_test                  9.726654
returns_test                         -3.00839
return_std_test                      2.783634
average_reward_test                 -0.309159
round_time_test        0 days 00:00:10.130119
round_time_total       0 days 00:14:01.823925
loss_total              5809098138382099456.0
loss_critic             7261372552598281216.0
loss_actor                -10494886468.096001
memory_size                            5820.0 

=== epoch 6/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.871542
returns                              -2.84364
return_std                           2.807996
average_reward                      -0.288127
round_time             0 days 00:14:02.330615
episodes_test                          1019.0
episode_length_test                   9.81158
returns_test                        -2.705468
return_std_test                       2.83447
average_reward_test                 -0.275558
round_time_test        0 days 00:00:10.083334
round_time_total       0 days 00:14:02.331719
loss_total              6036669979616063488.0
loss_critic             7545837347234427904.0
loss_actor                -10649180323.327999
memory_size                            5820.0 

=== epoch 6/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:17,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.867589
returns                             -2.779665
return_std                           2.851672
average_reward                      -0.281729
round_time             0 days 00:14:00.641011
episodes_test                          1016.0
episode_length_test                  9.837598
returns_test                        -2.695272
return_std_test                      2.822213
average_reward_test                 -0.273463
round_time_test        0 days 00:00:09.983826
round_time_total       0 days 00:14:00.642127
loss_total              6278973991745448960.0
loss_critic             7848717352294397952.0
loss_actor                   -10828559208.448
memory_size                            5820.0 

=== epoch 6/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.854043
returns                             -2.720535
return_std                           2.850388
average_reward                      -0.275987
round_time             0 days 00:14:02.586057
episodes_test                          1006.0
episode_length_test                   9.93837
returns_test                        -2.639583
return_std_test                      2.803395
average_reward_test                 -0.265405
round_time_test        0 days 00:00:10.007259
round_time_total       0 days 00:14:02.587126
loss_total              6442956355960183808.0
loss_critic             8053695298990061568.0
loss_actor                -11027061314.559999
memory_size                            5820.0 

=== epoch 6/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.880317
returns                             -2.570744
return_std                           2.916825
average_reward                      -0.260132
round_time             0 days 00:14:00.367776
episodes_test                          1008.0
episode_length_test                  9.916667
returns_test                        -2.506017
return_std_test                      2.924615
average_reward_test                 -0.252835
round_time_test        0 days 00:00:10.054314
round_time_total       0 days 00:14:00.368853
loss_total              6494911212907550720.0
loss_critic             8118638873713321984.0
loss_actor                   -11174036080.128
memory_size                            5820.0 

=== epoch 6/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1009
episode_length                       9.901883
returns                             -2.465452
return_std                           2.896818
average_reward                      -0.249115
round_time             0 days 00:14:01.534922
episodes_test                          1006.0
episode_length_test                  9.934394
returns_test                        -2.498474
return_std_test                      2.747792
average_reward_test                 -0.250934
round_time_test        0 days 00:00:10.077069
round_time_total       0 days 00:14:01.535997
loss_total              6678697802028585984.0
loss_critic             8348372108929206272.0
loss_actor                   -11294739323.392
memory_size                            5820.0 

=== epoch 6/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1003
episode_length                       9.957129
returns                             -2.421188
return_std                            2.91079
average_reward                      -0.243451
round_time             0 days 00:14:01.078023
episodes_test                          1005.0
episode_length_test                  9.950249
returns_test                        -2.764316
return_std_test                      2.911704
average_reward_test                 -0.277814
round_time_test        0 days 00:00:10.161351
round_time_total       0 days 00:14:01.079115
loss_total              6970157015021541376.0
loss_critic             8712696115137355776.0
loss_actor                   -11416710424.576
memory_size                            5820.0 

=== epoch 6/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1003
episode_length                       9.961117
returns                              -2.45554
return_std                            2.85818
average_reward                      -0.246821
round_time             0 days 00:14:01.378849
episodes_test                          1021.0
episode_length_test                  9.786484
returns_test                        -2.816875
return_std_test                      2.774566
average_reward_test                 -0.287097
round_time_test        0 days 00:00:10.080707
round_time_total       0 days 00:14:01.379914
loss_total              6963295548854849536.0
loss_critic             8704119286861398016.0
loss_actor                -11596457423.360001
memory_size                            5820.0 

=== epoch 6/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                  997
episode_length                      10.016048
returns                             -2.446029
return_std                           2.863542
average_reward                      -0.243982
round_time             0 days 00:14:00.210699
episodes_test                          1004.0
episode_length_test                  9.954183
returns_test                        -2.563301
return_std_test                      2.752217
average_reward_test                 -0.256849
round_time_test        0 days 00:00:10.112360
round_time_total       0 days 00:14:00.211773
loss_total              7335086717927457792.0
loss_critic             9168858246707509248.0
loss_actor                -11786032228.351999
memory_size                            5820.0 

=== epoch 6/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  998
episode_length                       10.01002
returns                             -2.521048
return_std                           2.787104
average_reward                      -0.251839
round_time             0 days 00:14:02.398937
episodes_test                          1019.0
episode_length_test                  9.804711
returns_test                        -2.910261
return_std_test                      2.820018
average_reward_test                 -0.296094
round_time_test        0 days 00:00:09.954729
round_time_total       0 days 00:14:02.400013
loss_total              7447496543942063104.0
loss_critic             9309370518041671680.0
loss_actor                -11952989597.695999
memory_size                            5820.0 

=== epoch 6/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  998
episode_length                      10.014028
returns                             -2.597452
return_std                           2.787397
average_reward                      -0.259153
round_time             0 days 00:14:02.070039
episodes_test                          1008.0
episode_length_test                  9.919643
returns_test                        -2.636068
return_std_test                       2.87505
average_reward_test                   -0.2656
round_time_test        0 days 00:00:10.032224
round_time_total       0 days 00:14:02.071105
loss_total              7709203012213414912.0
loss_critic             9636503601508255744.0
loss_actor                   -12076323954.688
memory_size                            5820.0 

=== epoch 6/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                  999
episode_length                      10.003003
returns                             -2.656901
return_std                           2.827684
average_reward                      -0.265402
round_time             0 days 00:14:02.981990
episodes_test                          1009.0
episode_length_test                  9.909812
returns_test                        -2.824242
return_std_test                      2.858344
average_reward_test                 -0.284867
round_time_test        0 days 00:00:10.051778
round_time_total       0 days 00:14:02.983078
loss_total              7853273940410767360.0
loss_critic             9816592258765649920.0
loss_actor                -12196488686.591999
memory_size                            5820.0 

=== epoch 6/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                  997
episode_length                      10.023069
returns                             -2.632095
return_std                           2.883014
average_reward                      -0.262887
round_time             0 days 00:14:02.993552
episodes_test                          1020.0
episode_length_test                   9.80098
returns_test                        -2.863386
return_std_test                      2.899236
average_reward_test                 -0.291812
round_time_test        0 days 00:00:09.886897
round_time_total       0 days 00:14:02.994632
loss_total              8153948396407516160.0
loss_critic            10192435324037120000.0
loss_actor                   -12310572386.816
memory_size                            5820.0 

=== epoch 6/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1003
episode_length                       9.967099
returns                             -2.644558
return_std                           2.891634
average_reward                      -0.265414
round_time             0 days 00:14:02.716357
episodes_test                          1022.0
episode_length_test                  9.774951
returns_test                        -2.804276
return_std_test                      2.738379
average_reward_test                 -0.286228
round_time_test        0 days 00:00:10.161699
round_time_total       0 days 00:14:02.717431
loss_total              8376706325458742272.0
loss_critic            10470882726555060224.0
loss_actor                    -12573711580.16
memory_size                            5820.0 

=== epoch 6/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:09,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1006
episode_length                       9.923459
returns                             -2.688778
return_std                           2.916227
average_reward                      -0.270839
round_time             0 days 00:14:02.912870
episodes_test                          1008.0
episode_length_test                  9.918651
returns_test                        -2.705475
return_std_test                      2.730156
average_reward_test                 -0.272603
round_time_test        0 days 00:00:09.983583
round_time_total       0 days 00:14:02.913942
loss_total              8373242625099785216.0
loss_critic            10466553098941700096.0
loss_actor                -12776125427.200001
memory_size                            5820.0 

=== epoch 6/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1007
episode_length                       9.917577
returns                             -2.738442
return_std                           2.942332
average_reward                      -0.276028
round_time             0 days 00:14:00.767655
episodes_test                          1014.0
episode_length_test                  9.854043
returns_test                        -2.768627
return_std_test                      2.776882
average_reward_test                 -0.281022
round_time_test        0 days 00:00:10.080815
round_time_total       0 days 00:14:00.768732
loss_total              8608180641529097216.0
loss_critic            10760225615166193664.0
loss_actor                     -12913732953.6
memory_size                            5820.0 

=== epoch 6/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1010
episode_length                       9.892079
returns                             -2.649848
return_std                           2.851223
average_reward                      -0.267912
round_time             0 days 00:14:01.114523
episodes_test                          1014.0
episode_length_test                  9.860947
returns_test                        -2.967548
return_std_test                      2.830863
average_reward_test                 -0.300806
round_time_test        0 days 00:00:10.047238
round_time_total       0 days 00:14:01.115594
loss_total              8846790689176390656.0
loss_critic            11058488163901992960.0
loss_actor                -13093936443.903999
memory_size                            5820.0 

=== epoch 6/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1015
episode_length                       9.839409
returns                             -2.633959
return_std                           2.809772
average_reward                      -0.268414
round_time             0 days 00:14:02.793692
episodes_test                          1019.0
episode_length_test                  9.809617
returns_test                        -2.732257
return_std_test                      2.811795
average_reward_test                 -0.277952
round_time_test        0 days 00:00:09.975990
round_time_total       0 days 00:14:02.794764
loss_total              9218172762545335296.0
loss_critic            11522715756901664768.0
loss_actor                   -13282781830.656
memory_size                            5820.0 

=== epoch 6/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.828909
returns                             -2.651473
return_std                           2.844834
average_reward                      -0.269281
round_time             0 days 00:14:02.342425
episodes_test                          1009.0
episode_length_test                  9.909812
returns_test                        -2.645199
return_std_test                      2.808422
average_reward_test                 -0.266816
round_time_test        0 days 00:00:10.181480
round_time_total       0 days 00:14:02.343508
loss_total              9574225806126835712.0
loss_critic            11967782051293956096.0
loss_actor                   -13432727210.496
memory_size                            5820.0 

=== epoch 6/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.830875
returns                             -2.693565
return_std                           2.852433
average_reward                       -0.27383
round_time             0 days 00:14:01.559368
episodes_test                          1021.0
episode_length_test                  9.785504
returns_test                        -2.680482
return_std_test                       2.81526
average_reward_test                 -0.273341
round_time_test        0 days 00:00:09.988032
round_time_total       0 days 00:14:01.560426
loss_total              9658954154502348800.0
loss_critic            12073692479564982272.0
loss_actor                -13601501847.040001
memory_size                            5820.0 

=== epoch 6/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1019
episode_length                       9.805692
returns                             -2.673083
return_std                            2.83535
average_reward                      -0.272412
round_time             0 days 00:14:01.007195
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.684629
return_std_test                       2.84241
average_reward_test                 -0.271949
round_time_test        0 days 00:00:09.990256
round_time_total       0 days 00:14:01.008259
loss_total              9946814758159130624.0
loss_critic            12433518230253309952.0
loss_actor                   -13759661252.608
memory_size                            5820.0 

=== epoch 6/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1013
episode_length                       9.852912
returns                             -2.655501
return_std                           2.835687
average_reward                       -0.26913
round_time             0 days 00:14:01.988972
episodes_test                          1018.0
episode_length_test                  9.823183
returns_test                         -2.69011
return_std_test                      2.720255
average_reward_test                 -0.273853
round_time_test        0 days 00:00:09.924612
round_time_total       0 days 00:14:01.990030
loss_total             10096907890830927872.0
loss_critic            12621134644598405120.0
loss_actor                -13952697370.624001
memory_size                            5820.0 

=== epoch 6/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:07,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.854043
returns                             -2.707299
return_std                           2.893722
average_reward                      -0.274307
round_time             0 days 00:14:01.274179
episodes_test                          1015.0
episode_length_test                  9.852217
returns_test                        -2.857383
return_std_test                      2.674794
average_reward_test                 -0.290024
round_time_test        0 days 00:00:10.046568
round_time_total       0 days 00:14:01.275240
loss_total             10457040791329423360.0
loss_critic            13071300774155716608.0
loss_actor                   -14140252568.576
memory_size                            5820.0 

=== epoch 6/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:37,  2.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1014
episode_length                        9.85503
returns                             -2.775804
return_std                           2.838613
average_reward                      -0.281764
round_time             0 days 00:14:01.674760
episodes_test                          1011.0
episode_length_test                  9.884273
returns_test                        -2.643221
return_std_test                       2.76984
average_reward_test                 -0.266997
round_time_test        0 days 00:00:10.041493
round_time_total       0 days 00:14:01.675824
loss_total             10610047230417227776.0
loss_critic            13262558821520824320.0
loss_actor                   -14308510466.048
memory_size                            5820.0 

=== epoch 6/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.873393
returns                             -2.671085
return_std                           2.855415
average_reward                      -0.270188
round_time             0 days 00:14:00.937965
episodes_test                          1014.0
episode_length_test                   9.85503
returns_test                        -2.698233
return_std_test                      2.896277
average_reward_test                 -0.273477
round_time_test        0 days 00:00:10.029204
round_time_total       0 days 00:14:00.939055
loss_total             10828650054380834816.0
loss_critic            13535812344758001664.0
loss_actor                   -14473786352.128
memory_size                            5820.0 

=== epoch 6/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1012
episode_length                       9.868577
returns                             -2.687949
return_std                           2.871233
average_reward                      -0.272551
round_time             0 days 00:14:01.112612
episodes_test                          1015.0
episode_length_test                  9.852217
returns_test                        -2.538773
return_std_test                      2.757407
average_reward_test                 -0.257685
round_time_test        0 days 00:00:10.113895
round_time_total       0 days 00:14:01.113679
loss_total             11186836235732371456.0
loss_critic            13983545055384246272.0
loss_actor                   -14606606485.504
memory_size                            5820.0 

=== epoch 6/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:05,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1018
episode_length                       9.821218
returns                             -2.645214
return_std                           2.895595
average_reward                      -0.269297
round_time             0 days 00:14:01.319450
episodes_test                          1015.0
episode_length_test                  9.844335
returns_test                        -2.829724
return_std_test                      2.770104
average_reward_test                 -0.286743
round_time_test        0 days 00:00:10.073690
round_time_total       0 days 00:14:01.320506
loss_total             11507651443540858880.0
loss_critic            14384564070023938048.0
loss_actor                -14726801325.568001
memory_size                            5820.0 

=== epoch 6/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1016
episode_length                       9.832677
returns                             -2.603188
return_std                           2.844348
average_reward                      -0.264758
round_time             0 days 00:14:02.417523
episodes_test                          1016.0
episode_length_test                  9.836614
returns_test                        -2.760302
return_std_test                      2.766037
average_reward_test                 -0.279874
round_time_test        0 days 00:00:10.105070
round_time_total       0 days 00:14:02.418586
loss_total             11654327352966117376.0
loss_critic            14567908938526132224.0
loss_actor                   -14895973768.704
memory_size                            5820.0 

=== epoch 6/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                 1019
episode_length                       9.803729
returns                             -2.570569
return_std                           2.839134
average_reward                       -0.26194
round_time             0 days 00:14:00.336352
episodes_test                          1023.0
episode_length_test                  9.767351
returns_test                        -2.612241
return_std_test                      2.874438
average_reward_test                 -0.266887
round_time_test        0 days 00:00:10.227960
round_time_total       0 days 00:14:00.337402
loss_total             11697284083966134272.0
loss_critic            14621604855918284800.0
loss_actor                -15022632167.936001
memory_size                            5820.0 

=== epoch 6/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1020
episode_length                        9.80098
returns                             -2.588631
return_std                           2.785118
average_reward                      -0.264089
round_time             0 days 00:14:00.952243
episodes_test                          1016.0
episode_length_test                  9.841535
returns_test                         -2.53633
return_std_test                      2.726798
average_reward_test                 -0.257608
round_time_test        0 days 00:00:09.885466
round_time_total       0 days 00:14:00.953294
loss_total             12099108232625379328.0
loss_critic            15123885021607532544.0
loss_actor                   -15058647831.552
memory_size                            5820.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:05,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                  203
episode_length                       9.807882
returns                             -2.849405
return_std                           2.808767
average_reward                      -0.291314
round_time             0 days 00:14:00.882110
episodes_test                          1023.0
episode_length_test                  9.767351
returns_test                        -2.656436
return_std_test                      2.827053
average_reward_test                 -0.272229
round_time_test        0 days 00:00:09.985457
round_time_total       0 days 00:14:00.883218
loss_total             12203082700814319616.0
loss_critic            15253853122872528896.0
loss_actor                -15187538636.799999
memory_size                            5820.0 

=== epoch 7/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:50,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                  409
episode_length                       9.765281
returns                             -2.924998
return_std                           2.867017
average_reward                      -0.297577
round_time             0 days 00:14:02.609974
episodes_test                          1021.0
episode_length_test                   9.79236
returns_test                        -2.791112
return_std_test                      2.763815
average_reward_test                 -0.284857
round_time_test        0 days 00:00:10.095574
round_time_total       0 days 00:14:02.611051
loss_total             12291280799955120128.0
loss_critic            15364100736799842304.0
loss_actor                -15401825775.615999
memory_size                            5820.0 

=== epoch 7/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  612
episode_length                       9.802288
returns                             -2.793291
return_std                           2.814353
average_reward                      -0.284746
round_time             0 days 00:14:01.574486
episodes_test                          1022.0
episode_length_test                  9.782779
returns_test                         -2.70011
return_std_test                      2.736087
average_reward_test                 -0.275896
round_time_test        0 days 00:00:09.994001
round_time_total       0 days 00:14:01.575562
loss_total             12735422309685817344.0
loss_critic            15919277618826434560.0
loss_actor                   -15510861253.632
memory_size                            5820.0 

=== epoch 7/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                  818
episode_length                       9.771394
returns                             -2.857793
return_std                           2.811435
average_reward                      -0.292325
round_time             0 days 00:14:02.488077
episodes_test                          1024.0
episode_length_test                  9.763672
returns_test                        -2.631983
return_std_test                      2.667367
average_reward_test                 -0.269351
round_time_test        0 days 00:00:10.146213
round_time_total       0 days 00:14:02.489169
loss_total             12918773386855297024.0
loss_critic            16148466440480552960.0
loss_actor                    -15661798794.24
memory_size                            5820.0 

=== epoch 7/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:11,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1021
episode_length                       9.789422
returns                             -2.857648
return_std                           2.818781
average_reward                      -0.291498
round_time             0 days 00:14:03.588839
episodes_test                          1020.0
episode_length_test                  9.801961
returns_test                        -2.746815
return_std_test                      2.866843
average_reward_test                  -0.28005
round_time_test        0 days 00:00:10.128038
round_time_total       0 days 00:14:03.590137
loss_total             13514655618982150144.0
loss_critic            16893319239022897152.0
loss_actor                   -15883650760.704
memory_size                            5820.0 

=== epoch 7/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1022
episode_length                       9.777886
returns                             -2.781218
return_std                           2.808668
average_reward                      -0.284537
round_time             0 days 00:14:01.432231
episodes_test                          1023.0
episode_length_test                  9.773216
returns_test                           -2.762
return_std_test                      2.678938
average_reward_test                 -0.282396
round_time_test        0 days 00:00:10.053387
round_time_total       0 days 00:14:01.433334
loss_total             13759633360678361088.0
loss_critic            17199541422568429568.0
loss_actor                -15954333176.832001
memory_size                            5820.0 

=== epoch 7/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:20,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1016
episode_length                       9.832677
returns                             -2.628201
return_std                           2.801776
average_reward                      -0.267191
round_time             0 days 00:14:01.884132
episodes_test                          1018.0
episode_length_test                  9.820236
returns_test                        -2.734387
return_std_test                      2.847174
average_reward_test                 -0.278232
round_time_test        0 days 00:00:10.200502
round_time_total       0 days 00:14:01.885247
loss_total             13730690234815520768.0
loss_critic            17163362496582541312.0
loss_actor                -16041922385.408001
memory_size                            5820.0 

=== epoch 7/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:05,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1014
episode_length                       9.850099
returns                             -2.725391
return_std                            2.84843
average_reward                      -0.276149
round_time             0 days 00:14:02.219615
episodes_test                          1013.0
episode_length_test                  9.866732
returns_test                        -2.553172
return_std_test                      2.847551
average_reward_test                 -0.258513
round_time_test        0 days 00:00:10.060366
round_time_total       0 days 00:14:02.220715
loss_total             14156904499169511424.0
loss_critic            17696130305515833344.0
loss_actor                   -16364577486.336
memory_size                            5820.0 

=== epoch 7/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.882295
returns                             -2.592159
return_std                           2.808771
average_reward                      -0.262265
round_time             0 days 00:14:00.854278
episodes_test                          1014.0
episode_length_test                   9.85503
returns_test                        -2.574036
return_std_test                      2.833592
average_reward_test                 -0.261336
round_time_test        0 days 00:00:10.054625
round_time_total       0 days 00:14:00.855481
loss_total             14468277145786316800.0
loss_critic            18085346126328145920.0
loss_actor                   -16731947546.112
memory_size                            5820.0 

=== epoch 7/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1012
episode_length                       9.875494
returns                             -2.561073
return_std                           2.772765
average_reward                      -0.259702
round_time             0 days 00:14:02.951806
episodes_test                          1004.0
episode_length_test                  9.953187
returns_test                        -2.662559
return_std_test                      2.821509
average_reward_test                 -0.266795
round_time_test        0 days 00:00:10.255175
round_time_total       0 days 00:14:02.952878
loss_total             14977211240424468480.0
loss_critic            18721513728785993728.0
loss_actor                   -17028193657.344
memory_size                            5820.0 

=== epoch 7/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1012
episode_length                        9.87253
returns                             -2.638394
return_std                           2.773724
average_reward                      -0.267971
round_time             0 days 00:14:01.989026
episodes_test                          1013.0
episode_length_test                  9.865745
returns_test                        -2.510419
return_std_test                      2.841919
average_reward_test                  -0.25404
round_time_test        0 days 00:00:09.880365
round_time_total       0 days 00:14:01.990094
loss_total             15497624726151591936.0
loss_critic            19372030571513806848.0
loss_actor                -17263582935.040001
memory_size                            5820.0 

=== epoch 7/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:27,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1019
episode_length                       9.800785
returns                             -2.750036
return_std                           2.731252
average_reward                      -0.280733
round_time             0 days 00:14:02.523680
episodes_test                          1014.0
episode_length_test                  9.861933
returns_test                        -2.677246
return_std_test                      2.807787
average_reward_test                 -0.271473
round_time_test        0 days 00:00:10.161099
round_time_total       0 days 00:14:02.524748
loss_total             16181895600722753536.0
loss_critic            20227369169263247360.0
loss_actor                -17472830782.976002
memory_size                            5820.0 

=== epoch 7/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1023
episode_length                       9.766373
returns                             -2.658759
return_std                           2.709919
average_reward                      -0.272394
round_time             0 days 00:14:02.594260
episodes_test                          1029.0
episode_length_test                  9.712342
returns_test                        -2.666406
return_std_test                      2.769221
average_reward_test                 -0.273938
round_time_test        0 days 00:00:10.143550
round_time_total       0 days 00:14:02.595361
loss_total             16411903006584725504.0
loss_critic            20514878397728534528.0
loss_actor                   -17721232753.152
memory_size                            5820.0 

=== epoch 7/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1023
episode_length                       9.773216
returns                             -2.699032
return_std                           2.757409
average_reward                      -0.276255
round_time             0 days 00:14:01.791566
episodes_test                          1010.0
episode_length_test                   9.89901
returns_test                        -2.517695
return_std_test                      2.779581
average_reward_test                 -0.254138
round_time_test        0 days 00:00:09.988829
round_time_total       0 days 00:14:01.792638
loss_total             16351025596697600000.0
loss_critic            20438781634063953920.0
loss_actor                -17729003286.015999
memory_size                            5820.0 

=== epoch 7/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:50,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1025
episode_length                       9.752195
returns                             -2.738792
return_std                           2.756463
average_reward                       -0.28086
round_time             0 days 00:14:01.070732
episodes_test                          1015.0
episode_length_test                  9.852217
returns_test                        -2.419231
return_std_test                      2.699191
average_reward_test                 -0.245552
round_time_test        0 days 00:00:10.057650
round_time_total       0 days 00:14:01.071821
loss_total             16750746447232874496.0
loss_critic            20938432706785054720.0
loss_actor                -17802144300.543999
memory_size                            5820.0 

=== epoch 7/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1022
episode_length                       9.782779
returns                             -2.676011
return_std                           2.739531
average_reward                      -0.273558
round_time             0 days 00:14:02.386020
episodes_test                          1016.0
episode_length_test                  9.834646
returns_test                        -2.643302
return_std_test                      2.732463
average_reward_test                 -0.268222
round_time_test        0 days 00:00:10.031915
round_time_total       0 days 00:14:02.387098
loss_total             17170068912780580864.0
loss_critic            21462585767210491904.0
loss_actor                -18020667802.624001
memory_size                            5820.0 

=== epoch 7/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:10,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1019
episode_length                       9.804711
returns                             -2.642032
return_std                           2.708815
average_reward                      -0.269176
round_time             0 days 00:14:02.648216
episodes_test                          1002.0
episode_length_test                  9.979042
returns_test                        -2.411071
return_std_test                      2.773583
average_reward_test                  -0.24149
round_time_test        0 days 00:00:09.962066
round_time_total       0 days 00:14:02.649292
loss_total             17188523696345872384.0
loss_critic            21485654259070971904.0
loss_actor                -18189122329.088001
memory_size                            5820.0 

=== epoch 7/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1019
episode_length                       9.801766
returns                             -2.655805
return_std                           2.722801
average_reward                      -0.270694
round_time             0 days 00:14:01.974721
episodes_test                          1008.0
episode_length_test                  9.912698
returns_test                        -2.455705
return_std_test                      2.855255
average_reward_test                 -0.246981
round_time_test        0 days 00:00:09.965798
round_time_total       0 days 00:14:01.975800
loss_total             17221855041147449344.0
loss_critic            21527318429627584512.0
loss_actor                   -18108991410.688
memory_size                            5820.0 

=== epoch 7/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:13,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1020
episode_length                       9.803922
returns                             -2.633035
return_std                           2.681738
average_reward                       -0.26857
round_time             0 days 00:14:01.866820
episodes_test                          1015.0
episode_length_test                  9.850246
returns_test                         -2.44088
return_std_test                      2.650645
average_reward_test                 -0.247579
round_time_test        0 days 00:00:10.105755
round_time_total       0 days 00:14:01.867869
loss_total             17315554748753086464.0
loss_critic            21644443061901246464.0
loss_actor                   -18164635019.264
memory_size                            5820.0 

=== epoch 7/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1016
episode_length                       9.834646
returns                             -2.550337
return_std                            2.69004
average_reward                      -0.259227
round_time             0 days 00:14:03.131105
episodes_test                          1014.0
episode_length_test                  9.852071
returns_test                        -2.637865
return_std_test                      2.748621
average_reward_test                 -0.267609
round_time_test        0 days 00:00:10.058174
round_time_total       0 days 00:14:03.132183
loss_total             17604960637250187264.0
loss_critic            22006200413245493248.0
loss_actor                -18496075760.127998
memory_size                            5820.0 

=== epoch 7/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:54,  2.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1015
episode_length                       9.835468
returns                             -2.526458
return_std                            2.70629
average_reward                      -0.257084
round_time             0 days 00:14:03.446058
episodes_test                          1003.0
episode_length_test                  9.963111
returns_test                        -2.509835
return_std_test                      2.788344
average_reward_test                 -0.251183
round_time_test        0 days 00:00:10.067992
round_time_total       0 days 00:14:03.447155
loss_total             18443087182560524288.0
loss_critic            23053858594093142016.0
loss_actor                -18827360028.672001
memory_size                            5820.0 

=== epoch 7/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1015
episode_length                       9.844335
returns                             -2.558785
return_std                           2.740237
average_reward                      -0.259349
round_time             0 days 00:14:02.947568
episodes_test                          1006.0
episode_length_test                  9.937376
returns_test                        -2.507321
return_std_test                      2.880306
average_reward_test                 -0.252023
round_time_test        0 days 00:00:09.927439
round_time_total       0 days 00:14:02.948641
loss_total             18930737538863239168.0
loss_critic            23663421512430419968.0
loss_actor                -19132141386.751999
memory_size                            5820.0 

=== epoch 7/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:37,  2.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1016
episode_length                        9.83563
returns                             -2.558252
return_std                           2.697737
average_reward                      -0.260168
round_time             0 days 00:14:01.508439
episodes_test                          1010.0
episode_length_test                  9.892079
returns_test                        -2.526654
return_std_test                      2.717975
average_reward_test                 -0.255696
round_time_test        0 days 00:00:09.953118
round_time_total       0 days 00:14:01.509510
loss_total             19181257762406977536.0
loss_critic            23976571753377185792.0
loss_actor                -19313652837.375999
memory_size                            5820.0 

=== epoch 7/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1016
episode_length                       9.836614
returns                             -2.526645
return_std                           2.754028
average_reward                      -0.256966
round_time             0 days 00:14:02.113136
episodes_test                          1011.0
episode_length_test                  9.886251
returns_test                        -2.529067
return_std_test                      2.716309
average_reward_test                 -0.255807
round_time_test        0 days 00:00:10.091803
round_time_total       0 days 00:14:02.114252
loss_total             19448656522978009088.0
loss_critic            24310820221270843392.0
loss_actor                   -19329230850.048
memory_size                            5820.0 

=== epoch 7/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.840394
returns                             -2.549565
return_std                           2.789616
average_reward                      -0.258955
round_time             0 days 00:14:01.966518
episodes_test                          1008.0
episode_length_test                  9.912698
returns_test                        -2.579763
return_std_test                      2.681056
average_reward_test                 -0.260168
round_time_test        0 days 00:00:10.153979
round_time_total       0 days 00:14:01.967614
loss_total             19906656089042550784.0
loss_critic            24883319684417798144.0
loss_actor                   -19614586100.736
memory_size                            5820.0 

=== epoch 7/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:49,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1017
episode_length                       9.829892
returns                              -2.55512
return_std                           2.804281
average_reward                      -0.259867
round_time             0 days 00:14:02.874743
episodes_test                          1005.0
episode_length_test                  9.950249
returns_test                        -2.529198
return_std_test                       2.86476
average_reward_test                 -0.254184
round_time_test        0 days 00:00:10.106191
round_time_total       0 days 00:14:02.875826
loss_total             20222709868069146624.0
loss_critic            25278386902566043648.0
loss_actor                -19886313282.560001
memory_size                            5820.0 

=== epoch 7/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1014
episode_length                       9.852071
returns                             -2.509643
return_std                           2.842747
average_reward                      -0.254774
round_time             0 days 00:14:02.608273
episodes_test                          1012.0
episode_length_test                  9.880435
returns_test                        -2.516163
return_std_test                      2.812755
average_reward_test                 -0.254559
round_time_test        0 days 00:00:10.044559
round_time_total       0 days 00:14:02.609357
loss_total             20470127823953162240.0
loss_critic            25587659337113141248.0
loss_actor                -20047713599.487999
memory_size                            5820.0 

=== epoch 7/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1011
episode_length                       9.873393
returns                             -2.461333
return_std                           2.833335
average_reward                      -0.249561
round_time             0 days 00:14:01.525419
episodes_test                          1007.0
episode_length_test                  9.924528
returns_test                        -2.361586
return_std_test                       2.78815
average_reward_test                 -0.238068
round_time_test        0 days 00:00:09.980623
round_time_total       0 days 00:14:01.526493
loss_total             21072099409310113792.0
loss_critic            26340123821008359424.0
loss_actor                -20242782696.448002
memory_size                            5820.0 

=== epoch 7/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.838424
returns                              -2.48365
return_std                            2.76747
average_reward                       -0.25232
round_time             0 days 00:14:01.008707
episodes_test                          1012.0
episode_length_test                  9.876482
returns_test                        -2.392382
return_std_test                      2.800402
average_reward_test                 -0.242324
round_time_test        0 days 00:00:10.045783
round_time_total       0 days 00:14:01.009767
loss_total             21196819770136715264.0
loss_critic            26496024262730117120.0
loss_actor                -20440878703.616001
memory_size                            5820.0 

=== epoch 7/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.38it/s]
episodes                                 1017
episode_length                       9.824975
returns                             -2.488423
return_std                           2.728879
average_reward                      -0.253222
round_time             0 days 00:14:02.660597
episodes_test                          1011.0
episode_length_test                  9.890208
returns_test                        -2.452551
return_std_test                      2.714599
average_reward_test                 -0.247858
round_time_test        0 days 00:00:09.986236
round_time_total       0 days 00:14:02.661678
loss_total             21920286042499100672.0
loss_critic            27400357095452160000.0
loss_actor                   -20646581669.888
memory_size                            5820.0 

=== epoch 7/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:43,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                 1018
episode_length                       9.817289
returns                             -2.404993
return_std                           2.693557
average_reward                      -0.245263
round_time             0 days 00:14:03.715045
episodes_test                          1002.0
episode_length_test                  9.976048
returns_test                        -2.238919
return_std_test                      2.754853
average_reward_test                 -0.224242
round_time_test        0 days 00:00:10.051953
round_time_total       0 days 00:14:03.716120
loss_total             22517291938263584768.0
loss_critic            28146614415233064960.0
loss_actor                -21022836204.543999
memory_size                            5820.0 

=== epoch 7/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                 1018
episode_length                       9.819253
returns                             -2.378021
return_std                           2.665587
average_reward                      -0.242242
round_time             0 days 00:14:02.702827
episodes_test                          1006.0
episode_length_test                  9.936382
returns_test                        -2.455969
return_std_test                      2.883882
average_reward_test                 -0.246846
round_time_test        0 days 00:00:09.920138
round_time_total       0 days 00:14:02.703895
loss_total             23286516482410151936.0
loss_critic            29108145065488945152.0
loss_actor                   -21162615235.584
memory_size                            5820.0 

=== epoch 7/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1015
episode_length                       9.849261
returns                              -2.36872
return_std                           2.664422
average_reward                      -0.240288
round_time             0 days 00:14:01.865659
episodes_test                          1010.0
episode_length_test                   9.89802
returns_test                        -2.399687
return_std_test                      2.712324
average_reward_test                 -0.242138
round_time_test        0 days 00:00:09.923482
round_time_total       0 days 00:14:01.866734
loss_total             23404038860266385408.0
loss_critic            29255048083438968832.0
loss_actor                -21326683552.768002
memory_size                            5820.0 

=== epoch 7/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                 1010
episode_length                        9.89802
returns                             -2.392573
return_std                           2.725445
average_reward                      -0.241711
round_time             0 days 00:14:01.562264
episodes_test                          1002.0
episode_length_test                  9.976048
returns_test                        -2.550034
return_std_test                      2.842317
average_reward_test                 -0.255488
round_time_test        0 days 00:00:10.088706
round_time_total       0 days 00:14:01.563325
loss_total             24247960744037437440.0
loss_critic            30309950397470851072.0
loss_actor                   -21824035209.216
memory_size                            5820.0 

=== epoch 7/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                 1006
episode_length                       9.927435
returns                             -2.358824
return_std                           2.740695
average_reward                      -0.237989
round_time             0 days 00:14:00.094959
episodes_test                          1012.0
episode_length_test                  9.881423
returns_test                        -2.533443
return_std_test                      2.791484
average_reward_test                 -0.256384
round_time_test        0 days 00:00:10.138204
round_time_total       0 days 00:14:00.096041
loss_total             25084515920243355648.0
loss_critic            31355644336254726144.0
loss_actor                   -22117133477.888
memory_size                            5820.0 

=== epoch 7/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:04<00:00,  2.55it/s]
episodes                                 1005
episode_length                       9.942289
returns                             -2.449158
return_std                           2.781173
average_reward                      -0.246261
round_time             0 days 00:13:04.672678
episodes_test                          1011.0
episode_length_test                  9.888229
returns_test                        -2.295115
return_std_test                      2.736491
average_reward_test                 -0.231973
round_time_test        0 days 00:00:09.993982
round_time_total       0 days 00:13:04.673729
loss_total             25689314483311104000.0
loss_critic            32111642572868608000.0
loss_actor                -22377951809.535999
memory_size                            5820.0 

=== epoch 7/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                 1005
episode_length                       9.943284
returns                             -2.537949
return_std                           2.814103
average_reward                      -0.254739
round_time             0 days 00:12:28.532103
episodes_test                          1015.0
episode_length_test                  9.852217
returns_test                        -2.523354
return_std_test                      2.741811
average_reward_test                  -0.25612
round_time_test        0 days 00:00:09.986653
round_time_total       0 days 00:12:28.533178
loss_total             26447277105749901312.0
loss_critic            33059095809341820928.0
loss_actor                   -22623407721.472
memory_size                            5820.0 

=== epoch 7/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:06,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                 1002
episode_length                       9.966068
returns                             -2.531231
return_std                           2.808718
average_reward                      -0.254238
round_time             0 days 00:12:27.776813
episodes_test                          1005.0
episode_length_test                  9.946269
returns_test                        -2.352204
return_std_test                       2.77226
average_reward_test                 -0.236387
round_time_test        0 days 00:00:09.857605
round_time_total       0 days 00:12:27.777880
loss_total             26894180021135417344.0
loss_critic            33617724452817797120.0
loss_actor                   -22860960709.632
memory_size                            5820.0 

=== epoch 7/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.68it/s]
episodes                                 1006
episode_length                       9.922465
returns                             -2.571161
return_std                           2.811264
average_reward                      -0.258697
round_time             0 days 00:12:27.904956
episodes_test                          1003.0
episode_length_test                  9.963111
returns_test                        -2.616051
return_std_test                      2.768639
average_reward_test                 -0.262497
round_time_test        0 days 00:00:10.036204
round_time_total       0 days 00:12:27.906016
loss_total             27510145378776244224.0
loss_critic            34387681141828653056.0
loss_actor                -23099224965.119999
memory_size                            5820.0 

=== epoch 7/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:13,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                 1012
episode_length                       9.879447
returns                             -2.591029
return_std                           2.760796
average_reward                      -0.262237
round_time             0 days 00:12:21.487542
episodes_test                          1011.0
episode_length_test                  9.888229
returns_test                        -2.428616
return_std_test                      2.798279
average_reward_test                 -0.245261
round_time_test        0 days 00:00:10.086468
round_time_total       0 days 00:12:21.488612
loss_total             27779487993716686848.0
loss_critic            34724359390094524416.0
loss_actor                -23227990486.015999
memory_size                            5820.0 

=== epoch 7/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:01,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1010
episode_length                        9.90099
returns                             -2.553803
return_std                           2.743092
average_reward                      -0.257934
round_time             0 days 00:12:20.075389
episodes_test                          1013.0
episode_length_test                  9.871668
returns_test                        -2.390265
return_std_test                       2.74584
average_reward_test                 -0.242134
round_time_test        0 days 00:00:09.781623
round_time_total       0 days 00:12:20.076443
loss_total             28128525613871243264.0
loss_critic            35160656412745097216.0
loss_actor                   -23376098737.152
memory_size                            5820.0 

=== epoch 7/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1007
episode_length                       9.924528
returns                             -2.466676
return_std                           2.708921
average_reward                      -0.248481
round_time             0 days 00:12:20.847331
episodes_test                          1012.0
episode_length_test                   9.87253
returns_test                        -2.607319
return_std_test                      2.813458
average_reward_test                 -0.264477
round_time_test        0 days 00:00:09.903007
round_time_total       0 days 00:12:20.848415
loss_total             28259120294102437888.0
loss_critic            35323899783993528320.0
loss_actor                -23657024750.591999
memory_size                            5820.0 

=== epoch 7/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:27<00:00,  2.67it/s]
episodes                                 1010
episode_length                        9.89802
returns                              -2.45095
return_std                           2.727314
average_reward                      -0.247449
round_time             0 days 00:12:28.303086
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -2.619501
return_std_test                      2.811712
average_reward_test                  -0.26527
round_time_test        0 days 00:00:09.840938
round_time_total       0 days 00:12:28.304141
loss_total             28932789918214189056.0
loss_critic            36165986773348212736.0
loss_actor                -23903370511.360001
memory_size                            5820.0 

=== epoch 7/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1009
episode_length                       9.905847
returns                             -2.455615
return_std                           2.723495
average_reward                      -0.247808
round_time             0 days 00:12:20.821495
episodes_test                          1012.0
episode_length_test                  9.880435
returns_test                        -2.610878
return_std_test                      2.911166
average_reward_test                  -0.26411
round_time_test        0 days 00:00:10.024156
round_time_total       0 days 00:12:20.822552
loss_total             29701814216554328064.0
loss_critic            37127267103014477824.0
loss_actor                -23971455389.695999
memory_size                            5820.0 

=== epoch 7/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:43,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                 1008
episode_length                       9.911706
returns                              -2.46699
return_std                           2.752569
average_reward                      -0.248731
round_time             0 days 00:12:26.411204
episodes_test                          1003.0
episode_length_test                  9.966102
returns_test                        -2.509129
return_std_test                      2.765647
average_reward_test                 -0.251351
round_time_test        0 days 00:00:09.901245
round_time_total       0 days 00:12:26.412252
loss_total             29856148707152654336.0
loss_critic            37320185226570301440.0
loss_actor                   -23976711969.792
memory_size                            5820.0 

=== epoch 7/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                 1011
episode_length                       9.885262
returns                             -2.465541
return_std                           2.731663
average_reward                       -0.24934
round_time             0 days 00:12:23.415721
episodes_test                          1008.0
episode_length_test                   9.91369
returns_test                        -2.487179
return_std_test                      2.823167
average_reward_test                 -0.250134
round_time_test        0 days 00:00:10.033775
round_time_total       0 days 00:12:23.416806
loss_total             29868983956744810496.0
loss_critic            37336229296119644160.0
loss_actor                   -23982292852.736
memory_size                            5820.0 

=== epoch 7/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                 1014
episode_length                       9.854043
returns                              -2.53159
return_std                           2.752429
average_reward                      -0.257319
round_time             0 days 00:12:24.863610
episodes_test                          1015.0
episode_length_test                  9.850246
returns_test                        -2.399274
return_std_test                      2.759706
average_reward_test                 -0.243339
round_time_test        0 days 00:00:09.905345
round_time_total       0 days 00:12:24.864689
loss_total             30302020381236629504.0
loss_critic            37877524801789239296.0
loss_actor                -24054090527.743999
memory_size                            5820.0 

=== epoch 7/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1015
episode_length                       9.836453
returns                             -2.583476
return_std                           2.770746
average_reward                      -0.262131
round_time             0 days 00:12:23.803036
episodes_test                          1013.0
episode_length_test                  9.871668
returns_test                        -2.434137
return_std_test                        2.6839
average_reward_test                 -0.246578
round_time_test        0 days 00:00:09.935684
round_time_total       0 days 00:12:23.804075
loss_total             30519456434865639424.0
loss_critic            38149319906827378688.0
loss_actor                -24175925196.799999
memory_size                            5820.0 

=== epoch 7/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                 1015
episode_length                       9.842365
returns                             -2.541907
return_std                           2.762896
average_reward                      -0.258355
round_time             0 days 00:14:11.974585
episodes_test                          1008.0
episode_length_test                  9.914683
returns_test                        -2.499935
return_std_test                      2.890724
average_reward_test                 -0.251955
round_time_test        0 days 00:00:10.116211
round_time_total       0 days 00:14:11.975672
loss_total             30637186949998133248.0
loss_critic            38296483007655878656.0
loss_actor                   -24328357454.848
memory_size                            5820.0 

=== epoch 7/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:29<00:00,  2.67it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1010
episode_length                        9.89802
returns                             -2.508088
return_std                           2.796117
average_reward                      -0.253177
round_time             0 days 00:12:29.540864
episodes_test                          1003.0
episode_length_test                  9.962114
returns_test                        -2.625723
return_std_test                      2.964514
average_reward_test                 -0.262825
round_time_test        0 days 00:00:09.967130
round_time_total       0 days 00:12:29.541925
loss_total             31227148569295134720.0
loss_critic            39033935054866874368.0
loss_actor                -24723549070.335999
memory_size                            5820.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:02<13:46,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:59<00:00,  2.22it/s]
episodes                                  201
episode_length                       9.935323
returns                             -2.282906
return_std                           2.871165
average_reward                      -0.228701
round_time             0 days 00:14:59.736574
episodes_test                          1009.0
episode_length_test                  9.909812
returns_test                        -2.562922
return_std_test                      2.874785
average_reward_test                 -0.258516
round_time_test        0 days 00:00:09.869804
round_time_total       0 days 00:14:59.737907
loss_total             32146106893484105728.0
loss_critic            40182632910075314176.0
loss_actor                -25121484068.863998
memory_size                            5820.0 

=== epoch 8/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:27,  1.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:49<00:00,  2.11it/s]
episodes                                  404
episode_length                       9.891089
returns                             -2.293171
return_std                           2.837156
average_reward                      -0.231199
round_time             0 days 00:15:49.794296
episodes_test                          1014.0
episode_length_test                  9.854043
returns_test                        -2.430144
return_std_test                       2.85293
average_reward_test                 -0.246978
round_time_test        0 days 00:00:12.111194
round_time_total       0 days 00:15:49.795359
loss_total             33178775842153242624.0
loss_critic            41473469086703329280.0
loss_actor                -25504547293.183998
memory_size                            5820.0 

=== epoch 8/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:14,  2.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                  605
episode_length                        9.91405
returns                             -2.314028
return_std                           2.838238
average_reward                      -0.233105
round_time             0 days 00:12:25.070478
episodes_test                          1015.0
episode_length_test                  9.846305
returns_test                        -2.611147
return_std_test                      2.752064
average_reward_test                 -0.265221
round_time_test        0 days 00:00:09.882516
round_time_total       0 days 00:12:25.071549
loss_total             34315982269781479424.0
loss_critic            42894977144946843648.0
loss_actor                -25829606943.743999
memory_size                            5820.0 

=== epoch 8/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  812
episode_length                       9.843596
returns                             -2.377521
return_std                           2.776221
average_reward                      -0.241505
round_time             0 days 00:12:21.239594
episodes_test                          1016.0
episode_length_test                  9.841535
returns_test                         -2.40853
return_std_test                      2.824329
average_reward_test                 -0.244624
round_time_test        0 days 00:00:10.067605
round_time_total       0 days 00:12:21.240666
loss_total             35275063857117413376.0
loss_critic            44093829069949288448.0
loss_actor                -26200651182.080002
memory_size                            5820.0 

=== epoch 8/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1014
episode_length                       9.854043
returns                             -2.366536
return_std                           2.774834
average_reward                      -0.239665
round_time             0 days 00:12:24.120297
episodes_test                          1012.0
episode_length_test                  9.879447
returns_test                        -2.392294
return_std_test                      2.738587
average_reward_test                 -0.241927
round_time_test        0 days 00:00:09.923461
round_time_total       0 days 00:12:24.121359
loss_total             35092659347309260800.0
loss_critic            43865823445677080576.0
loss_actor                -26300102948.863998
memory_size                            5820.0 

=== epoch 8/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                 1016
episode_length                       9.834646
returns                             -2.396418
return_std                             2.7234
average_reward                      -0.243643
round_time             0 days 00:12:23.177044
episodes_test                          1006.0
episode_length_test                  9.932406
returns_test                        -2.454469
return_std_test                      2.834119
average_reward_test                 -0.247386
round_time_test        0 days 00:00:09.931716
round_time_total       0 days 00:12:23.178109
loss_total             36254272747794763776.0
loss_critic            45317840157732331520.0
loss_actor                   -26631927469.056
memory_size                            5820.0 

=== epoch 8/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1015
episode_length                       9.841379
returns                             -2.400545
return_std                           2.667843
average_reward                      -0.244183
round_time             0 days 00:12:24.385023
episodes_test                          1014.0
episode_length_test                  9.856016
returns_test                        -2.585944
return_std_test                      2.825454
average_reward_test                 -0.262236
round_time_test        0 days 00:00:10.150140
round_time_total       0 days 00:12:24.386107
loss_total             38164536251653824512.0
loss_critic            47705669479900512256.0
loss_actor                    -27002872939.52
memory_size                            5820.0 

=== epoch 8/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1018
episode_length                       9.814342
returns                             -2.477977
return_std                           2.651852
average_reward                      -0.252626
round_time             0 days 00:12:20.541625
episodes_test                          1009.0
episode_length_test                  9.904856
returns_test                        -2.540281
return_std_test                      2.794015
average_reward_test                 -0.256436
round_time_test        0 days 00:00:09.903650
round_time_total       0 days 00:12:20.542694
loss_total             38628097049400238080.0
loss_critic            48285120501135351808.0
loss_actor                -27306594335.743999
memory_size                            5820.0 

=== epoch 8/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:04,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:06<00:00,  1.95it/s]
episodes                                 1010
episode_length                        9.89604
returns                              -2.45123
return_std                           2.715829
average_reward                      -0.247755
round_time             0 days 00:17:06.571471
episodes_test                          1001.0
episode_length_test                  9.985015
returns_test                        -2.565224
return_std_test                      2.817197
average_reward_test                 -0.256602
round_time_test        0 days 00:00:10.119226
round_time_total       0 days 00:17:06.572733
loss_total             39000081691356372992.0
loss_critic            48750101283239550976.0
loss_actor                -27592022839.296001
memory_size                            5820.0 

=== epoch 8/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:10,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:16<00:00,  1.73it/s]
episodes                                 1008
episode_length                       9.916667
returns                             -2.460404
return_std                           2.687589
average_reward                      -0.248306
round_time             0 days 00:19:16.729546
episodes_test                          1004.0
episode_length_test                  9.956175
returns_test                          -2.3615
return_std_test                      2.823628
average_reward_test                 -0.236884
round_time_test        0 days 00:00:11.198315
round_time_total       0 days 00:19:16.730754
loss_total             39739683322007699456.0
loss_critic            49674603305473351680.0
loss_actor                   -27740681101.312
memory_size                            5820.0 

=== epoch 8/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:21,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:25<00:00,  1.72it/s]
episodes                                 1003
episode_length                       9.959123
returns                             -2.404733
return_std                           2.721069
average_reward                      -0.241544
round_time             0 days 00:19:25.607700
episodes_test                           997.0
episode_length_test                 10.024072
returns_test                        -2.295767
return_std_test                       2.82119
average_reward_test                 -0.228499
round_time_test        0 days 00:00:11.088974
round_time_total       0 days 00:19:25.608889
loss_total             40636191995626741760.0
loss_critic            50795239108533215232.0
loss_actor                   -27972764952.576
memory_size                            5820.0 

=== epoch 8/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:28,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:20<00:00,  1.72it/s]
episodes                                 1001
episode_length                       9.982018
returns                             -2.403423
return_std                           2.774392
average_reward                      -0.240917
round_time             0 days 00:19:21.406861
episodes_test                           985.0
episode_length_test                 10.150254
returns_test                        -2.128175
return_std_test                      2.881911
average_reward_test                 -0.209473
round_time_test        0 days 00:00:11.155536
round_time_total       0 days 00:19:21.408117
loss_total             40420689716869177344.0
loss_critic            50525861265789976576.0
loss_actor                -28223830499.327999
memory_size                            5820.0 

=== epoch 8/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:24<00:00,  1.72it/s]
episodes                                  999
episode_length                       9.996997
returns                             -2.327505
return_std                           2.764226
average_reward                      -0.232307
round_time             0 days 00:19:25.606016
episodes_test                          1001.0
episode_length_test                  9.987013
returns_test                        -2.465015
return_std_test                      2.862973
average_reward_test                 -0.246665
round_time_test        0 days 00:00:11.110031
round_time_total       0 days 00:19:25.607469
loss_total             41672914166368329728.0
loss_critic            52091141831443488768.0
loss_actor                -28370074487.807999
memory_size                            5820.0 

=== epoch 8/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:40,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:07<00:00,  1.74it/s]
episodes                                  999
episode_length                      10.002002
returns                             -2.311624
return_std                           2.819512
average_reward                      -0.231285
round_time             0 days 00:19:08.075171
episodes_test                           999.0
episode_length_test                 10.008008
returns_test                        -2.501806
return_std_test                      2.906766
average_reward_test                 -0.249785
round_time_test        0 days 00:00:11.060774
round_time_total       0 days 00:19:08.076344
loss_total             42274491075154542592.0
loss_critic            52843112937670721536.0
loss_actor                   -28610265050.112
memory_size                            5820.0 

=== epoch 8/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<21:45,  1.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:04<00:00,  1.75it/s]
episodes                                 1000
episode_length                          9.986
returns                             -2.350209
return_std                           2.873723
average_reward                      -0.234836
round_time             0 days 00:19:05.541755
episodes_test                          1009.0
episode_length_test                  9.909812
returns_test                        -2.517755
return_std_test                      2.842027
average_reward_test                 -0.253944
round_time_test        0 days 00:00:11.189326
round_time_total       0 days 00:19:05.542909
loss_total             43167744495340863488.0
loss_critic            53959679722524344320.0
loss_actor                    -29046073876.48
memory_size                            5820.0 

=== epoch 8/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:21,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:03<00:00,  1.75it/s]
episodes                                 1006
episode_length                         9.9334
returns                             -2.439661
return_std                           2.862248
average_reward                      -0.245945
round_time             0 days 00:19:03.708760
episodes_test                          1006.0
episode_length_test                  9.932406
returns_test                        -2.353418
return_std_test                      2.825244
average_reward_test                 -0.236672
round_time_test        0 days 00:00:11.037551
round_time_total       0 days 00:19:03.709916
loss_total             45198985999165734912.0
loss_critic            56498731603198787584.0
loss_actor                -29499323257.855999
memory_size                            5820.0 

=== epoch 8/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:31,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:04<00:00,  1.75it/s]
episodes                                 1005
episode_length                       9.938308
returns                             -2.397159
return_std                            2.86637
average_reward                       -0.24077
round_time             0 days 00:19:05.087451
episodes_test                          1015.0
episode_length_test                  9.849261
returns_test                        -2.479717
return_std_test                      2.754765
average_reward_test                 -0.251472
round_time_test        0 days 00:00:10.974276
round_time_total       0 days 00:19:05.088662
loss_total             46169333316230520832.0
loss_critic            57711665582197850112.0
loss_actor                -29865649723.391998
memory_size                            5820.0 

=== epoch 8/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                 1007
episode_length                       9.923535
returns                              -2.38669
return_std                           2.890929
average_reward                      -0.240956
round_time             0 days 00:14:15.435175
episodes_test                          1024.0
episode_length_test                  9.761719
returns_test                        -2.442599
return_std_test                      2.768985
average_reward_test                  -0.25019
round_time_test        0 days 00:00:11.306093
round_time_total       0 days 00:14:15.436229
loss_total             46766522923483021312.0
loss_critic            58458152644933378048.0
loss_actor                -30037747982.335999
memory_size                            5820.0 

=== epoch 8/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:33,  2.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1014
episode_length                       9.861933
returns                             -2.430531
return_std                           2.796606
average_reward                      -0.246456
round_time             0 days 00:12:24.283691
episodes_test                          1012.0
episode_length_test                  9.880435
returns_test                        -2.376867
return_std_test                      2.728284
average_reward_test                 -0.240438
round_time_test        0 days 00:00:09.872165
round_time_total       0 days 00:12:24.284782
loss_total             47282909007519006720.0
loss_critic            59103635303991869440.0
loss_actor                -30338757093.375999
memory_size                            5820.0 

=== epoch 8/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1016
episode_length                       9.831693
returns                             -2.445308
return_std                           2.759334
average_reward                      -0.248591
round_time             0 days 00:12:24.219508
episodes_test                          1010.0
episode_length_test                  9.893069
returns_test                         -2.51534
return_std_test                      2.827068
average_reward_test                 -0.254869
round_time_test        0 days 00:00:09.981235
round_time_total       0 days 00:12:24.220572
loss_total             47427248776067514368.0
loss_critic            59284059954822840320.0
loss_actor                    -30433050296.32
memory_size                            5820.0 

=== epoch 8/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                 1015
episode_length                       9.846305
returns                              -2.45701
return_std                           2.798223
average_reward                      -0.249531
round_time             0 days 00:12:25.734231
episodes_test                          1003.0
episode_length_test                  9.968096
returns_test                        -2.432449
return_std_test                       2.85074
average_reward_test                 -0.243801
round_time_test        0 days 00:00:09.884862
round_time_total       0 days 00:12:25.735296
loss_total             47500877784702156800.0
loss_critic            59376096174315741184.0
loss_actor                    -30473085383.68
memory_size                            5820.0 

=== epoch 8/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:07,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                 1016
episode_length                        9.83563
returns                             -2.480955
return_std                            2.79397
average_reward                      -0.252291
round_time             0 days 00:12:24.912697
episodes_test                          1011.0
episode_length_test                  9.885262
returns_test                         -2.57553
return_std_test                      2.859403
average_reward_test                 -0.260292
round_time_test        0 days 00:00:09.890846
round_time_total       0 days 00:12:24.913761
loss_total             48342576145624686592.0
loss_critic            60428219220920254464.0
loss_actor                -30688342904.832001
memory_size                            5820.0 

=== epoch 8/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                 1012
episode_length                       9.870553
returns                             -2.464324
return_std                           2.791791
average_reward                      -0.249408
round_time             0 days 00:12:23.127953
episodes_test                          1014.0
episode_length_test                  9.854043
returns_test                        -2.425541
return_std_test                       2.79808
average_reward_test                  -0.24547
round_time_test        0 days 00:00:09.950295
round_time_total       0 days 00:12:23.129028
loss_total             48207526462805680128.0
loss_critic            60259407071491891200.0
loss_actor                -30859745768.448002
memory_size                            5820.0 

=== epoch 8/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:03,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1014
episode_length                       9.861933
returns                             -2.506025
return_std                           2.806286
average_reward                      -0.254111
round_time             0 days 00:12:24.397661
episodes_test                          1002.0
episode_length_test                  9.972056
returns_test                        -2.382442
return_std_test                      2.731824
average_reward_test                 -0.238619
round_time_test        0 days 00:00:09.820788
round_time_total       0 days 00:12:24.398710
loss_total             49451429237384413184.0
loss_critic            61814285536691642368.0
loss_actor                -31389569888.256001
memory_size                            5820.0 

=== epoch 8/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                 1010
episode_length                        9.89505
returns                             -2.468442
return_std                            2.81107
average_reward                      -0.249423
round_time             0 days 00:12:26.351355
episodes_test                          1010.0
episode_length_test                   9.89901
returns_test                        -2.370656
return_std_test                      2.841267
average_reward_test                 -0.239224
round_time_test        0 days 00:00:10.125222
round_time_total       0 days 00:12:26.352408
loss_total             51255510871176773632.0
loss_critic            64069387538937364480.0
loss_actor                    -31777909652.48
memory_size                            5820.0 

=== epoch 8/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:34<00:00,  2.65it/s]
episodes                                 1009
episode_length                        9.89891
returns                              -2.36565
return_std                           2.781561
average_reward                      -0.238786
round_time             0 days 00:12:35.074683
episodes_test                           998.0
episode_length_test                 10.018036
returns_test                        -2.357313
return_std_test                      2.798672
average_reward_test                 -0.235116
round_time_test        0 days 00:00:09.888001
round_time_total       0 days 00:12:35.075742
loss_total             51967575017056411648.0
loss_critic            64959467692149858304.0
loss_actor                   -31739367938.048
memory_size                            5820.0 

=== epoch 8/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:11,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:41<00:00,  2.63it/s]
episodes                                 1011
episode_length                        9.87636
returns                             -2.385236
return_std                           2.750078
average_reward                      -0.241562
round_time             0 days 00:12:41.476632
episodes_test                          1006.0
episode_length_test                  9.930417
returns_test                        -2.329955
return_std_test                      2.701844
average_reward_test                 -0.234076
round_time_test        0 days 00:00:09.842806
round_time_total       0 days 00:12:41.477725
loss_total             51951228112043098112.0
loss_critic            64939034059646255104.0
loss_actor                    -32058435220.48
memory_size                            5820.0 

=== epoch 8/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:15,  2.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:31<00:00,  2.66it/s]
episodes                                 1015
episode_length                        9.84335
returns                             -2.362771
return_std                           2.672004
average_reward                      -0.240591
round_time             0 days 00:12:31.717743
episodes_test                          1013.0
episode_length_test                  9.866732
returns_test                        -2.402476
return_std_test                      2.838882
average_reward_test                 -0.243069
round_time_test        0 days 00:00:09.918165
round_time_total       0 days 00:12:31.718821
loss_total             53409144729275957248.0
loss_critic            66761429765491515392.0
loss_actor                -32186286664.703999
memory_size                            5820.0 

=== epoch 8/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:17,  2.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                 1012
episode_length                       9.876482
returns                             -2.264114
return_std                           2.631929
average_reward                      -0.229236
round_time             0 days 00:12:28.988614
episodes_test                          1017.0
episode_length_test                  9.827925
returns_test                        -2.353135
return_std_test                       2.65328
average_reward_test                 -0.239573
round_time_test        0 days 00:00:09.902114
round_time_total       0 days 00:12:28.989672
loss_total             53473604994284339200.0
loss_critic            66842005174954754048.0
loss_actor                   -32484423356.416
memory_size                            5820.0 

=== epoch 8/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:07,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                 1010
episode_length                       9.893069
returns                             -2.255203
return_std                           2.667913
average_reward                      -0.228057
round_time             0 days 00:12:25.877105
episodes_test                          1026.0
episode_length_test                  9.736842
returns_test                        -2.466374
return_std_test                      2.840012
average_reward_test                 -0.253179
round_time_test        0 days 00:00:10.100000
round_time_total       0 days 00:12:25.878160
loss_total             54455011026128216064.0
loss_critic            68068762601166307328.0
loss_actor                   -32640181562.368
memory_size                            5820.0 

=== epoch 8/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:41,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1008
episode_length                       9.910714
returns                             -2.285872
return_std                           2.656746
average_reward                      -0.230501
round_time             0 days 00:12:23.620996
episodes_test                          1020.0
episode_length_test                  9.803922
returns_test                        -2.404836
return_std_test                      2.676947
average_reward_test                 -0.245293
round_time_test        0 days 00:00:10.022911
round_time_total       0 days 00:12:23.622064
loss_total             54453193888824631296.0
loss_critic            68066491151430565888.0
loss_actor                -32815614404.608002
memory_size                            5820.0 

=== epoch 8/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:08,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1011
episode_length                       9.881306
returns                             -2.290051
return_std                           2.662249
average_reward                      -0.232194
round_time             0 days 00:12:23.772090
episodes_test                          1017.0
episode_length_test                  9.825959
returns_test                        -2.492561
return_std_test                      2.687861
average_reward_test                 -0.253155
round_time_test        0 days 00:00:09.886633
round_time_total       0 days 00:12:23.773145
loss_total             55618418923598495744.0
loss_critic            69523022501660172288.0
loss_actor                   -33046466537.472
memory_size                            5820.0 

=== epoch 8/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                 1010
episode_length                       9.889109
returns                             -2.320807
return_std                           2.711137
average_reward                      -0.234485
round_time             0 days 00:12:23.508286
episodes_test                          1010.0
episode_length_test                   9.89505
returns_test                        -2.276498
return_std_test                      2.841609
average_reward_test                 -0.230127
round_time_test        0 days 00:00:10.029448
round_time_total       0 days 00:12:23.509342
loss_total             56260757406012055552.0
loss_critic            70325945520976805888.0
loss_actor                -33726351073.279999
memory_size                            5820.0 

=== epoch 8/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:48,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.70it/s]
episodes                                 1010
episode_length                        9.89604
returns                             -2.327895
return_std                           2.765871
average_reward                      -0.235723
round_time             0 days 00:12:22.576360
episodes_test                          1019.0
episode_length_test                  9.807655
returns_test                        -2.402283
return_std_test                      2.678188
average_reward_test                 -0.244666
round_time_test        0 days 00:00:09.924793
round_time_total       0 days 00:12:22.577416
loss_total             58708765159284129792.0
loss_critic            73385955200059957248.0
loss_actor                -34223654564.863998
memory_size                            5820.0 

=== epoch 8/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1015
episode_length                       9.852217
returns                             -2.362049
return_std                           2.745691
average_reward                      -0.239748
round_time             0 days 00:12:20.098102
episodes_test                          1012.0
episode_length_test                  9.874506
returns_test                        -2.496363
return_std_test                      2.834991
average_reward_test                 -0.252373
round_time_test        0 days 00:00:09.908213
round_time_total       0 days 00:12:20.099159
loss_total             59624449502883446784.0
loss_critic            74530560557266206720.0
loss_actor                -34327715827.712002
memory_size                            5820.0 

=== epoch 8/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:39,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1014
episode_length                        9.84714
returns                             -2.351706
return_std                           2.763873
average_reward                      -0.238482
round_time             0 days 00:12:20.903264
episodes_test                          1013.0
episode_length_test                  9.868707
returns_test                        -2.302225
return_std_test                      2.786155
average_reward_test                 -0.232872
round_time_test        0 days 00:00:09.932263
round_time_total       0 days 00:12:20.904308
loss_total             61679524695027499008.0
loss_critic            77099404579606986752.0
loss_actor                -34997444673.536003
memory_size                            5820.0 

=== epoch 8/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1010
episode_length                        9.89505
returns                             -2.384077
return_std                           2.786127
average_reward                       -0.24106
round_time             0 days 00:12:20.160168
episodes_test                          1021.0
episode_length_test                  9.787463
returns_test                        -2.646197
return_std_test                      2.758302
average_reward_test                 -0.269648
round_time_test        0 days 00:00:09.862217
round_time_total       0 days 00:12:20.161235
loss_total             63080103639046496256.0
loss_critic            78850128170295427072.0
loss_actor                -35500958959.615997
memory_size                            5820.0 

=== epoch 8/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1005
episode_length                       9.934328
returns                             -2.397014
return_std                           2.774787
average_reward                      -0.240865
round_time             0 days 00:12:20.709501
episodes_test                          1023.0
episode_length_test                  9.772239
returns_test                        -2.486274
return_std_test                      2.671742
average_reward_test                 -0.254199
round_time_test        0 days 00:00:09.834972
round_time_total       0 days 00:12:20.710547
loss_total             65060908310572539904.0
loss_critic            81326134003655655424.0
loss_actor                -35981231483.903999
memory_size                            5820.0 

=== epoch 8/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:04,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                 1007
episode_length                       9.923535
returns                              -2.33443
return_std                             2.7165
average_reward                       -0.23479
round_time             0 days 00:12:19.501942
episodes_test                          1009.0
episode_length_test                  9.908821
returns_test                        -2.489575
return_std_test                      2.707634
average_reward_test                 -0.251044
round_time_test        0 days 00:00:09.912423
round_time_total       0 days 00:12:19.503023
loss_total             67116358471366033408.0
loss_critic            83895446687330222080.0
loss_actor                -36315861813.248001
memory_size                            5820.0 

=== epoch 8/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1005
episode_length                       9.946269
returns                             -2.333432
return_std                           2.738001
average_reward                      -0.234145
round_time             0 days 00:12:20.194494
episodes_test                          1016.0
episode_length_test                  9.837598
returns_test                        -2.555384
return_std_test                       2.77922
average_reward_test                 -0.259328
round_time_test        0 days 00:00:10.150635
round_time_total       0 days 00:12:20.195553
loss_total             67187043056855851008.0
loss_critic            83983802476367085568.0
loss_actor                -36777503157.248001
memory_size                            5820.0 

=== epoch 8/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:25,  2.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1003
episode_length                       9.957129
returns                             -2.302552
return_std                           2.704911
average_reward                      -0.231229
round_time             0 days 00:12:20.140166
episodes_test                          1022.0
episode_length_test                   9.77593
returns_test                        -2.547104
return_std_test                      2.672182
average_reward_test                 -0.260349
round_time_test        0 days 00:00:09.976547
round_time_total       0 days 00:12:20.141222
loss_total             70125154916980973568.0
loss_critic            87656442033792417792.0
loss_actor                -37092872617.984001
memory_size                            5820.0 

=== epoch 8/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1004
episode_length                       9.952191
returns                             -2.214809
return_std                           2.682184
average_reward                      -0.223145
round_time             0 days 00:12:20.203845
episodes_test                          1012.0
episode_length_test                  9.879447
returns_test                        -2.299324
return_std_test                      2.655754
average_reward_test                 -0.232531
round_time_test        0 days 00:00:09.921875
round_time_total       0 days 00:12:20.204975
loss_total             70225297355768635392.0
loss_critic            87781620152233410560.0
loss_actor                -37384610624.512001
memory_size                            5820.0 

=== epoch 8/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:58,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                 1011
episode_length                       9.882295
returns                             -2.240267
return_std                           2.688913
average_reward                      -0.226356
round_time             0 days 00:12:20.139337
episodes_test                          1014.0
episode_length_test                  9.858974
returns_test                        -2.418819
return_std_test                      2.778387
average_reward_test                 -0.245016
round_time_test        0 days 00:00:09.970877
round_time_total       0 days 00:12:20.140400
loss_total             72340963394558164992.0
loss_critic            90426202642721095680.0
loss_actor                -37694038040.575996
memory_size                            5820.0 

=== epoch 8/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:09,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1012
episode_length                        9.87747
returns                             -2.298406
return_std                           2.683645
average_reward                      -0.232799
round_time             0 days 00:12:20.462740
episodes_test                          1011.0
episode_length_test                  9.886251
returns_test                        -2.324475
return_std_test                      2.737024
average_reward_test                  -0.23506
round_time_test        0 days 00:00:09.936101
round_time_total       0 days 00:12:20.463822
loss_total             72242705811720364032.0
loss_critic            90303380678192611328.0
loss_actor                -38063241336.832001
memory_size                            5820.0 

=== epoch 8/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1015
episode_length                        9.84335
returns                             -2.279798
return_std                           2.625832
average_reward                      -0.231807
round_time             0 days 00:12:20.712056
episodes_test                          1011.0
episode_length_test                  9.890208
returns_test                        -2.276567
return_std_test                      2.650899
average_reward_test                 -0.230057
round_time_test        0 days 00:00:09.857554
round_time_total       0 days 00:12:20.713126
loss_total             74063995452298788864.0
loss_critic            92579992772208918528.0
loss_actor                -38418074515.456001
memory_size                            5820.0 

=== epoch 8/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:50,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1018
episode_length                       9.818271
returns                             -2.319511
return_std                           2.633754
average_reward                      -0.236192
round_time             0 days 00:12:20.995346
episodes_test                          1016.0
episode_length_test                  9.832677
returns_test                        -2.367078
return_std_test                      2.711739
average_reward_test                 -0.240942
round_time_test        0 days 00:00:09.892955
round_time_total       0 days 00:12:20.996430
loss_total             75650414982492192768.0
loss_critic            94563017120766689280.0
loss_actor                    -38745147064.32
memory_size                            5820.0 

=== epoch 8/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:13,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1019
episode_length                       9.804711
returns                             -2.336218
return_std                              2.614
average_reward                       -0.23824
round_time             0 days 00:12:20.654251
episodes_test                          1009.0
episode_length_test                  9.902874
returns_test                         -2.20096
return_std_test                        2.7848
average_reward_test                 -0.222479
round_time_test        0 days 00:00:09.939715
round_time_total       0 days 00:12:20.655314
loss_total             77307029152495534080.0
loss_critic            96633784759466147840.0
loss_actor                -39046830172.160004
memory_size                            5820.0 

=== epoch 8/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1016
episode_length                        9.82874
returns                             -2.306832
return_std                            2.64479
average_reward                      -0.234453
round_time             0 days 00:12:20.892463
episodes_test                          1006.0
episode_length_test                    9.9334
returns_test                         -2.27855
return_std_test                      2.709904
average_reward_test                 -0.229142
round_time_test        0 days 00:00:09.951791
round_time_total       0 days 00:12:20.893513
loss_total             77622764774638501888.0
loss_critic            97028454275737403392.0
loss_actor                -39351064569.856003
memory_size                            5820.0 

=== epoch 8/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:10,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                 1017
episode_length                       9.827925
returns                             -2.316946
return_std                           2.676506
average_reward                      -0.235233
round_time             0 days 00:12:20.605066
episodes_test                          1004.0
episode_length_test                  9.951195
returns_test                        -2.210109
return_std_test                      2.779177
average_reward_test                 -0.222047
round_time_test        0 days 00:00:09.857698
round_time_total       0 days 00:12:20.606124
loss_total             78890585449406857216.0
loss_critic            98613230057487761408.0
loss_actor                   -39755421116.416
memory_size                            5820.0 

=== epoch 8/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:22,  2.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  1011
episode_length                        9.877349
returns                              -2.323482
return_std                            2.683781
average_reward                       -0.235164
round_time              0 days 00:12:20.356323
episodes_test                           1015.0
episode_length_test                   9.844335
returns_test                         -2.438437
return_std_test                       2.714206
average_reward_test                  -0.247621
round_time_test         0 days 00:00:09.889750
round_time_total        0 days 00:12:20.357384
loss_total              81682509120375046144.0
loss_critic            102103134705434198016.0
loss_actor                 -40070230515.711998
memory_size                             5820.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                   202
episode_length                        9.881188
returns                              -2.131417
return_std                            2.657404
average_reward                       -0.213812
round_time              0 days 00:12:20.432086
episodes_test                           1018.0
episode_length_test                   9.816306
returns_test                         -2.642379
return_std_test                       2.711605
average_reward_test                  -0.268713
round_time_test         0 days 00:00:09.915996
round_time_total        0 days 00:12:20.433186
loss_total              83431758528204161024.0
loss_critic            104289696288611532800.0
loss_actor                 -40532343379.968002
memory_size                             5820.0 

=== epoch 9/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:42,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                   405
episode_length                        9.876543
returns                              -2.374874
return_std                            2.728317
average_reward                       -0.240456
round_time              0 days 00:12:20.394779
episodes_test                           1007.0
episode_length_test                   9.920556
returns_test                         -2.301673
return_std_test                       2.616848
average_reward_test                   -0.23138
round_time_test         0 days 00:00:09.922420
round_time_total        0 days 00:12:20.395841
loss_total              84945109133564411904.0
loss_critic            106181384635609235456.0
loss_actor                 -40932282308.608002
memory_size                             5820.0 

=== epoch 9/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                   605
episode_length                        9.915702
returns                              -2.339949
return_std                            2.713003
average_reward                       -0.235768
round_time              0 days 00:12:20.472654
episodes_test                           1009.0
episode_length_test                   9.909812
returns_test                         -2.521404
return_std_test                       2.775222
average_reward_test                  -0.254334
round_time_test         0 days 00:00:09.843758
round_time_total        0 days 00:12:20.473749
loss_total              87190473649178542080.0
loss_critic            108988090077679321088.0
loss_actor                 -41342931812.351997
memory_size                             5820.0 

=== epoch 9/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                   808
episode_length                        9.888614
returns                              -2.332909
return_std                            2.679542
average_reward                       -0.235607
round_time              0 days 00:12:23.795521
episodes_test                           1011.0
episode_length_test                   9.888229
returns_test                         -2.491721
return_std_test                         2.8495
average_reward_test                  -0.251702
round_time_test         0 days 00:00:09.999368
round_time_total        0 days 00:12:23.796586
loss_total              88777272709895143424.0
loss_critic            110971588940821020672.0
loss_actor                 -41641122840.575996
memory_size                             5820.0 

=== epoch 9/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:09,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1009
episode_length                        9.904856
returns                              -2.310982
return_std                            2.711278
average_reward                       -0.233113
round_time              0 days 00:12:20.291784
episodes_test                           1009.0
episode_length_test                   9.901883
returns_test                         -2.350719
return_std_test                       2.860569
average_reward_test                  -0.237338
round_time_test         0 days 00:00:09.903672
round_time_total        0 days 00:12:20.292852
loss_total              88122481057630306304.0
loss_critic            110153099544952225792.0
loss_actor                 -41805752875.008003
memory_size                             5820.0 

=== epoch 9/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:06,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                  1007
episode_length                        9.917577
returns                              -2.337045
return_std                            2.736893
average_reward                       -0.236106
round_time              0 days 00:12:25.151104
episodes_test                           1001.0
episode_length_test                   9.984016
returns_test                         -2.314834
return_std_test                       2.860326
average_reward_test                  -0.232154
round_time_test         0 days 00:00:09.818052
round_time_total        0 days 00:12:25.152176
loss_total              88648788891603288064.0
loss_critic            110810984085492826112.0
loss_actor                 -42093376970.751999
memory_size                             5820.0 

=== epoch 9/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                  1002
episode_length                        9.976048
returns                              -2.247828
return_std                            2.731736
average_reward                       -0.224768
round_time              0 days 00:12:25.383483
episodes_test                           1001.0
episode_length_test                    9.98002
returns_test                         -2.327174
return_std_test                        2.84329
average_reward_test                  -0.232965
round_time_test         0 days 00:00:09.959071
round_time_total        0 days 00:12:25.384552
loss_total              90742460386652209152.0
loss_critic            113428073591742939136.0
loss_actor                 -42307296155.648003
memory_size                             5820.0 

=== epoch 9/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1000
episode_length                           9.981
returns                              -2.259608
return_std                            2.765621
average_reward                        -0.22583
round_time              0 days 00:12:19.800943
episodes_test                           1005.0
episode_length_test                   9.940299
returns_test                         -2.348225
return_std_test                       2.846092
average_reward_test                  -0.235748
round_time_test         0 days 00:00:09.949686
round_time_total        0 days 00:12:19.802013
loss_total              90497226068007780352.0
loss_critic            113121530674608275456.0
loss_actor                 -42418962739.199997
memory_size                             5820.0 

=== epoch 9/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                   995
episode_length                       10.048241
returns                               -2.24435
return_std                            2.806684
average_reward                       -0.223331
round_time              0 days 00:12:19.786446
episodes_test                           1006.0
episode_length_test                   9.939364
returns_test                         -2.326914
return_std_test                       2.694361
average_reward_test                  -0.234006
round_time_test         0 days 00:00:09.911285
round_time_total        0 days 00:12:19.787526
loss_total              92039980149768568832.0
loss_critic            115049973256605728768.0
loss_actor                     -42640203847.68
memory_size                             5820.0 

=== epoch 9/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:09,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                   999
episode_length                            10.0
returns                              -2.349316
return_std                             2.78909
average_reward                       -0.234823
round_time              0 days 00:12:20.176545
episodes_test                           1014.0
episode_length_test                   9.851085
returns_test                         -2.519377
return_std_test                       2.738164
average_reward_test                  -0.255249
round_time_test         0 days 00:00:09.912580
round_time_total        0 days 00:12:20.177610
loss_total              93559414984084979712.0
loss_critic            116949266678142648320.0
loss_actor                    -42743675602.944
memory_size                             5820.0 

=== epoch 9/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:13,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1000
episode_length                           9.989
returns                              -2.345093
return_std                            2.778346
average_reward                       -0.234681
round_time              0 days 00:12:20.606418
episodes_test                            999.0
episode_length_test                  10.006006
returns_test                         -2.396629
return_std_test                       2.833045
average_reward_test                  -0.239062
round_time_test         0 days 00:00:09.915231
round_time_total        0 days 00:12:20.607485
loss_total              93669981916804841472.0
loss_critic            117087475461277908992.0
loss_actor                 -43142046210.047997
memory_size                             5820.0 

=== epoch 9/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1004
episode_length                        9.948207
returns                              -2.397479
return_std                            2.817123
average_reward                       -0.240589
round_time              0 days 00:12:20.126290
episodes_test                            995.0
episode_length_test                  10.044221
returns_test                          -2.21082
return_std_test                       2.712823
average_reward_test                  -0.219419
round_time_test         0 days 00:00:09.966300
round_time_total        0 days 00:12:20.127359
loss_total              95592862672559521792.0
loss_critic            119491076295607762944.0
loss_actor                 -43460826437.632004
memory_size                             5820.0 

=== epoch 9/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:06,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1007
episode_length                        9.924528
returns                              -2.407181
return_std                            2.791584
average_reward                       -0.242635
round_time              0 days 00:12:20.623977
episodes_test                           1006.0
episode_length_test                   9.935388
returns_test                          -2.34476
return_std_test                       2.834935
average_reward_test                  -0.235494
round_time_test         0 days 00:00:09.966133
round_time_total        0 days 00:12:20.625048
loss_total              96172939215030370304.0
loss_critic            120216171957753413632.0
loss_actor                    -43869937825.792
memory_size                             5820.0 

=== epoch 9/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1010
episode_length                        9.889109
returns                              -2.435179
return_std                            2.769222
average_reward                       -0.245891
round_time              0 days 00:12:21.168246
episodes_test                           1004.0
episode_length_test                   9.958167
returns_test                         -2.386647
return_std_test                       2.845541
average_reward_test                  -0.239456
round_time_test         0 days 00:00:09.995330
round_time_total        0 days 00:12:21.169322
loss_total              98072083020522536960.0
loss_critic            122590101637377933312.0
loss_actor                 -44480332400.639999
memory_size                             5820.0 

=== epoch 9/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:43,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:02<00:00,  2.56it/s]
episodes                                  1005
episode_length                        9.944279
returns                              -2.307315
return_std                            2.791355
average_reward                       -0.232031
round_time              0 days 00:13:02.898522
episodes_test                           1009.0
episode_length_test                   9.903865
returns_test                         -2.443413
return_std_test                       2.813149
average_reward_test                  -0.246768
round_time_test         0 days 00:00:09.958442
round_time_total        0 days 00:13:02.899671
loss_total             103388055702469066752.0
loss_critic            129235067258638778368.0
loss_actor                 -44996044056.575996
memory_size                             5820.0 

=== epoch 9/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:39<00:00,  2.27it/s]
episodes                                  1001
episode_length                        9.977023
returns                              -2.286285
return_std                            2.810651
average_reward                       -0.229321
round_time              0 days 00:14:40.159699
episodes_test                           1005.0
episode_length_test                   9.941294
returns_test                         -2.450416
return_std_test                       2.933273
average_reward_test                  -0.245907
round_time_test         0 days 00:00:10.288107
round_time_total        0 days 00:14:40.160836
loss_total             104375849754758447104.0
loss_critic            130469810030433812480.0
loss_actor                    -45389009686.528
memory_size                             5820.0 

=== epoch 9/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:37,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:37<00:00,  2.28it/s]
episodes                                  1002
episode_length                        9.972056
returns                              -2.303922
return_std                             2.76769
average_reward                       -0.231221
round_time              0 days 00:14:38.009448
episodes_test                           1006.0
episode_length_test                   9.937376
returns_test                         -2.174357
return_std_test                        2.70307
average_reward_test                  -0.218516
round_time_test         0 days 00:00:10.390029
round_time_total        0 days 00:14:38.010659
loss_total             107045015722704994304.0
loss_critic            133806267271014432768.0
loss_actor                 -45595516667.903999
memory_size                             5820.0 

=== epoch 9/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:08,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:31<00:00,  2.30it/s]
episodes                                  1002
episode_length                        9.976048
returns                               -2.28026
return_std                            2.807449
average_reward                       -0.228999
round_time              0 days 00:14:31.582064
episodes_test                           1013.0
episode_length_test                   9.870681
returns_test                         -2.372132
return_std_test                       2.708191
average_reward_test                  -0.240196
round_time_test         0 days 00:00:10.127321
round_time_total        0 days 00:14:31.583161
loss_total             107024134543339159552.0
loss_critic            133780165904009510912.0
loss_actor                    -45792119717.888
memory_size                             5820.0 

=== epoch 9/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:44,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:54<00:00,  2.40it/s]
episodes                                  1007
episode_length                        9.912612
returns                              -2.320778
return_std                            2.810584
average_reward                       -0.234054
round_time              0 days 00:13:54.802029
episodes_test                           1019.0
episode_length_test                   9.804711
returns_test                         -2.413189
return_std_test                        2.63983
average_reward_test                  -0.245985
round_time_test         0 days 00:00:10.217440
round_time_total        0 days 00:13:54.803095
loss_total             107671609701635391488.0
loss_critic            134589509717189623808.0
loss_actor                 -45854106073.087997
memory_size                             5820.0 

=== epoch 9/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:53,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1011
episode_length                        9.878338
returns                              -2.440629
return_std                            2.811716
average_reward                       -0.246504
round_time              0 days 00:12:19.558242
episodes_test                           1015.0
episode_length_test                   9.850246
returns_test                         -2.430817
return_std_test                       2.693563
average_reward_test                  -0.246578
round_time_test         0 days 00:00:10.058768
round_time_total        0 days 00:12:19.559305
loss_total             108056623334279315456.0
loss_critic            135070776956181495808.0
loss_actor                 -46306224324.608002
memory_size                             5820.0 

=== epoch 9/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1014
episode_length                        9.850099
returns                              -2.453497
return_std                            2.794269
average_reward                       -0.248813
round_time              0 days 00:12:20.501111
episodes_test                           1016.0
episode_length_test                    9.84252
returns_test                         -2.440347
return_std_test                       2.755675
average_reward_test                  -0.247939
round_time_test         0 days 00:00:09.893636
round_time_total        0 days 00:12:20.502174
loss_total             111574585955461054464.0
loss_critic            139468230130129223680.0
loss_actor                    -46857629741.056
memory_size                             5820.0 

=== epoch 9/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:31,  2.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1018
episode_length                        9.821218
returns                              -2.418122
return_std                            2.803571
average_reward                       -0.246073
round_time              0 days 00:12:20.712556
episodes_test                           1007.0
episode_length_test                     9.9285
returns_test                         -2.332319
return_std_test                       2.724044
average_reward_test                  -0.234677
round_time_test         0 days 00:00:10.207257
round_time_total        0 days 00:12:20.713596
loss_total             113751581233322868736.0
loss_critic            142189474130974343168.0
loss_actor                 -47371953518.592003
memory_size                             5820.0 

=== epoch 9/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:03,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1021
episode_length                        9.794319
returns                              -2.434741
return_std                             2.74544
average_reward                       -0.248587
round_time              0 days 00:12:19.704239
episodes_test                           1012.0
episode_length_test                    9.87747
returns_test                         -2.357833
return_std_test                       2.735442
average_reward_test                  -0.238422
round_time_test         0 days 00:00:09.850815
round_time_total        0 days 00:12:19.705288
loss_total             114899284451346235392.0
loss_critic            143624103180991348736.0
loss_actor                 -47877454004.223999
memory_size                             5820.0 

=== epoch 9/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:41,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1012
episode_length                        9.875494
returns                              -2.305577
return_std                            2.743047
average_reward                       -0.233368
round_time              0 days 00:12:19.919018
episodes_test                           1017.0
episode_length_test                   9.830875
returns_test                         -2.446592
return_std_test                        2.72958
average_reward_test                  -0.248645
round_time_test         0 days 00:00:09.951289
round_time_total        0 days 00:12:19.920080
loss_total             116390810441938190336.0
loss_critic            145488510619203518464.0
loss_actor                 -48255938162.688004
memory_size                             5820.0 

=== epoch 9/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:19,  2.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1009
episode_length                        9.900892
returns                              -2.211251
return_std                             2.71728
average_reward                       -0.223115
round_time              0 days 00:12:19.884858
episodes_test                           1012.0
episode_length_test                   9.881423
returns_test                         -2.376545
return_std_test                       2.812617
average_reward_test                  -0.240506
round_time_test         0 days 00:00:10.053363
round_time_total        0 days 00:12:19.885903
loss_total             119086970383518826496.0
loss_critic            148858710430180835328.0
loss_actor                 -48762302703.615997
memory_size                             5820.0 

=== epoch 9/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1009
episode_length                        9.901883
returns                              -2.239513
return_std                             2.72382
average_reward                       -0.225991
round_time              0 days 00:12:19.719436
episodes_test                           1014.0
episode_length_test                   9.860947
returns_test                          -2.43321
return_std_test                       2.776918
average_reward_test                  -0.246647
round_time_test         0 days 00:00:09.977083
round_time_total        0 days 00:12:19.720492
loss_total             122732728593584635904.0
loss_critic            153415907966263689216.0
loss_actor                 -49169473054.720001
memory_size                             5820.0 

=== epoch 9/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1005
episode_length                        9.937313
returns                              -2.194163
return_std                            2.731881
average_reward                       -0.220705
round_time              0 days 00:12:19.735539
episodes_test                           1021.0
episode_length_test                    9.79334
returns_test                         -2.615346
return_std_test                       2.845782
average_reward_test                  -0.266945
round_time_test         0 days 00:00:10.070261
round_time_total        0 days 00:12:19.736596
loss_total             125358033259291344896.0
loss_critic            156697538891855560704.0
loss_actor                 -49582344620.031998
memory_size                             5820.0 

=== epoch 9/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                  1001
episode_length                        9.981019
returns                              -2.180436
return_std                            2.773926
average_reward                       -0.218475
round_time              0 days 00:12:19.238096
episodes_test                           1002.0
episode_length_test                    9.97505
returns_test                         -2.241441
return_std_test                       2.800807
average_reward_test                  -0.224284
round_time_test         0 days 00:00:09.890330
round_time_total        0 days 00:12:19.239150
loss_total             126462447434593255424.0
loss_critic            158078056617305145344.0
loss_actor                 -50053278732.288002
memory_size                             5820.0 

=== epoch 9/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1005
episode_length                        9.943284
returns                              -2.305907
return_std                            2.774524
average_reward                       -0.231878
round_time              0 days 00:12:19.837034
episodes_test                           1015.0
episode_length_test                   9.848276
returns_test                         -2.429794
return_std_test                       2.761995
average_reward_test                  -0.246365
round_time_test         0 days 00:00:10.052505
round_time_total        0 days 00:12:19.838088
loss_total             129414027469932724224.0
loss_critic            161767531418487390208.0
loss_actor                    -50583953102.848
memory_size                             5820.0 

=== epoch 9/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1007
episode_length                         9.91857
returns                              -2.400038
return_std                            2.792491
average_reward                       -0.241555
round_time              0 days 00:12:20.399129
episodes_test                           1011.0
episode_length_test                   9.885262
returns_test                          -2.52357
return_std_test                       2.747875
average_reward_test                  -0.255549
round_time_test         0 days 00:00:09.838996
round_time_total        0 days 00:12:20.400185
loss_total             132894399965593829376.0
loss_critic            166117997045485502464.0
loss_actor                 -51177001250.816002
memory_size                             5820.0 

=== epoch 9/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                  1012
episode_length                        9.879447
returns                              -2.475158
return_std                            2.771288
average_reward                       -0.250484
round_time              0 days 00:12:19.469277
episodes_test                           1005.0
episode_length_test                   9.943284
returns_test                          -2.38946
return_std_test                       2.767299
average_reward_test                  -0.240435
round_time_test         0 days 00:00:09.780761
round_time_total        0 days 00:12:19.470332
loss_total             135451619024117465088.0
loss_critic            169314520640766246912.0
loss_actor                 -51701116311.552002
memory_size                             5820.0 

=== epoch 9/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1012
episode_length                        9.868577
returns                              -2.544884
return_std                            2.782004
average_reward                       -0.257281
round_time              0 days 00:12:20.855073
episodes_test                           1004.0
episode_length_test                   9.953187
returns_test                         -2.480363
return_std_test                       2.810066
average_reward_test                  -0.249081
round_time_test         0 days 00:00:09.816324
round_time_total        0 days 00:12:20.856123
loss_total             136570252644540694528.0
loss_critic            170712812779819859968.0
loss_actor                    -52209497249.792
memory_size                             5820.0 

=== epoch 9/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:40,  2.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1013
episode_length                        9.865745
returns                              -2.533422
return_std                             2.76511
average_reward                       -0.256786
round_time              0 days 00:12:20.281222
episodes_test                           1008.0
episode_length_test                   9.917659
returns_test                         -2.477728
return_std_test                       2.920961
average_reward_test                  -0.249507
round_time_test         0 days 00:00:09.811559
round_time_total        0 days 00:12:20.282278
loss_total             139981060698675462144.0
loss_critic            174976322733688881152.0
loss_actor                 -52641270378.496002
memory_size                             5820.0 

=== epoch 9/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:50,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1015
episode_length                        9.837438
returns                              -2.550772
return_std                            2.772839
average_reward                       -0.259519
round_time              0 days 00:12:19.824079
episodes_test                           1011.0
episode_length_test                   9.891197
returns_test                         -2.638591
return_std_test                         2.7719
average_reward_test                  -0.266762
round_time_test         0 days 00:00:09.812550
round_time_total        0 days 00:12:19.825151
loss_total             142349712262890782720.0
loss_critic            177937137280217481216.0
loss_actor                 -52866378514.431999
memory_size                             5820.0 

=== epoch 9/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1013
episode_length                        9.857848
returns                              -2.520248
return_std                            2.821464
average_reward                       -0.255907
round_time              0 days 00:12:19.722863
episodes_test                           1018.0
episode_length_test                     9.8222
returns_test                         -2.483339
return_std_test                       2.712957
average_reward_test                   -0.25274
round_time_test         0 days 00:00:10.056003
round_time_total        0 days 00:12:19.723912
loss_total             144829827593527607296.0
loss_critic            181037281390462074880.0
loss_actor                 -52885162205.183998
memory_size                             5820.0 

=== epoch 9/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1009
episode_length                        9.905847
returns                              -2.502712
return_std                            2.868486
average_reward                       -0.252085
round_time              0 days 00:12:20.216966
episodes_test                           1012.0
episode_length_test                    9.87253
returns_test                         -2.568794
return_std_test                       2.788776
average_reward_test                  -0.259435
round_time_test         0 days 00:00:09.970424
round_time_total        0 days 00:12:20.218016
loss_total             143592367559064436736.0
loss_critic            179490456420225744896.0
loss_actor                 -52993793429.503998
memory_size                             5820.0 

=== epoch 9/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1010
episode_length                        9.892079
returns                              -2.554354
return_std                            2.790535
average_reward                       -0.258274
round_time              0 days 00:12:20.754857
episodes_test                           1006.0
episode_length_test                   9.936382
returns_test                         -2.609161
return_std_test                       2.765568
average_reward_test                  -0.262653
round_time_test         0 days 00:00:09.808786
round_time_total        0 days 00:12:20.755905
loss_total             143381421296677191680.0
loss_critic            179226773394329600000.0
loss_actor                 -53499725322.239998
memory_size                             5820.0 

=== epoch 9/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1015
episode_length                        9.847291
returns                              -2.638934
return_std                            2.814164
average_reward                       -0.268196
round_time              0 days 00:12:20.175697
episodes_test                           1010.0
episode_length_test                    9.89802
returns_test                         -2.727592
return_std_test                       2.912073
average_reward_test                  -0.275212
round_time_test         0 days 00:00:09.929581
round_time_total        0 days 00:12:20.176784
loss_total             147458781905563779072.0
loss_critic            184323474244773183488.0
loss_actor                    -54031378266.112
memory_size                             5820.0 

=== epoch 9/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:22,  2.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                  1007
episode_length                         9.91857
returns                               -2.58184
return_std                            2.980867
average_reward                       -0.260391
round_time              0 days 00:12:19.276812
episodes_test                           1015.0
episode_length_test                   9.851232
returns_test                         -2.708239
return_std_test                       2.923959
average_reward_test                  -0.274804
round_time_test         0 days 00:00:09.849731
round_time_total        0 days 00:12:19.277855
loss_total             151123452256587055104.0
loss_critic            188904312036217716736.0
loss_actor                 -54635325073.407997
memory_size                             5820.0 

=== epoch 9/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:05,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1008
episode_length                         9.90873
returns                              -2.603108
return_std                            2.949054
average_reward                       -0.262571
round_time              0 days 00:12:20.014395
episodes_test                           1017.0
episode_length_test                   9.823992
returns_test                         -2.661844
return_std_test                       2.948435
average_reward_test                  -0.270759
round_time_test         0 days 00:00:09.870407
round_time_total        0 days 00:12:20.015445
loss_total             155884438957838991360.0
loss_critic            194855545165392510976.0
loss_actor                 -55020830951.424004
memory_size                             5820.0 

=== epoch 9/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1013
episode_length                        9.860809
returns                              -2.603112
return_std                            2.939549
average_reward                       -0.264134
round_time              0 days 00:12:19.968103
episodes_test                           1019.0
episode_length_test                   9.807655
returns_test                         -2.681813
return_std_test                        2.85974
average_reward_test                  -0.272607
round_time_test         0 days 00:00:09.925969
round_time_total        0 days 00:12:19.969147
loss_total             155651274730948362240.0
loss_critic            194564089964517457920.0
loss_actor                    -55504834793.472
memory_size                             5820.0 

=== epoch 9/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1013
episode_length                        9.858835
returns                              -2.604981
return_std                            2.969405
average_reward                       -0.264754
round_time              0 days 00:12:20.497784
episodes_test                           1028.0
episode_length_test                   9.726654
returns_test                         -2.759668
return_std_test                       2.655376
average_reward_test                  -0.283616
round_time_test         0 days 00:00:09.848854
round_time_total        0 days 00:12:20.498850
loss_total             159917725775067578368.0
loss_critic            199897153619033391104.0
loss_actor                 -56017754771.456001
memory_size                             5820.0 

=== epoch 9/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1017
episode_length                        9.820059
returns                              -2.556234
return_std                            2.929549
average_reward                       -0.259743
round_time              0 days 00:12:20.332825
episodes_test                           1018.0
episode_length_test                   9.823183
returns_test                         -2.439654
return_std_test                       2.870944
average_reward_test                  -0.248357
round_time_test         0 days 00:00:10.013595
round_time_total        0 days 00:12:20.333882
loss_total             165752174549377024000.0
loss_critic            207190214626502639616.0
loss_actor                 -56550151614.463997
memory_size                             5820.0 

=== epoch 9/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1023
episode_length                        9.774194
returns                              -2.603599
return_std                            2.799794
average_reward                         -0.2663
round_time              0 days 00:12:20.786844
episodes_test                           1016.0
episode_length_test                   9.840551
returns_test                         -2.413148
return_std_test                       2.796415
average_reward_test                  -0.245092
round_time_test         0 days 00:00:09.823940
round_time_total        0 days 00:12:20.787894
loss_total             163071341516213583872.0
loss_critic            203839173506572124160.0
loss_actor                 -57059038021.632004
memory_size                             5820.0 

=== epoch 9/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1024
episode_length                        9.761719
returns                              -2.572893
return_std                            2.797542
average_reward                        -0.26365
round_time              0 days 00:12:19.984311
episodes_test                           1022.0
episode_length_test                   9.782779
returns_test                         -2.546382
return_std_test                       2.828302
average_reward_test                  -0.260128
round_time_test         0 days 00:00:09.858887
round_time_total        0 days 00:12:19.985385
loss_total             167598180045315768320.0
loss_critic            209497721586036277248.0
loss_actor                 -57498619621.375999
memory_size                             5820.0 

=== epoch 9/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1021
episode_length                        9.789422
returns                              -2.499258
return_std                            2.779016
average_reward                        -0.25591
round_time              0 days 00:12:19.733013
episodes_test                           1009.0
episode_length_test                   9.905847
returns_test                         -2.306632
return_std_test                       2.760455
average_reward_test                  -0.232817
round_time_test         0 days 00:00:09.840605
round_time_total        0 days 00:12:19.734048
loss_total             170485541437175365632.0
loss_critic            213106923323661713408.0
loss_actor                 -57676074268.671997
memory_size                             5820.0 

=== epoch 9/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1025
episode_length                        9.745366
returns                              -2.468322
return_std                            2.801421
average_reward                       -0.253012
round_time              0 days 00:12:20.493894
episodes_test                           1011.0
episode_length_test                   9.885262
returns_test                         -2.449927
return_std_test                        2.76316
average_reward_test                   -0.24762
round_time_test         0 days 00:00:10.043375
round_time_total        0 days 00:12:20.494942
loss_total             172459452253258612736.0
loss_critic            215574311780818747392.0
loss_actor                 -58303659997.183998
memory_size                             5820.0 

=== epoch 9/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:11,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                  1026
episode_length                        9.740741
returns                               -2.54577
return_std                            2.846423
average_reward                       -0.261459
round_time              0 days 00:12:22.046237
episodes_test                           1021.0
episode_length_test                   9.786484
returns_test                         -2.771142
return_std_test                       2.828016
average_reward_test                   -0.28296
round_time_test         0 days 00:00:09.947495
round_time_total        0 days 00:12:22.047325
loss_total             173685714119134150656.0
loss_critic            217107139200574357504.0
loss_actor                 -58786522568.704002
memory_size                             5820.0 

=== epoch 9/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:43,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1027
episode_length                        9.737098
returns                              -2.593869
return_std                            2.842441
average_reward                        -0.26639
round_time              0 days 00:12:21.424713
episodes_test                           1025.0
episode_length_test                   9.754146
returns_test                         -2.547727
return_std_test                       2.787137
average_reward_test                  -0.260962
round_time_test         0 days 00:00:09.818735
round_time_total        0 days 00:12:21.425755
loss_total             178770165144506433536.0
loss_critic            223462702795097833472.0
loss_actor                 -59201747372.031998
memory_size                             5820.0 

=== epoch 9/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  1028
episode_length                        9.714008
returns                              -2.617013
return_std                            2.802389
average_reward                       -0.269775
round_time              0 days 00:12:23.346556
episodes_test                            999.0
episode_length_test                  10.005005
returns_test                         -2.401552
return_std_test                       2.919047
average_reward_test                  -0.240036
round_time_test         0 days 00:00:09.829648
round_time_total        0 days 00:12:23.347609
loss_total             180055544151905927168.0
loss_critic            225069426224218865664.0
loss_actor                 -59660706023.424004
memory_size                             5820.0 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                   202
episode_length                        9.871287
returns                              -2.517208
return_std                            2.597265
average_reward                       -0.256179
round_time              0 days 00:12:24.452888
episodes_test                           1006.0
episode_length_test                   9.940358
returns_test                         -2.491149
return_std_test                       2.815606
average_reward_test                   -0.25061
round_time_test         0 days 00:00:09.978720
round_time_total        0 days 00:12:24.453957
loss_total             181380976061129916416.0
loss_critic            226726215939499917312.0
loss_actor                 -59829304303.615997
memory_size                             5820.0 

=== epoch 10/10 ==== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:55,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                   403
episode_length                        9.920596
returns                              -2.296549
return_std                            2.629896
average_reward                       -0.230886
round_time              0 days 00:12:24.646206
episodes_test                           1008.0
episode_length_test                   9.914683
returns_test                         -2.368666
return_std_test                       2.804746
average_reward_test                  -0.238662
round_time_test         0 days 00:00:09.928182
round_time_total        0 days 00:12:24.647281
loss_total             181411273818715848704.0
loss_critic            226764088712076656640.0
loss_actor                 -59913571383.295998
memory_size                             5820.0 

=== epoch 10/10 ==== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                   605
episode_length                        9.905785
returns                              -2.339804
return_std                            2.692818
average_reward                       -0.235956
round_time              0 days 00:12:25.918174
episodes_test                           1003.0
episode_length_test                   9.969093
returns_test                         -2.413874
return_std_test                       2.892724
average_reward_test                  -0.242038
round_time_test         0 days 00:00:09.836373
round_time_total        0 days 00:12:25.919236
loss_total             183879767336735834112.0
loss_critic            229849705185190150144.0
loss_actor                     -60317479495.68
memory_size                             5820.0 

=== epoch 10/10 ==== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                   805
episode_length                        9.935404
returns                              -2.409557
return_std                            2.772996
average_reward                       -0.242266
round_time              0 days 00:12:19.535216
episodes_test                           1008.0
episode_length_test                   9.918651
returns_test                          -2.41003
return_std_test                       2.863031
average_reward_test                  -0.242737
round_time_test         0 days 00:00:09.928653
round_time_total        0 days 00:12:19.536300
loss_total             187171683942916915200.0
loss_critic            233964600753800478720.0
loss_actor                 -60705666527.232002
memory_size                             5820.0 

=== epoch 10/10 ==== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1009
episode_length                        9.909812
returns                              -2.455796
return_std                            2.770768
average_reward                       -0.247715
round_time              0 days 00:12:20.804890
episodes_test                           1009.0
episode_length_test                   9.909812
returns_test                         -2.611046
return_std_test                        2.79128
average_reward_test                  -0.263368
round_time_test         0 days 00:00:09.924100
round_time_total        0 days 00:12:20.805949
loss_total             187750841243089403904.0
loss_critic            234688547656093040640.0
loss_actor                 -61421772857.344002
memory_size                             5820.0 

=== epoch 10/10 ==== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1005
episode_length                        9.941294
returns                              -2.426787
return_std                            2.840547
average_reward                       -0.243704
round_time              0 days 00:12:20.785424
episodes_test                           1013.0
episode_length_test                   9.870681
returns_test                         -2.676881
return_std_test                       2.951037
average_reward_test                  -0.271092
round_time_test         0 days 00:00:09.893474
round_time_total        0 days 00:12:20.786481
loss_total             191539315435718901760.0
loss_critic            239424140074173267968.0
loss_actor                 -61877108662.272003
memory_size                             5820.0 

=== epoch 10/10 ==== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1008
episode_length                        9.905754
returns                              -2.584898
return_std                            2.882203
average_reward                       -0.260605
round_time              0 days 00:12:20.209262
episodes_test                           1015.0
episode_length_test                   9.846305
returns_test                          -2.81014
return_std_test                       2.844003
average_reward_test                  -0.285025
round_time_test         0 days 00:00:09.851218
round_time_total        0 days 00:12:20.210326
loss_total             195558207685310676992.0
loss_critic            244447755414750265344.0
loss_actor                 -62705079459.839996
memory_size                             5820.0 

=== epoch 10/10 ==== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:11,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:25<00:00,  2.68it/s]
episodes                                  1005
episode_length                        9.940299
returns                              -2.541088
return_std                             2.89724
average_reward                       -0.255369
round_time              0 days 00:12:25.665497
episodes_test                           1011.0
episode_length_test                   9.888229
returns_test                         -2.573995
return_std_test                       2.829018
average_reward_test                  -0.259977
round_time_test         0 days 00:00:09.738185
round_time_total        0 days 00:12:25.666560
loss_total             199318255844286988288.0
loss_critic            249147815632437215232.0
loss_actor                 -63219140155.391998
memory_size                             5820.0 

=== epoch 10/10 ==== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:55,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1009
episode_length                        9.894945
returns                              -2.527065
return_std                            2.859599
average_reward                       -0.255532
round_time              0 days 00:12:20.834360
episodes_test                           1014.0
episode_length_test                   9.857002
returns_test                         -2.699612
return_std_test                       2.869183
average_reward_test                  -0.273382
round_time_test         0 days 00:00:09.892528
round_time_total        0 days 00:12:20.835448
loss_total             201462470438608928768.0
loss_critic            251828083760715563008.0
loss_actor                    -63879990038.528
memory_size                             5820.0 

=== epoch 10/10 ==== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1007
episode_length                         9.91857
returns                              -2.516167
return_std                            2.846599
average_reward                       -0.253821
round_time              0 days 00:12:20.126314
episodes_test                           1011.0
episode_length_test                   9.886251
returns_test                         -2.532694
return_std_test                       2.796907
average_reward_test                  -0.256206
round_time_test         0 days 00:00:09.935217
round_time_total        0 days 00:12:20.127395
loss_total             204147009540640538624.0
loss_critic            255183757378220589056.0
loss_actor                 -64691656716.288002
memory_size                             5820.0 

=== epoch 10/10 ==== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:59,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1007
episode_length                        9.921549
returns                              -2.454418
return_std                            2.851321
average_reward                       -0.247359
round_time              0 days 00:12:20.403716
episodes_test                           1009.0
episode_length_test                   9.901883
returns_test                         -2.589333
return_std_test                       2.879461
average_reward_test                  -0.261423
round_time_test         0 days 00:00:10.023297
round_time_total        0 days 00:12:20.404773
loss_total             215143173485856030720.0
loss_critic            268928962094235680768.0
loss_actor                 -65234861660.160004
memory_size                             5820.0 

=== epoch 10/10 ==== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:00,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1006
episode_length                        9.925447
returns                              -2.410108
return_std                            2.908667
average_reward                        -0.24299
round_time              0 days 00:12:20.760203
episodes_test                            999.0
episode_length_test                  10.007007
returns_test                         -2.412112
return_std_test                       2.924546
average_reward_test                  -0.240874
round_time_test         0 days 00:00:09.870372
round_time_total        0 days 00:12:20.761265
loss_total             217786352967071072256.0
loss_critic            272232936433110089728.0
loss_actor                 -65914222755.839996
memory_size                             5820.0 

=== epoch 10/10 ==== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:51,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                  1011
episode_length                         9.88724
returns                              -2.526508
return_std                             2.94769
average_reward                       -0.256071
round_time              0 days 00:12:21.979506
episodes_test                           1004.0
episode_length_test                   9.957171
returns_test                         -2.537178
return_std_test                       2.874456
average_reward_test                  -0.254541
round_time_test         0 days 00:00:09.966305
round_time_total        0 days 00:12:21.980576
loss_total             221205477033366519808.0
loss_critic            276506841501960601600.0
loss_actor                 -66542587629.568001
memory_size                             5820.0 

=== epoch 10/10 ==== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:58,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                  1010
episode_length                         9.89703
returns                              -2.545177
return_std                              2.9509
average_reward                       -0.256811
round_time              0 days 00:12:21.604756
episodes_test                           1005.0
episode_length_test                   9.941294
returns_test                         -2.615427
return_std_test                        2.96113
average_reward_test                  -0.262873
round_time_test         0 days 00:00:10.044052
round_time_total        0 days 00:12:21.605809
loss_total             227942894399355617280.0
loss_critic            284928613201200644096.0
loss_actor                 -66924773177.344002
memory_size                             5820.0 

=== epoch 10/10 ==== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:10,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                  1007
episode_length                        9.927507
returns                              -2.537539
return_std                            2.977761
average_reward                       -0.255797
round_time              0 days 00:12:21.384281
episodes_test                           1009.0
episode_length_test                   9.904856
returns_test                          -2.59956
return_std_test                       2.843584
average_reward_test                   -0.26247
round_time_test         0 days 00:00:09.842212
round_time_total        0 days 00:12:21.385332
loss_total             226289337376903069696.0
loss_critic            282861666860187942912.0
loss_actor                 -67388953335.807999
memory_size                             5820.0 

=== epoch 10/10 ==== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:53,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:24<00:00,  2.69it/s]
episodes                                  1007
episode_length                        9.921549
returns                                -2.5767
return_std                            2.918296
average_reward                       -0.259685
round_time              0 days 00:12:25.219761
episodes_test                           1006.0
episode_length_test                   9.935388
returns_test                         -2.489535
return_std_test                       2.800131
average_reward_test                  -0.250289
round_time_test         0 days 00:00:09.788313
round_time_total        0 days 00:12:25.220843
loss_total             234613413597734273024.0
loss_critic            293266762106540130304.0
loss_actor                 -67772062306.304001
memory_size                             5820.0 

=== epoch 10/10 ==== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:17,  2.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:26<00:00,  2.68it/s]
episodes                                  1004
episode_length                        9.955179
returns                              -2.577058
return_std                            2.820512
average_reward                       -0.258672
round_time              0 days 00:12:27.455369
episodes_test                           1012.0
episode_length_test                    9.87747
returns_test                         -2.457804
return_std_test                       2.853723
average_reward_test                  -0.248481
round_time_test         0 days 00:00:10.028490
round_time_total        0 days 00:12:27.456427
loss_total             235073189701085790208.0
loss_critic            293841482261568028672.0
loss_actor                 -68644624830.463997
memory_size                             5820.0 

=== epoch 10/10 ==== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:01,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:28<00:00,  2.67it/s]
episodes                                  1000
episode_length                           9.992
returns                              -2.515747
return_std                            2.770255
average_reward                       -0.250879
round_time              0 days 00:12:28.905818
episodes_test                            992.0
episode_length_test                  10.071573
returns_test                         -2.257106
return_std_test                        2.77811
average_reward_test                  -0.224071
round_time_test         0 days 00:00:09.894659
round_time_total        0 days 00:12:28.906916
loss_total             237462618254945091584.0
loss_critic            296828267302431555584.0
loss_actor                    -69186298300.416
memory_size                             5820.0 

=== epoch 10/10 ==== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:45,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   996
episode_length                       10.032129
returns                              -2.463234
return_std                            2.753383
average_reward                       -0.245675
round_time              0 days 00:14:10.074405
episodes_test                           1007.0
episode_length_test                   9.924528
returns_test                          -2.46767
return_std_test                       2.728274
average_reward_test                  -0.248284
round_time_test         0 days 00:00:09.822003
round_time_total        0 days 00:14:10.076163
loss_total             243061691863700865024.0
loss_critic            303827109496444944384.0
loss_actor                    -69282905276.416
memory_size                             5820.0 

=== epoch 10/10 ==== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:09,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:37<00:00,  2.28it/s]
episodes                                   999
episode_length                       10.005005
returns                              -2.471059
return_std                            2.777819
average_reward                       -0.246843
round_time              0 days 00:14:37.828592
episodes_test                           1005.0
episode_length_test                   9.949254
returns_test                         -2.487662
return_std_test                       2.855066
average_reward_test                  -0.249935
round_time_test         0 days 00:00:10.205528
round_time_total        0 days 00:14:37.829711
loss_total             246792032675347562496.0
loss_critic            308490035751796342784.0
loss_actor                 -69621510676.479996
memory_size                             5820.0 

=== epoch 10/10 ==== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:30<00:00,  2.30it/s]
episodes                                  1003
episode_length                        9.961117
returns                              -2.557381
return_std                            2.777549
average_reward                       -0.256419
round_time              0 days 00:14:31.155169
episodes_test                           1008.0
episode_length_test                    9.91369
returns_test                         -2.620177
return_std_test                       2.849378
average_reward_test                  -0.263804
round_time_test         0 days 00:00:10.294740
round_time_total        0 days 00:14:31.156343
loss_total             252639852155364605952.0
loss_critic            315799809843982172160.0
loss_actor                    -70409691994.112
memory_size                             5820.0 

=== epoch 10/10 ==== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:42<00:00,  2.43it/s]
episodes                                  1003
episode_length                        9.957129
returns                              -2.472076
return_std                            2.807301
average_reward                       -0.248375
round_time              0 days 00:13:42.727896
episodes_test                           1009.0
episode_length_test                   9.905847
returns_test                         -2.293428
return_std_test                       2.801059
average_reward_test                  -0.231412
round_time_test         0 days 00:00:10.193851
round_time_total        0 days 00:13:42.728947
loss_total             253799449120519094272.0
loss_critic            317249305964663406592.0
loss_actor                 -70871740751.871994
memory_size                             5820.0 

=== epoch 10/10 ==== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1006
episode_length                        9.936382
returns                              -2.454677
return_std                            2.767476
average_reward                       -0.247496
round_time              0 days 00:12:19.977709
episodes_test                            999.0
episode_length_test                  10.002002
returns_test                         -2.189713
return_std_test                       2.797559
average_reward_test                  -0.218306
round_time_test         0 days 00:00:09.895392
round_time_total        0 days 00:12:19.978756
loss_total             259939571728285827072.0
loss_critic            324924458803258851328.0
loss_actor                 -71672418230.272003
memory_size                             5820.0 

=== epoch 10/10 ==== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:50,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                  1006
episode_length                        9.939364
returns                              -2.362927
return_std                            2.789225
average_reward                       -0.237626
round_time              0 days 00:12:20.304296
episodes_test                           1005.0
episode_length_test                   9.948259
returns_test                         -2.342752
return_std_test                       2.838206
average_reward_test                  -0.235388
round_time_test         0 days 00:00:09.912465
round_time_total        0 days 00:12:20.305357
loss_total             265052977067464032256.0
loss_critic            331316215538804260864.0
loss_actor                 -72332851140.608002
memory_size                             5820.0 

=== epoch 10/10 ==== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:46,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
episodes                                  1005
episode_length                        9.945274
returns                              -2.381175
return_std                            2.797861
average_reward                       -0.239609
round_time              0 days 00:12:19.600504
episodes_test                           1005.0
episode_length_test                   9.945274
returns_test                         -2.266197
return_std_test                       2.803054
average_reward_test                  -0.227835
round_time_test         0 days 00:00:10.015050
round_time_total        0 days 00:12:19.601561
loss_total             268089103800931713024.0
loss_critic            335111373952890109952.0
loss_actor                 -73164081510.399994
memory_size                             5820.0 

=== epoch 10/10 ==== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                  1000
episode_length                            9.99
returns                              -2.341748
return_std                            2.788845
average_reward                       -0.234531
round_time              0 days 00:14:10.114906
episodes_test                           1009.0
episode_length_test                   9.901883
returns_test                         -2.540863
return_std_test                        2.74371
average_reward_test                  -0.256395
round_time_test         0 days 00:00:09.889986
round_time_total        0 days 00:14:10.115989
loss_total             276330526231857987584.0
loss_critic            345413151585828143104.0
loss_actor                 -73668731043.839996
memory_size                             5820.0 

=== epoch 10/10 ==== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:43<00:00,  2.26it/s]
episodes                                   999
episode_length                        9.998999
returns                              -2.440359
return_std                            2.858386
average_reward                       -0.244745
round_time              0 days 00:14:44.017175
episodes_test                           1006.0
episode_length_test                   9.940358
returns_test                         -2.563572
return_std_test                       3.039404
average_reward_test                  -0.257895
round_time_test         0 days 00:00:10.152883
round_time_total        0 days 00:14:44.018346
loss_total             279442811157193064448.0
loss_critic            349303508034417262592.0
loss_actor                 -74334101716.992004
memory_size                             5820.0 

=== epoch 10/10 ==== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:18,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:29<00:00,  2.30it/s]
episodes                                   998
episode_length                        10.02004
returns                              -2.457041
return_std                            2.916458
average_reward                       -0.245213
round_time              0 days 00:14:29.992984
episodes_test                            999.0
episode_length_test                  10.009009
returns_test                         -2.309514
return_std_test                       2.813987
average_reward_test                  -0.230642
round_time_test         0 days 00:00:10.221377
round_time_total        0 days 00:14:29.994175
loss_total             284695498666738024448.0
loss_critic            355869367609365037056.0
loss_actor                 -75064875454.464005
memory_size                             5820.0 

=== epoch 10/10 ==== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:58,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:22<00:00,  2.49it/s]
episodes                                   998
episode_length                        10.01002
returns                              -2.592853
return_std                            2.932378
average_reward                       -0.258936
round_time              0 days 00:13:22.636891
episodes_test                           1006.0
episode_length_test                    9.93837
returns_test                         -2.400742
return_std_test                       2.735817
average_reward_test                  -0.241379
round_time_test         0 days 00:00:10.139327
round_time_total        0 days 00:13:22.637946
loss_total             296965127541152415744.0
loss_critic            371206403135584731136.0
loss_actor                 -75235862405.119995
memory_size                             5820.0 

=== epoch 10/10 ==== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:31<00:00,  1.80it/s]
episodes                                   995
episode_length                       10.034171
returns                              -2.457562
return_std                            2.888372
average_reward                       -0.244707
round_time              0 days 00:18:31.944415
episodes_test                            994.0
episode_length_test                   10.05332
returns_test                         -2.094499
return_std_test                       2.830513
average_reward_test                  -0.207625
round_time_test         0 days 00:00:09.920501
round_time_total        0 days 00:18:31.945675
loss_total             290218846723859742720.0
loss_critic            362773552367956131840.0
loss_actor                 -75919608881.151993
memory_size                             5820.0 

=== epoch 10/10 ==== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:44,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:18<00:00,  1.49it/s]
episodes                                   996
episode_length                       10.028112
returns                              -2.514601
return_std                            2.948742
average_reward                       -0.250436
round_time              0 days 00:22:19.549792
episodes_test                           1002.0
episode_length_test                   9.973054
returns_test                         -2.334958
return_std_test                       2.686867
average_reward_test                  -0.233683
round_time_test         0 days 00:00:11.252104
round_time_total        0 days 00:22:19.551103
loss_total             297833645225158901760.0
loss_critic            372292050247189921792.0
loss_actor                 -76734700785.664001
memory_size                             5820.0 

=== epoch 10/10 ==== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:57,  1.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [24:02<00:00,  1.39it/s]
episodes                                  1000
episode_length                           9.995
returns                              -2.432017
return_std                            2.836556
average_reward                       -0.243341
round_time              0 days 00:24:02.772612
episodes_test                            996.0
episode_length_test                  10.033133
returns_test                         -2.279259
return_std_test                       2.882831
average_reward_test                  -0.226927
round_time_test         0 days 00:00:11.364154
round_time_total        0 days 00:24:02.773910
loss_total             298877241404608610304.0
loss_critic            373596545279087542272.0
loss_actor                 -77003818147.839996
memory_size                             5820.0 

=== epoch 10/10 ==== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:25,  1.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:32<00:00,  1.42it/s]
episodes                                  1000
episode_length                           9.994
returns                              -2.445449
return_std                            2.835807
average_reward                       -0.244978
round_time              0 days 00:23:33.200206
episodes_test                           1000.0
episode_length_test                      9.996
returns_test                         -2.372183
return_std_test                       2.654545
average_reward_test                  -0.237035
round_time_test         0 days 00:00:12.415235
round_time_total        0 days 00:23:33.201857
loss_total             306345598639410708480.0
loss_critic            382931992013905133568.0
loss_actor                 -77605259927.552002
memory_size                             5820.0 

=== epoch 10/10 ==== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:59,  1.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:59<00:00,  1.59it/s]
episodes                                   998
episode_length                       10.011022
returns                              -2.342395
return_std                            2.810938
average_reward                       -0.234382
round_time              0 days 00:21:00.139294
episodes_test                            991.0
episode_length_test                  10.088799
returns_test                          -2.39342
return_std_test                       2.797859
average_reward_test                  -0.237065
round_time_test         0 days 00:00:12.249298
round_time_total        0 days 00:21:00.140493
loss_total             310617129629560340480.0
loss_critic            388271405143562321920.0
loss_actor                 -78355407196.160004
memory_size                             5820.0 

=== epoch 10/10 ==== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [19:31<00:00,  1.71it/s]
episodes                                   995
episode_length                       10.040201
returns                              -2.356268
return_std                            2.808131
average_reward                       -0.234958
round_time              0 days 00:19:32.253917
episodes_test                            996.0
episode_length_test                  10.039157
returns_test                         -2.437369
return_std_test                       2.851568
average_reward_test                  -0.242655
round_time_test         0 days 00:00:11.149934
round_time_total        0 days 00:19:32.255138
loss_total             317445095749532844032.0
loss_critic            396806362795177148416.0
loss_actor                 -78952851619.839996
memory_size                             5820.0 

=== epoch 10/10 ==== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:55,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:59<00:00,  1.45it/s]
episodes                                   997
episode_length                       10.022066
returns                              -2.355125
return_std                            2.810914
average_reward                       -0.234992
round_time              0 days 00:22:59.928331
episodes_test                           1005.0
episode_length_test                   9.950249
returns_test                         -2.535635
return_std_test                       2.869683
average_reward_test                  -0.254831
round_time_test         0 days 00:00:11.176018
round_time_total        0 days 00:22:59.929550
loss_total             320210708438751117312.0
loss_critic            400263378674292162560.0
loss_actor                 -79839287881.727997
memory_size                             5820.0 

=== epoch 10/10 ==== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:52,  1.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [20:43<00:00,  1.61it/s]
episodes                                   998
episode_length                       10.006012
returns                              -2.438722
return_std                            2.810065
average_reward                       -0.243602
round_time              0 days 00:20:43.806702
episodes_test                           1004.0
episode_length_test                   9.956175
returns_test                         -2.559787
return_std_test                       2.713699
average_reward_test                  -0.256809
round_time_test         0 days 00:00:12.257514
round_time_total        0 days 00:20:43.807956
loss_total             329876278088429797376.0
loss_critic            412345340524734578688.0
loss_actor                 -80345825288.192001
memory_size                             5820.0 

=== epoch 10/10 ==== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:34,  2.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:15<00:00,  2.34it/s]
episodes                                   995
episode_length                       10.043216
returns                              -2.404381
return_std                            2.842665
average_reward                       -0.238942
round_time              0 days 00:14:16.541591
episodes_test                           1000.0
episode_length_test                       10.0
returns_test                         -2.430825
return_std_test                        2.75884
average_reward_test                  -0.243082
round_time_test         0 days 00:00:10.186283
round_time_total        0 days 00:14:16.542642
loss_total             332114605959973830656.0
loss_critic            415143249877080997888.0
loss_actor                 -81015324876.800003
memory_size                             5820.0 

=== epoch 10/10 ==== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:12,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                   994
episode_length                       10.047284
returns                               -2.39363
return_std                            2.835182
average_reward                       -0.238039
round_time              0 days 00:12:23.607257
episodes_test                            997.0
episode_length_test                  10.026078
returns_test                          -2.37387
return_std_test                       2.771925
average_reward_test                  -0.236417
round_time_test         0 days 00:00:09.857487
round_time_total        0 days 00:12:23.608313
loss_total             339150609054357651456.0
loss_critic            423938253564740829184.0
loss_actor                 -81548202024.960007
memory_size                             5820.0 

=== epoch 10/10 ==== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:14,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                   997
episode_length                       10.026078
returns                              -2.444018
return_std                             2.86367
average_reward                       -0.243725
round_time              0 days 00:12:23.717020
episodes_test                            986.0
episode_length_test                  10.134888
returns_test                         -2.280751
return_std_test                       2.931984
average_reward_test                  -0.224806
round_time_test         0 days 00:00:09.963903
round_time_total        0 days 00:12:23.718077
loss_total             345277410660939464704.0
loss_critic            431596755434429612032.0
loss_actor                 -82135279693.824005
memory_size                             5820.0 

=== epoch 10/10 ==== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:04,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                   993
episode_length                       10.068479
returns                              -2.356882
return_std                            2.927031
average_reward                       -0.233949
round_time              0 days 00:12:23.240836
episodes_test                            989.0
episode_length_test                   10.10819
returns_test                         -2.325516
return_std_test                       3.020766
average_reward_test                  -0.229882
round_time_test         0 days 00:00:09.819488
round_time_total        0 days 00:12:23.241888
loss_total             348037267846590955520.0
loss_critic            435046577481642999808.0
loss_actor                    -82839646531.584
memory_size                            5821.07 

=== epoch 10/10 ==== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:50,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:23<00:00,  2.69it/s]
episodes                                   984
episode_length                       10.158537
returns                              -2.329289
return_std                             3.00928
average_reward                       -0.229661
round_time              0 days 00:12:23.560471
episodes_test                            997.0
episode_length_test                  10.024072
returns_test                         -2.556936
return_std_test                       2.909074
average_reward_test                  -0.255136
round_time_test         0 days 00:00:09.937911
round_time_total        0 days 00:12:23.561515
loss_total             355346573621590097920.0
loss_critic            444183209446404718592.0
loss_actor                 -83005779632.128006
memory_size                             5825.0 

=== epoch 10/10 ==== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:56,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:21<00:00,  2.70it/s]
episodes                                   978
episode_length                       10.211656
returns                              -2.322088
return_std                            2.986126
average_reward                       -0.227534
round_time              0 days 00:12:21.543439
episodes_test                            994.0
episode_length_test                  10.057344
returns_test                         -2.413747
return_std_test                       3.008317
average_reward_test                  -0.239897
round_time_test         0 days 00:00:09.905629
round_time_total        0 days 00:12:21.544496
loss_total             361968985204921991168.0
loss_critic            452461223765040889856.0
loss_actor                 -83303042490.367996
memory_size                          5826.3495 

=== epoch 10/10 ==== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:39,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.70it/s]
episodes                                   978
episode_length                       10.211656
returns                              -2.420115
return_std                             2.99746
average_reward                       -0.237007
round_time              0 days 00:12:20.020646
episodes_test                            999.0
episode_length_test                  10.003003
returns_test                         -2.639223
return_std_test                       2.904907
average_reward_test                  -0.263776
round_time_test         0 days 00:00:09.989871
round_time_total        0 days 00:12:20.021698
loss_total             366632776582410403840.0
loss_critic            458290962970408714240.0
loss_actor                 -83790644846.591995
memory_size                             5827.0 

=== epoch 10/10 ==== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:00,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:20<00:00,  2.70it/s]
episodes                                   976
episode_length                       10.232582
returns                              -2.435647
return_std                            2.997291
average_reward                       -0.238263
round_time              0 days 00:12:21.251388
episodes_test                            996.0
episode_length_test                  10.038153
returns_test                         -2.645797
return_std_test                       2.877296
average_reward_test                  -0.263404
round_time_test         0 days 00:00:10.026882
round_time_total        0 days 00:12:21.252431
loss_total             367719662192709206016.0
loss_critic            459649569983282282496.0
loss_actor                 -84331222118.399994
memory_size                             5827.0 

=== epoch 10/10 ==== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                   980
episode_length                       10.193878
returns                              -2.479934
return_std                             2.94192
average_reward                       -0.243168
round_time              0 days 00:12:18.993952
episodes_test                            991.0
episode_length_test                  10.085772
returns_test                         -2.443793
return_std_test                       3.012255
average_reward_test                   -0.24211
round_time_test         0 days 00:00:09.847797
round_time_total        0 days 00:12:18.994995
loss_total             369967137396264992768.0
loss_critic            462458913970684559360.0
loss_actor                 -84907626008.576004
memory_size                             5827.0 

=== epoch 10/10 ==== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<12:49,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                   987
episode_length                       10.129686
returns                              -2.445341
return_std                            2.903115
average_reward                       -0.241151
round_time              0 days 00:12:19.366237
episodes_test                            986.0
episode_length_test                  10.141988
returns_test                         -2.481017
return_std_test                       2.966703
average_reward_test                  -0.244628
round_time_test         0 days 00:00:09.808928
round_time_total        0 days 00:12:19.367335
loss_total             376356546201695158272.0
loss_critic            470445674889511239680.0
loss_actor                 -85558215835.647995
memory_size                             5827.0 

=== epoch 10/10 ==== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                   993
episode_length                       10.060423
returns                              -2.545742
return_std                            2.983331
average_reward                       -0.253208
round_time              0 days 00:12:19.181546
episodes_test                            997.0
episode_length_test                  10.022066
returns_test                         -2.542087
return_std_test                       2.964054
average_reward_test                  -0.253037
round_time_test         0 days 00:00:09.802919
round_time_total        0 days 00:12:19.182599
loss_total             382621047183276965888.0
loss_critic            478276300389711413248.0
loss_actor                 -85885894524.927994
memory_size                             5827.0 

=== epoch 10/10 ==== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:59,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:18<00:00,  2.71it/s]
episodes                                   995
episode_length                       10.041206
returns                              -2.530658
return_std                            2.978716
average_reward                       -0.252324
round_time              0 days 00:12:18.845578
episodes_test                            997.0
episode_length_test                  10.022066
returns_test                          -2.45735
return_std_test                       2.897885
average_reward_test                  -0.244931
round_time_test         0 days 00:00:09.739560
round_time_total        0 days 00:12:18.846626
loss_total             381711887239422279680.0
loss_critic            477139850910692802560.0
loss_actor                    -86323362349.056
memory_size                             5827.0 

=== epoch 10/10 ==== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:59,  2.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:19<00:00,  2.71it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
episodes                                  1002
episode_length                        9.967066
returns                              -2.575403
return_std                            2.986865
average_reward                       -0.258461
round_time              0 days 00:12:19.705322
episodes_test                           1016.0
episode_length_test                    9.84252
returns_test                         -2.700994
return_std_test                       2.837393
average_reward_test                  -0.274421
round_time_test         0 days 00:00:09.956549
round_time_total        0 days 00:12:19.706385
loss_total             390403589386328997888.0
loss_critic            488004478318348730368.0
loss_actor                 -86724938121.216003
memory_size                             5827.0 


