/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Walker2d-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 3
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>

<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:01<00:00, 1166.31it/s]
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  111
episode_length                      18.018018
returns                               0.61169
return_std                            4.92675
average_reward                       0.033949
round_time             0 days 00:00:01.787928
episodes_test                           510.0
episode_length_test                 19.562745
returns_test                         6.259944
return_std_test                      6.759093
average_reward_test                   0.32089
round_time_test        0 days 00:00:08.798977
round_time_total       0 days 00:00:10.615618 

=== epoch 1/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 93%|█████████▎| 1869/2000 [00:01<00:00, 1119.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1148.83it/s]
episodes                                  225
episode_length                      17.773333
returns                              0.502783
return_std                           4.841892
average_reward                       0.028511
round_time             0 days 00:00:02.245007
episodes_test                           489.0
episode_length_test                 20.435583
returns_test                         7.100032
return_std_test                      8.130025
average_reward_test                  0.347757
round_time_test        0 days 00:00:09.104448
round_time_total       0 days 00:00:10.835865 

=== epoch 1/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 88%|████████▊ | 1753/2000 [00:01<00:00, 1112.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1091.76it/s]
episodes                                  345
episode_length                      17.382609
returns                              0.484996
return_std                           4.765435
average_reward                       0.028305
round_time             0 days 00:00:02.347739
episodes_test                           514.0
episode_length_test                 19.445525
returns_test                         6.598041
return_std_test                      7.527922
average_reward_test                  0.339262
round_time_test        0 days 00:00:08.922972
round_time_total       0 days 00:00:10.726668 

=== epoch 1/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 92%|█████████▏| 1845/2000 [00:01<00:00, 1116.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1132.62it/s]
episodes                                  454
episode_length                      17.594714
returns                              0.280915
return_std                           4.884158
average_reward                       0.016618
round_time             0 days 00:00:02.305336
episodes_test                           507.0
episode_length_test                 19.706114
returns_test                         6.454861
return_std_test                      7.908287
average_reward_test                  0.327612
round_time_test        0 days 00:00:08.932265
round_time_total       0 days 00:00:10.726753 

=== epoch 1/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 94%|█████████▍| 1875/2000 [00:01<00:00, 1150.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1156.05it/s]
episodes                                  563
episode_length                      17.706927
returns                              0.319927
return_std                           5.437469
average_reward                       0.019523
round_time             0 days 00:00:02.301683
episodes_test                           483.0
episode_length_test                   20.6853
returns_test                         7.330656
return_std_test                      8.400756
average_reward_test                  0.353996
round_time_test        0 days 00:00:08.954318
round_time_total       0 days 00:00:10.791324 

=== epoch 1/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 4/2000 [00:01<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
starting training
episodes                                  577
episode_length                      17.317158
returns                              -0.83634
return_std                            6.18833
average_reward                       -0.04838
round_time             0 days 00:10:22.260938
episodes_test                           502.0
episode_length_test                 19.912351
returns_test                         6.713638
return_std_test                      7.620951
average_reward_test                  0.337239
round_time_test        0 days 00:00:09.201626
round_time_total       0 days 00:10:22.262075
loss_total                   391788067.884775
loss_critic                  489741392.064682
loss_actor                      -25258.273675
memory_size                          645.1865 

=== epoch 1/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:51,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:25<00:00,  3.20it/s]
episodes                                  638
episode_length                       15.65047
returns                             -3.542011
return_std                           6.835973
average_reward                      -0.226519
round_time             0 days 00:10:25.910966
episodes_test                           796.0
episode_length_test                 12.561558
returns_test                       -10.143793
return_std_test                      2.323385
average_reward_test                  -0.80733
round_time_test        0 days 00:00:09.336400
round_time_total       0 days 00:10:25.912067
loss_total               1635528931272.191895
loss_critic              2044411506491.135986
loss_actor                     -1577941.60207
memory_size                             660.0 

=== epoch 1/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                  671
episode_length                      14.874814
returns                             -5.864899
return_std                            6.66078
average_reward                      -0.394201
round_time             0 days 00:10:29.377476
episodes_test                           866.0
episode_length_test                 11.546189
returns_test                       -10.367523
return_std_test                      1.652836
average_reward_test                 -0.897762
round_time_test        0 days 00:00:09.367629
round_time_total       0 days 00:10:29.378594
loss_total              36806164044840.960938
loss_critic             46007705875644.414062
loss_actor                       -8121282.546
memory_size                             660.0 

=== epoch 1/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  701
episode_length                      14.235378
returns                             -7.587849
return_std                           5.921107
average_reward                      -0.532327
round_time             0 days 00:10:34.805823
episodes_test                           727.0
episode_length_test                 13.755158
returns_test                       -10.585559
return_std_test                      1.634088
average_reward_test                  -0.76957
round_time_test        0 days 00:00:09.494564
round_time_total       0 days 00:10:34.806992
loss_total              235288635440627.71875
loss_critic                294110789193695.25
loss_actor                      -20598404.765
memory_size                             660.0 

=== epoch 1/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                  724
episode_length                      13.791436
returns                             -9.140388
return_std                            3.76588
average_reward                       -0.66202
round_time             0 days 00:10:31.123660
episodes_test                           654.0
episode_length_test                   15.2737
returns_test                        -9.738991
return_std_test                       2.10807
average_reward_test                 -0.637697
round_time_test        0 days 00:00:09.213370
round_time_total       0 days 00:10:31.124954
loss_total                 883750805276983.25
loss_critic              1104688487031898.125
loss_actor                      -39588396.209
memory_size                             660.0 

=== epoch 1/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  742
episode_length                      13.456873
returns                            -10.147286
return_std                           1.862149
average_reward                      -0.754285
round_time             0 days 00:10:33.764200
episodes_test                           648.0
episode_length_test                 15.421296
returns_test                        -9.473026
return_std_test                      3.442395
average_reward_test                 -0.613991
round_time_test        0 days 00:00:09.215257
round_time_total       0 days 00:10:33.765313
loss_total                 2431588853253931.0
loss_critic                3039486018223866.0
loss_actor                      -65501133.128
memory_size                             660.0 

=== epoch 1/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  708
episode_length                      14.108757
returns                            -10.267434
return_std                           1.704149
average_reward                      -0.727875
round_time             0 days 00:10:34.031782
episodes_test                           714.0
episode_length_test                 13.994398
returns_test                       -10.818385
return_std_test                      1.318444
average_reward_test                 -0.772981
round_time_test        0 days 00:00:09.306564
round_time_total       0 days 00:10:34.032894
loss_total                 5651855603425870.0
loss_critic                7064819380885914.0
loss_actor                      -99581036.616
memory_size                             660.0 

=== epoch 1/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  691
episode_length                      14.460203
returns                            -10.164896
return_std                           1.686356
average_reward                      -0.702722
round_time             0 days 00:10:32.400748
episodes_test                           686.0
episode_length_test                 14.575802
returns_test                       -10.243901
return_std_test                      2.212461
average_reward_test                 -0.702647
round_time_test        0 days 00:00:09.428759
round_time_total       0 days 00:10:32.401846
loss_total                11610045169327408.0
loss_critic               14512556201411084.0
loss_actor                     -141777618.836
memory_size                             660.0 

=== epoch 1/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  683
episode_length                      14.622255
returns                             -10.08559
return_std                           1.736093
average_reward                      -0.689827
round_time             0 days 00:10:33.227966
episodes_test                           678.0
episode_length_test                 14.740413
returns_test                       -10.079139
return_std_test                      2.205546
average_reward_test                 -0.683737
round_time_test        0 days 00:00:09.300303
round_time_total       0 days 00:10:33.229077
loss_total                21860784665727600.0
loss_critic               27325980373671740.0
loss_actor                     -194006528.368
memory_size                             660.0 

=== epoch 1/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  684
episode_length                      14.602339
returns                            -10.206676
return_std                           1.685997
average_reward                      -0.698871
round_time             0 days 00:10:34.277918
episodes_test                           651.0
episode_length_test                  15.35023
returns_test                        -9.346919
return_std_test                      2.241179
average_reward_test                 -0.608572
round_time_test        0 days 00:00:09.131998
round_time_total       0 days 00:10:34.279205
loss_total                38317464940366008.0
loss_critic               47896830301163224.0
loss_actor                     -255318211.688
memory_size                             660.0 

=== epoch 1/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:36,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  675
episode_length                      14.807407
returns                            -10.090792
return_std                           1.682495
average_reward                      -0.681289
round_time             0 days 00:10:32.024916
episodes_test                           672.0
episode_length_test                    14.875
returns_test                       -10.168692
return_std_test                      1.835722
average_reward_test                 -0.683288
round_time_test        0 days 00:00:09.225280
round_time_total       0 days 00:10:32.026020
loss_total                62966202198214576.0
loss_critic               78707751427871088.0
loss_actor                     -328136106.048
memory_size                             660.0 

=== epoch 1/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  659
episode_length                      15.135053
returns                             -9.844948
return_std                           1.826991
average_reward                      -0.651417
round_time             0 days 00:10:32.716629
episodes_test                           652.0
episode_length_test                 15.329755
returns_test                        -9.728824
return_std_test                      1.776921
average_reward_test                  -0.63423
round_time_test        0 days 00:00:09.196829
round_time_total       0 days 00:10:32.717918
loss_total                99996499758905552.0
loss_critic              124995622720799504.0
loss_actor                     -412174208.464
memory_size                           664.694 

=== epoch 1/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.15it/s]
episodes                                  651
episode_length                      15.317972
returns                             -9.666894
return_std                           1.864629
average_reward                        -0.6317
round_time             0 days 00:10:34.472750
episodes_test                           648.0
episode_length_test                 15.427469
returns_test                        -9.332047
return_std_test                      1.907712
average_reward_test                 -0.604663
round_time_test        0 days 00:00:09.502025
round_time_total       0 days 00:10:34.473870
loss_total               153961044111060384.0
loss_critic              192451301722178976.0
loss_actor                     -510818838.048
memory_size                             668.0 

=== epoch 1/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  645
episode_length                      15.483721
returns                             -9.358601
return_std                            2.16787
average_reward                      -0.604315
round_time             0 days 00:10:33.434953
episodes_test                           649.0
episode_length_test                 15.402157
returns_test                        -9.451868
return_std_test                      1.891766
average_reward_test                 -0.613161
round_time_test        0 days 00:00:09.126678
round_time_total       0 days 00:10:33.436080
loss_total               232026796255013248.0
loss_critic              290033490631883520.0
loss_actor                      -624286828.88
memory_size                           673.898 

=== epoch 1/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  616
episode_length                       16.12987
returns                             -8.795559
return_std                           2.942161
average_reward                      -0.539251
round_time             0 days 00:10:35.424576
episodes_test                           596.0
episode_length_test                 16.760067
returns_test                        -7.366726
return_std_test                      4.553091
average_reward_test                 -0.439632
round_time_test        0 days 00:00:09.127720
round_time_total       0 days 00:10:35.425689
loss_total               347655713061267840.0
loss_critic              434569633352977984.0
loss_actor                     -781151481.728
memory_size                           738.032 

=== epoch 1/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  619
episode_length                      16.134087
returns                             -8.869242
return_std                           3.586293
average_reward                      -0.550029
round_time             0 days 00:10:39.711224
episodes_test                           538.0
episode_length_test                 18.553903
returns_test                        -7.549856
return_std_test                      5.083227
average_reward_test                 -0.406713
round_time_test        0 days 00:00:08.995027
round_time_total       0 days 00:10:39.712513
loss_total               444264751437845632.0
loss_critic              555330930597850752.0
loss_actor                     -930336252.288
memory_size                          791.9985 

=== epoch 1/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  626
episode_length                      15.950479
returns                             -9.128417
return_std                           3.594638
average_reward                      -0.572594
round_time             0 days 00:10:38.214654
episodes_test                           699.0
episode_length_test                 14.296137
returns_test                       -10.929302
return_std_test                      1.348123
average_reward_test                 -0.763782
round_time_test        0 days 00:00:09.165071
round_time_total       0 days 00:10:38.215915
loss_total               567356905237866624.0
loss_critic              709196119396870784.0
loss_actor                    -1070020720.032
memory_size                             792.0 

=== epoch 1/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<13:24,  2.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  638
episode_length                      15.653605
returns                             -9.468812
return_std                           3.706129
average_reward                      -0.605072
round_time             0 days 00:10:39.072575
episodes_test                           624.0
episode_length_test                 16.014423
returns_test                       -10.017267
return_std_test                      4.753923
average_reward_test                 -0.624936
round_time_test        0 days 00:00:09.359929
round_time_total       0 days 00:10:39.074086
loss_total               744528562939953664.0
loss_critic              930660687388426112.0
loss_actor                     -1231406979.52
memory_size                          797.2785 

=== epoch 1/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:09,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  654
episode_length                      15.275229
returns                             -9.982678
return_std                           3.498474
average_reward                       -0.65286
round_time             0 days 00:10:39.093920
episodes_test                           678.0
episode_length_test                 14.744838
returns_test                       -10.513996
return_std_test                      2.934919
average_reward_test                 -0.712671
round_time_test        0 days 00:00:09.290169
round_time_total       0 days 00:10:39.095037
loss_total               945136551901391744.0
loss_critic             1181420671176452096.0
loss_actor                    -1397123542.528
memory_size                             798.0 

=== epoch 1/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  693
episode_length                      14.427128
returns                            -10.798154
return_std                           1.828099
average_reward                      -0.748102
round_time             0 days 00:10:38.755167
episodes_test                           707.0
episode_length_test                 14.144272
returns_test                       -11.028627
return_std_test                       2.07571
average_reward_test                 -0.779724
round_time_test        0 days 00:00:09.278755
round_time_total       0 days 00:10:38.756405
loss_total              1186958625887932672.0
loss_critic             1483698254743276032.0
loss_actor                     -1584204604.16
memory_size                             798.0 

=== epoch 1/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                  700
episode_length                      14.252857
returns                            -10.870923
return_std                           1.879042
average_reward                      -0.761986
round_time             0 days 00:10:36.111024
episodes_test                           730.0
episode_length_test                 13.690411
returns_test                       -11.315849
return_std_test                      1.421327
average_reward_test                 -0.825814
round_time_test        0 days 00:00:09.401400
round_time_total       0 days 00:10:36.112294
loss_total              1501052050345914624.0
loss_critic             1876315029672166400.0
loss_actor                     -1792630865.28
memory_size                             798.0 

=== epoch 1/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  712
episode_length                      14.039326
returns                            -11.025893
return_std                           1.844223
average_reward                      -0.784859
round_time             0 days 00:10:39.356452
episodes_test                           720.0
episode_length_test                 13.888889
returns_test                       -11.260682
return_std_test                      1.263856
average_reward_test                 -0.810769
round_time_test        0 days 00:00:09.171595
round_time_total       0 days 00:10:39.357586
loss_total              1858931810038856960.0
loss_critic             2323664726350586880.0
loss_actor                    -2002199851.072
memory_size                          798.4965 

=== epoch 1/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  715
episode_length                      13.967832
returns                            -11.122015
return_std                           1.560145
average_reward                      -0.796216
round_time             0 days 00:10:39.258388
episodes_test                           725.0
episode_length_test                  13.78069
returns_test                       -11.261019
return_std_test                       1.41955
average_reward_test                 -0.816768
round_time_test        0 days 00:00:09.271602
round_time_total       0 days 00:10:39.259632
loss_total              2270529346445904896.0
loss_critic             2838161634592970240.0
loss_actor                    -2219881109.312
memory_size                          799.5865 

=== epoch 1/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  723
episode_length                      13.817427
returns                            -11.234503
return_std                           1.475539
average_reward                      -0.813357
round_time             0 days 00:10:39.043810
episodes_test                           729.0
episode_length_test                 13.703704
returns_test                        -11.29695
return_std_test                      2.098964
average_reward_test                 -0.824323
round_time_test        0 days 00:00:09.360715
round_time_total       0 days 00:10:39.044903
loss_total              2812267301847995392.0
loss_critic             3515334062988563968.0
loss_actor                    -2476906284.032
memory_size                             802.0 

=== epoch 1/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  725
episode_length                      13.791724
returns                            -11.192136
return_std                           1.480774
average_reward                      -0.811332
round_time             0 days 00:10:40.095050
episodes_test                           735.0
episode_length_test                 13.593197
returns_test                       -11.338244
return_std_test                      1.242351
average_reward_test                 -0.833986
round_time_test        0 days 00:00:09.251437
round_time_total       0 days 00:10:40.096153
loss_total              3448580204561296384.0
loss_critic             4310725185968531456.0
loss_actor                    -2746220898.944
memory_size                             802.0 

=== epoch 1/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  727
episode_length                      13.724897
returns                            -11.237946
return_std                           1.415614
average_reward                      -0.817611
round_time             0 days 00:10:40.658018
episodes_test                           725.0
episode_length_test                 13.784828
returns_test                        -11.16666
return_std_test                      3.004077
average_reward_test                 -0.809641
round_time_test        0 days 00:00:09.388463
round_time_total       0 days 00:10:40.659118
loss_total              4165611677962136064.0
loss_critic             5207014515745212416.0
loss_actor                     -3037432132.48
memory_size                             802.0 

=== epoch 1/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  727
episode_length                      13.727648
returns                            -11.213246
return_std                           1.527396
average_reward                      -0.817369
round_time             0 days 00:10:38.619696
episodes_test                           732.0
episode_length_test                 13.647541
returns_test                       -11.255774
return_std_test                      1.397478
average_reward_test                 -0.824661
round_time_test        0 days 00:00:09.278042
round_time_total       0 days 00:10:38.620807
loss_total              5045534105014168576.0
loss_critic             6306917517021581312.0
loss_actor                    -3344059472.512
memory_size                             802.0 

=== epoch 1/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  730
episode_length                      13.684932
returns                            -11.222483
return_std                           1.510986
average_reward                      -0.819576
round_time             0 days 00:10:40.764401
episodes_test                           737.0
episode_length_test                 13.553596
returns_test                        -11.30561
return_std_test                      1.376579
average_reward_test                  -0.83411
round_time_test        0 days 00:00:09.300478
round_time_total       0 days 00:10:40.765553
loss_total              5995586654457969664.0
loss_critic             7494483182351496192.0
loss_actor                     -3651593016.96
memory_size                             802.0 

=== epoch 1/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  731
episode_length                      13.671683
returns                            -11.252119
return_std                           1.518551
average_reward                      -0.822676
round_time             0 days 00:10:40.686041
episodes_test                           729.0
episode_length_test                 13.705075
returns_test                       -11.362464
return_std_test                       1.27476
average_reward_test                 -0.828556
round_time_test        0 days 00:00:09.223346
round_time_total       0 days 00:10:40.687254
loss_total              7227070185991103488.0
loss_critic             9033837592438586368.0
loss_actor                    -4016390033.536
memory_size                             802.0 

=== epoch 1/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  729
episode_length                      13.689986
returns                            -11.290219
return_std                           1.483309
average_reward                      -0.823894
round_time             0 days 00:10:39.221312
episodes_test                           722.0
episode_length_test                 13.847645
returns_test                        -11.27121
return_std_test                      1.698085
average_reward_test                  -0.81364
round_time_test        0 days 00:00:09.167181
round_time_total       0 days 00:10:39.222812
loss_total              8594790379770275840.0
loss_critic            10743487793224708096.0
loss_actor                    -4371339945.344
memory_size                             802.0 

=== epoch 1/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  727
episode_length                      13.735901
returns                            -11.277308
return_std                           1.341604
average_reward                      -0.819664
round_time             0 days 00:10:40.485768
episodes_test                           720.0
episode_length_test                 13.873611
returns_test                       -11.216292
return_std_test                      1.326211
average_reward_test                  -0.80807
round_time_test        0 days 00:00:09.349567
round_time_total       0 days 00:10:40.487303
loss_total             10268971905546948608.0
loss_critic            12836214640934479872.0
loss_actor                    -4792617860.224
memory_size                             802.0 

=== epoch 1/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  726
episode_length                      13.758953
returns                            -11.248277
return_std                           1.426657
average_reward                      -0.817131
round_time             0 days 00:10:40.332594
episodes_test                           729.0
episode_length_test                 13.707819
returns_test                       -11.410199
return_std_test                      1.221705
average_reward_test                 -0.831857
round_time_test        0 days 00:00:09.520518
round_time_total       0 days 00:10:40.333823
loss_total             12313987023143933952.0
loss_critic            15392483513329139712.0
loss_actor                    -5229215307.776
memory_size                             802.0 

=== epoch 1/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  725
episode_length                      13.772414
returns                            -11.256509
return_std                           1.475369
average_reward                      -0.817446
round_time             0 days 00:10:40.475676
episodes_test                           728.0
episode_length_test                 13.729396
returns_test                       -11.397482
return_std_test                      1.293696
average_reward_test                 -0.829445
round_time_test        0 days 00:00:09.192528
round_time_total       0 days 00:10:40.477160
loss_total             14525464858945734656.0
loss_critic            18156830785610121216.0
loss_actor                    -5684973595.392
memory_size                             802.0 

=== epoch 1/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:38,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  721
episode_length                      13.851595
returns                             -11.23491
return_std                           1.452442
average_reward                      -0.810885
round_time             0 days 00:10:40.097547
episodes_test                           718.0
episode_length_test                 13.915042
returns_test                       -11.173442
return_std_test                      1.363977
average_reward_test                 -0.802706
round_time_test        0 days 00:00:09.333865
round_time_total       0 days 00:10:40.099049
loss_total             16892264943436423168.0
loss_critic            21115330847517896704.0
loss_actor                     -6148759842.56
memory_size                             802.0 

=== epoch 1/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  722
episode_length                      13.825485
returns                            -11.270988
return_std                           1.430495
average_reward                      -0.815714
round_time             0 days 00:10:39.010262
episodes_test                           717.0
episode_length_test                  13.92887
returns_test                        -11.26301
return_std_test                        1.4966
average_reward_test                 -0.808359
round_time_test        0 days 00:00:09.464383
round_time_total       0 days 00:10:39.011369
loss_total             20036760790333644800.0
loss_critic            25045950546050822144.0
loss_actor                     -6673996399.36
memory_size                             802.0 

=== epoch 1/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  718
episode_length                      13.905292
returns                            -11.290476
return_std                           1.433619
average_reward                      -0.812227
round_time             0 days 00:10:39.471351
episodes_test                           713.0
episode_length_test                 14.012623
returns_test                       -11.280352
return_std_test                      1.482478
average_reward_test                 -0.804744
round_time_test        0 days 00:00:09.307726
round_time_total       0 days 00:10:39.472452
loss_total             23047320290199330816.0
loss_critic            28809149867419176960.0
loss_actor                    -7190355879.424
memory_size                             802.0 

=== epoch 1/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  710
episode_length                      14.057746
returns                            -11.207768
return_std                            2.57148
average_reward                      -0.797722
round_time             0 days 00:10:37.915745
episodes_test                           712.0
episode_length_test                  14.04073
returns_test                       -11.244929
return_std_test                      1.625073
average_reward_test                 -0.800475
round_time_test        0 days 00:00:09.263379
round_time_total       0 days 00:10:37.916886
loss_total             26151926091635843072.0
loss_critic            32689907073722519552.0
loss_actor                    -7759276281.088
memory_size                           831.909 

=== epoch 1/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:54,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  699
episode_length                      14.276109
returns                            -11.040427
return_std                           3.275173
average_reward                      -0.773352
round_time             0 days 00:10:39.300489
episodes_test                           691.0
episode_length_test                 14.470333
returns_test                       -10.936922
return_std_test                      1.624781
average_reward_test                 -0.755646
round_time_test        0 days 00:00:09.420905
round_time_total       0 days 00:10:39.301592
loss_total             28985698087751843840.0
loss_critic            36232122034095468544.0
loss_actor                    -8372128554.752
memory_size                          879.8065 

=== epoch 1/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  692
episode_length                      14.426301
returns                            -10.860128
return_std                           3.511582
average_reward                      -0.753413
round_time             0 days 00:10:39.597185
episodes_test                           680.0
episode_length_test                 14.702941
returns_test                       -10.607923
return_std_test                      2.561045
average_reward_test                 -0.721215
round_time_test        0 days 00:00:09.280281
round_time_total       0 days 00:10:39.598280
loss_total             32661005268193845248.0
loss_critic            40826255938179710976.0
loss_actor                    -8914164212.736
memory_size                           910.626 

=== epoch 1/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  683
episode_length                      14.620791
returns                            -10.646037
return_std                           3.772608
average_reward                      -0.727959
round_time             0 days 00:10:39.721733
episodes_test                           691.0
episode_length_test                 14.455861
returns_test                       -10.959269
return_std_test                      1.543119
average_reward_test                 -0.757529
round_time_test        0 days 00:00:09.179868
round_time_total       0 days 00:10:39.722906
loss_total             37031514198318891008.0
loss_critic            46289391934809767936.0
loss_actor                    -9581351407.104
memory_size                          928.7115 

=== epoch 1/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  677
episode_length                      14.751846
returns                            -10.429174
return_std                           3.880193
average_reward                      -0.707404
round_time             0 days 00:10:39.962456
episodes_test                           674.0
episode_length_test                 14.816024
returns_test                       -10.454908
return_std_test                      3.060591
average_reward_test                 -0.705659
round_time_test        0 days 00:00:09.121683
round_time_total       0 days 00:10:39.963567
loss_total             41120057380320182272.0
loss_critic            51400070722645622784.0
loss_actor                   -10203511624.448
memory_size                             937.0 

=== epoch 1/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  675
episode_length                      14.794074
returns                            -10.398001
return_std                            3.18603
average_reward                      -0.703097
round_time             0 days 00:10:38.919935
episodes_test                           673.0
episode_length_test                 14.858841
returns_test                        -10.50789
return_std_test                       1.91494
average_reward_test                 -0.707181
round_time_test        0 days 00:00:09.209313
round_time_total       0 days 00:10:38.921024
loss_total             45529849799321362432.0
loss_critic            56912311158161293312.0
loss_actor                -10807564253.695999
memory_size                             937.0 

=== epoch 1/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  675
episode_length                      14.807407
returns                            -10.336663
return_std                           2.522861
average_reward                      -0.697888
round_time             0 days 00:10:40.790478
episodes_test                           659.0
episode_length_test                  15.16692
returns_test                        -9.835342
return_std_test                      4.943203
average_reward_test                 -0.647905
round_time_test        0 days 00:00:09.340297
round_time_total       0 days 00:10:40.791613
loss_total             51723693286280396800.0
loss_critic            64654615479751565312.0
loss_actor                   -11466737743.872
memory_size                             937.0 

=== epoch 1/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  670
episode_length                       14.91791
returns                            -10.193386
return_std                           2.273019
average_reward                      -0.683438
round_time             0 days 00:10:40.836274
episodes_test                           660.0
episode_length_test                 15.137879
returns_test                        -9.694482
return_std_test                      3.312414
average_reward_test                 -0.640435
round_time_test        0 days 00:00:09.228621
round_time_total       0 days 00:10:40.837381
loss_total             58778822845025230848.0
loss_critic            73473527392448479232.0
loss_actor                    -12230801108.48
memory_size                             937.0 

=== epoch 1/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  662
episode_length                      15.093656
returns                            -10.031651
return_std                           2.193462
average_reward                      -0.664787
round_time             0 days 00:10:39.792612
episodes_test                           653.0
episode_length_test                 15.301685
returns_test                        -9.883174
return_std_test                       1.69708
average_reward_test                 -0.645316
round_time_test        0 days 00:00:09.049409
round_time_total       0 days 00:10:39.793698
loss_total             66126099815384489984.0
loss_critic            82657623501768589312.0
loss_actor                -12962532679.167999
memory_size                          939.7285 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  125
episode_length                         15.912
returns                             -9.448258
return_std                            1.68711
average_reward                      -0.593099
round_time             0 days 00:10:34.693250
episodes_test                           620.0
episode_length_test                 16.122581
returns_test                        -9.357826
return_std_test                        3.3883
average_reward_test                 -0.580094
round_time_test        0 days 00:00:09.340099
round_time_total       0 days 00:10:34.694435
loss_total             73504100920744787968.0
loss_critic            91880124591273738240.0
loss_actor                   -13693456034.816
memory_size                             954.0 

=== epoch 2/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   253
episode_length                       15.810277
returns                               -9.47315
return_std                            2.664063
average_reward                       -0.599177
round_time              0 days 00:10:33.414672
episodes_test                            630.0
episode_length_test                  15.857143
returns_test                         -9.280076
return_std_test                       2.522252
average_reward_test                  -0.584984
round_time_test         0 days 00:00:09.116514
round_time_total        0 days 00:10:33.415860
loss_total              82568418953526738944.0
loss_critic            103210521801847947264.0
loss_actor                    -14509915780.608
memory_size                            956.838 

=== epoch 2/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:16,  2.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                   379
episode_length                       15.815303
returns                              -9.554568
return_std                            2.783424
average_reward                       -0.603049
round_time              0 days 00:10:34.832194
episodes_test                            621.0
episode_length_test                  16.080515
returns_test                         -9.674749
return_std_test                        3.32412
average_reward_test                  -0.601718
round_time_test         0 days 00:00:09.077522
round_time_total        0 days 00:10:34.833298
loss_total              92670569250927722496.0
loss_critic            115838209448199274496.0
loss_actor                 -15435259339.775999
memory_size                            975.601 

=== epoch 2/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   508
episode_length                       15.746063
returns                              -9.675643
return_std                            2.517893
average_reward                       -0.614292
round_time              0 days 00:10:36.754571
episodes_test                            617.0
episode_length_test                  16.205835
returns_test                         -9.471784
return_std_test                       3.718711
average_reward_test                  -0.584323
round_time_test         0 days 00:00:09.412532
round_time_total        0 days 00:10:36.755687
loss_total             102385802097007624192.0
loss_critic            127982250453022605312.0
loss_actor                 -16294070433.280001
memory_size                              982.0 

=== epoch 2/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   632
episode_length                       15.814873
returns                              -9.769385
return_std                            2.374272
average_reward                       -0.617066
round_time              0 days 00:10:36.185814
episodes_test                            651.0
episode_length_test                  15.348694
returns_test                        -10.093743
return_std_test                       1.909225
average_reward_test                  -0.657256
round_time_test         0 days 00:00:09.034145
round_time_total        0 days 00:10:36.186925
loss_total             110838018892789743616.0
loss_critic            138547521477986828288.0
loss_actor                 -17238146743.296001
memory_size                           1005.114 

=== epoch 2/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   635
episode_length                       15.735433
returns                              -9.810011
return_std                            2.499205
average_reward                       -0.623099
round_time              0 days 00:10:36.505008
episodes_test                            649.0
episode_length_test                  15.402157
returns_test                         -9.685733
return_std_test                       2.374576
average_reward_test                   -0.62831
round_time_test         0 days 00:00:09.251286
round_time_total        0 days 00:10:36.506111
loss_total             122715326038111633408.0
loss_critic            153394155123766165504.0
loss_actor                 -18329272918.015999
memory_size                          1048.8245 

=== epoch 2/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   639
episode_length                       15.649452
returns                              -9.883683
return_std                            2.191055
average_reward                       -0.631567
round_time              0 days 00:10:33.190804
episodes_test                            634.0
episode_length_test                  15.763407
returns_test                         -9.682262
return_std_test                       2.487465
average_reward_test                  -0.613522
round_time_test         0 days 00:00:09.368931
round_time_total        0 days 00:10:33.192027
loss_total             137522826488358748160.0
loss_critic            171903530335281086464.0
loss_actor                 -19366833984.512001
memory_size                             1052.0 

=== epoch 2/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   635
episode_length                       15.710236
returns                              -9.784121
return_std                             2.10979
average_reward                       -0.623103
round_time              0 days 00:10:36.375741
episodes_test                            640.0
episode_length_test                  15.621875
returns_test                         -9.653811
return_std_test                       2.347713
average_reward_test                  -0.617752
round_time_test         0 days 00:00:09.264865
round_time_total        0 days 00:10:36.376879
loss_total             155542324770264875008.0
loss_critic            194427902578534318080.0
loss_actor                 -20490944288.768002
memory_size                            1052.63 

=== epoch 2/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   633
episode_length                       15.770932
returns                              -9.678661
return_std                            2.262735
average_reward                       -0.613364
round_time              0 days 00:10:35.525311
episodes_test                            629.0
episode_length_test                  15.896661
returns_test                         -9.485179
return_std_test                       2.961243
average_reward_test                  -0.596542
round_time_test         0 days 00:00:09.382559
round_time_total        0 days 00:10:35.526624
loss_total             172906010818369847296.0
loss_critic            216132509641686286336.0
loss_actor                    -21616376016.896
memory_size                          1055.9075 

=== epoch 2/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   622
episode_length                       16.054662
returns                              -9.370144
return_std                            2.964017
average_reward                       -0.584055
round_time              0 days 00:10:35.840331
episodes_test                            616.0
episode_length_test                  16.220779
returns_test                         -9.494911
return_std_test                       2.716481
average_reward_test                  -0.584843
round_time_test         0 days 00:00:09.058498
round_time_total        0 days 00:10:35.841442
loss_total             192610691851766759424.0
loss_critic            240763360632166219776.0
loss_actor                 -22800171071.487999
memory_size                           1062.371 

=== epoch 2/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                   609
episode_length                       16.385878
returns                              -9.182064
return_std                             3.12496
average_reward                        -0.56006
round_time              0 days 00:10:38.846408
episodes_test                            521.0
episode_length_test                  19.166987
returns_test                         -6.789559
return_std_test                       9.095552
average_reward_test                  -0.354871
round_time_test         0 days 00:00:09.087924
round_time_total        0 days 00:10:38.847620
loss_total             215054508353620738048.0
loss_critic            268818131091258408960.0
loss_actor                 -24271771512.832001
memory_size                          1128.7435 

=== epoch 2/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   591
episode_length                       16.905245
returns                              -8.882823
return_std                            3.633065
average_reward                       -0.524948
round_time              0 days 00:10:37.231311
episodes_test                            557.0
episode_length_test                  17.926391
returns_test                         -8.261514
return_std_test                        5.20432
average_reward_test                  -0.460429
round_time_test         0 days 00:00:09.106497
round_time_total        0 days 00:10:37.232518
loss_total             229639665423960539136.0
loss_critic            287049577365411463168.0
loss_actor                 -25622275113.984001
memory_size                           1170.926 

=== epoch 2/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                   568
episode_length                       17.580986
returns                              -8.436964
return_std                            4.351643
average_reward                       -0.479833
round_time              0 days 00:10:37.938168
episodes_test                            485.0
episode_length_test                  20.610309
returns_test                         -6.473003
return_std_test                       8.646947
average_reward_test                  -0.313609
round_time_test         0 days 00:00:08.974885
round_time_total        0 days 00:10:37.939279
loss_total             251318140363468832768.0
loss_critic            314147670440562982912.0
loss_actor                 -27198435240.959999
memory_size                           1252.232 

=== epoch 2/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                   530
episode_length                       18.724528
returns                              -7.633839
return_std                            8.237486
average_reward                       -0.402166
round_time              0 days 00:10:39.526745
episodes_test                            485.0
episode_length_test                  20.587629
returns_test                         -6.619986
return_std_test                       7.100766
average_reward_test                  -0.321552
round_time_test         0 days 00:00:09.088967
round_time_total        0 days 00:10:39.527857
loss_total             263270435195763556352.0
loss_critic            329088038305831256064.0
loss_actor                     -28454944988.16
memory_size                           1370.799 

=== epoch 2/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                   446
episode_length                       22.313901
returns                              -4.907142
return_std                           22.322398
average_reward                       -0.217142
round_time              0 days 00:10:37.105793
episodes_test                            230.0
episode_length_test                  43.478261
returns_test                           4.03917
return_std_test                      47.903198
average_reward_test                   0.092901
round_time_test         0 days 00:00:08.771808
round_time_total        0 days 00:10:37.107098
loss_total             194579609841623400448.0
loss_critic            243224508093098721280.0
loss_actor                 -30376652105.728001
memory_size                            2251.28 

=== epoch 2/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                   356
episode_length                       27.851124
returns                               0.559112
return_std                           31.369613
average_reward                         0.02652
round_time              0 days 00:10:34.736901
episodes_test                            110.0
episode_length_test                  90.054545
returns_test                         58.546388
return_std_test                      65.566457
average_reward_test                   0.656638
round_time_test         0 days 00:00:08.672241
round_time_total        0 days 00:10:34.738016
loss_total             139744579532176375808.0
loss_critic            174680721458083954688.0
loss_actor                 -30587754200.063999
memory_size                           3654.048 

=== epoch 2/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   267
episode_length                       37.340824
returns                              12.363287
return_std                            48.70873
average_reward                        0.329028
round_time              0 days 00:10:32.890966
episodes_test                            108.0
episode_length_test                  92.277778
returns_test                         93.183415
return_std_test                      99.003527
average_reward_test                   1.009941
round_time_test         0 days 00:00:08.763051
round_time_total        0 days 00:10:32.892096
loss_total             112991318691218866176.0
loss_critic            141239145912112660480.0
loss_actor                 -29301840357.375999
memory_size                           5037.249 

=== epoch 2/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   192
episode_length                       52.036458
returns                              32.582359
return_std                           62.896386
average_reward                        0.625499
round_time              0 days 00:10:35.715897
episodes_test                            112.0
episode_length_test                  88.696429
returns_test                         95.769422
return_std_test                      74.587217
average_reward_test                   1.080399
round_time_test         0 days 00:00:08.623507
round_time_total        0 days 00:10:35.717148
loss_total              94163124736246988800.0
loss_critic            117703903797701541888.0
loss_actor                 -27315561245.695999
memory_size                           6437.411 

=== epoch 2/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  124
episode_length                      77.258065
returns                             69.078927
return_std                          66.355228
average_reward                       0.866123
round_time             0 days 00:10:34.763173
episodes_test                           115.0
episode_length_test                 86.747826
returns_test                        97.284043
return_std_test                     54.897238
average_reward_test                  1.118876
round_time_test        0 days 00:00:08.780143
round_time_total       0 days 00:10:34.764279
loss_total             79959078466579628032.0
loss_critic            99948846427360018432.0
loss_actor                   -25136249332.736
memory_size                         7834.9375 

=== epoch 2/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                  115
episode_length                      85.730435
returns                             87.540419
return_std                          64.738717
average_reward                       1.022825
round_time             0 days 00:10:35.983639
episodes_test                           114.0
episode_length_test                 87.482456
returns_test                       103.863676
return_std_test                     62.320014
average_reward_test                  1.185779
round_time_test        0 days 00:00:08.822875
round_time_total       0 days 00:10:35.984867
loss_total             67122094376138031104.0
loss_critic            83902616490504765440.0
loss_actor                   -22794025050.112
memory_size                         9323.4255 

=== epoch 2/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  113
episode_length                      87.787611
returns                              96.16104
return_std                          63.244122
average_reward                       1.094984
round_time             0 days 00:10:38.860986
episodes_test                           115.0
episode_length_test                 86.495652
returns_test                        106.70199
return_std_test                     53.843968
average_reward_test                  1.233536
round_time_test        0 days 00:00:08.681312
round_time_total       0 days 00:10:38.862250
loss_total             57341547687235067904.0
loss_critic            71676933401505185792.0
loss_actor                -20669151272.959999
memory_size                        10801.1525 

=== epoch 2/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  112
episode_length                      87.830357
returns                             94.823034
return_std                          59.938019
average_reward                       1.082661
round_time             0 days 00:10:38.659547
episodes_test                           108.0
episode_length_test                 92.296296
returns_test                        91.175154
return_std_test                     75.655123
average_reward_test                  0.987812
round_time_test        0 days 00:00:08.759744
round_time_total       0 days 00:10:38.660659
loss_total             50191760667779162112.0
loss_critic            62739699750605488128.0
loss_actor                   -18623579403.264
memory_size                         12269.541 

=== epoch 2/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  118
episode_length                       83.59322
returns                               83.7099
return_std                          66.240612
average_reward                       1.005825
round_time             0 days 00:10:41.870809
episodes_test                           127.0
episode_length_test                 78.086614
returns_test                        69.602969
return_std_test                     73.970703
average_reward_test                  0.889031
round_time_test        0 days 00:00:08.739327
round_time_total       0 days 00:10:41.872071
loss_total             45484775879792107520.0
loss_critic            56855968848634789888.0
loss_actor                -16921049747.455999
memory_size                        13665.3515 

=== epoch 2/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  136
episode_length                      73.463235
returns                             65.921062
return_std                          74.076885
average_reward                       0.895716
round_time             0 days 00:10:42.979253
episodes_test                           135.0
episode_length_test                 73.807407
returns_test                        59.967639
return_std_test                     71.333634
average_reward_test                  0.813085
round_time_test        0 days 00:00:08.771086
round_time_total       0 days 00:10:42.980379
loss_total             41877910686839046144.0
loss_critic            52347387443480248320.0
loss_actor                   -15606038094.336
memory_size                        14941.6615 

=== epoch 2/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  193
episode_length                       51.61658
returns                             34.225398
return_std                          67.365551
average_reward                       0.667732
round_time             0 days 00:10:40.581286
episodes_test                           330.0
episode_length_test                 30.287879
returns_test                          3.34763
return_std_test                     48.748239
average_reward_test                  0.110848
round_time_test        0 days 00:00:08.958987
round_time_total       0 days 00:10:40.582446
loss_total             37753353191135707136.0
loss_critic            47191690691911147520.0
loss_actor                -14444142847.488001
memory_size                         15850.184 

=== epoch 2/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  272
episode_length                      36.191176
returns                             14.182389
return_std                          57.436262
average_reward                       0.388049
round_time             0 days 00:10:40.961825
episodes_test                           303.0
episode_length_test                 32.950495
returns_test                        -0.236696
return_std_test                     41.383862
average_reward_test                 -0.007353
round_time_test        0 days 00:00:09.090977
round_time_total       0 days 00:10:40.962916
loss_total             34985077180190240768.0
loss_critic            43731345689499303936.0
loss_actor                -13585991544.832001
memory_size                        16371.0065 

=== epoch 2/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  368
episode_length                      26.945652
returns                              2.021212
return_std                          43.601722
average_reward                       0.068102
round_time             0 days 00:10:41.004068
episodes_test                           559.0
episode_length_test                 17.872987
returns_test                        -9.443809
return_std_test                     11.584767
average_reward_test                 -0.528459
round_time_test        0 days 00:00:09.230228
round_time_total       0 days 00:10:41.005398
loss_total             30637380401922211840.0
loss_critic            38296724805106237440.0
loss_actor                   -12040479501.312
memory_size                        16570.6875 

=== epoch 2/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  464
episode_length                      21.534483
returns                             -4.946796
return_std                          31.972346
average_reward                      -0.229534
round_time             0 days 00:10:41.703743
episodes_test                           647.0
episode_length_test                 15.437403
returns_test                       -11.516562
return_std_test                      3.218532
average_reward_test                 -0.745744
round_time_test        0 days 00:00:09.102187
round_time_total       0 days 00:10:41.704847
loss_total             26957064946290569216.0
loss_critic            33696330578131816448.0
loss_actor                -11171227887.615999
memory_size                           16573.0 

=== epoch 2/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  549
episode_length                       18.17122
returns                             -8.365811
return_std                          22.993771
average_reward                      -0.460675
round_time             0 days 00:10:41.472287
episodes_test                           620.0
episode_length_test                 16.104839
returns_test                       -11.358918
return_std_test                      4.193675
average_reward_test                 -0.705662
round_time_test        0 days 00:00:09.364819
round_time_total       0 days 00:10:41.473439
loss_total             28002696929793245184.0
loss_critic            35003370547889434624.0
loss_actor                    -10544102471.68
memory_size                        16593.1135 

=== epoch 2/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  601
episode_length                      16.620632
returns                            -10.444707
return_std                          12.281452
average_reward                      -0.628499
round_time             0 days 00:10:41.476223
episodes_test                           636.0
episode_length_test                 15.721698
returns_test                       -11.790027
return_std_test                       2.58516
average_reward_test                 -0.749767
round_time_test        0 days 00:00:09.230549
round_time_total       0 days 00:10:41.477335
loss_total             25495683633503289344.0
loss_critic            31869604006485667840.0
loss_actor                   -10413475347.968
memory_size                           16619.0 

=== epoch 2/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  628
episode_length                      15.914013
returns                            -11.408273
return_std                           2.608051
average_reward                      -0.715977
round_time             0 days 00:10:43.128270
episodes_test                           642.0
episode_length_test                 15.557632
returns_test                        -11.84977
return_std_test                      1.894106
average_reward_test                 -0.761633
round_time_test        0 days 00:00:09.257754
round_time_total       0 days 00:10:43.129608
loss_total             25775726223914758144.0
loss_critic            32219657256251035648.0
loss_actor                   -10565425021.952
memory_size                           16619.0 

=== epoch 2/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  636
episode_length                      15.696541
returns                             -11.58677
return_std                           2.272376
average_reward                      -0.737938
round_time             0 days 00:10:41.750704
episodes_test                           630.0
episode_length_test                 15.855556
returns_test                       -11.646127
return_std_test                      3.176472
average_reward_test                 -0.734301
round_time_test        0 days 00:00:09.193850
round_time_total       0 days 00:10:41.751820
loss_total             25854125213882490880.0
loss_critic            32317655984364851200.0
loss_actor                -10838779510.271999
memory_size                           16619.0 

=== epoch 2/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                  633
episode_length                      15.775671
returns                            -11.666628
return_std                           2.226355
average_reward                      -0.739288
round_time             0 days 00:10:43.801922
episodes_test                           630.0
episode_length_test                 15.850794
returns_test                       -11.670015
return_std_test                      4.008323
average_reward_test                 -0.736455
round_time_test        0 days 00:00:09.289690
round_time_total       0 days 00:10:43.803017
loss_total             25300306294961774592.0
loss_critic            31625382342998224896.0
loss_actor                -11101998057.983999
memory_size                         16627.182 

=== epoch 2/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  638
episode_length                      15.655172
returns                            -11.798815
return_std                           1.703923
average_reward                      -0.753095
round_time             0 days 00:10:42.542784
episodes_test                           625.0
episode_length_test                   15.9872
returns_test                       -11.373727
return_std_test                      6.796044
average_reward_test                 -0.711527
round_time_test        0 days 00:00:09.193338
round_time_total       0 days 00:10:42.543915
loss_total             25873446161509502976.0
loss_critic            32341807133851684864.0
loss_actor                   -11715104561.664
memory_size                           16638.0 

=== epoch 2/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  638
episode_length                      15.639498
returns                            -11.870085
return_std                           1.632299
average_reward                      -0.758893
round_time             0 days 00:10:42.205848
episodes_test                           613.0
episode_length_test                 16.298532
returns_test                       -11.754645
return_std_test                      2.142289
average_reward_test                 -0.720277
round_time_test        0 days 00:00:09.251853
round_time_total       0 days 00:10:42.207022
loss_total             26153691868827144192.0
loss_critic            32692114250681425920.0
loss_actor                -12750085440.511999
memory_size                           16638.0 

=== epoch 2/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  639
episode_length                      15.624413
returns                            -11.938771
return_std                           1.677809
average_reward                      -0.764566
round_time             0 days 00:10:41.054627
episodes_test                           658.0
episode_length_test                 15.183891
returns_test                       -12.271654
return_std_test                      1.371428
average_reward_test                 -0.807512
round_time_test        0 days 00:00:09.397669
round_time_total       0 days 00:10:41.055741
loss_total             26900909455059623936.0
loss_critic            33626136238488551424.0
loss_actor                    -14015749532.16
memory_size                        16645.6455 

=== epoch 2/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  643
episode_length                       15.51944
returns                             -12.31222
return_std                            1.66512
average_reward                      -0.793745
round_time             0 days 00:10:41.256237
episodes_test                           660.0
episode_length_test                 15.136364
returns_test                       -12.642702
return_std_test                      1.622022
average_reward_test                 -0.835225
round_time_test        0 days 00:00:09.269897
round_time_total       0 days 00:10:41.257440
loss_total             28259772183276552192.0
loss_critic            35324714627113074688.0
loss_actor                    -15401204874.24
memory_size                           16647.0 

=== epoch 2/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  646
episode_length                      15.456656
returns                            -12.831794
return_std                           2.875938
average_reward                      -0.830194
round_time             0 days 00:10:41.224306
episodes_test                           656.0
episode_length_test                 15.222561
returns_test                       -14.071963
return_std_test                       1.80627
average_reward_test                 -0.924372
round_time_test        0 days 00:00:09.245654
round_time_total       0 days 00:10:41.225626
loss_total             28152347937363357696.0
loss_critic            35190434329204871168.0
loss_actor                -16735486601.728001
memory_size                        16675.4935 

=== epoch 2/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  646
episode_length                       15.44582
returns                            -13.590607
return_std                           3.127424
average_reward                      -0.879889
round_time             0 days 00:10:41.306859
episodes_test                           677.0
episode_length_test                 14.748892
returns_test                       -15.848537
return_std_test                      1.807264
average_reward_test                 -1.074549
round_time_test        0 days 00:00:09.444365
round_time_total       0 days 00:10:41.307973
loss_total             28049632022341652480.0
loss_critic            35062039442162245632.0
loss_actor                -17871840766.976002
memory_size                           16696.0 

=== epoch 2/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  644
episode_length                      15.498447
returns                            -14.413439
return_std                           3.178429
average_reward                      -0.929887
round_time             0 days 00:10:42.630770
episodes_test                           652.0
episode_length_test                 15.328221
returns_test                       -16.185513
return_std_test                      1.638344
average_reward_test                 -1.054803
round_time_test        0 days 00:00:09.432500
round_time_total       0 days 00:10:42.632039
loss_total             30477979309516296192.0
loss_critic            38097473466811752448.0
loss_actor                -18661090374.655998
memory_size                        16719.1055 

=== epoch 2/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  642
episode_length                      15.563863
returns                            -15.266086
return_std                           3.035423
average_reward                       -0.98162
round_time             0 days 00:10:41.939536
episodes_test                           632.0
episode_length_test                 15.803797
returns_test                       -16.021786
return_std_test                      1.674925
average_reward_test                 -1.013283
round_time_test        0 days 00:00:09.650178
round_time_total       0 days 00:10:41.940745
loss_total             30904503769190223872.0
loss_critic            38630629082292248576.0
loss_actor                -19502025953.279999
memory_size                           16722.0 

=== epoch 2/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.12it/s]
episodes                                  632
episode_length                      15.806962
returns                            -15.896533
return_std                           2.883976
average_reward                       -1.00564
round_time             0 days 00:10:42.581250
episodes_test                           622.0
episode_length_test                 16.064309
returns_test                       -16.374902
return_std_test                      1.545912
average_reward_test                 -1.018731
round_time_test        0 days 00:00:09.553939
round_time_total       0 days 00:10:42.582390
loss_total             31976250291297357824.0
loss_critic            39970312183936319488.0
loss_actor                -20383742678.015999
memory_size                        16725.5285 

=== epoch 2/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:43<00:00,  3.11it/s]
episodes                                  629
episode_length                      15.866455
returns                            -16.273157
return_std                           1.592352
average_reward                      -1.025822
round_time             0 days 00:10:44.105018
episodes_test                           627.0
episode_length_test                 15.931419
returns_test                        -16.13405
return_std_test                       1.74363
average_reward_test                 -1.012299
round_time_test        0 days 00:00:09.363549
round_time_total       0 days 00:10:44.106130
loss_total             33397508751330398208.0
loss_critic            41746885205582577664.0
loss_actor                -21038001326.080002
memory_size                           16732.0 

=== epoch 2/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:03,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  623
episode_length                      16.040128
returns                            -16.292835
return_std                           1.589086
average_reward                      -1.016131
round_time             0 days 00:10:42.177702
episodes_test                           626.0
episode_length_test                 15.966454
returns_test                        -16.20231
return_std_test                      1.692994
average_reward_test                 -1.013879
round_time_test        0 days 00:00:09.379331
round_time_total       0 days 00:10:42.178824
loss_total             34065907526018117632.0
loss_critic            42582383685693267968.0
loss_actor                   -21678634377.216
memory_size                        16732.4565 

=== epoch 2/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  608
episode_length                      16.414474
returns                            -16.136564
return_std                           1.724452
average_reward                      -0.983076
round_time             0 days 00:10:43.324168
episodes_test                           597.0
episode_length_test                 16.748744
returns_test                       -15.936486
return_std_test                      1.696277
average_reward_test                 -0.951319
round_time_test        0 days 00:00:09.362461
round_time_total       0 days 00:10:43.325414
loss_total             34562179867539345408.0
loss_critic            43202724125720215552.0
loss_actor                   -22247365351.424
memory_size                         16735.723 

=== epoch 2/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:45,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:44<00:00,  3.10it/s]
episodes                                  598
episode_length                      16.722408
returns                             -16.02034
return_std                           1.779522
average_reward                      -0.958016
round_time             0 days 00:10:45.337492
episodes_test                           559.0
episode_length_test                 17.858676
returns_test                       -15.593443
return_std_test                       2.10132
average_reward_test                 -0.872799
round_time_test        0 days 00:00:09.418474
round_time_total       0 days 00:10:45.338597
loss_total             35807427313502760960.0
loss_critic            44759283437984849920.0
loss_actor                -22654944499.712002
memory_size                        16738.7275 

=== epoch 2/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  579
episode_length                      17.243523
returns                              -15.5702
return_std                            2.14561
average_reward                      -0.902901
round_time             0 days 00:10:43.571443
episodes_test                           530.0
episode_length_test                 18.849057
returns_test                       -14.999643
return_std_test                      2.288364
average_reward_test                 -0.794431
round_time_test        0 days 00:00:09.134865
round_time_total       0 days 00:10:43.572552
loss_total             36904543122209390592.0
loss_critic            46130678102042394624.0
loss_actor                -23035434632.192001
memory_size                         16742.175 

=== epoch 2/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:42<00:00,  3.11it/s]
episodes                                  549
episode_length                      18.183971
returns                            -15.050429
return_std                            2.48273
average_reward                      -0.828331
round_time             0 days 00:10:43.429899
episodes_test                           467.0
episode_length_test                 21.376874
returns_test                       -13.746087
return_std_test                      2.694584
average_reward_test                 -0.643423
round_time_test        0 days 00:00:09.303031
round_time_total       0 days 00:10:43.431062
loss_total             36401295897560604672.0
loss_critic            45501619068620079104.0
loss_actor                   -23321423676.416
memory_size                         16763.657 

=== epoch 2/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:40,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.06it/s]
episodes                                  517
episode_length                      19.290135
returns                            -14.393241
return_std                           2.930046
average_reward                      -0.746426
round_time             0 days 00:10:53.178825
episodes_test                           459.0
episode_length_test                 21.779956
returns_test                       -12.961808
return_std_test                      3.057398
average_reward_test                 -0.594951
round_time_test        0 days 00:00:09.350814
round_time_total       0 days 00:10:53.179955
loss_total             38106970069137375232.0
loss_critic            47633711742409105408.0
loss_actor                -23554262965.248001
memory_size                        16816.1115 

=== epoch 2/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  495
episode_length                      20.173737
returns                            -13.736109
return_std                           3.246529
average_reward                      -0.679885
round_time             0 days 00:11:06.328101
episodes_test                           431.0
episode_length_test                 23.167053
returns_test                       -11.705099
return_std_test                      10.33796
average_reward_test                 -0.505483
round_time_test        0 days 00:00:09.030634
round_time_total       0 days 00:11:06.329208
loss_total             40239948479806930944.0
loss_critic            50299934787219570688.0
loss_actor                -23692697670.655998
memory_size                        16895.8445 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                   88
episode_length                      22.556818
returns                            -11.772088
return_std                           3.183445
average_reward                      -0.519169
round_time             0 days 00:10:31.908426
episodes_test                           432.0
episode_length_test                 23.113426
returns_test                       -11.952179
return_std_test                      9.814518
average_reward_test                 -0.516931
round_time_test        0 days 00:00:09.101180
round_time_total       0 days 00:10:31.909709
loss_total             41283592077894631424.0
loss_critic            51604489178451443712.0
loss_actor                   -23705367457.792
memory_size                         16944.402 

=== epoch 3/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  175
episode_length                      22.765714
returns                            -12.147656
return_std                           3.163996
average_reward                      -0.531463
round_time             0 days 00:10:34.633442
episodes_test                           438.0
episode_length_test                 22.785388
returns_test                       -12.082315
return_std_test                      6.002688
average_reward_test                 -0.529974
round_time_test        0 days 00:00:09.539066
round_time_total       0 days 00:10:34.634538
loss_total             43088854438191398912.0
loss_critic            53861067128272650240.0
loss_actor                   -23679925665.792
memory_size                         16998.081 

=== epoch 3/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                  265
episode_length                      22.637736
returns                            -12.049403
return_std                           3.282418
average_reward                      -0.531999
round_time             0 days 00:10:36.629830
episodes_test                           432.0
episode_length_test                 23.127315
returns_test                       -12.131527
return_std_test                      3.502746
average_reward_test                 -0.523414
round_time_test        0 days 00:00:09.159058
round_time_total       0 days 00:10:36.630950
loss_total             44279527786688118784.0
loss_critic            55349408811694530560.0
loss_actor                -23460353633.279999
memory_size                         17061.974 

=== epoch 3/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  352
episode_length                      22.721591
returns                             -12.02855
return_std                           3.407305
average_reward                      -0.529178
round_time             0 days 00:10:34.796006
episodes_test                           432.0
episode_length_test                    23.125
returns_test                       -12.066487
return_std_test                      3.845422
average_reward_test                 -0.520844
round_time_test        0 days 00:00:09.217165
round_time_total       0 days 00:10:34.797112
loss_total             45232447340051701760.0
loss_critic            56540558177257816064.0
loss_actor                   -23241755309.056
memory_size                         17122.649 

=== epoch 3/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  441
episode_length                      22.664399
returns                            -11.928946
return_std                           3.445568
average_reward                      -0.525642
round_time             0 days 00:10:37.786562
episodes_test                           432.0
episode_length_test                 23.148148
returns_test                       -11.745014
return_std_test                      6.779928
average_reward_test                 -0.507385
round_time_test        0 days 00:00:09.266241
round_time_total       0 days 00:10:37.788099
loss_total             46187375204179968000.0
loss_critic            57734218071464706048.0
loss_actor                -22916462676.992001
memory_size                        17178.8485 

=== epoch 3/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:59,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                  440
episode_length                      22.686364
returns                            -11.999687
return_std                            3.46469
average_reward                      -0.528976
round_time             0 days 00:10:37.354340
episodes_test                           427.0
episode_length_test                 23.407494
returns_test                       -12.397328
return_std_test                      3.277692
average_reward_test                 -0.528958
round_time_test        0 days 00:00:09.149033
round_time_total       0 days 00:10:37.355445
loss_total             46769555215748849664.0
loss_critic            58461943015557070848.0
loss_actor                -22557340772.352001
memory_size                        17232.5185 

=== epoch 3/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.14it/s]
episodes                                  442
episode_length                      22.554299
returns                            -11.814114
return_std                           3.515207
average_reward                      -0.524949
round_time             0 days 00:10:36.600558
episodes_test                           425.0
episode_length_test                 23.482353
returns_test                       -11.147491
return_std_test                     10.762491
average_reward_test                 -0.474426
round_time_test        0 days 00:00:09.148380
round_time_total       0 days 00:10:36.601661
loss_total             44728634581047410688.0
loss_critic            55910792276056342528.0
loss_actor                -21478604526.591999
memory_size                        17286.9805 

=== epoch 3/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  439
episode_length                      22.683371
returns                            -11.824672
return_std                           3.655826
average_reward                       -0.52088
round_time             0 days 00:10:37.891496
episodes_test                           429.0
episode_length_test                 23.254079
returns_test                       -12.272635
return_std_test                      3.895716
average_reward_test                 -0.527706
round_time_test        0 days 00:00:09.120090
round_time_total       0 days 00:10:37.892599
loss_total             45179529426504556544.0
loss_critic            56474410863389220864.0
loss_actor                -20859245329.408001
memory_size                        17352.2485 

=== epoch 3/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  439
episode_length                       22.71754
returns                            -11.926037
return_std                           3.630528
average_reward                      -0.523899
round_time             0 days 00:10:39.281167
episodes_test                           441.0
episode_length_test                 22.673469
returns_test                       -11.595012
return_std_test                      4.699752
average_reward_test                 -0.511252
round_time_test        0 days 00:00:09.163118
round_time_total       0 days 00:10:39.282269
loss_total             43565849489569177600.0
loss_critic            54457310903462215680.0
loss_actor                -20279504519.167999
memory_size                        17413.7175 

=== epoch 3/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  438
episode_length                      22.757991
returns                             -11.97168
return_std                           3.638847
average_reward                      -0.526052
round_time             0 days 00:10:40.429358
episodes_test                           429.0
episode_length_test                 23.293706
returns_test                       -11.867083
return_std_test                      3.857286
average_reward_test                 -0.509298
round_time_test        0 days 00:00:09.103108
round_time_total       0 days 00:10:40.430495
loss_total             43285809758712578048.0
loss_critic            54107261290881310720.0
loss_actor                -19624107875.327999
memory_size                          17483.07 

=== epoch 3/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  442
episode_length                      22.599548
returns                            -12.190352
return_std                           3.592387
average_reward                      -0.539278
round_time             0 days 00:10:39.549046
episodes_test                           433.0
episode_length_test                 23.057737
returns_test                        -12.21639
return_std_test                      3.534124
average_reward_test                 -0.529007
round_time_test        0 days 00:00:09.297131
round_time_total       0 days 00:10:39.550146
loss_total             43254772807155875840.0
loss_critic            54068465092364468224.0
loss_actor                -19153861202.431999
memory_size                         17549.357 

=== epoch 3/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  446
episode_length                      22.399103
returns                            -12.640967
return_std                            3.58943
average_reward                       -0.56429
round_time             0 days 00:10:40.938786
episodes_test                           483.0
episode_length_test                 20.670807
returns_test                       -13.436823
return_std_test                      3.137699
average_reward_test                 -0.649077
round_time_test        0 days 00:00:09.311998
round_time_total       0 days 00:10:40.940245
loss_total             41268672722703720448.0
loss_critic            51585840010438770688.0
loss_actor                   -18627646002.688
memory_size                        17607.2355 

=== epoch 3/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  458
episode_length                      21.777293
returns                            -12.968413
return_std                           3.375904
average_reward                        -0.5956
round_time             0 days 00:10:41.881189
episodes_test                           517.0
episode_length_test                 19.313346
returns_test                       -14.638609
return_std_test                      2.477614
average_reward_test                 -0.757651
round_time_test        0 days 00:00:09.112534
round_time_total       0 days 00:10:41.882295
loss_total             39565233947326660608.0
loss_critic            49456541575027433472.0
loss_actor                -18020563199.487999
memory_size                          17656.67 

=== epoch 3/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  484
episode_length                      20.590909
returns                            -13.686301
return_std                           3.200764
average_reward                      -0.665841
round_time             0 days 00:10:39.310627
episodes_test                           579.0
episode_length_test                 17.255613
returns_test                       -15.751768
return_std_test                      1.701516
average_reward_test                 -0.912313
round_time_test        0 days 00:00:09.218977
round_time_total       0 days 00:10:39.311744
loss_total             37708299759906119680.0
loss_critic            47135373898201235456.0
loss_actor                -17368664343.551998
memory_size                           17680.0 

=== epoch 3/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:06,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:45<00:00,  3.10it/s]
episodes                                  520
episode_length                      19.207692
returns                            -14.517775
return_std                           2.743669
average_reward                      -0.756222
round_time             0 days 00:10:46.366124
episodes_test                           580.0
episode_length_test                 17.237931
returns_test                       -16.053762
return_std_test                      1.574994
average_reward_test                 -0.930911
round_time_test        0 days 00:00:09.254142
round_time_total       0 days 00:10:46.367448
loss_total             36623323096956448768.0
loss_critic            45779153043813064704.0
loss_actor                    -17025864174.08
memory_size                           17680.0 

=== epoch 3/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  550
episode_length                      18.132727
returns                            -15.033546
return_std                            2.54878
average_reward                      -0.828536
round_time             0 days 00:10:42.330510
episodes_test                           619.0
episode_length_test                 16.137318
returns_test                       -16.178634
return_std_test                      1.501581
average_reward_test                 -1.001831
round_time_test        0 days 00:00:09.317893
round_time_total       0 days 00:10:42.331849
loss_total             35548123589368610816.0
loss_critic            44435153687228375040.0
loss_actor                   -16722514042.368
memory_size                         17681.674 

=== epoch 3/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  584
episode_length                      17.101027
returns                            -15.478375
return_std                           2.247599
average_reward                      -0.904975
round_time             0 days 00:10:38.257114
episodes_test                           629.0
episode_length_test                 15.890302
returns_test                       -16.150381
return_std_test                       1.63331
average_reward_test                 -1.015808
round_time_test        0 days 00:00:09.450009
round_time_total       0 days 00:10:38.258645
loss_total             34396652007996534784.0
loss_critic            42995814266176053248.0
loss_actor                -16383052256.256001
memory_size                           17689.0 

=== epoch 3/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  616
episode_length                      16.207792
returns                            -15.968838
return_std                           1.736481
average_reward                        -0.9849
round_time             0 days 00:10:40.775228
episodes_test                           644.0
episode_length_test                 15.520186
returns_test                       -16.143049
return_std_test                      1.684585
average_reward_test                 -1.039381
round_time_test        0 days 00:00:09.436446
round_time_total       0 days 00:10:40.776567
loss_total             33318427079265902592.0
loss_critic            41648033122305196032.0
loss_actor                -16063543498.752001
memory_size                           17689.0 

=== epoch 3/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  633
episode_length                      15.789889
returns                            -16.126182
return_std                           1.677936
average_reward                         -1.021
round_time             0 days 00:10:40.089909
episodes_test                           648.0
episode_length_test                 15.410494
returns_test                       -16.174573
return_std_test                      1.636977
average_reward_test                 -1.048899
round_time_test        0 days 00:00:09.479248
round_time_total       0 days 00:10:40.091383
loss_total             31942134431755223040.0
loss_critic            39927667377513152512.0
loss_actor                -15809200788.992001
memory_size                          17695.02 

=== epoch 3/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  642
episode_length                      15.563863
returns                             -16.29261
return_std                           1.688876
average_reward                      -1.045475
round_time             0 days 00:10:39.711889
episodes_test                           671.0
episode_length_test                 14.886736
returns_test                       -16.666316
return_std_test                      1.377466
average_reward_test                 -1.119031
round_time_test        0 days 00:00:09.335541
round_time_total       0 days 00:10:39.713019
loss_total             29811798028230205440.0
loss_critic            37264746941551476736.0
loss_actor                -15671237950.464001
memory_size                        17697.5105 

=== epoch 3/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  653
episode_length                      15.300153
returns                            -16.471333
return_std                           1.557427
average_reward                      -1.077096
round_time             0 days 00:10:41.019659
episodes_test                           666.0
episode_length_test                 15.003003
returns_test                       -16.580703
return_std_test                      1.529463
average_reward_test                  -1.10503
round_time_test        0 days 00:00:09.491855
round_time_total       0 days 00:10:41.020770
loss_total             27833887046458343424.0
loss_critic            34792358207602139136.0
loss_actor                   -15444599418.368
memory_size                           17699.0 

=== epoch 3/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  659
episode_length                      15.150228
returns                            -16.523977
return_std                           1.523742
average_reward                      -1.090334
round_time             0 days 00:10:39.721477
episodes_test                           667.0
episode_length_test                 14.974513
returns_test                       -16.759469
return_std_test                      1.438357
average_reward_test                 -1.118924
round_time_test        0 days 00:00:09.357603
round_time_total       0 days 00:10:39.722581
loss_total             27097058126296788992.0
loss_critic            33871322118697971712.0
loss_actor                    -15366862179.84
memory_size                        17699.4785 

=== epoch 3/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.14it/s]
episodes                                  661
episode_length                      15.101362
returns                            -16.532152
return_std                           1.477024
average_reward                      -1.094936
round_time             0 days 00:10:38.368970
episodes_test                           667.0
episode_length_test                 14.983508
returns_test                       -16.682864
return_std_test                      1.453769
average_reward_test                 -1.112367
round_time_test        0 days 00:00:09.442534
round_time_total       0 days 00:10:38.370063
loss_total             26363847609292050432.0
loss_critic            32954808968456318976.0
loss_actor                -15287800473.087999
memory_size                           17700.0 

=== epoch 3/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  666
episode_length                      14.987988
returns                            -16.553469
return_std                           1.467859
average_reward                      -1.103977
round_time             0 days 00:10:41.452396
episodes_test                           666.0
episode_length_test                 15.012012
returns_test                       -16.668633
return_std_test                      1.551095
average_reward_test                 -1.109962
round_time_test        0 days 00:00:09.591873
round_time_total       0 days 00:10:41.453731
loss_total             26113988553975930880.0
loss_critic            32642485145462878208.0
loss_actor                -15211967607.808001
memory_size                           17700.0 

=== epoch 3/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  668
episode_length                      14.947605
returns                            -16.550625
return_std                           1.476532
average_reward                      -1.107311
round_time             0 days 00:10:39.886180
episodes_test                           664.0
episode_length_test                 15.045181
returns_test                       -16.627232
return_std_test                      1.433355
average_reward_test                 -1.104152
round_time_test        0 days 00:00:09.424602
round_time_total       0 days 00:10:39.887283
loss_total             25271393350555680768.0
loss_critic            31589241137407995904.0
loss_actor                   -15163769837.056
memory_size                         17700.505 

=== epoch 3/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:37<00:00,  3.13it/s]
episodes                                  672
episode_length                      14.877976
returns                            -16.516465
return_std                           1.465637
average_reward                      -1.109733
round_time             0 days 00:10:38.533070
episodes_test                           667.0
episode_length_test                 14.976012
returns_test                       -16.657992
return_std_test                      1.349701
average_reward_test                 -1.111819
round_time_test        0 days 00:00:09.338068
round_time_total       0 days 00:10:38.534180
loss_total             24108403655081803776.0
loss_critic            30135504044797526016.0
loss_actor                   -14992666127.872
memory_size                           17704.0 

=== epoch 3/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  674
episode_length                      14.808605
returns                            -16.578319
return_std                           1.483634
average_reward                      -1.119761
round_time             0 days 00:10:39.316175
episodes_test                           690.0
episode_length_test                 14.485507
returns_test                       -16.644684
return_std_test                       1.56635
average_reward_test                 -1.148304
round_time_test        0 days 00:00:09.485926
round_time_total       0 days 00:10:39.317272
loss_total             23069228493414985728.0
loss_critic            28836535106664054784.0
loss_actor                -14794876591.615999
memory_size                           17704.0 

=== epoch 3/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  678
episode_length                      14.725664
returns                            -16.701393
return_std                           1.479074
average_reward                      -1.134269
round_time             0 days 00:10:40.894903
episodes_test                           689.0
episode_length_test                 14.502177
returns_test                       -16.750424
return_std_test                      1.602203
average_reward_test                 -1.154321
round_time_test        0 days 00:00:09.476917
round_time_total       0 days 00:10:40.896020
loss_total             22534931985087086592.0
loss_critic            28168664521144520704.0
loss_actor                -14655656172.544001
memory_size                           17704.0 

=== epoch 3/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  680
episode_length                      14.686765
returns                             -16.74712
return_std                           1.437586
average_reward                      -1.140591
round_time             0 days 00:10:38.786050
episodes_test                           681.0
episode_length_test                 14.676946
returns_test                       -16.586592
return_std_test                      3.420303
average_reward_test                 -1.129098
round_time_test        0 days 00:00:09.584588
round_time_total       0 days 00:10:38.787211
loss_total             21295503845975793664.0
loss_critic            26619379354058633216.0
loss_actor                -14527491260.927999
memory_size                        17704.8685 

=== epoch 3/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  684
episode_length                      14.587719
returns                            -16.712326
return_std                           1.540162
average_reward                       -1.14553
round_time             0 days 00:10:38.923294
episodes_test                           689.0
episode_length_test                 14.507983
returns_test                        -16.62442
return_std_test                      1.623194
average_reward_test                  -1.14537
round_time_test        0 days 00:00:09.417746
round_time_total       0 days 00:10:38.924430
loss_total             20634276132029038592.0
loss_critic            25792844694445318144.0
loss_actor                -14256126540.799999
memory_size                           17707.0 

=== epoch 3/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  685
episode_length                      14.566423
returns                            -16.758047
return_std                           1.520097
average_reward                      -1.150315
round_time             0 days 00:10:39.072253
episodes_test                           692.0
episode_length_test                 14.450867
returns_test                       -16.664875
return_std_test                      1.348624
average_reward_test                 -1.153209
round_time_test        0 days 00:00:09.299866
round_time_total       0 days 00:10:39.073362
loss_total             20028738372241735680.0
loss_critic            25035922505225269248.0
loss_actor                   -14053864625.152
memory_size                         17709.429 

=== epoch 3/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  689
episode_length                      14.487663
returns                            -16.735596
return_std                           1.536432
average_reward                      -1.156197
round_time             0 days 00:10:38.969682
episodes_test                           695.0
episode_length_test                 14.385612
returns_test                       -16.702997
return_std_test                      1.447703
average_reward_test                 -1.160766
round_time_test        0 days 00:00:09.468789
round_time_total       0 days 00:10:38.970800
loss_total             19212205836040757248.0
loss_critic            24015256866447568896.0
loss_actor                    -13830856171.52
memory_size                           17714.0 

=== epoch 3/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  692
episode_length                      14.434971
returns                            -16.656801
return_std                            1.59421
average_reward                      -1.153858
round_time             0 days 00:10:38.743084
episodes_test                           699.0
episode_length_test                 14.304721
returns_test                       -16.945301
return_std_test                      1.601591
average_reward_test                 -1.184379
round_time_test        0 days 00:00:09.326870
round_time_total       0 days 00:10:38.744236
loss_total             18282613389018361856.0
loss_critic            22853266350516170752.0
loss_actor                   -13659812081.152
memory_size                           17714.0 

=== epoch 3/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  694
episode_length                      14.387608
returns                            -16.614448
return_std                           1.887972
average_reward                      -1.154615
round_time             0 days 00:10:39.306155
episodes_test                           696.0
episode_length_test                 14.366379
returns_test                       -16.817141
return_std_test                      1.375881
average_reward_test                 -1.170385
round_time_test        0 days 00:00:09.535064
round_time_total       0 days 00:10:39.307372
loss_total             17981245988152037376.0
loss_critic            22476557084005740544.0
loss_actor                   -13548896631.296
memory_size                        17716.1675 

=== epoch 3/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  696
episode_length                      14.354885
returns                            -16.728129
return_std                           1.789869
average_reward                      -1.165236
round_time             0 days 00:10:39.436875
episodes_test                           690.0
episode_length_test                 14.486957
returns_test                       -16.779205
return_std_test                      1.493672
average_reward_test                 -1.157544
round_time_test        0 days 00:00:09.617161
round_time_total       0 days 00:10:39.438021
loss_total             17623327644740651008.0
loss_critic            22029159183328808960.0
loss_actor                   -13391678466.048
memory_size                           17717.0 

=== epoch 3/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  697
episode_length                      14.329986
returns                            -16.767656
return_std                           1.799563
average_reward                      -1.170246
round_time             0 days 00:10:39.808330
episodes_test                           698.0
episode_length_test                 14.326648
returns_test                       -16.876781
return_std_test                      1.510222
average_reward_test                 -1.177999
round_time_test        0 days 00:00:09.430470
round_time_total       0 days 00:10:39.809651
loss_total             17057457819032207360.0
loss_critic            21321821903083040768.0
loss_actor                   -13243456375.296
memory_size                           17717.0 

=== epoch 3/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:46,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:36<00:00,  3.14it/s]
episodes                                  695
episode_length                      14.369784
returns                            -16.778342
return_std                           1.749808
average_reward                      -1.167759
round_time             0 days 00:10:36.787330
episodes_test                           697.0
episode_length_test                 14.347202
returns_test                       -16.834692
return_std_test                       1.43708
average_reward_test                 -1.173378
round_time_test        0 days 00:00:09.518797
round_time_total       0 days 00:10:36.788450
loss_total             16555845933898981376.0
loss_critic            20694807056940068864.0
loss_actor                   -13168919118.848
memory_size                           17717.0 

=== epoch 3/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:49,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  695
episode_length                      14.366906
returns                            -16.862297
return_std                           1.703973
average_reward                      -1.173715
round_time             0 days 00:10:39.400056
episodes_test                           694.0
episode_length_test                 14.402017
returns_test                       -16.928922
return_std_test                      1.378336
average_reward_test                 -1.174725
round_time_test        0 days 00:00:09.649410
round_time_total       0 days 00:10:39.401562
loss_total             16332677773898336256.0
loss_critic            20415846883499339776.0
loss_actor                   -13094295548.416
memory_size                           17717.0 

=== epoch 3/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  694
episode_length                      14.371758
returns                            -16.899086
return_std                           1.383085
average_reward                      -1.176352
round_time             0 days 00:10:40.542465
episodes_test                           694.0
episode_length_test                 14.399135
returns_test                       -16.882428
return_std_test                      1.518955
average_reward_test                 -1.171559
round_time_test        0 days 00:00:09.360060
round_time_total       0 days 00:10:40.543972
loss_total             15839239006607917056.0
loss_critic            19799048414353276928.0
loss_actor                   -12950489609.216
memory_size                           17717.0 

=== epoch 3/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:24,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  692
episode_length                      14.424855
returns                            -16.885047
return_std                            1.37832
average_reward                      -1.170334
round_time             0 days 00:10:41.007985
episodes_test                           699.0
episode_length_test                 14.298999
returns_test                       -16.786379
return_std_test                       1.45047
average_reward_test                 -1.173129
round_time_test        0 days 00:00:09.461730
round_time_total       0 days 00:10:41.009110
loss_total             14940797178104543232.0
loss_critic            18675996164423827456.0
loss_actor                   -12828810127.872
memory_size                        17717.0615 

=== epoch 3/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:42,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  690
episode_length                      14.478261
returns                            -16.854569
return_std                           1.380997
average_reward                      -1.164742
round_time             0 days 00:10:41.337914
episodes_test                           696.0
episode_length_test                 14.363506
returns_test                       -16.817881
return_std_test                      1.606935
average_reward_test                 -1.170488
round_time_test        0 days 00:00:09.454989
round_time_total       0 days 00:10:41.339128
loss_total             14971224219428759552.0
loss_critic            18714029957317361664.0
loss_actor                   -12618838240.768
memory_size                           17718.0 

=== epoch 3/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  690
episode_length                      14.473913
returns                            -16.840958
return_std                           1.413885
average_reward                      -1.163362
round_time             0 days 00:10:40.069755
episodes_test                           693.0
episode_length_test                 14.412698
returns_test                       -16.602224
return_std_test                      1.570996
average_reward_test                 -1.151769
round_time_test        0 days 00:00:09.428728
round_time_total       0 days 00:10:40.070973
loss_total             14269865366120937472.0
loss_critic            17837331403498768384.0
loss_actor                -12265871620.096001
memory_size                           17718.0 

=== epoch 3/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  686
episode_length                      14.559767
returns                            -16.754772
return_std                           1.479439
average_reward                      -1.150813
round_time             0 days 00:10:41.114299
episodes_test                           691.0
episode_length_test                 14.470333
returns_test                       -16.852757
return_std_test                      1.409141
average_reward_test                 -1.164428
round_time_test        0 days 00:00:09.383044
round_time_total       0 days 00:10:41.115401
loss_total             13813472044295262208.0
loss_critic            17266839757435785216.0
loss_actor                -12131373706.752001
memory_size                         17727.414 

=== epoch 3/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:39<00:00,  3.13it/s]
episodes                                  689
episode_length                      14.507983
returns                            -16.759811
return_std                           1.515845
average_reward                      -1.154726
round_time             0 days 00:10:39.992678
episodes_test                           697.0
episode_length_test                  14.33429
returns_test                       -16.680723
return_std_test                      1.589536
average_reward_test                  -1.16344
round_time_test        0 days 00:00:09.451139
round_time_total       0 days 00:10:39.994004
loss_total             13747016707741700096.0
loss_critic            17183770597051756544.0
loss_actor                -12080059969.535999
memory_size                           17734.0 

=== epoch 3/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  688
episode_length                      14.507267
returns                            -16.755966
return_std                           1.522077
average_reward                      -1.155612
round_time             0 days 00:10:40.796086
episodes_test                           698.0
episode_length_test                  14.30659
returns_test                       -16.881908
return_std_test                        1.5467
average_reward_test                 -1.179887
round_time_test        0 days 00:00:09.445665
round_time_total       0 days 00:10:40.797396
loss_total             13213759986806906880.0
loss_critic            16517199699697195008.0
loss_actor                    -11942763025.92
memory_size                           17734.0 

=== epoch 3/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  691
episode_length                      14.444284
returns                            -16.736047
return_std                           1.599553
average_reward                      -1.158003
round_time             0 days 00:10:41.635569
episodes_test                           699.0
episode_length_test                  14.30186
returns_test                       -16.857794
return_std_test                      1.577824
average_reward_test                 -1.178253
round_time_test        0 days 00:00:09.427590
round_time_total       0 days 00:10:41.636745
loss_total             13195219107495430144.0
loss_critic            16494023595472607232.0
loss_actor                   -12002900832.768
memory_size                           17734.0 

=== epoch 3/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
episodes                                  692
episode_length                      14.426301
returns                             -16.75726
return_std                           1.587965
average_reward                      -1.161737
round_time             0 days 00:10:40.989047
episodes_test                           698.0
episode_length_test                 14.313754
returns_test                       -16.732046
return_std_test                      1.674444
average_reward_test                  -1.16819
round_time_test        0 days 00:00:09.664317
round_time_total       0 days 00:10:40.990142
loss_total             13021967455569545216.0
loss_critic            16277459041697806336.0
loss_actor                   -12075731449.344
memory_size                           17734.0 

=== epoch 3/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:38<00:00,  3.13it/s]
episodes                                  696
episode_length                      14.347701
returns                             -16.73367
return_std                           1.637258
average_reward                      -1.166172
round_time             0 days 00:10:38.991882
episodes_test                           696.0
episode_length_test                 14.364943
returns_test                       -16.576765
return_std_test                      1.865933
average_reward_test                 -1.153568
round_time_test        0 days 00:00:09.518195
round_time_total       0 days 00:10:38.993005
loss_total             12901230217039247360.0
loss_critic            16126537509684011008.0
loss_actor                -12139536028.672001
memory_size                           17734.0 

=== epoch 3/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:41<00:00,  3.12it/s]
episodes                                  695
episode_length                      14.366906
returns                            -16.707392
return_std                           1.674632
average_reward                      -1.162777
round_time             0 days 00:10:42.208222
episodes_test                           696.0
episode_length_test                 14.366379
returns_test                        -16.58526
return_std_test                      1.730549
average_reward_test                  -1.15425
round_time_test        0 days 00:00:10.181071
round_time_total       0 days 00:10:42.209347
loss_total             12472306764830509056.0
loss_critic            15590383163980361728.0
loss_actor                -12209606236.672001
memory_size                           17734.0 

=== epoch 3/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:51,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:40<00:00,  3.12it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  700
episode_length                      14.271429
returns                            -16.469786
return_std                           1.825616
average_reward                      -1.153632
round_time             0 days 00:10:40.774754
episodes_test                           695.0
episode_length_test                 14.382734
returns_test                       -16.149203
return_std_test                      1.793797
average_reward_test                 -1.122105
round_time_test        0 days 00:00:09.365656
round_time_total       0 days 00:10:40.775863
loss_total             12454441214219466752.0
loss_critic            15568051259423461376.0
loss_actor                -12237420384.256001
memory_size                           17734.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<10:03,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  146
episode_length                      13.691781
returns                            -13.104176
return_std                           2.442759
average_reward                      -0.956203
round_time             0 days 00:10:32.189848
episodes_test                           724.0
episode_length_test                 13.808011
returns_test                       -13.969997
return_std_test                      2.527468
average_reward_test                 -1.011161
round_time_test        0 days 00:00:09.901190
round_time_total       0 days 00:10:32.191183
loss_total             12688879680734836736.0
loss_critic            15861099337774491648.0
loss_actor                   -12314199035.392
memory_size                           17734.0 

=== epoch 4/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  293
episode_length                      13.638225
returns                            -12.831663
return_std                           2.489612
average_reward                      -0.939041
round_time             0 days 00:10:32.142808
episodes_test                           734.0
episode_length_test                 13.615804
returns_test                       -12.613449
return_std_test                      2.417217
average_reward_test                  -0.92567
round_time_test        0 days 00:00:09.415701
round_time_total       0 days 00:10:32.144170
loss_total             12706585125616605184.0
loss_critic            15883231136025499648.0
loss_actor                -12390827971.072001
memory_size                           17734.0 

=== epoch 4/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  438
episode_length                      13.682648
returns                            -12.800672
return_std                           2.446109
average_reward                      -0.934192
round_time             0 days 00:10:33.592756
episodes_test                           732.0
episode_length_test                 13.648907
returns_test                       -12.596096
return_std_test                      2.301376
average_reward_test                 -0.922757
round_time_test        0 days 00:00:09.357926
round_time_total       0 days 00:10:33.593866
loss_total             12619880126570125312.0
loss_critic            15774849891959044096.0
loss_actor                    -12475302100.48
memory_size                           17734.0 

=== epoch 4/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  583
episode_length                      13.703259
returns                            -12.708867
return_std                           2.470463
average_reward                      -0.927181
round_time             0 days 00:10:32.918271
episodes_test                           729.0
episode_length_test                 13.717421
returns_test                       -12.572276
return_std_test                      2.294667
average_reward_test                 -0.916519
round_time_test        0 days 00:00:09.489271
round_time_total       0 days 00:10:32.919372
loss_total             12796627207187132416.0
loss_critic            15995783743864172544.0
loss_actor                -12609758279.167999
memory_size                           17734.0 

=== epoch 4/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                  727
episode_length                      13.740028
returns                            -12.704733
return_std                            2.41386
average_reward                      -0.924639
round_time             0 days 00:10:31.000579
episodes_test                           732.0
episode_length_test                 13.651639
returns_test                       -12.529675
return_std_test                      2.468042
average_reward_test                 -0.917101
round_time_test        0 days 00:00:09.347034
round_time_total       0 days 00:10:31.002086
loss_total             13083144651150274560.0
loss_critic            16353930550810966016.0
loss_actor                -12753468417.535999
memory_size                        17737.6315 

=== epoch 4/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                  727
episode_length                      13.740028
returns                            -12.672395
return_std                           2.393489
average_reward                      -0.922535
round_time             0 days 00:10:31.441639
episodes_test                           727.0
episode_length_test                 13.740028
returns_test                       -12.515638
return_std_test                      2.450038
average_reward_test                 -0.911216
round_time_test        0 days 00:00:09.442868
round_time_total       0 days 00:10:31.442758
loss_total             13116059999575771136.0
loss_critic            16395074729608329216.0
loss_actor                    -12934484021.76
memory_size                           17740.0 

=== epoch 4/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                  726
episode_length                      13.750689
returns                            -12.786966
return_std                           2.349471
average_reward                      -0.929731
round_time             0 days 00:10:31.418196
episodes_test                           731.0
episode_length_test                 13.663475
returns_test                       -12.554607
return_std_test                      2.390677
average_reward_test                  -0.91861
round_time_test        0 days 00:00:09.495100
round_time_total       0 days 00:10:31.419302
loss_total             13437937008786860032.0
loss_critic            16797420960748179456.0
loss_actor                -13067134956.544001
memory_size                           17740.0 

=== epoch 4/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.15it/s]
episodes                                  725
episode_length                      13.776552
returns                            -12.729387
return_std                            2.37362
average_reward                      -0.923601
round_time             0 days 00:10:34.529085
episodes_test                           727.0
episode_length_test                 13.751032
returns_test                       -12.623635
return_std_test                      2.354244
average_reward_test                 -0.917478
round_time_test        0 days 00:00:09.347500
round_time_total       0 days 00:10:34.530234
loss_total             13488870122151565312.0
loss_critic            16861087354309488640.0
loss_actor                    -13257447252.48
memory_size                         17740.851 

=== epoch 4/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  726
episode_length                      13.760331
returns                            -12.777676
return_std                           2.324253
average_reward                      -0.928183
round_time             0 days 00:10:33.683433
episodes_test                           727.0
episode_length_test                 13.740028
returns_test                       -12.647581
return_std_test                      2.296674
average_reward_test                 -0.920334
round_time_test        0 days 00:00:09.418005
round_time_total       0 days 00:10:33.684543
loss_total             13844620631466551296.0
loss_critic            17305775494148675584.0
loss_actor                     -13387994944.0
memory_size                           17741.0 

=== epoch 4/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  727
episode_length                      13.740028
returns                            -12.778233
return_std                           2.345817
average_reward                      -0.929388
round_time             0 days 00:10:32.921772
episodes_test                           730.0
episode_length_test                 13.690411
returns_test                       -12.541123
return_std_test                      2.474376
average_reward_test                 -0.915407
round_time_test        0 days 00:00:09.563902
round_time_total       0 days 00:10:32.922872
loss_total             14326161913561169920.0
loss_critic            17907702074570559488.0
loss_actor                    -13546210588.16
memory_size                         17744.591 

=== epoch 4/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  728
episode_length                      13.725275
returns                            -12.785329
return_std                           2.332211
average_reward                      -0.931186
round_time             0 days 00:10:32.667286
episodes_test                           733.0
episode_length_test                 13.633015
returns_test                       -12.636417
return_std_test                      2.224664
average_reward_test                 -0.926261
round_time_test        0 days 00:00:09.519976
round_time_total       0 days 00:10:32.668386
loss_total             14108982404032466944.0
loss_critic            17636227710543265792.0
loss_actor                -13717676236.799999
memory_size                           17745.0 

=== epoch 4/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:38,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  729
episode_length                      13.703704
returns                            -12.629825
return_std                           2.329578
average_reward                      -0.921881
round_time             0 days 00:10:33.408958
episodes_test                           735.0
episode_length_test                 13.589116
returns_test                       -12.770117
return_std_test                      2.363425
average_reward_test                 -0.939823
round_time_test        0 days 00:00:09.472807
round_time_total       0 days 00:10:33.410160
loss_total             14404243201293322240.0
loss_critic            18005303699628912640.0
loss_actor                -13839820674.559999
memory_size                           17745.0 

=== epoch 4/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  733
episode_length                      13.619372
returns                            -12.689696
return_std                           2.270945
average_reward                      -0.931765
round_time             0 days 00:10:33.502943
episodes_test                           729.0
episode_length_test                 13.699588
returns_test                       -12.666656
return_std_test                      2.433177
average_reward_test                 -0.924048
round_time_test        0 days 00:00:09.413370
round_time_total       0 days 00:10:33.504194
loss_total             14578856186740543488.0
loss_critic            18223569930441506816.0
loss_actor                -13982209943.040001
memory_size                           17745.0 

=== epoch 4/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  734
episode_length                      13.606267
returns                            -12.706667
return_std                           2.266527
average_reward                      -0.934022
round_time             0 days 00:10:34.719862
episodes_test                           742.0
episode_length_test                  13.47035
returns_test                       -12.682849
return_std_test                      2.418286
average_reward_test                 -0.940666
round_time_test        0 days 00:00:09.626557
round_time_total       0 days 00:10:34.721012
loss_total             14742239437171066880.0
loss_critic            18427798990696519680.0
loss_actor                    -14067842485.76
memory_size                           17745.0 

=== epoch 4/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  737
episode_length                      13.548168
returns                            -12.757742
return_std                           2.310355
average_reward                      -0.941374
round_time             0 days 00:10:31.612816
episodes_test                           739.0
episode_length_test                 13.523681
returns_test                        -12.84234
return_std_test                      2.256908
average_reward_test                 -0.949183
round_time_test        0 days 00:00:09.545930
round_time_total       0 days 00:10:31.614065
loss_total             14788852434934943744.0
loss_critic            18486065227730886656.0
loss_actor                -14143562652.672001
memory_size                           17745.0 

=== epoch 4/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  739
episode_length                      13.519621
returns                            -12.700175
return_std                           2.334061
average_reward                      -0.939394
round_time             0 days 00:10:31.591934
episodes_test                           730.0
episode_length_test                 13.690411
returns_test                        -12.52833
return_std_test                      2.388104
average_reward_test                 -0.914652
round_time_test        0 days 00:00:09.369993
round_time_total       0 days 00:10:31.593051
loss_total             14915700535341803520.0
loss_critic            18644625343275134976.0
loss_actor                     -14281739353.6
memory_size                           17745.0 

=== epoch 4/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  738
episode_length                       13.53252
returns                            -12.821177
return_std                           2.288334
average_reward                      -0.947557
round_time             0 days 00:10:32.464912
episodes_test                           732.0
episode_length_test                 13.651639
returns_test                       -12.735639
return_std_test                       2.30079
average_reward_test                  -0.93246
round_time_test        0 days 00:00:09.783815
round_time_total       0 days 00:10:32.466089
loss_total             14948909886854377472.0
loss_critic            18686137029436039168.0
loss_actor                   -14393802922.496
memory_size                           17745.0 

=== epoch 4/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  736
episode_length                      13.561141
returns                            -12.890232
return_std                           2.300802
average_reward                        -0.9508
round_time             0 days 00:10:33.079127
episodes_test                           740.0
episode_length_test                 13.497297
returns_test                       -12.866145
return_std_test                      2.387064
average_reward_test                 -0.953062
round_time_test        0 days 00:00:09.502687
round_time_total       0 days 00:10:33.080379
loss_total             14963344944556646400.0
loss_critic            18704180862009233408.0
loss_actor                   -14501585110.528
memory_size                           17745.0 

=== epoch 4/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  736
episode_length                      13.561141
returns                            -12.868593
return_std                             2.3125
average_reward                      -0.949575
round_time             0 days 00:10:32.695642
episodes_test                           733.0
episode_length_test                 13.639836
returns_test                       -12.819244
return_std_test                      2.415033
average_reward_test                 -0.939495
round_time_test        0 days 00:00:09.545106
round_time_total       0 days 00:10:32.696913
loss_total             15176160375336718336.0
loss_critic            18970200152752066560.0
loss_actor                    -14702256099.84
memory_size                           17745.0 

=== epoch 4/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  735
episode_length                      13.595918
returns                            -12.828222
return_std                           2.263527
average_reward                      -0.943511
round_time             0 days 00:10:34.272100
episodes_test                           735.0
episode_length_test                      13.6
returns_test                       -12.948223
return_std_test                      2.231241
average_reward_test                 -0.951448
round_time_test        0 days 00:00:09.526982
round_time_total       0 days 00:10:34.273220
loss_total             15433256050874122240.0
loss_critic            19291569713157681152.0
loss_actor                -14810097443.327999
memory_size                           17745.0 

=== epoch 4/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  734
episode_length                      13.595368
returns                            -12.858612
return_std                            2.21252
average_reward                       -0.94626
round_time             0 days 00:10:32.371567
episodes_test                           734.0
episode_length_test                 13.621253
returns_test                       -12.696246
return_std_test                      2.287526
average_reward_test                 -0.931837
round_time_test        0 days 00:00:09.526538
round_time_total       0 days 00:10:32.372692
loss_total             15910140748530434048.0
loss_critic            19887675591172304896.0
loss_actor                -14966386322.431999
memory_size                           17745.0 

=== epoch 4/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  734
episode_length                      13.594005
returns                            -12.815543
return_std                           2.343257
average_reward                      -0.942634
round_time             0 days 00:10:34.409517
episodes_test                           734.0
episode_length_test                 13.623978
returns_test                       -12.538867
return_std_test                      2.487034
average_reward_test                 -0.920353
round_time_test        0 days 00:00:09.456200
round_time_total       0 days 00:10:34.410782
loss_total             15762153111837632512.0
loss_critic            19702691047780204544.0
loss_actor                -15160344818.176001
memory_size                           17745.0 

=== epoch 4/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  735
episode_length                      13.602721
returns                            -12.708322
return_std                           2.410495
average_reward                       -0.93437
round_time             0 days 00:10:31.985810
episodes_test                           730.0
episode_length_test                 13.683562
returns_test                       -12.700544
return_std_test                      2.376381
average_reward_test                 -0.927955
round_time_test        0 days 00:00:09.539105
round_time_total       0 days 00:10:31.987073
loss_total             16140830906445596672.0
loss_critic            20176038272382820352.0
loss_actor                -15355760960.511999
memory_size                           17745.0 

=== epoch 4/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  733
episode_length                      13.622101
returns                            -12.644649
return_std                           2.395033
average_reward                      -0.928365
round_time             0 days 00:10:35.016240
episodes_test                           726.0
episode_length_test                 13.774105
returns_test                       -12.572044
return_std_test                      2.356052
average_reward_test                  -0.91273
round_time_test        0 days 00:00:09.521292
round_time_total       0 days 00:10:35.017691
loss_total             16139398232761559040.0
loss_critic            20174247432923475968.0
loss_actor                -15415541124.096001
memory_size                           17745.0 

=== epoch 4/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  733
episode_length                      13.620737
returns                            -12.674182
return_std                           2.454199
average_reward                      -0.930495
round_time             0 days 00:10:32.196108
episodes_test                           732.0
episode_length_test                 13.651639
returns_test                       -12.786803
return_std_test                      2.390316
average_reward_test                 -0.936079
round_time_test        0 days 00:00:09.388726
round_time_total       0 days 00:10:32.197197
loss_total             16254459650157400064.0
loss_critic            20318074231193993216.0
loss_actor                   -15525489758.208
memory_size                           17745.0 

=== epoch 4/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  731
episode_length                      13.663475
returns                             -12.66857
return_std                           2.537131
average_reward                      -0.926783
round_time             0 days 00:10:32.286909
episodes_test                           728.0
episode_length_test                 13.726648
returns_test                       -12.792985
return_std_test                      2.365521
average_reward_test                 -0.931661
round_time_test        0 days 00:00:09.557614
round_time_total       0 days 00:10:32.288354
loss_total             16603314056627834880.0
loss_critic            20754142206743363584.0
loss_actor                -15647547529.728001
memory_size                           17745.0 

=== epoch 4/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  733
episode_length                      13.635744
returns                            -12.844271
return_std                           2.607539
average_reward                      -0.942522
round_time             0 days 00:10:31.912242
episodes_test                           729.0
episode_length_test                 13.707819
returns_test                       -12.854881
return_std_test                      2.688183
average_reward_test                 -0.937256
round_time_test        0 days 00:00:09.430033
round_time_total       0 days 00:10:31.913328
loss_total             16545402765311539200.0
loss_critic            20681753090295894016.0
loss_actor                -15632307919.360001
memory_size                           17745.0 

=== epoch 4/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  736
episode_length                       13.58288
returns                             -13.39514
return_std                           2.854315
average_reward                      -0.985823
round_time             0 days 00:10:32.003487
episodes_test                           746.0
episode_length_test                 13.403485
returns_test                       -14.586054
return_std_test                      3.193908
average_reward_test                 -1.088039
round_time_test        0 days 00:00:09.491496
round_time_total       0 days 00:10:32.004598
loss_total             16452111984359651328.0
loss_critic            20565139634859315200.0
loss_actor                   -15795341334.528
memory_size                           17745.0 

=== epoch 4/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  737
episode_length                      13.553596
returns                            -14.067645
return_std                           3.129114
average_reward                      -1.038007
round_time             0 days 00:10:32.730230
episodes_test                           745.0
episode_length_test                 13.413423
returns_test                       -15.137903
return_std_test                      3.114777
average_reward_test                 -1.127796
round_time_test        0 days 00:00:09.445907
round_time_total       0 days 00:10:32.731487
loss_total             17078823227991842816.0
loss_critic            21348528650195091456.0
loss_actor                -16016131622.912001
memory_size                           17745.0 

=== epoch 4/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  737
episode_length                       13.53867
returns                            -14.888515
return_std                           3.283562
average_reward                      -1.100208
round_time             0 days 00:10:32.791548
episodes_test                           740.0
episode_length_test                 13.513514
returns_test                       -16.794929
return_std_test                      2.733106
average_reward_test                 -1.242825
round_time_test        0 days 00:00:09.545313
round_time_total       0 days 00:10:32.792735
loss_total             17755829184210956288.0
loss_critic            22194786098149044224.0
loss_actor                   -16378082301.952
memory_size                           17745.0 

=== epoch 4/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  738
episode_length                      13.539295
returns                            -15.457171
return_std                           3.190602
average_reward                      -1.141461
round_time             0 days 00:10:32.755789
episodes_test                           728.0
episode_length_test                 13.721154
returns_test                       -15.876715
return_std_test                      2.852857
average_reward_test                 -1.156809
round_time_test        0 days 00:00:09.540877
round_time_total       0 days 00:10:32.756886
loss_total             17983345391256066048.0
loss_critic            22479181357127233536.0
loss_actor                     -16712141862.4
memory_size                           17745.0 

=== epoch 4/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:04,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  734
episode_length                      13.622616
returns                            -15.930829
return_std                           3.049366
average_reward                      -1.169228
round_time             0 days 00:10:33.095756
episodes_test                           731.0
episode_length_test                 13.668947
returns_test                       -15.858572
return_std_test                      3.026441
average_reward_test                 -1.159871
round_time_test        0 days 00:00:09.466242
round_time_total       0 days 00:10:33.096844
loss_total             18553901137472561152.0
loss_critic            23192376013028532224.0
loss_actor                -17016386676.224001
memory_size                        17745.7475 

=== epoch 4/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                  731
episode_length                      13.649795
returns                            -16.156004
return_std                           2.994249
average_reward                      -1.183646
round_time             0 days 00:10:29.646195
episodes_test                           729.0
episode_length_test                 13.705075
returns_test                       -16.156933
return_std_test                      2.955574
average_reward_test                 -1.178709
round_time_test        0 days 00:00:09.589559
round_time_total       0 days 00:10:29.647287
loss_total             19007004298982662144.0
loss_critic            23758754950828666880.0
loss_actor                -17239104864.768002
memory_size                           17746.0 

=== epoch 4/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  733
episode_length                      13.626194
returns                             -16.17733
return_std                           2.881705
average_reward                      -1.186882
round_time             0 days 00:10:32.618135
episodes_test                           736.0
episode_length_test                 13.577446
returns_test                       -16.480162
return_std_test                      2.982191
average_reward_test                  -1.21293
round_time_test        0 days 00:00:09.590356
round_time_total       0 days 00:10:32.619269
loss_total             19409341318139215872.0
loss_critic            24261676269235859456.0
loss_actor                   -17419434946.048
memory_size                           17746.0 

=== epoch 4/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  731
episode_length                      13.668947
returns                             -16.08817
return_std                           2.878916
average_reward                       -1.17569
round_time             0 days 00:10:33.887870
episodes_test                           730.0
episode_length_test                 13.689041
returns_test                       -16.445518
return_std_test                      2.855287
average_reward_test                 -1.200826
round_time_test        0 days 00:00:09.639739
round_time_total       0 days 00:10:33.889116
loss_total             19695845773708939264.0
loss_critic            24619806812893851648.0
loss_actor                -17590668602.880001
memory_size                           17746.0 

=== epoch 4/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  730
episode_length                      13.671233
returns                             -16.19228
return_std                           2.888027
average_reward                      -1.183852
round_time             0 days 00:10:32.585184
episodes_test                           762.0
episode_length_test                 13.122047
returns_test                       -15.838363
return_std_test                      2.878551
average_reward_test                 -1.206776
round_time_test        0 days 00:00:09.551773
round_time_total       0 days 00:10:32.586442
loss_total             20197251261817933824.0
loss_critic            25246563612178997248.0
loss_actor                -17651530412.543999
memory_size                         17754.884 

=== epoch 4/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  731
episode_length                      13.655267
returns                            -16.190428
return_std                           2.936882
average_reward                      -1.185263
round_time             0 days 00:10:32.622625
episodes_test                           722.0
episode_length_test                  13.84626
returns_test                       -15.673048
return_std_test                      3.116911
average_reward_test                 -1.131273
round_time_test        0 days 00:00:09.472698
round_time_total       0 days 00:10:32.624071
loss_total             20372205883532406784.0
loss_critic            25465256933165117440.0
loss_actor                -17800071669.248001
memory_size                           17755.0 

=== epoch 4/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  729
episode_length                      13.710562
returns                            -16.248264
return_std                           2.951965
average_reward                      -1.184565
round_time             0 days 00:10:33.648124
episodes_test                           742.0
episode_length_test                 13.458221
returns_test                       -16.608819
return_std_test                      2.923179
average_reward_test                 -1.234087
round_time_test        0 days 00:00:09.418221
round_time_total       0 days 00:10:33.649640
loss_total             20601080682878922752.0
loss_critic            25751350398022885376.0
loss_actor                -17872396138.495998
memory_size                         17755.118 

=== epoch 4/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:49,  2.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  728
episode_length                      13.728022
returns                            -16.296519
return_std                           2.980412
average_reward                      -1.187878
round_time             0 days 00:10:35.113699
episodes_test                           726.0
episode_length_test                 13.758953
returns_test                       -16.714307
return_std_test                      2.890929
average_reward_test                  -1.21442
round_time_test        0 days 00:00:09.563935
round_time_total       0 days 00:10:35.114796
loss_total             20611586071317553152.0
loss_critic            25764482149892046848.0
loss_actor                -18128651593.728001
memory_size                           17756.0 

=== epoch 4/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  728
episode_length                      13.697802
returns                            -16.417614
return_std                           2.951576
average_reward                      -1.197337
round_time             0 days 00:10:33.447728
episodes_test                           715.0
episode_length_test                 13.967832
returns_test                        -16.73391
return_std_test                      2.630633
average_reward_test                  -1.19812
round_time_test        0 days 00:00:09.701838
round_time_total       0 days 00:10:33.448846
loss_total             21353625390990794752.0
loss_critic            26692031276737449984.0
loss_actor                -18351627144.192001
memory_size                        17761.0535 

=== epoch 4/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  728
episode_length                      13.725275
returns                            -16.493893
return_std                           2.922777
average_reward                      -1.201876
round_time             0 days 00:10:32.809168
episodes_test                           721.0
episode_length_test                 13.862691
returns_test                       -17.055712
return_std_test                      2.583638
average_reward_test                 -1.229418
round_time_test        0 days 00:00:09.768732
round_time_total       0 days 00:10:32.810679
loss_total             21805534251700809728.0
loss_critic            27256917342042169344.0
loss_actor                -18493803319.807999
memory_size                         17764.828 

=== epoch 4/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  727
episode_length                      13.730399
returns                            -16.751482
return_std                           2.898121
average_reward                      -1.220398
round_time             0 days 00:10:34.103131
episodes_test                           723.0
episode_length_test                 13.825726
returns_test                       -16.997562
return_std_test                      2.894528
average_reward_test                 -1.228796
round_time_test        0 days 00:00:09.617342
round_time_total       0 days 00:10:34.104238
loss_total             21973042115356332032.0
loss_critic            27466302182744129536.0
loss_actor                    -18697648491.52
memory_size                           17766.0 

=== epoch 4/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  727
episode_length                      13.729023
returns                            -16.763588
return_std                           2.881778
average_reward                      -1.220611
round_time             0 days 00:10:33.862297
episodes_test                           719.0
episode_length_test                 13.891516
returns_test                       -16.595487
return_std_test                      2.929743
average_reward_test                 -1.194192
round_time_test        0 days 00:00:09.578069
round_time_total       0 days 00:10:33.863751
loss_total             22698420995479498752.0
loss_critic            28373025762488401920.0
loss_actor                -18889166979.071999
memory_size                        17768.1675 

=== epoch 4/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  725
episode_length                      13.791724
returns                            -16.882174
return_std                           2.945885
average_reward                      -1.223871
round_time             0 days 00:10:32.740103
episodes_test                           734.0
episode_length_test                 13.617166
returns_test                        -16.47058
return_std_test                      2.987168
average_reward_test                 -1.209074
round_time_test        0 days 00:00:09.510925
round_time_total       0 days 00:10:32.741304
loss_total             23013793790055284736.0
loss_critic            28767241761274413056.0
loss_actor                -18996772339.712002
memory_size                         17769.054 

=== epoch 4/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  724
episode_length                      13.788674
returns                            -16.850403
return_std                            2.98791
average_reward                      -1.221475
round_time             0 days 00:10:33.471271
episodes_test                           728.0
episode_length_test                 13.722527
returns_test                       -16.705644
return_std_test                      2.997089
average_reward_test                 -1.217043
round_time_test        0 days 00:00:09.355568
round_time_total       0 days 00:10:33.472373
loss_total             22802707090071482368.0
loss_critic            28503383368496316416.0
loss_actor                -19184128776.192001
memory_size                           17770.0 

=== epoch 4/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  723
episode_length                       13.80083
returns                            -16.901519
return_std                             2.9981
average_reward                      -1.224874
round_time             0 days 00:10:34.426179
episodes_test                           729.0
episode_length_test                 13.710562
returns_test                       -16.719414
return_std_test                      3.075195
average_reward_test                 -1.218788
round_time_test        0 days 00:00:09.708774
round_time_total       0 days 00:10:34.427298
loss_total             23545810367564505088.0
loss_critic            29432262449213513728.0
loss_actor                -19383546082.304001
memory_size                           17770.0 

=== epoch 4/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                  729
episode_length                      13.695473
returns                            -16.680153
return_std                           3.071873
average_reward                      -1.218521
round_time             0 days 00:10:33.727060
episodes_test                           726.0
episode_length_test                 13.757576
returns_test                       -16.925621
return_std_test                      2.824524
average_reward_test                 -1.230327
round_time_test        0 days 00:00:09.478233
round_time_total       0 days 00:10:33.728168
loss_total             24286742800635723776.0
loss_critic            30358428014398197760.0
loss_actor                -19611061729.279999
memory_size                           17770.0 

=== epoch 4/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:20,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                  746
episode_length                      13.395442
returns                            -16.448083
return_std                           3.383435
average_reward                      -1.227717
round_time             0 days 00:10:35.173322
episodes_test                           762.0
episode_length_test                 13.107612
returns_test                       -16.355612
return_std_test                      3.416446
average_reward_test                 -1.247676
round_time_test        0 days 00:00:09.567069
round_time_total       0 days 00:10:35.174425
loss_total             24599962703065812992.0
loss_critic            30749952876973928448.0
loss_actor                -19823797004.287998
memory_size                           17770.0 

=== epoch 4/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  773
episode_length                      12.906856
returns                            -15.823951
return_std                           3.805793
average_reward                      -1.225798
round_time             0 days 00:10:32.921776
episodes_test                           833.0
episode_length_test                 11.995198
returns_test                       -14.539409
return_std_test                      4.162005
average_reward_test                 -1.211411
round_time_test        0 days 00:00:09.658136
round_time_total       0 days 00:10:32.922870
loss_total             25245194370832596992.0
loss_critic            31556492452405280768.0
loss_actor                -20014905256.959999
memory_size                           17770.0 

=== epoch 4/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  810
episode_length                      12.328395
returns                            -14.886928
return_std                           4.093167
average_reward                      -1.207511
round_time             0 days 00:10:35.697545
episodes_test                           895.0
episode_length_test                 11.165363
returns_test                       -13.245588
return_std_test                      4.111981
average_reward_test                 -1.185755
round_time_test        0 days 00:00:09.725121
round_time_total       0 days 00:10:35.698642
loss_total             25447234163798732800.0
loss_critic            31809042184198381568.0
loss_actor                -20262815280.127998
memory_size                           17770.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:38,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  191
episode_length                      10.418848
returns                            -11.667548
return_std                           3.324628
average_reward                      -1.116818
round_time             0 days 00:10:31.717885
episodes_test                           951.0
episode_length_test                 10.511041
returns_test                       -11.812305
return_std_test                      3.567351
average_reward_test                 -1.123428
round_time_test        0 days 00:00:09.778540
round_time_total       0 days 00:10:31.719005
loss_total             26195879949390123008.0
loss_critic            32744849376467759104.0
loss_actor                   -20514805526.528
memory_size                           17770.0 

=== epoch 5/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                  381
episode_length                      10.498688
returns                            -11.668866
return_std                           3.196048
average_reward                      -1.111459
round_time             0 days 00:10:30.437868
episodes_test                           950.0
episode_length_test                 10.521053
returns_test                       -11.904662
return_std_test                      3.465897
average_reward_test                   -1.1308
round_time_test        0 days 00:00:09.837013
round_time_total       0 days 00:10:30.439002
loss_total             26425223812629114880.0
loss_critic            33031529217473691648.0
loss_actor                -20679936622.591999
memory_size                           17770.0 

=== epoch 5/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  580
episode_length                      10.337931
returns                            -11.330638
return_std                           3.178224
average_reward                      -1.094991
round_time             0 days 00:10:33.564565
episodes_test                           970.0
episode_length_test                      10.3
returns_test                       -11.227255
return_std_test                      3.144682
average_reward_test                 -1.089481
round_time_test        0 days 00:00:09.742387
round_time_total       0 days 00:10:33.565791
loss_total             27158505139007320064.0
loss_critic            33948130849264320512.0
loss_actor                -20880562641.919998
memory_size                           17770.0 

=== epoch 5/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  780
episode_length                      10.248718
returns                            -11.190963
return_std                           3.165596
average_reward                      -1.091103
round_time             0 days 00:10:32.553755
episodes_test                           981.0
episode_length_test                 10.189602
returns_test                       -10.937067
return_std_test                      3.361589
average_reward_test                  -1.07288
round_time_test        0 days 00:00:09.745017
round_time_total       0 days 00:10:32.554855
loss_total             27408658527538880512.0
loss_critic            34260822572971585536.0
loss_actor                -21132907199.487999
memory_size                           17770.0 

=== epoch 5/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                  984
episode_length                      10.160569
returns                            -10.991952
return_std                           3.191572
average_reward                      -1.081442
round_time             0 days 00:10:32.306328
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                       -10.381523
return_std_test                      3.087645
average_reward_test                 -1.052295
round_time_test        0 days 00:00:09.673614
round_time_total       0 days 00:10:32.307444
loss_total             28497150137572651008.0
loss_critic            35621437041189736448.0
loss_actor                -21342067983.360001
memory_size                           17770.0 

=== epoch 5/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                  993
episode_length                      10.058409
returns                            -10.759343
return_std                           3.080713
average_reward                      -1.069889
round_time             0 days 00:10:32.623613
episodes_test                          1067.0
episode_length_test                  9.366448
returns_test                        -9.576828
return_std_test                      2.938664
average_reward_test                 -1.022184
round_time_test        0 days 00:00:09.849757
round_time_total       0 days 00:10:32.624770
loss_total             28520982492184223744.0
loss_critic            35651227472840613888.0
loss_actor                -21512545016.832001
memory_size                           17770.0 

=== epoch 5/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1022
episode_length                       9.779843
returns                            -10.216555
return_std                           3.055452
average_reward                      -1.043829
round_time             0 days 00:10:32.729022
episodes_test                          1033.0
episode_length_test                  9.680542
returns_test                       -10.052851
return_std_test                      2.975255
average_reward_test                 -1.038459
round_time_test        0 days 00:00:09.868309
round_time_total       0 days 00:10:32.730115
loss_total             29792337224098529280.0
loss_critic            37240420894674157568.0
loss_actor                -21660585425.919998
memory_size                           17770.0 

=== epoch 5/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1035
episode_length                       9.655072
returns                             -9.970269
return_std                           3.017215
average_reward                      -1.033351
round_time             0 days 00:10:32.257710
episodes_test                          1108.0
episode_length_test                  9.025271
returns_test                        -8.607413
return_std_test                      2.339682
average_reward_test                 -0.953701
round_time_test        0 days 00:00:10.078088
round_time_total       0 days 00:10:32.258842
loss_total             29530091442693693440.0
loss_critic            36912613668948910080.0
loss_actor                -21842262918.144001
memory_size                           17770.0 

=== epoch 5/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                 1048
episode_length                       9.531489
returns                             -9.676838
return_std                           2.915633
average_reward                      -1.015589
round_time             0 days 00:10:34.901762
episodes_test                          1049.0
episode_length_test                  9.531935
returns_test                        -9.915415
return_std_test                      3.041081
average_reward_test                 -1.040044
round_time_test        0 days 00:00:10.120896
round_time_total       0 days 00:10:34.902879
loss_total             30369380657091534848.0
loss_critic            37961725169147863040.0
loss_actor                -21885944394.751999
memory_size                           17770.0 

=== epoch 5/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1061
episode_length                       9.415646
returns                             -9.458601
return_std                           2.831015
average_reward                      -1.004875
round_time             0 days 00:10:32.310049
episodes_test                          1106.0
episode_length_test                  9.035262
returns_test                        -8.775279
return_std_test                      2.390284
average_reward_test                 -0.970456
round_time_test        0 days 00:00:09.945096
round_time_total       0 days 00:10:32.311129
loss_total             30538858907981316096.0
loss_critic            38173572931014328320.0
loss_actor                -22080712560.639999
memory_size                           17770.0 

=== epoch 5/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1076
episode_length                       9.287175
returns                             -9.228453
return_std                           2.718942
average_reward                      -0.993639
round_time             0 days 00:10:31.947252
episodes_test                          1078.0
episode_length_test                  9.272727
returns_test                        -9.099081
return_std_test                      2.721841
average_reward_test                 -0.980868
round_time_test        0 days 00:00:10.067152
round_time_total       0 days 00:10:31.948354
loss_total             31320185355364421632.0
loss_critic            39150231022884962304.0
loss_actor                -22232894218.240002
memory_size                           17770.0 

=== epoch 5/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:40,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1065
episode_length                       9.381221
returns                             -9.392572
return_std                           2.817189
average_reward                      -1.001712
round_time             0 days 00:10:32.384044
episodes_test                          1040.0
episode_length_test                    9.6125
returns_test                        -9.820623
return_std_test                      2.818568
average_reward_test                 -1.021249
round_time_test        0 days 00:00:09.929672
round_time_total       0 days 00:10:32.385157
loss_total             31747361759188049920.0
loss_critic            39684201525465464832.0
loss_actor                   -22425757707.264
memory_size                           17770.0 

=== epoch 5/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:28<00:00,  3.18it/s]
episodes                                 1071
episode_length                       9.333333
returns                             -9.264364
return_std                            2.81601
average_reward                      -0.992232
round_time             0 days 00:10:29.269942
episodes_test                          1052.0
episode_length_test                  9.499049
returns_test                        -9.534448
return_std_test                      2.917684
average_reward_test                 -1.003427
round_time_test        0 days 00:00:09.849561
round_time_total       0 days 00:10:29.271046
loss_total             32203157941059043328.0
loss_critic            40253946721330692096.0
loss_actor                -22715754099.712002
memory_size                           17770.0 

=== epoch 5/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1083
episode_length                       9.229917
returns                             -9.090254
return_std                           2.792905
average_reward                      -0.985515
round_time             0 days 00:10:33.674677
episodes_test                          1115.0
episode_length_test                  8.964126
returns_test                        -8.604517
return_std_test                      2.460356
average_reward_test                 -0.959408
round_time_test        0 days 00:00:10.077150
round_time_total       0 days 00:10:33.675773
loss_total             32528663202852106240.0
loss_critic            40660828335130779648.0
loss_actor                -22851393927.167999
memory_size                           17770.0 

=== epoch 5/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1088
episode_length                       9.180147
returns                             -9.012725
return_std                           2.729198
average_reward                      -0.981474
round_time             0 days 00:10:32.376377
episodes_test                          1080.0
episode_length_test                  9.256481
returns_test                        -9.207951
return_std_test                      2.688859
average_reward_test                 -0.994474
round_time_test        0 days 00:00:09.907260
round_time_total       0 days 00:10:32.377502
loss_total             32695725376170774528.0
loss_critic            40869656030750957568.0
loss_actor                -23077197594.624001
memory_size                           17770.0 

=== epoch 5/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1093
episode_length                       9.138152
returns                             -8.836272
return_std                           2.647667
average_reward                      -0.967159
round_time             0 days 00:10:33.882973
episodes_test                          1082.0
episode_length_test                  9.239372
returns_test                        -9.283945
return_std_test                      2.475317
average_reward_test                 -1.004362
round_time_test        0 days 00:00:10.025546
round_time_total       0 days 00:10:33.884076
loss_total             33293514707642593280.0
loss_critic            41616892655851913216.0
loss_actor                -23156418380.799999
memory_size                        17770.5745 

=== epoch 5/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1118
episode_length                       8.931127
returns                             -8.545338
return_std                           2.436022
average_reward                       -0.95625
round_time             0 days 00:10:32.970609
episodes_test                          1151.0
episode_length_test                  8.688097
returns_test                        -8.083849
return_std_test                      1.993928
average_reward_test                 -0.930451
round_time_test        0 days 00:00:10.074973
round_time_total       0 days 00:10:32.971881
loss_total             33206004982357934080.0
loss_critic            41507505490587435008.0
loss_actor                -23277053376.512001
memory_size                           17773.0 

=== epoch 5/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                 1128
episode_length                       8.859043
returns                             -8.384728
return_std                           2.308372
average_reward                      -0.946695
round_time             0 days 00:10:34.660713
episodes_test                          1114.0
episode_length_test                  8.969479
returns_test                         -8.46642
return_std_test                      2.307634
average_reward_test                 -0.944007
round_time_test        0 days 00:00:09.999662
round_time_total       0 days 00:10:34.661816
loss_total             34063194265297424384.0
loss_critic            42578992091169415168.0
loss_actor                -23494667908.096001
memory_size                           17773.0 

=== epoch 5/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1131
episode_length                       8.839965
returns                             -8.315355
return_std                           2.255911
average_reward                      -0.940334
round_time             0 days 00:10:33.207685
episodes_test                          1141.0
episode_length_test                  8.758983
returns_test                        -8.000572
return_std_test                      2.101202
average_reward_test                 -0.913069
round_time_test        0 days 00:00:09.963759
round_time_total       0 days 00:10:33.208898
loss_total             33904459435156496384.0
loss_critic            42380573564488376320.0
loss_actor                -23709499709.439999
memory_size                           17773.0 

=== epoch 5/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:24,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1129
episode_length                       8.846767
returns                             -8.286401
return_std                           2.196004
average_reward                      -0.936444
round_time             0 days 00:10:32.091991
episodes_test                          1090.0
episode_length_test                   9.17156
returns_test                        -8.870427
return_std_test                      2.554914
average_reward_test                 -0.966859
round_time_test        0 days 00:00:09.866996
round_time_total       0 days 00:10:32.093239
loss_total             34948326804476293120.0
loss_critic            43685407720956379136.0
loss_actor                     -23870281600.0
memory_size                           17773.0 

=== epoch 5/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                 1127
episode_length                       8.861579
returns                              -8.37554
return_std                           2.284541
average_reward                      -0.945081
round_time             0 days 00:10:35.751596
episodes_test                          1089.0
episode_length_test                  9.182736
returns_test                        -8.962455
return_std_test                      2.460935
average_reward_test                 -0.976011
round_time_test        0 days 00:00:09.963017
round_time_total       0 days 00:10:35.752763
loss_total             35715803273431326720.0
loss_critic            44644753352436310016.0
loss_actor                -24060747959.296001
memory_size                           17773.0 

=== epoch 5/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:46,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1113
episode_length                       8.981132
returns                             -8.517141
return_std                           2.393973
average_reward                      -0.948533
round_time             0 days 00:10:32.804617
episodes_test                          1086.0
episode_length_test                  9.208103
returns_test                        -9.058802
return_std_test                       2.60291
average_reward_test                 -0.983786
round_time_test        0 days 00:00:09.710778
round_time_total       0 days 00:10:32.805765
loss_total             35972957973501689856.0
loss_critic            44966196754943336448.0
loss_actor                -24326982171.647999
memory_size                           17773.0 

=== epoch 5/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:06,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                 1089
episode_length                       9.172635
returns                             -8.956624
return_std                           2.677946
average_reward                      -0.976184
round_time             0 days 00:10:31.187199
episodes_test                          1035.0
episode_length_test                  9.658937
returns_test                       -10.155929
return_std_test                      3.527128
average_reward_test                 -1.050956
round_time_test        0 days 00:00:09.935792
round_time_total       0 days 00:10:31.188305
loss_total             36959234816567844864.0
loss_critic            46199042754006605824.0
loss_actor                -24530937910.271999
memory_size                           17773.0 

=== epoch 5/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1068
episode_length                       9.352996
returns                             -9.372528
return_std                           2.945979
average_reward                      -1.001992
round_time             0 days 00:10:32.371934
episodes_test                           944.0
episode_length_test                 10.591102
returns_test                       -11.954245
return_std_test                      3.785782
average_reward_test                 -1.128434
round_time_test        0 days 00:00:09.833218
round_time_total       0 days 00:10:32.373237
loss_total             37204565974424010752.0
loss_critic            46505706661675671552.0
loss_actor                   -24677270480.896
memory_size                           17773.0 

=== epoch 5/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                 1070
episode_length                       9.337383
returns                             -9.321988
return_std                            2.96675
average_reward                      -0.998664
round_time             0 days 00:10:31.448308
episodes_test                          1112.0
episode_length_test                   8.98741
returns_test                        -8.631844
return_std_test                      2.521232
average_reward_test                 -0.960014
round_time_test        0 days 00:00:09.894973
round_time_total       0 days 00:10:31.449534
loss_total             37443560178652225536.0
loss_critic            46804449375866699776.0
loss_actor                -24822355793.919998
memory_size                        17773.0205 

=== epoch 5/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1071
episode_length                       9.334267
returns                              -9.26661
return_std                           3.015543
average_reward                      -0.993168
round_time             0 days 00:10:32.370600
episodes_test                          1122.0
episode_length_test                  8.912656
returns_test                        -8.275158
return_std_test                      2.442744
average_reward_test                 -0.928473
round_time_test        0 days 00:00:10.407629
round_time_total       0 days 00:10:32.371691
loss_total             37949731479983169536.0
loss_critic            47437163527063232512.0
loss_actor                   -24983218010.112
memory_size                           17774.0 

=== epoch 5/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.15it/s]
episodes                                 1070
episode_length                       9.335514
returns                             -9.323452
return_std                           3.121935
average_reward                      -0.998283
round_time             0 days 00:10:34.448316
episodes_test                          1110.0
episode_length_test                  9.008108
returns_test                        -8.701311
return_std_test                      2.728712
average_reward_test                 -0.965783
round_time_test        0 days 00:00:09.831984
round_time_total       0 days 00:10:34.449609
loss_total             38473902966436782080.0
loss_critic            48092377884099452928.0
loss_actor                -25114526503.936001
memory_size                           17774.0 

=== epoch 5/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:47,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                 1075
episode_length                       9.293953
returns                             -9.170599
return_std                           3.087166
average_reward                      -0.986248
round_time             0 days 00:10:31.097313
episodes_test                          1038.0
episode_length_test                  9.632948
returns_test                        -9.889593
return_std_test                      3.557705
average_reward_test                 -1.026456
round_time_test        0 days 00:00:09.922045
round_time_total       0 days 00:10:31.098560
loss_total             38904162053419016192.0
loss_critic            48630201765642108928.0
loss_actor                -25228509282.304001
memory_size                           17774.0 

=== epoch 5/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:19,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1090
episode_length                       9.163303
returns                             -8.866857
return_std                            2.97185
average_reward                      -0.968079
round_time             0 days 00:10:33.840151
episodes_test                          1102.0
episode_length_test                  9.068966
returns_test                        -8.734702
return_std_test                      2.806489
average_reward_test                 -0.962622
round_time_test        0 days 00:00:10.072032
round_time_total       0 days 00:10:33.841395
loss_total             39344239569230569472.0
loss_critic            49180298622748278784.0
loss_actor                -25407453966.335999
memory_size                           17774.0 

=== epoch 5/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1093
episode_length                       9.147301
returns                             -8.849317
return_std                           3.075256
average_reward                      -0.967756
round_time             0 days 00:10:34.238872
episodes_test                          1132.0
episode_length_test                  8.833922
returns_test                        -8.196409
return_std_test                      2.673129
average_reward_test                 -0.927834
round_time_test        0 days 00:00:09.855677
round_time_total       0 days 00:10:34.240124
loss_total             40396586501966151680.0
loss_critic            50495732290660614144.0
loss_actor                -25531466732.543999
memory_size                           17774.0 

=== epoch 5/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1097
episode_length                       9.111212
returns                             -8.762856
return_std                           3.128931
average_reward                      -0.961141
round_time             0 days 00:10:32.104918
episodes_test                          1145.0
episode_length_test                  8.727511
returns_test                        -7.966706
return_std_test                      2.576611
average_reward_test                  -0.91234
round_time_test        0 days 00:00:09.942392
round_time_total       0 days 00:10:32.106197
loss_total             40338026883481092096.0
loss_critic            50422532796072886272.0
loss_actor                -25789632891.903999
memory_size                           17774.0 

=== epoch 5/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:40,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1098
episode_length                       9.102004
returns                             -8.592558
return_std                           3.115986
average_reward                       -0.94461
round_time             0 days 00:10:31.843437
episodes_test                          1131.0
episode_length_test                  8.836428
returns_test                        -8.088322
return_std_test                      2.932481
average_reward_test                 -0.914781
round_time_test        0 days 00:00:10.246224
round_time_total       0 days 00:10:31.844532
loss_total             40704021224186011648.0
loss_critic            50880025636535721984.0
loss_actor                -25896118756.352001
memory_size                           17774.0 

=== epoch 5/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:31,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1111
episode_length                       8.990099
returns                             -8.421056
return_std                             3.1017
average_reward                      -0.937522
round_time             0 days 00:10:34.077586
episodes_test                          1114.0
episode_length_test                  8.968582
returns_test                         -8.43369
return_std_test                      2.737333
average_reward_test                 -0.940102
round_time_test        0 days 00:00:09.907253
round_time_total       0 days 00:10:34.078689
loss_total             41104017985894285312.0
loss_critic            51380021522837798912.0
loss_actor                -26069943104.512001
memory_size                           17774.0 

=== epoch 5/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1112
episode_length                       8.989209
returns                             -8.455364
return_std                           3.164594
average_reward                      -0.940506
round_time             0 days 00:10:33.540903
episodes_test                          1114.0
episode_length_test                  8.974865
returns_test                        -8.488525
return_std_test                       2.75403
average_reward_test                 -0.945516
round_time_test        0 days 00:00:10.169769
round_time_total       0 days 00:10:33.542007
loss_total             41489769631633498112.0
loss_critic            51862211178418102272.0
loss_actor                -26141434485.759998
memory_size                           17774.0 

=== epoch 5/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1103
episode_length                        9.06437
returns                             -8.632301
return_std                           3.391126
average_reward                      -0.952021
round_time             0 days 00:10:32.668680
episodes_test                          1068.0
episode_length_test                  9.358614
returns_test                        -9.810263
return_std_test                      4.062267
average_reward_test                  -1.04777
round_time_test        0 days 00:00:09.888796
round_time_total       0 days 00:10:32.669807
loss_total             41112648943539994624.0
loss_critic            51390810291019595776.0
loss_actor                    -26219660764.16
memory_size                           17774.0 

=== epoch 5/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1107
episode_length                        9.02981
returns                               -8.6815
return_std                           3.408626
average_reward                       -0.96188
round_time             0 days 00:10:33.450731
episodes_test                          1154.0
episode_length_test                  8.664645
returns_test                         -8.10227
return_std_test                      2.745135
average_reward_test                  -0.93492
round_time_test        0 days 00:00:10.048176
round_time_total       0 days 00:10:33.451943
loss_total             42367562876546007040.0
loss_critic            52959452692914749440.0
loss_actor                    -26354253880.32
memory_size                           17774.0 

=== epoch 5/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1126
episode_length                       8.873002
returns                             -8.484984
return_std                           3.230062
average_reward                      -0.955769
round_time             0 days 00:10:32.860830
episodes_test                          1106.0
episode_length_test                  9.035262
returns_test                        -8.612157
return_std_test                      3.095017
average_reward_test                 -0.952722
round_time_test        0 days 00:00:10.037449
round_time_total       0 days 00:10:32.862028
loss_total             42055132996310220800.0
loss_critic            52568915347499098112.0
loss_actor                   -26598796849.152
memory_size                           17774.0 

=== epoch 5/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1145
episode_length                       8.727511
returns                             -8.202702
return_std                           2.993114
average_reward                      -0.939968
round_time             0 days 00:10:33.027333
episodes_test                          1172.0
episode_length_test                  8.528157
returns_test                        -7.518584
return_std_test                      2.151148
average_reward_test                 -0.881144
round_time_test        0 days 00:00:10.189210
round_time_total       0 days 00:10:33.028438
loss_total             43239701891433906176.0
loss_critic            54049626474650034176.0
loss_actor                    -26828010040.32
memory_size                           17774.0 

=== epoch 5/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1161
episode_length                       8.602929
returns                             -7.935456
return_std                            2.68036
average_reward                       -0.92211
round_time             0 days 00:10:33.217243
episodes_test                          1200.0
episode_length_test                    8.3325
returns_test                        -7.221051
return_std_test                      1.788766
average_reward_test                 -0.866429
round_time_test        0 days 00:00:09.941461
round_time_total       0 days 00:10:33.218573
loss_total             43587551192406507520.0
loss_critic            54484438025755402240.0
loss_actor                    -26987279523.84
memory_size                           17774.0 

=== epoch 5/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1180
episode_length                       8.463559
returns                             -7.685398
return_std                           2.174086
average_reward                      -0.907919
round_time             0 days 00:10:32.444148
episodes_test                          1174.0
episode_length_test                  8.516184
returns_test                        -7.729069
return_std_test                      2.014361
average_reward_test                 -0.907188
round_time_test        0 days 00:00:10.153491
round_time_total       0 days 00:10:32.445313
loss_total             44499438122460405760.0
loss_critic            55624296738487992320.0
loss_actor                    -27148304711.68
memory_size                           17774.0 

=== epoch 5/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1184
episode_length                       8.441723
returns                              -7.62142
return_std                            1.97758
average_reward                      -0.902238
round_time             0 days 00:10:33.525499
episodes_test                          1138.0
episode_length_test                  8.785589
returns_test                        -8.363079
return_std_test                      2.664154
average_reward_test                 -0.951639
round_time_test        0 days 00:00:09.975467
round_time_total       0 days 00:10:33.526596
loss_total             45078917419835777024.0
loss_critic            56348645858626658304.0
loss_actor                   -27337905782.784
memory_size                           17774.0 

=== epoch 5/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1176
episode_length                       8.494048
returns                             -7.794309
return_std                           2.013235
average_reward                      -0.917385
round_time             0 days 00:10:33.893395
episodes_test                          1177.0
episode_length_test                  8.492778
returns_test                        -7.596935
return_std_test                      1.887878
average_reward_test                 -0.893884
round_time_test        0 days 00:00:10.024927
round_time_total       0 days 00:10:33.894509
loss_total             45368922458882007040.0
loss_critic            56711152078681931776.0
loss_actor                -27499866912.768002
memory_size                           17774.0 

=== epoch 5/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                 1171
episode_length                       8.533732
returns                              -7.94046
return_std                           2.080341
average_reward                      -0.930256
round_time             0 days 00:10:32.351860
episodes_test                          1102.0
episode_length_test                  9.071688
returns_test                        -8.965252
return_std_test                      2.865636
average_reward_test                 -0.987821
round_time_test        0 days 00:00:09.872515
round_time_total       0 days 00:10:32.352944
loss_total             45143562341774729216.0
loss_critic            56429451958273785856.0
loss_actor                -27454462738.431999
memory_size                        17774.3725 

=== epoch 5/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                 1169
episode_length                       8.549187
returns                             -8.091424
return_std                           2.130924
average_reward                      -0.946976
round_time             0 days 00:10:34.633305
episodes_test                          1235.0
episode_length_test                  8.093117
returns_test                        -7.393135
return_std_test                      1.506408
average_reward_test                 -0.913435
round_time_test        0 days 00:00:10.006819
round_time_total       0 days 00:10:34.634385
loss_total             44277715845951373312.0
loss_critic            55347143857323728896.0
loss_actor                -27033546699.776001
memory_size                           17775.0 

=== epoch 5/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                 1164
episode_length                        8.58677
returns                             -8.221884
return_std                           2.190008
average_reward                      -0.957335
round_time             0 days 00:10:35.882799
episodes_test                          1175.0
episode_length_test                  8.504681
returns_test                          -8.2459
return_std_test                      2.131305
average_reward_test                 -0.969005
round_time_test        0 days 00:00:10.005156
round_time_total       0 days 00:10:35.883921
loss_total             45033474550383722496.0
loss_critic            56291842267138670592.0
loss_actor                -27002342103.040001
memory_size                           17775.0 

=== epoch 5/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                 1162
episode_length                       8.600688
returns                             -8.157125
return_std                           2.154927
average_reward                      -0.948878
round_time             0 days 00:10:33.003262
episodes_test                          1124.0
episode_length_test                  8.890569
returns_test                        -8.300431
return_std_test                      2.470129
average_reward_test                 -0.933545
round_time_test        0 days 00:00:09.796195
round_time_total       0 days 00:10:33.004542
loss_total             56201633592101789696.0
loss_critic            70252040807327596544.0
loss_actor                -30035751904.256001
memory_size                           17775.0 

=== epoch 5/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                 1146
episode_length                        8.72164
returns                             -8.172261
return_std                           2.235569
average_reward                      -0.937009
round_time             0 days 00:10:35.181255
episodes_test                          1120.0
episode_length_test                  8.925893
returns_test                        -8.144474
return_std_test                       2.27164
average_reward_test                 -0.912012
round_time_test        0 days 00:00:09.913232
round_time_total       0 days 00:10:35.182346
loss_total             64732063906182955008.0
loss_critic            80915078412819087360.0
loss_actor                -32464622732.287998
memory_size                           17775.0 

=== epoch 5/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                 1127
episode_length                       8.866016
returns                             -8.115424
return_std                           2.330823
average_reward                      -0.915501
round_time             0 days 00:10:33.867642
episodes_test                          1091.0
episode_length_test                  9.160403
returns_test                        -8.067568
return_std_test                      2.323749
average_reward_test                 -0.880167
round_time_test        0 days 00:00:09.847097
round_time_total       0 days 00:10:33.868888
loss_total             70349975650517303296.0
loss_critic            87937468032626442240.0
loss_actor                   -34225319273.472
memory_size                           17775.0 

=== epoch 5/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:34<00:00,  3.15it/s]
episodes                                 1098
episode_length                       9.094718
returns                             -8.062066
return_std                           2.481245
average_reward                      -0.886241
round_time             0 days 00:10:34.811721
episodes_test                          1073.0
episode_length_test                  9.316869
returns_test                        -7.500973
return_std_test                      2.456032
average_reward_test                 -0.804753
round_time_test        0 days 00:00:09.775631
round_time_total       0 days 00:10:34.812834
loss_total             73986478493857398784.0
loss_critic            92483096484272095232.0
loss_actor                -35554154611.711998
memory_size                           17775.0 

=== epoch 5/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  1056
episode_length                        9.464962
returns                              -8.368512
return_std                             2.77275
average_reward                        -0.88467
round_time              0 days 00:10:33.947818
episodes_test                           1009.0
episode_length_test                   9.909812
returns_test                         -8.779042
return_std_test                       3.015422
average_reward_test                  -0.885714
round_time_test         0 days 00:00:09.586002
round_time_total        0 days 00:10:33.949021
loss_total              80263551521738915840.0
loss_critic            100329437745072193536.0
loss_actor                 -36359052760.064003
memory_size                            17775.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:01,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                   183
episode_length                       10.901639
returns                             -11.244214
return_std                             2.91087
average_reward                       -1.029164
round_time              0 days 00:10:33.570022
episodes_test                            955.0
episode_length_test                  10.471204
returns_test                         -9.756371
return_std_test                       3.228159
average_reward_test                  -0.931733
round_time_test         0 days 00:00:09.568114
round_time_total        0 days 00:10:33.571157
loss_total              82033113572361863168.0
loss_critic            102541390228223950848.0
loss_actor                 -36941935490.047997
memory_size                            17775.0 

=== epoch 6/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:35<00:00,  3.15it/s]
episodes                                   369
episode_length                       10.826558
returns                             -11.364602
return_std                            2.803258
average_reward                       -1.047477
round_time              0 days 00:10:35.504084
episodes_test                            922.0
episode_length_test                  10.835141
returns_test                        -11.497861
return_std_test                       2.822696
average_reward_test                  -1.060962
round_time_test         0 days 00:00:09.578094
round_time_total        0 days 00:10:35.505327
loss_total              86594257147924660224.0
loss_critic            108242819636104806400.0
loss_actor                 -37373753290.751999
memory_size                            17775.0 

=== epoch 6/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:15,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                   572
episode_length                       10.487762
returns                             -10.731346
return_std                            2.691914
average_reward                       -1.022897
round_time              0 days 00:10:31.323152
episodes_test                            959.0
episode_length_test                  10.423358
returns_test                        -10.639437
return_std_test                       2.279738
average_reward_test                  -1.020168
round_time_test         0 days 00:00:09.620810
round_time_total        0 days 00:10:31.324264
loss_total              87231379820404703232.0
loss_critic            109039222882421719040.0
loss_actor                 -37923994357.760002
memory_size                            17775.0 

=== epoch 6/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                   770
episode_length                       10.388312
returns                             -10.534622
return_std                            2.736757
average_reward                       -1.013838
round_time              0 days 00:10:30.918258
episodes_test                           1055.0
episode_length_test                   9.477725
returns_test                         -8.981582
return_std_test                       1.904959
average_reward_test                  -0.947465
round_time_test         0 days 00:00:09.791338
round_time_total        0 days 00:10:30.919424
loss_total              92560233983658557440.0
loss_critic            115700290413865730048.0
loss_actor                 -38327327088.639999
memory_size                            17775.0 

=== epoch 6/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                   952
episode_length                       10.496849
returns                             -10.512428
return_std                            2.688114
average_reward                       -1.000953
round_time              0 days 00:10:32.188913
episodes_test                            934.0
episode_length_test                  10.694861
returns_test                        -10.697843
return_std_test                       2.249894
average_reward_test                  -1.000059
round_time_test         0 days 00:00:09.430821
round_time_total        0 days 00:10:32.190017
loss_total              94527147755132764160.0
loss_critic            118158932706823569408.0
loss_actor                 -38837606522.879997
memory_size                            17775.0 

=== epoch 6/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   960
episode_length                       10.410417
returns                             -10.448633
return_std                            2.572946
average_reward                       -1.003956
round_time              0 days 00:10:33.096468
episodes_test                            931.0
episode_length_test                  10.736842
returns_test                        -11.026405
return_std_test                       2.727845
average_reward_test                  -1.026527
round_time_test         0 days 00:00:09.622962
round_time_total        0 days 00:10:33.097738
loss_total              98754880130598551552.0
loss_critic            123443598034318786560.0
loss_actor                 -39022424942.592003
memory_size                         17775.4015 

=== epoch 6/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   966
episode_length                       10.334369
returns                             -10.235318
return_std                            2.399619
average_reward                       -0.991244
round_time              0 days 00:10:33.490311
episodes_test                            985.0
episode_length_test                  10.145178
returns_test                        -10.498999
return_std_test                       2.150604
average_reward_test                  -1.034393
round_time_test         0 days 00:00:09.551097
round_time_total        0 days 00:10:33.491520
loss_total              99916058801503731712.0
loss_critic            124895071364978819072.0
loss_actor                 -39429225037.823997
memory_size                          17779.175 

=== epoch 6/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                   934
episode_length                       10.690578
returns                             -10.742091
return_std                             2.51838
average_reward                        -1.00452
round_time              0 days 00:10:34.096953
episodes_test                            880.0
episode_length_test                  11.360227
returns_test                         -11.80751
return_std_test                       2.061281
average_reward_test                  -1.038922
round_time_test         0 days 00:00:09.508904
round_time_total        0 days 00:10:34.098274
loss_total             102989807283545980928.0
loss_critic            128737256852357775360.0
loss_actor                 -39952293691.391998
memory_size                            17781.0 

=== epoch 6/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:17,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:30<00:00,  3.17it/s]
episodes                                   911
episode_length                       10.967069
returns                             -10.986568
return_std                            2.317585
average_reward                       -1.002109
round_time              0 days 00:10:30.932150
episodes_test                            810.0
episode_length_test                  12.337037
returns_test                        -11.861607
return_std_test                       1.770958
average_reward_test                  -0.960801
round_time_test         0 days 00:00:09.323972
round_time_total        0 days 00:10:30.933270
loss_total             104762745191657701376.0
loss_critic            130953429214957453312.0
loss_actor                 -40550919544.832001
memory_size                            17781.0 

=== epoch 6/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   896
episode_length                       11.149554
returns                             -11.197267
return_std                            2.206321
average_reward                       -1.004334
round_time              0 days 00:10:33.536268
episodes_test                            866.0
episode_length_test                  11.542725
returns_test                        -11.166621
return_std_test                       2.063104
average_reward_test                  -0.966796
round_time_test         0 days 00:00:09.589955
round_time_total        0 days 00:10:33.537404
loss_total             107711799323332739072.0
loss_critic            134639746875153186816.0
loss_actor                 -41043299137.536003
memory_size                            17781.0 

=== epoch 6/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:29<00:00,  3.18it/s]
episodes                                   865
episode_length                       11.543353
returns                             -11.358662
return_std                            2.078052
average_reward                       -0.983782
round_time              0 days 00:10:30.455225
episodes_test                            838.0
episode_length_test                  11.923628
returns_test                        -11.214445
return_std_test                       1.927936
average_reward_test                   -0.94047
round_time_test         0 days 00:00:09.629983
round_time_total        0 days 00:10:30.456326
loss_total             111780670259893649408.0
loss_critic            139725835396595630080.0
loss_actor                 -41485296308.223999
memory_size                            17781.0 

=== epoch 6/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:31<00:00,  3.17it/s]
episodes                                   836
episode_length                       11.955742
returns                             -11.635647
return_std                            1.943516
average_reward                       -0.972929
round_time              0 days 00:10:31.557073
episodes_test                            838.0
episode_length_test                  11.930788
returns_test                        -11.226635
return_std_test                       1.708095
average_reward_test                  -0.940612
round_time_test         0 days 00:00:09.508901
round_time_total        0 days 00:10:31.558309
loss_total             114252545469960880128.0
loss_critic            142815679370971643904.0
loss_actor                 -41961831942.143997
memory_size                            17781.0 

=== epoch 6/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:33<00:00,  3.16it/s]
episodes                                   829
episode_length                       12.041013
returns                             -11.518089
return_std                            1.822119
average_reward                         -0.9566
round_time              0 days 00:10:33.765822
episodes_test                            818.0
episode_length_test                  12.218826
returns_test                        -11.652312
return_std_test                       1.729459
average_reward_test                  -0.953333
round_time_test         0 days 00:00:09.530575
round_time_total        0 days 00:10:33.766934
loss_total             117781869473435303936.0
loss_critic            147227334392357093376.0
loss_actor                 -42248692701.183998
memory_size                            17781.0 

=== epoch 6/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:53,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   825
episode_length                       12.118788
returns                             -11.515308
return_std                            1.779937
average_reward                        -0.94995
round_time              0 days 00:10:33.159471
episodes_test                            826.0
episode_length_test                  12.102906
returns_test                        -11.604199
return_std_test                       1.735333
average_reward_test                  -0.958268
round_time_test         0 days 00:00:09.326404
round_time_total        0 days 00:10:33.160704
loss_total             118823182356461256704.0
loss_critic            148528975415325425664.0
loss_actor                 -42556200548.351997
memory_size                            17781.0 

=== epoch 6/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:32<00:00,  3.16it/s]
episodes                                   830
episode_length                       12.033735
returns                             -11.357368
return_std                             1.73406
average_reward                       -0.943953
round_time              0 days 00:10:32.985712
episodes_test                            858.0
episode_length_test                  11.642191
returns_test                        -10.937614
return_std_test                       1.720387
average_reward_test                  -0.939189
round_time_test         0 days 00:00:09.385525
round_time_total        0 days 00:10:32.986999
loss_total             120909073043127009280.0
loss_critic            151136338780254699520.0
loss_actor                 -42800383901.695999
memory_size                            17781.0 

=== epoch 6/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:59,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:24<00:00,  3.20it/s]
episodes                                   841
episode_length                       11.881094
returns                             -11.234217
return_std                            1.676024
average_reward                       -0.946224
round_time              0 days 00:10:25.211569
episodes_test                            876.0
episode_length_test                  11.407534
returns_test                        -10.730307
return_std_test                       1.730096
average_reward_test                  -0.939817
round_time_test         0 days 00:00:09.353037
round_time_total        0 days 00:10:25.212662
loss_total             125967347675218919424.0
loss_critic            157459181940077461504.0
loss_actor                    -42996964046.848
memory_size                            17781.0 

=== epoch 6/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:06,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:56<00:00,  3.35it/s]
episodes                                   847
episode_length                        11.79693
returns                             -11.121401
return_std                            1.646225
average_reward                        -0.94315
round_time              0 days 00:09:57.007637
episodes_test                            850.0
episode_length_test                  11.763529
returns_test                        -11.080167
return_std_test                       1.717967
average_reward_test                  -0.941742
round_time_test         0 days 00:00:08.972958
round_time_total        0 days 00:09:57.008727
loss_total             127432662309160271872.0
loss_critic            159290825065378381824.0
loss_actor                 -43291985160.192001
memory_size                            17781.0 

=== epoch 6/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:54<00:00,  3.36it/s]
episodes                                   849
episode_length                       11.766784
returns                             -11.031328
return_std                            1.614248
average_reward                       -0.937416
round_time              0 days 00:09:55.371498
episodes_test                            830.0
episode_length_test                  12.036145
returns_test                        -11.159667
return_std_test                       1.637148
average_reward_test                  -0.927253
round_time_test         0 days 00:00:08.953738
round_time_total        0 days 00:09:55.372704
loss_total             129543529410703130624.0
loss_critic            161929408946430115840.0
loss_actor                 -43566954278.912003
memory_size                            17781.0 

=== epoch 6/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:22,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   840
episode_length                       11.886905
returns                             -11.076075
return_std                            1.619202
average_reward                       -0.931351
round_time              0 days 00:09:55.693763
episodes_test                            804.0
episode_length_test                  12.436567
returns_test                        -11.387222
return_std_test                       1.664053
average_reward_test                  -0.915427
round_time_test         0 days 00:00:09.062902
round_time_total        0 days 00:09:55.694883
loss_total             131741758264706007040.0
loss_critic            164677194983147405312.0
loss_actor                    -43817995892.736
memory_size                            17781.0 

=== epoch 6/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   825
episode_length                       12.113939
returns                             -11.218402
return_std                            1.586838
average_reward                       -0.926589
round_time              0 days 00:09:56.104045
episodes_test                            802.0
episode_length_test                   12.46384
returns_test                        -11.488452
return_std_test                       1.665864
average_reward_test                  -0.921288
round_time_test         0 days 00:00:08.941396
round_time_total        0 days 00:09:56.105123
loss_total             135383978934647914496.0
loss_critic            169229970727666024448.0
loss_actor                 -44092963491.839996
memory_size                            17781.0 

=== epoch 6/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:56<00:00,  3.35it/s]
episodes                                   814
episode_length                       12.281327
returns                             -11.251206
return_std                            1.593583
average_reward                       -0.915606
round_time              0 days 00:09:57.149555
episodes_test                            795.0
episode_length_test                  12.578616
returns_test                        -11.574442
return_std_test                       1.636121
average_reward_test                  -0.920168
round_time_test         0 days 00:00:08.878792
round_time_total        0 days 00:09:57.150638
loss_total             136234322530374631424.0
loss_critic            170292900347943649280.0
loss_actor                 -44323704043.519997
memory_size                            17781.0 

=== epoch 6/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   807
episode_length                       12.374226
returns                             -11.361677
return_std                            1.624786
average_reward                       -0.917736
round_time              0 days 00:09:56.134899
episodes_test                            829.0
episode_length_test                  12.062726
returns_test                        -11.346651
return_std_test                       1.820262
average_reward_test                  -0.940637
round_time_test         0 days 00:00:09.020636
round_time_total        0 days 00:09:56.135992
loss_total             140251443332593647616.0
loss_critic            175314301188539449344.0
loss_actor                 -44590784057.344002
memory_size                            17781.0 

=== epoch 6/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:20,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   801
episode_length                       12.476904
returns                              -11.46639
return_std                            1.639513
average_reward                       -0.918673
round_time              0 days 00:09:55.901954
episodes_test                            815.0
episode_length_test                  12.257669
returns_test                        -11.519561
return_std_test                       1.804137
average_reward_test                  -0.939506
round_time_test         0 days 00:00:09.026879
round_time_total        0 days 00:09:55.903046
loss_total             142386302002154930176.0
loss_critic            177982874436155736064.0
loss_actor                 -44901615218.688004
memory_size                            17781.0 

=== epoch 6/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   796
episode_length                       12.547739
returns                             -11.512372
return_std                            1.633299
average_reward                       -0.917354
round_time              0 days 00:09:55.742686
episodes_test                            793.0
episode_length_test                  12.605296
returns_test                        -11.532328
return_std_test                       1.727617
average_reward_test                  -0.914251
round_time_test         0 days 00:00:09.165649
round_time_total        0 days 00:09:55.743772
loss_total             144522846015233916928.0
loss_critic            180653554345851846656.0
loss_actor                 -45242145497.087997
memory_size                            17781.0 

=== epoch 6/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   793
episode_length                       12.593947
returns                              -11.60022
return_std                            1.647923
average_reward                       -0.921197
round_time              0 days 00:09:56.190294
episodes_test                            762.0
episode_length_test                   13.11811
returns_test                        -11.646595
return_std_test                        1.49462
average_reward_test                  -0.887289
round_time_test         0 days 00:00:09.019169
round_time_total        0 days 00:09:56.191414
loss_total             146958713733518999552.0
loss_critic            183698388962921873408.0
loss_actor                 -45565748439.040001
memory_size                            17781.0 

=== epoch 6/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:54<00:00,  3.36it/s]
episodes                                   783
episode_length                       12.753512
returns                              -11.70654
return_std                            1.637118
average_reward                       -0.917863
round_time              0 days 00:09:55.470963
episodes_test                            767.0
episode_length_test                  13.032595
returns_test                        -11.708925
return_std_test                       1.470826
average_reward_test                  -0.897743
round_time_test         0 days 00:00:08.874854
round_time_total        0 days 00:09:55.472047
loss_total             149326558480499933184.0
loss_critic            186658194822980730880.0
loss_actor                 -45942509580.288002
memory_size                            17781.0 

=== epoch 6/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   772
episode_length                       12.927461
returns                             -11.742115
return_std                            1.548215
average_reward                       -0.908786
round_time              0 days 00:09:56.141996
episodes_test                            785.0
episode_length_test                  12.735032
returns_test                         -11.51253
return_std_test                       1.482065
average_reward_test                  -0.903729
round_time_test         0 days 00:00:09.008353
round_time_total        0 days 00:09:56.143122
loss_total             153508880325349638144.0
loss_critic            191886096987480752128.0
loss_actor                 -46343121534.975998
memory_size                            17781.0 

=== epoch 6/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:54<00:00,  3.37it/s]
episodes                                   767
episode_length                        13.01043
returns                                -11.743
return_std                            1.531386
average_reward                       -0.903043
round_time              0 days 00:09:54.843797
episodes_test                            772.0
episode_length_test                  12.953368
returns_test                        -11.877502
return_std_test                       1.481422
average_reward_test                  -0.916943
round_time_test         0 days 00:00:08.969812
round_time_total        0 days 00:09:54.844880
loss_total             157052099405920567296.0
loss_critic            196315120814829797376.0
loss_actor                 -46658276243.456001
memory_size                            17781.0 

=== epoch 6/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   766
episode_length                       13.037859
returns                             -11.752436
return_std                            1.502976
average_reward                       -0.901208
round_time              0 days 00:09:56.532085
episodes_test                            764.0
episode_length_test                  13.089005
returns_test                         -11.72019
return_std_test                       1.530296
average_reward_test                  -0.895423
round_time_test         0 days 00:00:09.019682
round_time_total        0 days 00:09:56.533173
loss_total             156756874597599412224.0
loss_critic            195946089786561298432.0
loss_actor                 -46862909431.807999
memory_size                            17781.0 

=== epoch 6/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:56<00:00,  3.35it/s]
episodes                                   766
episode_length                       13.045692
returns                             -11.730528
return_std                            1.565762
average_reward                       -0.898595
round_time              0 days 00:09:56.705708
episodes_test                            765.0
episode_length_test                  13.069281
returns_test                        -11.670221
return_std_test                       1.579642
average_reward_test                  -0.892589
round_time_test         0 days 00:00:08.911099
round_time_total        0 days 00:09:56.706803
loss_total             159108436308548222976.0
loss_critic            198885541746851512320.0
loss_actor                 -47039703425.024002
memory_size                            17781.0 

=== epoch 6/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   777
episode_length                        12.85843
returns                             -11.744385
return_std                            1.674282
average_reward                       -0.913837
round_time              0 days 00:09:55.721327
episodes_test                            861.0
episode_length_test                  11.606272
returns_test                        -11.741701
return_std_test                       2.035055
average_reward_test                  -1.011506
round_time_test         0 days 00:00:09.069788
round_time_total        0 days 00:09:55.722407
loss_total             161347837965346996224.0
loss_critic            201684794007515758592.0
loss_actor                 -47373491654.655998
memory_size                            17781.0 

=== epoch 6/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:45<00:00,  3.42it/s]
episodes                                   784
episode_length                       12.746173
returns                             -11.801179
return_std                            1.731525
average_reward                       -0.925824
round_time              0 days 00:09:45.563551
episodes_test                            815.0
episode_length_test                  12.260123
returns_test                        -12.153797
return_std_test                        1.90394
average_reward_test                  -0.991039
round_time_test         0 days 00:00:09.070519
round_time_total        0 days 00:09:45.564639
loss_total             165148542562373894144.0
loss_critic            206435674639999959040.0
loss_actor                 -47858735040.512001
memory_size                            17781.0 

=== epoch 6/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                   782
episode_length                       12.774936
returns                             -11.869678
return_std                            1.702461
average_reward                       -0.928974
round_time              0 days 00:09:17.674852
episodes_test                            783.0
episode_length_test                  12.765006
returns_test                        -11.997444
return_std_test                       1.611203
average_reward_test                  -0.939324
round_time_test         0 days 00:00:08.926422
round_time_total        0 days 00:09:17.675936
loss_total             170748867377978671104.0
loss_critic            213436080533062090752.0
loss_actor                 -48272972861.440002
memory_size                            17781.0 

=== epoch 6/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:59,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   783
episode_length                       12.758621
returns                             -11.959357
return_std                            1.726164
average_reward                       -0.937478
round_time              0 days 00:09:17.397542
episodes_test                            763.0
episode_length_test                  13.104849
returns_test                        -12.151618
return_std_test                       1.674123
average_reward_test                  -0.927078
round_time_test         0 days 00:00:08.898819
round_time_total        0 days 00:09:17.398616
loss_total             173427895276932104192.0
loss_critic            216784865213239820288.0
loss_actor                    -48439084337.152
memory_size                            17781.0 

=== epoch 6/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:50,  3.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   783
episode_length                       12.759898
returns                             -12.000893
return_std                            1.694219
average_reward                       -0.940288
round_time              0 days 00:09:16.108826
episodes_test                            761.0
episode_length_test                  13.127464
returns_test                        -12.186406
return_std_test                       1.802901
average_reward_test                  -0.928344
round_time_test         0 days 00:00:08.886187
round_time_total        0 days 00:09:16.109911
loss_total             174580516197050155008.0
loss_critic            218225641448049770496.0
loss_actor                 -48935935746.047997
memory_size                            17781.0 

=== epoch 6/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:46,  3.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   776
episode_length                       12.871134
returns                             -12.079373
return_std                            1.618981
average_reward                       -0.938505
round_time              0 days 00:09:16.482855
episodes_test                            765.0
episode_length_test                  13.060131
returns_test                        -12.291728
return_std_test                       1.651197
average_reward_test                  -0.940951
round_time_test         0 days 00:00:08.748015
round_time_total        0 days 00:09:16.483936
loss_total             178341986837100265472.0
loss_critic            222927479778348990464.0
loss_actor                 -49518492975.103996
memory_size                            17781.0 

=== epoch 6/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:26,  3.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   780
episode_length                       12.801282
returns                             -12.015335
return_std                            1.701314
average_reward                       -0.938749
round_time              0 days 00:09:16.152866
episodes_test                            777.0
episode_length_test                  12.861004
returns_test                        -12.363219
return_std_test                       1.664329
average_reward_test                  -0.960842
round_time_test         0 days 00:00:08.947980
round_time_total        0 days 00:09:16.153932
loss_total             180048003236069212160.0
loss_critic            225059999924116946944.0
loss_actor                 -50245466689.536003
memory_size                          17781.126 

=== epoch 6/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   793
episode_length                       12.604035
returns                             -11.978472
return_std                            1.793645
average_reward                       -0.950446
round_time              0 days 00:09:16.878224
episodes_test                            835.0
episode_length_test                  11.968862
returns_test                         -11.74574
return_std_test                       2.111382
average_reward_test                  -0.980857
round_time_test         0 days 00:00:08.864927
round_time_total        0 days 00:09:16.879305
loss_total             187819009965654966272.0
loss_critic            234773758355890339840.0
loss_actor                 -51056784250.879997
memory_size                          17782.898 

=== epoch 6/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:41,  3.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   802
episode_length                       12.445137
returns                             -11.905786
return_std                            1.807957
average_reward                       -0.956607
round_time              0 days 00:09:16.337349
episodes_test                            845.0
episode_length_test                  11.827219
returns_test                        -11.474019
return_std_test                       1.953468
average_reward_test                  -0.969792
round_time_test         0 days 00:00:08.947977
round_time_total        0 days 00:09:16.338453
loss_total             186965136204228820992.0
loss_critic            233706416238220279808.0
loss_actor                 -51876313309.183998
memory_size                            17783.0 

=== epoch 6/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   820
episode_length                       12.173171
returns                             -11.961858
return_std                            1.999374
average_reward                       -0.983073
round_time              0 days 00:09:16.976736
episodes_test                            871.0
episode_length_test                  11.475316
returns_test                        -11.720571
return_std_test                       2.180546
average_reward_test                  -1.020811
round_time_test         0 days 00:00:08.999999
round_time_total        0 days 00:09:16.977797
loss_total             196950915226685931520.0
loss_critic            246188639909089312768.0
loss_actor                 -52541353154.559998
memory_size                            17783.0 

=== epoch 6/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   836
episode_length                       11.953349
returns                             -11.876181
return_std                            2.134252
average_reward                         -0.9929
round_time              0 days 00:09:15.884977
episodes_test                            837.0
episode_length_test                  11.942652
returns_test                          -12.0967
return_std_test                        2.11272
average_reward_test                   -1.01226
round_time_test         0 days 00:00:08.925052
round_time_total        0 days 00:09:15.886065
loss_total             197026798276763582464.0
loss_critic            246283493609536192512.0
loss_actor                 -53436133795.839996
memory_size                            17783.0 

=== epoch 6/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:52,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   844
episode_length                       11.831754
returns                             -11.845053
return_std                            2.169804
average_reward                       -1.000936
round_time              0 days 00:09:16.168958
episodes_test                            836.0
episode_length_test                  11.948565
returns_test                        -12.101429
return_std_test                       2.134714
average_reward_test                  -1.012444
round_time_test         0 days 00:00:08.958268
round_time_total        0 days 00:09:16.170029
loss_total             206288734226010210304.0
loss_critic            257860913305851166720.0
loss_actor                    -54096586940.416
memory_size                            17783.0 

=== epoch 6/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   853
episode_length                       11.713951
returns                             -11.786645
return_std                            2.306019
average_reward                       -1.006355
round_time              0 days 00:09:15.955274
episodes_test                            907.0
episode_length_test                  11.016538
returns_test                        -10.948687
return_std_test                       2.390994
average_reward_test                  -0.993727
round_time_test         0 days 00:00:08.961094
round_time_total        0 days 00:09:15.956327
loss_total             211637836995401187328.0
loss_critic            264547291621354831872.0
loss_actor                 -54806377357.311996
memory_size                          17786.215 

=== epoch 6/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:36,  3.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   851
episode_length                       11.737955
returns                              -11.89838
return_std                            2.357845
average_reward                       -1.013395
round_time              0 days 00:09:16.098679
episodes_test                            815.0
episode_length_test                  12.268712
returns_test                        -12.213236
return_std_test                       2.067284
average_reward_test                  -0.995278
round_time_test         0 days 00:00:09.023217
round_time_total        0 days 00:09:16.099781
loss_total             217527638965173846016.0
loss_critic            271909544134697975808.0
loss_actor                 -55641079347.199997
memory_size                            17788.0 

=== epoch 6/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:43,  3.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   843
episode_length                       11.857651
returns                             -11.987373
return_std                            2.290679
average_reward                       -1.010926
round_time              0 days 00:09:15.823892
episodes_test                            851.0
episode_length_test                  11.742656
returns_test                        -12.483822
return_std_test                       2.362554
average_reward_test                  -1.062825
round_time_test         0 days 00:00:09.012521
round_time_total        0 days 00:09:15.824961
loss_total             221160454522602913792.0
loss_critic            276450563450092650496.0
loss_actor                 -56502628454.400002
memory_size                            17788.0 

=== epoch 6/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   837
episode_length                       11.934289
returns                             -12.173755
return_std                            2.243336
average_reward                       -1.020323
round_time              0 days 00:09:17.457338
episodes_test                            796.0
episode_length_test                  12.551508
returns_test                         -12.82028
return_std_test                       2.083368
average_reward_test                   -1.02128
round_time_test         0 days 00:00:08.973534
round_time_total        0 days 00:09:17.458397
loss_total             228453317090213560320.0
loss_critic            285566641560100175872.0
loss_actor                 -57348773521.407997
memory_size                            17788.0 

=== epoch 6/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:53,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   827
episode_length                       12.077388
returns                             -12.410699
return_std                            2.184552
average_reward                       -1.027806
round_time              0 days 00:09:17.361592
episodes_test                            806.0
episode_length_test                    12.3933
returns_test                        -13.044959
return_std_test                       2.057952
average_reward_test                  -1.052494
round_time_test         0 days 00:00:09.001905
round_time_total        0 days 00:09:17.362656
loss_total             231883659443960381440.0
loss_critic            289854569037190266880.0
loss_actor                    -58176171636.736
memory_size                            17788.0 

=== epoch 6/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:41,  3.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   811
episode_length                       12.316893
returns                             -12.803012
return_std                            1.991753
average_reward                       -1.039259
round_time              0 days 00:09:16.160059
episodes_test                            797.0
episode_length_test                  12.538269
returns_test                        -13.277842
return_std_test                       1.975584
average_reward_test                  -1.058454
round_time_test         0 days 00:00:08.862081
round_time_total        0 days 00:09:16.161146
loss_total             239730151780956667904.0
loss_critic            299662684922429505536.0
loss_actor                 -58817417715.711998
memory_size                            17788.0 

=== epoch 6/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   805
episode_length                       12.413665
returns                              -13.16369
return_std                            2.012389
average_reward                        -1.06054
round_time              0 days 00:09:15.918419
episodes_test                            774.0
episode_length_test                  12.918605
returns_test                        -13.587008
return_std_test                        1.85649
average_reward_test                  -1.051568
round_time_test         0 days 00:00:08.867070
round_time_total        0 days 00:09:15.919492
loss_total             244961150429685252096.0
loss_critic            306201432788038057984.0
loss_actor                 -59614886932.480003
memory_size                            17788.0 

=== epoch 6/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:46,  3.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   791
episode_length                       12.620733
returns                             -13.250335
return_std                            1.945837
average_reward                       -1.050141
round_time              0 days 00:09:16.939289
episodes_test                            775.0
episode_length_test                  12.900645
returns_test                        -14.047091
return_std_test                       1.811169
average_reward_test                  -1.088508
round_time_test         0 days 00:00:08.795115
round_time_total        0 days 00:09:16.940355
loss_total             250144566947461758976.0
loss_critic            312680703451751317504.0
loss_actor                    -60565424590.848
memory_size                           17790.47 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   150
episode_length                       13.293333
returns                             -12.741487
return_std                            1.684824
average_reward                       -0.956002
round_time              0 days 00:09:15.882447
episodes_test                            753.0
episode_length_test                  13.276228
returns_test                        -12.942543
return_std_test                       1.704351
average_reward_test                  -0.974591
round_time_test         0 days 00:00:08.863261
round_time_total        0 days 00:09:15.883536
loss_total             257683137422511964160.0
loss_critic            322103916261340348416.0
loss_actor                    -61333593905.152
memory_size                            17793.0 

=== epoch 7/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:17,  3.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   302
episode_length                       13.235099
returns                             -13.122366
return_std                            1.759646
average_reward                       -0.990725
round_time              0 days 00:09:15.764170
episodes_test                            758.0
episode_length_test                  13.188654
returns_test                        -13.278313
return_std_test                       1.649797
average_reward_test                  -1.006254
round_time_test         0 days 00:00:09.003891
round_time_total        0 days 00:09:15.765246
loss_total             262324943284349763584.0
loss_critic            327906173327503589376.0
loss_actor                 -62091621715.968002
memory_size                            17793.0 

=== epoch 7/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:39,  3.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                   455
episode_length                       13.184615
returns                             -13.348196
return_std                            1.785374
average_reward                       -1.012085
round_time              0 days 00:09:14.790892
episodes_test                            764.0
episode_length_test                  13.077225
returns_test                        -13.790934
return_std_test                       1.670004
average_reward_test                   -1.05422
round_time_test         0 days 00:00:08.814761
round_time_total        0 days 00:09:14.791970
loss_total             268343914398011883520.0
loss_critic            335429886891377033216.0
loss_actor                 -63016078581.760002
memory_size                            17793.0 

=== epoch 7/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   606
episode_length                       13.183168
returns                             -13.529584
return_std                            1.775265
average_reward                       -1.026215
round_time              0 days 00:09:15.610372
episodes_test                            762.0
episode_length_test                  13.111549
returns_test                        -13.942932
return_std_test                       1.806321
average_reward_test                  -1.063168
round_time_test         0 days 00:00:09.117990
round_time_total        0 days 00:09:15.611447
loss_total             277156798189830995968.0
loss_critic            346445991581672931328.0
loss_actor                    -63975049310.208
memory_size                            17793.0 

=== epoch 7/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:57,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   758
episode_length                       13.188654
returns                             -13.654657
return_std                            1.749063
average_reward                       -1.034959
round_time              0 days 00:09:16.350815
episodes_test                            758.0
episode_length_test                   13.17942
returns_test                        -14.066939
return_std_test                       1.713637
average_reward_test                  -1.066791
round_time_test         0 days 00:00:08.938586
round_time_total        0 days 00:09:16.351883
loss_total             288760319685258936320.0
loss_critic            360950393426218844160.0
loss_actor                 -64706842750.975998
memory_size                         17794.8305 

=== epoch 7/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:38,  3.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   760
episode_length                       13.139474
returns                             -13.841907
return_std                            1.709108
average_reward                       -1.053481
round_time              0 days 00:09:16.103930
episodes_test                            764.0
episode_length_test                  13.079843
returns_test                        -13.961174
return_std_test                        1.87749
average_reward_test                  -1.066387
round_time_test         0 days 00:00:08.961258
round_time_total        0 days 00:09:16.105031
loss_total             296153699126493773824.0
loss_critic            370192117855855509504.0
loss_actor                 -65692696649.727997
memory_size                         17805.6205 

=== epoch 7/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   761
episode_length                       13.127464
returns                             -13.905145
return_std                             1.75789
average_reward                       -1.059575
round_time              0 days 00:09:16.692432
episodes_test                            769.0
episode_length_test                  12.997399
returns_test                        -14.070815
return_std_test                       1.688534
average_reward_test                  -1.082046
round_time_test         0 days 00:00:08.934829
round_time_total        0 days 00:09:16.693520
loss_total             303675912562881789952.0
loss_critic            379594884235175329792.0
loss_actor                 -66516965648.384003
memory_size                            17806.0 

=== epoch 7/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:22,  3.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   763
episode_length                       13.074705
returns                             -13.984888
return_std                            1.804461
average_reward                       -1.069714
round_time              0 days 00:09:17.273270
episodes_test                            770.0
episode_length_test                  12.977922
returns_test                        -14.266675
return_std_test                       1.939113
average_reward_test                  -1.098717
round_time_test         0 days 00:00:08.981412
round_time_total        0 days 00:09:17.274382
loss_total             311282926706574229504.0
loss_critic            389103651517867163648.0
loss_actor                 -67404949420.031998
memory_size                            17806.0 

=== epoch 7/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                   768
episode_length                       13.014323
returns                             -13.922449
return_std                             1.84138
average_reward                        -1.06961
round_time              0 days 00:09:18.120038
episodes_test                            767.0
episode_length_test                  13.028683
returns_test                        -13.970256
return_std_test                       1.950238
average_reward_test                  -1.071632
round_time_test         0 days 00:00:08.794891
round_time_total        0 days 00:09:18.121120
loss_total             318896045637891260416.0
loss_critic            398620050044574498816.0
loss_actor                 -68362339645.440002
memory_size                            17806.0 

=== epoch 7/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   776
episode_length                          12.875
returns                             -13.748612
return_std                            1.881187
average_reward                       -1.068429
round_time              0 days 00:09:15.754005
episodes_test                            777.0
episode_length_test                  12.863578
returns_test                        -14.054549
return_std_test                       1.821754
average_reward_test                  -1.092166
round_time_test         0 days 00:00:08.928112
round_time_total        0 days 00:09:15.755074
loss_total             326297068639870844928.0
loss_critic            407871329098315137024.0
loss_actor                 -69091869413.376007
memory_size                            17806.0 

=== epoch 7/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:57,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                   778
episode_length                       12.835476
returns                             -13.794001
return_std                            1.881356
average_reward                       -1.074521
round_time              0 days 00:09:17.898700
episodes_test                            773.0
episode_length_test                  12.927555
returns_test                        -13.667058
return_std_test                       1.822478
average_reward_test                  -1.056648
round_time_test         0 days 00:00:08.944959
round_time_total        0 days 00:09:17.899786
loss_total             332775235112434860032.0
loss_critic            415969036664553144320.0
loss_actor                    -69837216700.416
memory_size                            17806.0 

=== epoch 7/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   777
episode_length                       12.853282
returns                             -13.770213
return_std                            1.845821
average_reward                       -1.071318
round_time              0 days 00:09:16.856877
episodes_test                            762.0
episode_length_test                  13.115486
returns_test                        -13.952129
return_std_test                       1.830203
average_reward_test                  -1.063258
round_time_test         0 days 00:00:08.856424
round_time_total        0 days 00:09:16.857972
loss_total             341496574027920179200.0
loss_critic            426870710176968474624.0
loss_actor                 -70661295296.511993
memory_size                            17806.0 

=== epoch 7/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   773
episode_length                       12.927555
returns                             -13.746883
return_std                            1.809689
average_reward                       -1.062642
round_time              0 days 00:09:16.583442
episodes_test                            756.0
episode_length_test                  13.222222
returns_test                          -13.7188
return_std_test                       1.794411
average_reward_test                  -1.037165
round_time_test         0 days 00:00:08.915022
round_time_total        0 days 00:09:16.584523
loss_total             352042025205704622080.0
loss_critic            440052523643973337088.0
loss_actor                 -71558138890.240005
memory_size                         17807.9335 

=== epoch 7/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:51,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   770
episode_length                       12.958442
returns                             -13.836995
return_std                            1.867375
average_reward                       -1.068739
round_time              0 days 00:09:17.362812
episodes_test                            760.0
episode_length_test                  13.155263
returns_test                        -14.004859
return_std_test                       2.098746
average_reward_test                  -1.064266
round_time_test         0 days 00:00:09.021356
round_time_total        0 days 00:09:17.363895
loss_total             358286354483191611392.0
loss_critic            447857935869202989056.0
loss_actor                 -72507939473.408005
memory_size                            17808.0 

=== epoch 7/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   764
episode_length                       13.081152
returns                             -14.021906
return_std                            1.882794
average_reward                       -1.071424
round_time              0 days 00:09:17.017001
episodes_test                            764.0
episode_length_test                  13.082461
returns_test                        -14.256455
return_std_test                        1.89771
average_reward_test                  -1.089319
round_time_test         0 days 00:00:09.050676
round_time_total        0 days 00:09:17.018136
loss_total             367642160054585065472.0
loss_critic            459552692154496385024.0
loss_actor                 -73432036804.608002
memory_size                            17808.0 

=== epoch 7/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:18,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   758
episode_length                       13.168865
returns                             -14.151777
return_std                            1.922448
average_reward                        -1.07495
round_time              0 days 00:09:15.928818
episodes_test                            758.0
episode_length_test                  13.183377
returns_test                        -14.384216
return_std_test                       1.799206
average_reward_test                  -1.090581
round_time_test         0 days 00:00:09.127422
round_time_total        0 days 00:09:15.929894
loss_total             380205129144703778816.0
loss_critic            475256403013018648576.0
loss_actor                 -74445476843.520004
memory_size                            17808.0 

=== epoch 7/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:54,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   757
episode_length                       13.178336
returns                             -14.133704
return_std                            1.932644
average_reward                       -1.072374
round_time              0 days 00:09:17.025643
episodes_test                            752.0
episode_length_test                  13.281915
returns_test                        -14.273811
return_std_test                        1.79298
average_reward_test                  -1.074304
round_time_test         0 days 00:00:08.996991
round_time_total        0 days 00:09:17.026743
loss_total             385531382967355965440.0
loss_critic            481914220968633106432.0
loss_actor                 -75379681206.272003
memory_size                         17811.6375 

=== epoch 7/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:22,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   760
episode_length                       13.138158
returns                             -14.117659
return_std                            1.890923
average_reward                       -1.074706
round_time              0 days 00:09:17.001307
episodes_test                            763.0
episode_length_test                  13.094364
returns_test                        -13.976899
return_std_test                       1.907315
average_reward_test                  -1.067014
round_time_test         0 days 00:00:08.895286
round_time_total        0 days 00:09:17.002379
loss_total             399852815699849183232.0
loss_critic            499816010725364400128.0
loss_actor                 -76460048113.664001
memory_size                            17814.0 

=== epoch 7/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:25,  3.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   760
episode_length                       13.151316
returns                             -14.077048
return_std                            1.883078
average_reward                        -1.06987
round_time              0 days 00:09:16.091588
episodes_test                            763.0
episode_length_test                  13.095675
returns_test                         -14.27989
return_std_test                       1.927686
average_reward_test                  -1.090263
round_time_test         0 days 00:00:08.934674
round_time_total        0 days 00:09:16.092665
loss_total             405770401547802509312.0
loss_critic            507212993118868930560.0
loss_actor                 -77425118498.815994
memory_size                            17814.0 

=== epoch 7/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:20,  3.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   760
episode_length                       13.138158
returns                              -14.06224
return_std                            1.859543
average_reward                       -1.070371
round_time              0 days 00:09:16.821327
episodes_test                            754.0
episode_length_test                  13.248011
returns_test                        -13.318388
return_std_test                       1.860939
average_reward_test                  -1.004588
round_time_test         0 days 00:00:08.906648
round_time_total        0 days 00:09:16.822395
loss_total             413665149521598480384.0
loss_critic            517081428185619693568.0
loss_actor                 -78230055284.735992
memory_size                            17814.0 

=== epoch 7/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:52,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   764
episode_length                       13.086387
returns                             -13.906492
return_std                            1.886016
average_reward                       -1.062818
round_time              0 days 00:09:16.646085
episodes_test                            770.0
episode_length_test                  12.987013
returns_test                        -13.953812
return_std_test                       1.803568
average_reward_test                  -1.074443
round_time_test         0 days 00:00:08.924798
round_time_total        0 days 00:09:16.647335
loss_total             434769155896329568256.0
loss_critic            543461435731821068288.0
loss_actor                 -79138102956.031998
memory_size                            17814.0 

=== epoch 7/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:03,  3.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                   767
episode_length                       13.024772
returns                             -13.987183
return_std                            1.874925
average_reward                       -1.073293
round_time              0 days 00:09:15.191752
episodes_test                            758.0
episode_length_test                   13.17942
returns_test                        -14.277085
return_std_test                       1.789197
average_reward_test                   -1.08304
round_time_test         0 days 00:00:08.889537
round_time_total        0 days 00:09:15.192837
loss_total             439033090514574573568.0
loss_critic            548791353989234163712.0
loss_actor                 -79986408161.279999
memory_size                            17814.0 

=== epoch 7/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:54,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   766
episode_length                        13.04047
returns                             -14.131349
return_std                             1.94286
average_reward                       -1.084033
round_time              0 days 00:09:17.106795
episodes_test                            763.0
episode_length_test                  13.100917
returns_test                        -13.897775
return_std_test                       1.793601
average_reward_test                  -1.060311
round_time_test         0 days 00:00:08.765443
round_time_total        0 days 00:09:17.107897
loss_total             449189928451060203520.0
loss_critic            561487401385102213120.0
loss_actor                 -80905743597.567993
memory_size                         17815.1865 

=== epoch 7/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:30,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   766
episode_length                       13.030026
returns                             -14.268642
return_std                            1.929288
average_reward                       -1.095701
round_time              0 days 00:09:16.466302
episodes_test                            772.0
episode_length_test                   12.94171
returns_test                        -14.692838
return_std_test                       1.764306
average_reward_test                  -1.134561
round_time_test         0 days 00:00:08.988608
round_time_total        0 days 00:09:16.467405
loss_total             460313408115404505088.0
loss_critic            575391750003459883008.0
loss_actor                 -81955367497.727997
memory_size                            17816.0 

=== epoch 7/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:14,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                   768
episode_length                       13.002604
returns                             -14.414309
return_std                            1.918408
average_reward                       -1.108779
round_time              0 days 00:09:15.084390
episodes_test                            766.0
episode_length_test                   13.05483
returns_test                        -14.812548
return_std_test                       1.765903
average_reward_test                  -1.134641
round_time_test         0 days 00:00:08.806472
round_time_total        0 days 00:09:15.085481
loss_total             475343509789831725056.0
loss_critic            594179376463175221248.0
loss_actor                 -82823370305.535995
memory_size                            17816.0 

=== epoch 7/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   772
episode_length                       12.952073
returns                              -14.55177
return_std                            1.777512
average_reward                       -1.123319
round_time              0 days 00:09:17.000469
episodes_test                            776.0
episode_length_test                  12.886598
returns_test                        -14.674115
return_std_test                        1.60821
average_reward_test                  -1.138711
round_time_test         0 days 00:00:08.921921
round_time_total        0 days 00:09:17.001550
loss_total             482023583835515256832.0
loss_critic            602529469816326062080.0
loss_actor                    -83394233298.944
memory_size                            17816.0 

=== epoch 7/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:43,  3.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   768
episode_length                        13.00651
returns                             -14.615812
return_std                            1.710276
average_reward                       -1.123475
round_time              0 days 00:09:16.423409
episodes_test                            773.0
episode_length_test                  12.923674
returns_test                        -14.319548
return_std_test                       1.678142
average_reward_test                  -1.107825
round_time_test         0 days 00:00:08.823566
round_time_total        0 days 00:09:16.424487
loss_total             480606214492420571136.0
loss_critic            600757758200129781760.0
loss_actor                 -84100408524.800003
memory_size                          17816.515 

=== epoch 7/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:12,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   768
episode_length                            13.0
returns                             -14.487426
return_std                            1.691239
average_reward                       -1.114327
round_time              0 days 00:09:15.658130
episodes_test                            768.0
episode_length_test                  13.016927
returns_test                        -14.153711
return_std_test                       1.749391
average_reward_test                  -1.086881
round_time_test         0 days 00:00:08.836027
round_time_total        0 days 00:09:15.659210
loss_total             497630686505085173760.0
loss_critic            622038347281375690752.0
loss_actor                 -85244045627.391998
memory_size                            17821.0 

=== epoch 7/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   769
episode_length                            13.0
returns                             -14.397277
return_std                            1.738414
average_reward                       -1.107392
round_time              0 days 00:09:16.849249
episodes_test                            763.0
episode_length_test                  13.093054
returns_test                         -14.34382
return_std_test                       1.848373
average_reward_test                  -1.095317
round_time_test         0 days 00:00:08.959468
round_time_total        0 days 00:09:16.850344
loss_total             513236031587725934592.0
loss_critic            641545028946938036224.0
loss_actor                 -86357684850.688004
memory_size                            17821.0 

=== epoch 7/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:33,  3.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   765
episode_length                       13.053595
returns                             -14.309386
return_std                            1.793001
average_reward                       -1.096017
round_time              0 days 00:09:16.221445
episodes_test                            762.0
episode_length_test                  13.107612
returns_test                        -14.517731
return_std_test                       1.844548
average_reward_test                  -1.107514
round_time_test         0 days 00:00:08.897537
round_time_total        0 days 00:09:16.222521
loss_total             524161462868007190528.0
loss_critic            655201817499732803584.0
loss_actor                 -87399400939.520004
memory_size                         17823.0865 

=== epoch 7/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   760
episode_length                       13.127632
returns                             -14.321213
return_std                             1.84447
average_reward                       -1.091035
round_time              0 days 00:09:15.687502
episodes_test                            768.0
episode_length_test                  13.005208
returns_test                        -14.578962
return_std_test                       1.840261
average_reward_test                   -1.12106
round_time_test         0 days 00:00:08.952961
round_time_total        0 days 00:09:15.688579
loss_total             541402527447529029632.0
loss_critic            676753146827755225088.0
loss_actor                 -88482856890.367996
memory_size                            17824.0 

=== epoch 7/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   761
episode_length                       13.122208
returns                             -14.280265
return_std                             1.96307
average_reward                       -1.088637
round_time              0 days 00:09:16.075560
episodes_test                            765.0
episode_length_test                  13.069281
returns_test                        -14.383428
return_std_test                       1.793159
average_reward_test                  -1.100203
round_time_test         0 days 00:00:08.920100
round_time_total        0 days 00:09:16.076636
loss_total             554244887902440325120.0
loss_critic            692806097913164857344.0
loss_actor                 -89523485806.591995
memory_size                         17825.9305 

=== epoch 7/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:52,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   762
episode_length                       13.103675
returns                             -14.451227
return_std                            1.965452
average_reward                        -1.10264
round_time              0 days 00:09:16.422177
episodes_test                            770.0
episode_length_test                  12.981818
returns_test                        -14.400014
return_std_test                       1.971425
average_reward_test                  -1.108511
round_time_test         0 days 00:00:08.810681
round_time_total        0 days 00:09:16.423280
loss_total             566771480026386989056.0
loss_critic            708464337157702549504.0
loss_actor                 -90538348847.104004
memory_size                            17827.0 

=== epoch 7/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   765
episode_length                       13.056209
returns                             -14.516475
return_std                            1.948087
average_reward                       -1.111674
round_time              0 days 00:09:15.853485
episodes_test                            775.0
episode_length_test                  12.901935
returns_test                        -14.815615
return_std_test                       1.837462
average_reward_test                  -1.148124
round_time_test         0 days 00:00:08.952025
round_time_total        0 days 00:09:15.854550
loss_total             572568806152335589376.0
loss_critic            715710994695291535360.0
loss_actor                    -91273849532.416
memory_size                            17827.0 

=== epoch 7/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:13,  3.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   766
episode_length                       13.039164
returns                             -14.519063
return_std                            1.909376
average_reward                       -1.113113
round_time              0 days 00:09:16.687609
episodes_test                            764.0
episode_length_test                  13.087696
returns_test                        -14.781677
return_std_test                       1.807482
average_reward_test                  -1.129223
round_time_test         0 days 00:00:08.891608
round_time_total        0 days 00:09:16.688692
loss_total             581786735056877060096.0
loss_critic            727233405937018994688.0
loss_actor                 -92456924753.919998
memory_size                            17827.0 

=== epoch 7/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   767
episode_length                       13.028683
returns                             -14.551434
return_std                            1.927962
average_reward                       -1.116161
round_time              0 days 00:09:16.097680
episodes_test                            765.0
episode_length_test                  13.069281
returns_test                        -14.624735
return_std_test                       1.756278
average_reward_test                  -1.118696
round_time_test         0 days 00:00:08.820204
round_time_total        0 days 00:09:16.098767
loss_total             601792005579654299648.0
loss_critic            752239993783726768128.0
loss_actor                 -93685040648.192001
memory_size                            17827.0 

=== epoch 7/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:20,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   768
episode_length                       12.998698
returns                             -14.670796
return_std                            1.796457
average_reward                       -1.128956
round_time              0 days 00:09:16.844801
episodes_test                            768.0
episode_length_test                  13.011719
returns_test                        -14.403624
return_std_test                       1.801957
average_reward_test                  -1.106404
round_time_test         0 days 00:00:08.823493
round_time_total        0 days 00:09:16.845885
loss_total             624040248869643091968.0
loss_critic            780050297679609069568.0
loss_actor                 -94674767712.255997
memory_size                            17827.0 

=== epoch 7/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                   766
episode_length                       13.039164
returns                             -14.546433
return_std                            1.837281
average_reward                       -1.115905
round_time              0 days 00:09:16.527284
episodes_test                            763.0
episode_length_test                  13.096986
returns_test                         -14.07052
return_std_test                       1.814402
average_reward_test                  -1.073523
round_time_test         0 days 00:00:08.935059
round_time_total        0 days 00:09:16.528374
loss_total             628988719798859857920.0
loss_critic            786235886399404244992.0
loss_actor                 -95544380432.384003
memory_size                          17827.914 

=== epoch 7/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:52,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   765
episode_length                       13.058824
returns                             -14.471319
return_std                            1.801609
average_reward                       -1.108019
round_time              0 days 00:09:16.770892
episodes_test                            762.0
episode_length_test                  13.111549
returns_test                        -14.264622
return_std_test                       1.921783
average_reward_test                  -1.087692
round_time_test         0 days 00:00:08.805314
round_time_total        0 days 00:09:16.771972
loss_total             640326129646217592832.0
loss_critic            800407647469451608064.0
loss_actor                 -96712115679.231995
memory_size                            17828.0 

=== epoch 7/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   768
episode_length                            13.0
returns                              -14.44191
return_std                            1.847061
average_reward                       -1.111099
round_time              0 days 00:09:16.855120
episodes_test                            771.0
episode_length_test                  12.968872
returns_test                        -14.462564
return_std_test                       1.880905
average_reward_test                  -1.114998
round_time_test         0 days 00:00:09.002588
round_time_total        0 days 00:09:16.856201
loss_total             659532472946457313280.0
loss_critic            824415576519984545792.0
loss_actor                 -97946511503.360001
memory_size                            17828.0 

=== epoch 7/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:56,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   768
episode_length                       12.997396
returns                             -14.376534
return_std                            1.810564
average_reward                       -1.106362
round_time              0 days 00:09:16.611027
episodes_test                            769.0
episode_length_test                  12.997399
returns_test                        -14.489786
return_std_test                       1.719618
average_reward_test                  -1.114383
round_time_test         0 days 00:00:08.836319
round_time_total        0 days 00:09:16.612119
loss_total             669804995482088898560.0
loss_critic            837256229780783497216.0
loss_actor                 -99176308465.664001
memory_size                            17828.0 

=== epoch 7/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   766
episode_length                       13.044386
returns                             -14.416329
return_std                            1.852399
average_reward                       -1.104671
round_time              0 days 00:09:16.678927
episodes_test                            767.0
episode_length_test                   13.03781
returns_test                        -14.691266
return_std_test                       1.891549
average_reward_test                   -1.12682
round_time_test         0 days 00:00:08.792877
round_time_total        0 days 00:09:16.680007
loss_total             694846345655546150912.0
loss_critic            868557916649881731072.0
loss_actor                -100250823606.272003
memory_size                         17829.9155 

=== epoch 7/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:18<00:00,  3.58it/s]
episodes                                   769
episode_length                       12.988296
returns                             -14.497563
return_std                            1.848573
average_reward                       -1.115741
round_time              0 days 00:09:18.827046
episodes_test                            774.0
episode_length_test                  12.913437
returns_test                        -14.855821
return_std_test                       1.811006
average_reward_test                  -1.149614
round_time_test         0 days 00:00:08.962882
round_time_total        0 days 00:09:18.828119
loss_total             703275549547614699520.0
loss_critic            879094421534758404096.0
loss_actor                -101526345670.656006
memory_size                            17835.0 

=== epoch 7/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                   769
episode_length                       12.997399
returns                             -14.607829
return_std                            1.806057
average_reward                       -1.124155
round_time              0 days 00:09:17.569740
episodes_test                            771.0
episode_length_test                  12.955901
returns_test                        -14.664293
return_std_test                       1.891266
average_reward_test                  -1.131879
round_time_test         0 days 00:00:08.800297
round_time_total        0 days 00:09:17.570809
loss_total             726736676134064357376.0
loss_critic            908420829295030566912.0
loss_actor                -102713098805.248001
memory_size                            17835.0 

=== epoch 7/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   768
episode_length                       13.015625
returns                              -14.68113
return_std                            1.823583
average_reward                       -1.128352
round_time              0 days 00:09:17.275201
episodes_test                            769.0
episode_length_test                  12.993498
returns_test                        -14.684923
return_std_test                       1.906998
average_reward_test                  -1.129637
round_time_test         0 days 00:00:08.912433
round_time_total        0 days 00:09:17.276263
loss_total             744401041228970721280.0
loss_critic            930501285426169118720.0
loss_actor                -103728662962.175995
memory_size                            17835.0 

=== epoch 7/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:18,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                   768
episode_length                       13.014323
returns                             -14.733125
return_std                            1.807473
average_reward                       -1.132247
round_time              0 days 00:09:16.934608
episodes_test                            770.0
episode_length_test                  12.987013
returns_test                        -14.523003
return_std_test                        1.96525
average_reward_test                  -1.118271
round_time_test         0 days 00:00:08.823757
round_time_total        0 days 00:09:16.935673
loss_total             761567178321980424192.0
loss_critic            951958957526904864768.0
loss_actor                   -104815410778.112
memory_size                            17835.0 

=== epoch 7/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:31,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                   769
episode_length                       12.985696
returns                             -14.735488
return_std                            1.817145
average_reward                       -1.134379
round_time              0 days 00:09:17.644118
episodes_test                            767.0
episode_length_test                  13.036506
returns_test                        -14.951307
return_std_test                       1.810831
average_reward_test                  -1.146671
round_time_test         0 days 00:00:08.823414
round_time_total        0 days 00:09:17.645197
loss_total             768520319238219497472.0
loss_critic            960650382662852018176.0
loss_actor                -106362584240.128006
memory_size                            17835.0 

=== epoch 7/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:42,  3.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                   768
episode_length                       12.998698
returns                             -14.761626
return_std                            1.823818
average_reward                       -1.136184
round_time              0 days 00:09:16.588383
episodes_test                            766.0
episode_length_test                  13.048303
returns_test                         -14.72894
return_std_test                       1.837932
average_reward_test                  -1.128332
round_time_test         0 days 00:00:08.987989
round_time_total        0 days 00:09:16.589463
loss_total             796173437352017133568.0
loss_critic            995216779677278011392.0
loss_actor                -107553228603.391998
memory_size                            17835.0 

=== epoch 7/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:51,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    768
episode_length                        12.988281
returns                              -14.684908
return_std                             1.906357
average_reward                        -1.130861
round_time               0 days 00:09:17.794914
episodes_test                             772.0
episode_length_test                    12.94171
returns_test                         -14.729222
return_std_test                        1.916218
average_reward_test                   -1.137441
round_time_test          0 days 00:00:09.080982
round_time_total         0 days 00:09:17.795991
loss_total              807291509574195478528.0
loss_critic            1009114369738397188096.0
loss_actor                 -108782526664.703995
memory_size                             17835.0 

=== epoch 7/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    768
episode_length                        13.020833
returns                              -14.755975
return_std                             1.892786
average_reward                        -1.133259
round_time               0 days 00:09:16.597981
episodes_test                             769.0
episode_length_test                   12.994798
returns_test                         -14.811391
return_std_test                        1.864459
average_reward_test                   -1.139056
round_time_test          0 days 00:00:08.896778
round_time_total         0 days 00:09:16.599043
loss_total              819954848671772180480.0
loss_critic            1024943543027626934272.0
loss_actor                 -110165063139.328003
memory_size                             17835.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<09:49,  3.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                    155
episode_length                        12.832258
returns                              -14.348669
return_std                             2.178292
average_reward                        -1.117357
round_time               0 days 00:09:14.816320
episodes_test                             771.0
episode_length_test                   12.967575
returns_test                         -14.878547
return_std_test                        1.843486
average_reward_test                    -1.14698
round_time_test          0 days 00:00:08.978178
round_time_total         0 days 00:09:14.817417
loss_total              839900298808686936064.0
loss_critic            1049875355265562705920.0
loss_actor                 -111101758603.264008
memory_size                             17835.0 

=== epoch 8/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:16,  3.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    310
episode_length                        12.874194
returns                              -14.651364
return_std                             2.070666
average_reward                        -1.137174
round_time               0 days 00:09:15.495880
episodes_test                             770.0
episode_length_test                   12.975325
returns_test                         -14.857689
return_std_test                        1.884744
average_reward_test                   -1.145076
round_time_test          0 days 00:00:08.889570
round_time_total         0 days 00:09:15.496961
loss_total              846011568608461586432.0
loss_critic            1057514442739581452288.0
loss_actor                 -112268279304.192001
memory_size                             17835.0 

=== epoch 8/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                    463
episode_length                        12.943844
returns                              -14.715005
return_std                             1.983782
average_reward                        -1.136575
round_time               0 days 00:09:15.224522
episodes_test                             773.0
episode_length_test                    12.92238
returns_test                         -14.771918
return_std_test                        2.040063
average_reward_test                   -1.142853
round_time_test          0 days 00:00:08.988329
round_time_total         0 days 00:09:15.225607
loss_total              882145240661677047808.0
loss_critic            1102681531638419423232.0
loss_actor                 -113368796217.343994
memory_size                          17836.0135 

=== epoch 8/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:48,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    618
episode_length                        12.923948
returns                              -14.568987
return_std                             2.102052
average_reward                        -1.126901
round_time               0 days 00:09:15.280638
episodes_test                             767.0
episode_length_test                    13.03781
returns_test                         -14.799101
return_std_test                        1.863914
average_reward_test                   -1.135091
round_time_test          0 days 00:00:08.936265
round_time_total         0 days 00:09:15.281741
loss_total              895653586172707209216.0
loss_critic            1119566963729517248512.0
loss_actor                 -114409688563.712006
memory_size                             17837.0 

=== epoch 8/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:31,  3.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    772
episode_length                        12.948187
returns                              -14.656113
return_std                             2.033093
average_reward                        -1.131264
round_time               0 days 00:09:16.037577
episodes_test                             767.0
episode_length_test                   13.027379
returns_test                         -14.732394
return_std_test                        1.883156
average_reward_test                    -1.13008
round_time_test          0 days 00:00:08.797194
round_time_total         0 days 00:09:16.038716
loss_total              912516808982144876544.0
loss_critic            1140645991372700123136.0
loss_actor                 -115720072531.968002
memory_size                             17837.0 

=== epoch 8/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    772
episode_length                        12.943005
returns                              -14.735449
return_std                             2.016043
average_reward                        -1.137836
round_time               0 days 00:09:15.475960
episodes_test                             762.0
episode_length_test                   13.110236
returns_test                         -14.841089
return_std_test                        1.883184
average_reward_test                   -1.131496
round_time_test          0 days 00:00:08.929024
round_time_total         0 days 00:09:15.477060
loss_total              930223368741764726784.0
loss_critic            1162779191701145649152.0
loss_actor                    -117053775917.056
memory_size                             17837.0 

=== epoch 8/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                    771
episode_length                        12.958495
returns                              -14.690959
return_std                             1.985227
average_reward                        -1.133494
round_time               0 days 00:09:15.091324
episodes_test                             771.0
episode_length_test                   12.958495
returns_test                         -14.849084
return_std_test                        1.913925
average_reward_test                   -1.145637
round_time_test          0 days 00:00:08.899145
round_time_total         0 days 00:09:15.092448
loss_total              936803837800389672960.0
loss_critic            1171004778072805212160.0
loss_actor                 -118459729510.399994
memory_size                             17837.0 

=== epoch 8/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    774
episode_length                        12.913437
returns                              -14.603064
return_std                             2.029056
average_reward                        -1.131306
round_time               0 days 00:09:15.695359
episodes_test                             769.0
episode_length_test                   13.003901
returns_test                         -14.963391
return_std_test                        1.759146
average_reward_test                   -1.150685
round_time_test          0 days 00:00:08.901860
round_time_total         0 days 00:09:15.696437
loss_total              978037352708058644480.0
loss_critic            1222546667999838339072.0
loss_actor                 -119659520737.279999
memory_size                             17837.0 

=== epoch 8/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:20,  3.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    773
episode_length                         12.92238
returns                              -14.699292
return_std                             1.910396
average_reward                         -1.13773
round_time               0 days 00:09:15.619611
episodes_test                             766.0
episode_length_test                   13.048303
returns_test                         -14.546192
return_std_test                        1.927337
average_reward_test                   -1.113955
round_time_test          0 days 00:00:08.933367
round_time_total         0 days 00:09:15.620696
loss_total              990504315651214802944.0
loss_critic            1238130372740911726592.0
loss_actor                 -121139322335.231995
memory_size                             17837.0 

=== epoch 8/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    772
episode_length                         12.94171
returns                              -14.657041
return_std                             1.943411
average_reward                        -1.133158
round_time               0 days 00:09:16.281880
episodes_test                             768.0
episode_length_test                   13.016927
returns_test                         -14.817878
return_std_test                        1.668819
average_reward_test                    -1.13774
round_time_test          0 days 00:00:09.040281
round_time_total         0 days 00:09:16.282959
loss_total             1017635955573797683200.0
loss_critic            1272044922210932752384.0
loss_actor                 -122551931981.824005
memory_size                             17837.0 

=== epoch 8/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    769
episode_length                        12.980494
returns                              -14.656773
return_std                               1.8923
average_reward                        -1.129927
round_time               0 days 00:09:15.478752
episodes_test                             766.0
episode_length_test                   13.052219
returns_test                         -14.609506
return_std_test                        1.916442
average_reward_test                   -1.118971
round_time_test          0 days 00:00:08.998104
round_time_total         0 days 00:09:15.479824
loss_total             1036867933345524219904.0
loss_critic            1296084892807109804032.0
loss_actor                 -123862273937.408005
memory_size                             17837.0 

=== epoch 8/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    770
episode_length                         12.97013
returns                              -14.622128
return_std                             1.894137
average_reward                        -1.127596
round_time               0 days 00:09:16.775899
episodes_test                             765.0
episode_length_test                   13.065359
returns_test                          -14.68747
return_std_test                        1.773917
average_reward_test                   -1.123176
round_time_test          0 days 00:00:08.853730
round_time_total         0 days 00:09:16.776974
loss_total             1050463812252675080192.0
loss_critic            1313079741944624578560.0
loss_actor                 -124999280693.248001
memory_size                             17837.0 

=== epoch 8/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:19,  3.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    768
episode_length                        13.016927
returns                              -14.673169
return_std                             1.843166
average_reward                        -1.126782
round_time               0 days 00:09:16.006503
episodes_test                             766.0
episode_length_test                   13.050914
returns_test                          -14.69729
return_std_test                        1.797328
average_reward_test                   -1.125702
round_time_test          0 days 00:00:09.127871
round_time_total         0 days 00:09:16.007601
loss_total             1088372731678802837504.0
loss_critic            1360465891669287895040.0
loss_actor                 -126276198412.287994
memory_size                             17837.0 

=== epoch 8/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:14,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    768
episode_length                        13.016927
returns                              -14.663197
return_std                             1.879151
average_reward                        -1.126359
round_time               0 days 00:09:16.441009
episodes_test                             774.0
episode_length_test                   12.916021
returns_test                         -14.685494
return_std_test                        1.970373
average_reward_test                   -1.136458
round_time_test          0 days 00:00:08.949376
round_time_total         0 days 00:09:16.442081
loss_total             1104046886521278300160.0
loss_critic            1380058583836997713920.0
loss_actor                 -127406206078.975998
memory_size                             17837.0 

=== epoch 8/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:03,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.032595
returns                              -14.620962
return_std                             1.823768
average_reward                        -1.121302
round_time               0 days 00:09:16.173867
episodes_test                             761.0
episode_length_test                   13.123522
returns_test                         -14.234628
return_std_test                        1.741961
average_reward_test                   -1.084571
round_time_test          0 days 00:00:08.858619
round_time_total         0 days 00:09:16.174954
loss_total             1129537481519585689600.0
loss_critic            1411921826579928317952.0
loss_actor                 -128778689245.184006
memory_size                           17847.221 

=== epoch 8/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:03,  3.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                    763
episode_length                        13.096986
returns                              -14.525748
return_std                              1.80571
average_reward                         -1.10867
round_time               0 days 00:09:15.182679
episodes_test                             771.0
episode_length_test                   12.964981
returns_test                           -14.7515
return_std_test                        1.731762
average_reward_test                   -1.137362
round_time_test          0 days 00:00:08.820186
round_time_total         0 days 00:09:15.183746
loss_total             1152037107503233236992.0
loss_critic            1440046359499292540928.0
loss_actor                 -130251911262.207993
memory_size                          17851.5875 

=== epoch 8/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:11,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    762
episode_length                        13.111549
returns                              -14.482291
return_std                             1.847583
average_reward                         -1.10494
round_time               0 days 00:09:17.502445
episodes_test                             764.0
episode_length_test                   13.081152
returns_test                         -14.354237
return_std_test                        1.902454
average_reward_test                    -1.09679
round_time_test          0 days 00:00:08.919561
round_time_total         0 days 00:09:17.503537
loss_total             1179170422457014288384.0
loss_critic            1473963000926524669952.0
loss_actor                 -131323358928.895996
memory_size                             17855.0 

=== epoch 8/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:03,  3.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    761
episode_length                        13.120894
returns                              -14.493158
return_std                             1.845387
average_reward                        -1.104231
round_time               0 days 00:09:16.628270
episodes_test                             761.0
episode_length_test                   13.130092
returns_test                         -14.537412
return_std_test                        1.899466
average_reward_test                   -1.106403
round_time_test          0 days 00:00:08.938683
round_time_total         0 days 00:09:16.629346
loss_total             1192913761811970392064.0
loss_critic            1491142175676572893184.0
loss_actor                 -132619171074.048004
memory_size                             17855.0 

=== epoch 8/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:17,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.158103
returns                              -14.467622
return_std                             1.765212
average_reward                        -1.099268
round_time               0 days 00:09:15.878831
episodes_test                             756.0
episode_length_test                   13.223545
returns_test                         -14.137114
return_std_test                        1.839808
average_reward_test                   -1.068714
round_time_test          0 days 00:00:08.886012
round_time_total         0 days 00:09:15.879906
loss_total             1229452380162599157760.0
loss_critic            1536815446884227481600.0
loss_actor                 -133813633597.440002
memory_size                             17855.0 

=== epoch 8/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.155468
returns                              -14.474344
return_std                              1.81944
average_reward                        -1.100095
round_time               0 days 00:09:15.859504
episodes_test                             764.0
episode_length_test                   13.075916
returns_test                          -14.62501
return_std_test                        1.824066
average_reward_test                   -1.118219
round_time_test          0 days 00:00:08.953973
round_time_total         0 days 00:09:15.860586
loss_total             1240189601660386672640.0
loss_critic            1550236975291380006912.0
loss_actor                 -135306480803.839996
memory_size                             17855.0 

=== epoch 8/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:25,  3.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    761
episode_length                        13.123522
returns                              -14.611796
return_std                             1.800643
average_reward                        -1.113287
round_time               0 days 00:09:15.964854
episodes_test                             761.0
episode_length_test                    13.13272
returns_test                         -14.795643
return_std_test                         1.73696
average_reward_test                   -1.126163
round_time_test          0 days 00:00:08.821246
round_time_total         0 days 00:09:15.965960
loss_total             1257902674170425376768.0
loss_critic            1572378313290100506624.0
loss_actor                 -136913920524.287994
memory_size                             17855.0 

=== epoch 8/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:05,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    761
episode_length                        13.114323
returns                              -14.651127
return_std                             1.810749
average_reward                         -1.11703
round_time               0 days 00:09:16.313990
episodes_test                             763.0
episode_length_test                    13.10616
returns_test                         -14.759188
return_std_test                        1.865686
average_reward_test                   -1.126126
round_time_test          0 days 00:00:09.014125
round_time_total         0 days 00:09:16.315093
loss_total             1317950151493317033984.0
loss_critic            1647437659569881350144.0
loss_actor                 -138057849737.216003
memory_size                             17855.0 

=== epoch 8/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<08:58,  3.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    762
episode_length                        13.099738
returns                              -14.720015
return_std                             1.776519
average_reward                        -1.123793
round_time               0 days 00:09:16.360729
episodes_test                             767.0
episode_length_test                   13.035202
returns_test                         -14.771107
return_std_test                        1.873849
average_reward_test                   -1.132828
round_time_test          0 days 00:00:08.836020
round_time_total         0 days 00:09:16.361827
loss_total             1325647252393798926336.0
loss_critic            1657059036203457970176.0
loss_actor                 -139464837017.600006
memory_size                             17855.0 

=== epoch 8/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:35,  3.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    764
episode_length                        13.066754
returns                              -14.798553
return_std                             1.809247
average_reward                        -1.132564
round_time               0 days 00:09:16.409184
episodes_test                             765.0
episode_length_test                   13.060131
returns_test                          -14.64488
return_std_test                        1.916275
average_reward_test                   -1.120991
round_time_test          0 days 00:00:08.899461
round_time_total         0 days 00:09:16.410316
loss_total             1354205690310334349312.0
loss_critic            1692757082409455583232.0
loss_actor                 -140617870471.167999
memory_size                             17855.0 

=== epoch 8/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    768
episode_length                         13.00651
returns                              -14.792702
return_std                               1.8738
average_reward                        -1.137342
round_time               0 days 00:09:15.665365
episodes_test                             765.0
episode_length_test                   13.070588
returns_test                          -14.50654
return_std_test                        2.005326
average_reward_test                   -1.109663
round_time_test          0 days 00:00:08.752576
round_time_total         0 days 00:09:15.666431
loss_total             1371552744811306156032.0
loss_critic            1714440898939179499520.0
loss_actor                 -142248434626.559998
memory_size                             17855.0 

=== epoch 8/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:12,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    770
episode_length                        12.974026
returns                              -14.795723
return_std                               1.9209
average_reward                        -1.140847
round_time               0 days 00:09:16.727159
episodes_test                             766.0
episode_length_test                   13.044386
returns_test                         -14.835018
return_std_test                        1.812054
average_reward_test                   -1.136687
round_time_test          0 days 00:00:08.996300
round_time_total         0 days 00:09:16.728262
loss_total             1412147952862576246784.0
loss_critic            1765184910230322151424.0
loss_actor                 -143688598659.071991
memory_size                             17855.0 

=== epoch 8/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:20,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.014342
returns                               -14.83514
return_std                             1.845165
average_reward                        -1.140143
round_time               0 days 00:09:16.378738
episodes_test                             770.0
episode_length_test                   12.975325
returns_test                         -15.039277
return_std_test                        1.639727
average_reward_test                   -1.158642
round_time_test          0 days 00:00:08.912401
round_time_total         0 days 00:09:16.379812
loss_total             1429552370925862649856.0
loss_critic            1786940432290460729344.0
loss_actor                 -145220838187.007996
memory_size                           17855.292 

=== epoch 8/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:43,  3.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    769
episode_length                        12.992198
returns                              -14.779751
return_std                             1.872194
average_reward                        -1.137995
round_time               0 days 00:09:15.971138
episodes_test                             763.0
episode_length_test                   13.093054
returns_test                         -14.713421
return_std_test                        1.885576
average_reward_test                   -1.124047
round_time_test          0 days 00:00:08.940263
round_time_total         0 days 00:09:15.972241
loss_total             1472637268809944399872.0
loss_critic            1840796556558713225216.0
loss_actor                 -146419260182.528015
memory_size                             17856.0 

=== epoch 8/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:32,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.016949
returns                              -14.706674
return_std                             1.851387
average_reward                        -1.129663
round_time               0 days 00:09:15.305221
episodes_test                             760.0
episode_length_test                   13.156579
returns_test                         -14.833368
return_std_test                        1.756246
average_reward_test                   -1.127247
round_time_test          0 days 00:00:08.925129
round_time_total         0 days 00:09:15.306312
loss_total             1502462168437311930368.0
loss_critic            1878077680626729484288.0
loss_actor                 -147923483373.567993
memory_size                             17856.0 

=== epoch 8/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:53,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    765
episode_length                        13.048366
returns                              -14.685745
return_std                             1.788388
average_reward                        -1.125889
round_time               0 days 00:09:16.564755
episodes_test                             768.0
episode_length_test                   13.010417
returns_test                         -14.545064
return_std_test                         1.87998
average_reward_test                   -1.117328
round_time_test          0 days 00:00:08.987571
round_time_total         0 days 00:09:16.565825
loss_total             1509365038230573678592.0
loss_critic            1886706267309754810368.0
loss_actor                 -149694354190.335999
memory_size                             17856.0 

=== epoch 8/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:48,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    763
episode_length                         13.08519
returns                              -14.607961
return_std                             1.732556
average_reward                        -1.116011
round_time               0 days 00:09:15.893976
episodes_test                             760.0
episode_length_test                   13.148684
returns_test                          -14.52101
return_std_test                        1.728963
average_reward_test                   -1.103699
round_time_test          0 days 00:00:08.897171
round_time_total         0 days 00:09:15.895060
loss_total             1561453034050287304704.0
loss_critic            1951816256912294150144.0
loss_actor                 -151222529040.384003
memory_size                             17856.0 

=== epoch 8/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    764
episode_length                        13.082461
returns                              -14.616361
return_std                             1.798779
average_reward                        -1.116531
round_time               0 days 00:09:16.130865
episodes_test                             758.0
episode_length_test                   13.182058
returns_test                         -14.634308
return_std_test                        1.819132
average_reward_test                   -1.109253
round_time_test          0 days 00:00:08.882201
round_time_total         0 days 00:09:16.131937
loss_total             1595908528586883072000.0
loss_critic            1994885626754296709120.0
loss_actor                 -152385934376.959991
memory_size                             17856.0 

=== epoch 8/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:59,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    762
episode_length                        13.097113
returns                              -14.615703
return_std                             1.844049
average_reward                        -1.116193
round_time               0 days 00:09:15.634426
episodes_test                             754.0
episode_length_test                   13.259947
returns_test                         -14.564943
return_std_test                        1.890444
average_reward_test                   -1.098057
round_time_test          0 days 00:00:08.898674
round_time_total         0 days 00:09:15.635507
loss_total             1646780994106260717568.0
loss_critic            2058476207149386825728.0
loss_actor                 -153505917452.287994
memory_size                             17856.0 

=== epoch 8/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:49,  3.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    762
episode_length                        13.108924
returns                              -14.679283
return_std                               1.8601
average_reward                        -1.119733
round_time               0 days 00:09:15.453569
episodes_test                             763.0
episode_length_test                   13.100917
returns_test                         -14.775377
return_std_test                        1.725501
average_reward_test                   -1.127391
round_time_test          0 days 00:00:08.955879
round_time_total         0 days 00:09:15.454640
loss_total             1631845037156187766784.0
loss_critic            2039806262162462081024.0
loss_actor                 -155141088051.200012
memory_size                             17856.0 

=== epoch 8/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    763
episode_length                        13.098296
returns                              -14.734232
return_std                             1.856267
average_reward                        -1.125194
round_time               0 days 00:09:16.591938
episodes_test                             762.0
episode_length_test                   13.115486
returns_test                         -14.758588
return_std_test                        1.807833
average_reward_test                   -1.124601
round_time_test          0 days 00:00:08.873767
round_time_total         0 days 00:09:16.593022
loss_total             1674892018232252170240.0
loss_critic            2093614985560851480576.0
loss_actor                 -156635610038.272003
memory_size                             17856.0 

=== epoch 8/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:37,  3.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    763
episode_length                        13.081258
returns                              -14.751383
return_std                             1.872321
average_reward                        -1.128188
round_time               0 days 00:09:15.945492
episodes_test                             756.0
episode_length_test                   13.223545
returns_test                          -14.77975
return_std_test                        1.873577
average_reward_test                   -1.117172
round_time_test          0 days 00:00:08.999126
round_time_total         0 days 00:09:15.946564
loss_total             1711877351752532754432.0
loss_critic            2139846653397986181120.0
loss_actor                  -158276029128.70401
memory_size                             17856.0 

=== epoch 8/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:11,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    764
episode_length                        13.065445
returns                              -14.730574
return_std                             1.866531
average_reward                        -1.127894
round_time               0 days 00:09:15.538862
episodes_test                             758.0
episode_length_test                   13.183377
returns_test                         -14.768587
return_std_test                        1.842601
average_reward_test                   -1.119557
round_time_test          0 days 00:00:08.870421
round_time_total         0 days 00:09:15.539930
loss_total             1736009930325244248064.0
loss_critic            2170012374801880449024.0
loss_actor                    -159795407314.944
memory_size                             17856.0 

=== epoch 8/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    764
episode_length                        13.087696
returns                              -14.762383
return_std                             1.871305
average_reward                        -1.127977
round_time               0 days 00:09:16.554263
episodes_test                             764.0
episode_length_test                   13.089005
returns_test                         -14.647823
return_std_test                        1.790753
average_reward_test                   -1.119094
round_time_test          0 days 00:00:08.843264
round_time_total         0 days 00:09:16.555332
loss_total             1805902387264339836928.0
loss_critic            2257377944599161470976.0
loss_actor                 -161539201122.303986
memory_size                             17856.0 

=== epoch 8/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:11,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    765
episode_length                        13.049673
returns                              -14.756874
return_std                              1.88554
average_reward                        -1.130865
round_time               0 days 00:09:16.872992
episodes_test                             766.0
episode_length_test                   13.045692
returns_test                         -14.839775
return_std_test                        1.755402
average_reward_test                   -1.136826
round_time_test          0 days 00:00:08.861646
round_time_total         0 days 00:09:16.874057
loss_total             1839678052793911607296.0
loss_critic            2299597525741467860992.0
loss_actor                 -162964091822.079987
memory_size                             17856.0 

=== epoch 8/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.024804
returns                              -14.735874
return_std                             1.885794
average_reward                        -1.131179
round_time               0 days 00:09:16.530602
episodes_test                             764.0
episode_length_test                   13.078534
returns_test                         -14.718809
return_std_test                        1.760292
average_reward_test                   -1.125098
round_time_test          0 days 00:00:08.998186
round_time_total         0 days 00:09:16.531681
loss_total             1874176131345482514432.0
loss_critic            2342720123715427237888.0
loss_actor                 -164476896231.424011
memory_size                             17856.0 

=== epoch 8/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:49,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.036506
returns                              -14.758014
return_std                             1.867412
average_reward                        -1.132053
round_time               0 days 00:09:16.084200
episodes_test                             759.0
episode_length_test                    13.16469
returns_test                          -14.69049
return_std_test                        1.854789
average_reward_test                   -1.115453
round_time_test          0 days 00:00:08.863683
round_time_total         0 days 00:09:16.085272
loss_total             1867209648172299714560.0
loss_critic            2334012019225581060096.0
loss_actor                 -166107472764.928009
memory_size                             17856.0 

=== epoch 8/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:26,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.031291
returns                              -14.753703
return_std                             1.861815
average_reward                        -1.132378
round_time               0 days 00:09:16.576446
episodes_test                             768.0
episode_length_test                   13.016927
returns_test                         -14.910302
return_std_test                        1.832754
average_reward_test                   -1.145045
round_time_test          0 days 00:00:08.907833
round_time_total         0 days 00:09:16.577526
loss_total             1908005971730503041024.0
loss_critic            2385007424834419818496.0
loss_actor                  -167332420296.70401
memory_size                             17856.0 

=== epoch 8/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:10,  3.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    769
episode_length                        12.988296
returns                              -14.738123
return_std                             1.817196
average_reward                        -1.134437
round_time               0 days 00:09:16.482238
episodes_test                             765.0
episode_length_test                   13.064052
returns_test                         -14.887156
return_std_test                        1.849855
average_reward_test                   -1.138711
round_time_test          0 days 00:00:09.033366
round_time_total         0 days 00:09:16.483316
loss_total             1943010462872605818880.0
loss_critic            2428763037671332773888.0
loss_actor                 -169107211403.264008
memory_size                             17856.0 

=== epoch 8/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    768
episode_length                        13.010417
returns                              -14.730532
return_std                             1.804852
average_reward                        -1.132334
round_time               0 days 00:09:16.790575
episodes_test                             766.0
episode_length_test                   13.043081
returns_test                         -14.812133
return_std_test                        1.895332
average_reward_test                   -1.135447
round_time_test          0 days 00:00:08.922055
round_time_total         0 days 00:09:16.791713
loss_total             2020210691948299943936.0
loss_critic            2525263320218237140992.0
loss_actor                 -170441174802.432007
memory_size                             17856.0 

=== epoch 8/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.045692
returns                              -14.739873
return_std                             1.784914
average_reward                        -1.129644
round_time               0 days 00:09:16.455098
episodes_test                             761.0
episode_length_test                   13.127464
returns_test                         -14.532729
return_std_test                        1.836744
average_reward_test                   -1.106606
round_time_test          0 days 00:00:08.920343
round_time_total         0 days 00:09:16.456196
loss_total             2023582861204504969216.0
loss_critic            2529478533479542030336.0
loss_actor                  -172087975780.35199
memory_size                             17856.0 

=== epoch 8/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.029987
returns                              -14.722716
return_std                             1.849026
average_reward                        -1.129465
round_time               0 days 00:09:16.536177
episodes_test                             763.0
episode_length_test                   13.094364
returns_test                         -14.644154
return_std_test                        1.857584
average_reward_test                   -1.118156
round_time_test          0 days 00:00:09.009205
round_time_total         0 days 00:09:16.537253
loss_total             2053091030568970747904.0
loss_critic            2566363744468242792448.0
loss_actor                 -174107411693.567993
memory_size                             17856.0 

=== epoch 8/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.028721
returns                              -14.787793
return_std                             1.804993
average_reward                         -1.13448
round_time               0 days 00:09:16.686356
episodes_test                             770.0
episode_length_test                   12.987013
returns_test                         -14.705611
return_std_test                        1.956879
average_reward_test                   -1.132332
round_time_test          0 days 00:00:08.937428
round_time_total         0 days 00:09:16.687415
loss_total             2106652199601665474560.0
loss_critic            2633315205099404394496.0
loss_actor                 -175312052756.480011
memory_size                             17856.0 

=== epoch 8/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:54,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    767
episode_length                        13.032595
returns                              -14.779414
return_std                              1.81035
average_reward                        -1.133756
round_time               0 days 00:09:17.244796
episodes_test                             767.0
episode_length_test                    13.03781
returns_test                         -14.941635
return_std_test                        1.683793
average_reward_test                   -1.146023
round_time_test          0 days 00:00:08.930214
round_time_total         0 days 00:09:17.245862
loss_total             2163601419245312802816.0
loss_critic            2704501727780395810816.0
loss_actor                 -177069280534.528015
memory_size                             17856.0 

=== epoch 8/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:21,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.030026
returns                              -14.821217
return_std                             1.779265
average_reward                        -1.136822
round_time               0 days 00:09:16.540054
episodes_test                             773.0
episode_length_test                   12.924968
returns_test                         -14.830513
return_std_test                        1.856966
average_reward_test                    -1.14709
round_time_test          0 days 00:00:09.068498
round_time_total         0 days 00:09:16.541129
loss_total             2177605477251557097472.0
loss_critic            2722006796017698078720.0
loss_actor                 -178270181679.104004
memory_size                             17856.0 

=== epoch 8/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:14,  3.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    766
episode_length                         13.04047
returns                              -14.784907
return_std                             1.802502
average_reward                         -1.13431
round_time               0 days 00:09:16.530339
episodes_test                             767.0
episode_length_test                   13.031291
returns_test                         -14.595558
return_std_test                        1.832265
average_reward_test                   -1.119119
round_time_test          0 days 00:00:08.935336
round_time_total         0 days 00:09:16.531408
loss_total             2201873517697504116736.0
loss_critic            2752341849231551627264.0
loss_actor                 -180314737025.023987
memory_size                          17857.6525 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<09:56,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    153
episode_length                        13.071895
returns                              -14.810598
return_std                              1.76639
average_reward                        -1.133011
round_time               0 days 00:09:15.073641
episodes_test                             768.0
episode_length_test                   13.013021
returns_test                         -14.752485
return_std_test                        1.776083
average_reward_test                    -1.13281
round_time_test          0 days 00:00:08.963804
round_time_total         0 days 00:09:15.074736
loss_total             2262735836789738045440.0
loss_critic            2828419744591601205248.0
loss_actor                 -182261921210.368011
memory_size                           17858.854 

=== epoch 9/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:57,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    305
episode_length                        13.114754
returns                              -14.614264
return_std                              1.81783
average_reward                        -1.114338
round_time               0 days 00:09:15.792601
episodes_test                             757.0
episode_length_test                   13.207398
returns_test                         -14.073612
return_std_test                        1.803128
average_reward_test                   -1.065167
round_time_test          0 days 00:00:08.833189
round_time_total         0 days 00:09:15.793680
loss_total             2340198842485827960832.0
loss_critic            2925248501909625503744.0
loss_actor                 -183935784280.063995
memory_size                             17859.0 

=== epoch 9/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:51,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    457
episode_length                        13.122538
returns                              -14.558014
return_std                             1.774535
average_reward                        -1.108564
round_time               0 days 00:09:16.195228
episodes_test                             757.0
episode_length_test                   13.195509
returns_test                          -14.30288
return_std_test                        1.863348
average_reward_test                   -1.084121
round_time_test          0 days 00:00:08.903530
round_time_total         0 days 00:09:16.196312
loss_total             2342220241555460980736.0
loss_critic            2927775250139736702976.0
loss_actor                 -185803444617.216003
memory_size                             17859.0 

=== epoch 9/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    607
episode_length                        13.159802
returns                              -14.569002
return_std                             1.737112
average_reward                        -1.107298
round_time               0 days 00:09:16.613243
episodes_test                             762.0
episode_length_test                   13.106299
returns_test                         -14.514962
return_std_test                        1.813934
average_reward_test                   -1.107398
round_time_test          0 days 00:00:08.873281
round_time_total         0 days 00:09:16.614324
loss_total             2425595792164146642944.0
loss_critic            3031994686597394071552.0
loss_actor                 -187702300508.160004
memory_size                          17859.2455 

=== epoch 9/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    760
episode_length                        13.152632
returns                              -14.541553
return_std                             1.762894
average_reward                         -1.10482
round_time               0 days 00:09:16.382697
episodes_test                             755.0
episode_length_test                   13.233113
returns_test                         -14.439509
return_std_test                        1.843014
average_reward_test                   -1.090896
round_time_test          0 days 00:00:08.957563
round_time_total         0 days 00:09:16.383861
loss_total             2454988769319360397312.0
loss_critic            3068735910996899004416.0
loss_actor                 -189336160083.967987
memory_size                             17861.0 

=== epoch 9/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:02,  3.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    760
episode_length                        13.144737
returns                               -14.48955
return_std                             1.775426
average_reward                        -1.101965
round_time               0 days 00:09:16.698844
episodes_test                             768.0
episode_length_test                   13.019531
returns_test                         -14.743626
return_std_test                        1.686206
average_reward_test                   -1.132204
round_time_test          0 days 00:00:08.936368
round_time_total         0 days 00:09:16.699929
loss_total             2504650582728447098880.0
loss_critic            3130813176170562453504.0
loss_actor                 -189658539581.440002
memory_size                             17861.0 

=== epoch 9/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.166008
returns                              -14.520967
return_std                              1.72927
average_reward                        -1.102277
round_time               0 days 00:09:16.697815
episodes_test                             760.0
episode_length_test                   13.157895
returns_test                         -14.989088
return_std_test                          1.6575
average_reward_test                   -1.139171
round_time_test          0 days 00:00:08.957378
round_time_total         0 days 00:09:16.698901
loss_total             2483607039887910371328.0
loss_critic            3104508745579197628416.0
loss_actor                 -191606144303.104004
memory_size                             17861.0 

=== epoch 9/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    758
episode_length                        13.175462
returns                              -14.546585
return_std                             1.727059
average_reward                        -1.104212
round_time               0 days 00:09:16.021299
episodes_test                             761.0
episode_length_test                   13.128778
returns_test                         -14.593841
return_std_test                        1.747591
average_reward_test                     -1.1112
round_time_test          0 days 00:00:08.929606
round_time_total         0 days 00:09:16.022379
loss_total             2569646345457759682560.0
loss_critic            3212057874335333351424.0
loss_actor                 -193789098041.343994
memory_size                             17861.0 

=== epoch 9/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.168643
returns                              -14.554108
return_std                             1.728474
average_reward                         -1.10465
round_time               0 days 00:09:16.409587
episodes_test                             761.0
episode_length_test                   13.131406
returns_test                         -14.492787
return_std_test                        1.828358
average_reward_test                   -1.102797
round_time_test          0 days 00:00:08.929165
round_time_total         0 days 00:09:16.410696
loss_total             2623247913080933318656.0
loss_critic            3279059833881893011456.0
loss_actor                 -195462352150.528015
memory_size                          17866.1015 

=== epoch 9/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:01,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    757
episode_length                        13.194188
returns                              -14.547217
return_std                             1.917779
average_reward                        -1.102734
round_time               0 days 00:09:17.121827
episodes_test                             754.0
episode_length_test                   13.262599
returns_test                         -14.590439
return_std_test                        1.789221
average_reward_test                   -1.100119
round_time_test          0 days 00:00:08.917606
round_time_total         0 days 00:09:17.122912
loss_total             2671630600753792942080.0
loss_critic            3339538193662083596288.0
loss_actor                 -196966704021.503998
memory_size                             17867.0 

=== epoch 9/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:37,  3.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    753
episode_length                        13.260292
returns                                -14.4277
return_std                             1.942959
average_reward                        -1.088212
round_time               0 days 00:09:15.542532
episodes_test                             746.0
episode_length_test                   13.394102
returns_test                          -13.45412
return_std_test                        1.800105
average_reward_test                   -1.004023
round_time_test          0 days 00:00:08.827085
round_time_total         0 days 00:09:15.543629
loss_total             2719564757294799912960.0
loss_critic            3399455891088764567552.0
loss_actor                 -198853032632.320007
memory_size                          17867.9085 

=== epoch 9/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    753
episode_length                        13.258964
returns                              -14.316457
return_std                             1.949311
average_reward                        -1.080102
round_time               0 days 00:09:16.912951
episodes_test                             756.0
episode_length_test                   13.210317
returns_test                         -14.436983
return_std_test                        1.774711
average_reward_test                   -1.092904
round_time_test          0 days 00:00:08.981125
round_time_total         0 days 00:09:16.914037
loss_total             2813265131282549440512.0
loss_critic            3516581353304591695872.0
loss_actor                 -200552902762.496002
memory_size                             17868.0 

=== epoch 9/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:46,  3.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    752
episode_length                        13.275266
returns                              -14.194592
return_std                             1.963036
average_reward                        -1.068912
round_time               0 days 00:09:17.123605
episodes_test                             752.0
episode_length_test                   13.292553
returns_test                         -14.074013
return_std_test                        1.726746
average_reward_test                   -1.058066
round_time_test          0 days 00:00:08.947544
round_time_total         0 days 00:09:17.124689
loss_total             2820383000810959667200.0
loss_critic            3525478692013905281024.0
loss_actor                 -202670550908.928009
memory_size                             17868.0 

=== epoch 9/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:14,  3.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    752
episode_length                        13.268617
returns                              -14.131288
return_std                             1.963583
average_reward                        -1.065332
round_time               0 days 00:09:16.274146
episodes_test                             750.0
episode_length_test                   13.333333
returns_test                         -14.038956
return_std_test                        1.722858
average_reward_test                   -1.052922
round_time_test          0 days 00:00:08.928862
round_time_total         0 days 00:09:16.275247
loss_total             2866489316573169844224.0
loss_critic            3583111584662780641280.0
loss_actor                 -204616083275.776001
memory_size                             17868.0 

=== epoch 9/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:26,  3.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    754
episode_length                        13.240053
returns                              -14.124316
return_std                             1.757347
average_reward                         -1.06676
round_time               0 days 00:09:16.860405
episodes_test                             753.0
episode_length_test                   13.269588
returns_test                         -14.291317
return_std_test                         1.72869
average_reward_test                   -1.076931
round_time_test          0 days 00:00:08.950595
round_time_total         0 days 00:09:16.861479
loss_total             2988015088957047963648.0
loss_critic            3735018798181099503616.0
loss_actor                 -206583828242.432007
memory_size                          17868.2395 

=== epoch 9/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:09,  3.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    757
episode_length                        13.200793
returns                              -14.245276
return_std                             1.742769
average_reward                        -1.078845
round_time               0 days 00:09:16.470623
episodes_test                             761.0
episode_length_test                   13.123522
returns_test                         -14.520076
return_std_test                        1.751587
average_reward_test                   -1.106191
round_time_test          0 days 00:00:08.865865
round_time_total         0 days 00:09:16.471695
loss_total             2994435052698032668672.0
loss_critic            3743043752786961891328.0
loss_actor                 -208516141916.160004
memory_size                             17869.0 

=== epoch 9/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    759
episode_length                        13.162055
returns                              -14.354569
return_std                             1.759002
average_reward                        -1.090171
round_time               0 days 00:09:17.799828
episodes_test                             759.0
episode_length_test                    13.15942
returns_test                         -14.581054
return_std_test                        1.608238
average_reward_test                   -1.107514
round_time_test          0 days 00:00:08.907775
round_time_total         0 days 00:09:17.800911
loss_total             3057470424528393863168.0
loss_critic            3821837968753590009856.0
loss_actor                 -210314768719.872009
memory_size                             17869.0 

=== epoch 9/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    756
episode_length                        13.202381
returns                              -14.489496
return_std                             1.734256
average_reward                        -1.098397
round_time               0 days 00:09:16.427134
episodes_test                             762.0
episode_length_test                   13.122047
returns_test                         -14.691893
return_std_test                        1.689585
average_reward_test                   -1.119425
round_time_test          0 days 00:00:09.133828
round_time_total         0 days 00:09:16.428229
loss_total             3125695930782626349056.0
loss_critic            3907119848787416711168.0
loss_actor                 -212500480974.847992
memory_size                          17872.1735 

=== epoch 9/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:37,  3.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    758
episode_length                        13.176781
returns                              -14.539614
return_std                             1.741472
average_reward                        -1.103368
round_time               0 days 00:09:16.879175
episodes_test                             755.0
episode_length_test                    13.24106
returns_test                         -14.620575
return_std_test                         1.77156
average_reward_test                   -1.103577
round_time_test          0 days 00:00:08.875838
round_time_total         0 days 00:09:16.880238
loss_total             3210131954392550604800.0
loss_critic            4012664874759394689024.0
loss_actor                 -214273263247.359985
memory_size                             17878.0 

=== epoch 9/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:49,  3.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    757
episode_length                        13.191546
returns                              -14.641059
return_std                             1.731749
average_reward                        -1.109659
round_time               0 days 00:09:16.661989
episodes_test                             771.0
episode_length_test                   12.970169
returns_test                          -14.74324
return_std_test                        1.710757
average_reward_test                   -1.136704
round_time_test          0 days 00:00:09.058277
round_time_total         0 days 00:09:16.663064
loss_total             3275621144919875780608.0
loss_critic            4094526360174169948160.0
loss_actor                  -216136306999.29599
memory_size                             17878.0 

=== epoch 9/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:54,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    757
episode_length                        13.180978
returns                              -14.629563
return_std                             1.718502
average_reward                         -1.11088
round_time               0 days 00:09:17.655860
episodes_test                             758.0
episode_length_test                   13.183377
returns_test                          -14.35117
return_std_test                        1.689145
average_reward_test                   -1.088098
round_time_test          0 days 00:00:08.970636
round_time_total         0 days 00:09:17.656942
loss_total             3294684473194577920000.0
loss_critic            4118355522619814445056.0
loss_actor                 -218266736992.256012
memory_size                             17878.0 

=== epoch 9/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    757
episode_length                         13.18362
returns                              -14.730004
return_std                             1.722486
average_reward                        -1.118155
round_time               0 days 00:09:16.862910
episodes_test                             765.0
episode_length_test                   13.061438
returns_test                         -14.850368
return_std_test                        1.700189
average_reward_test                   -1.136223
round_time_test          0 days 00:00:08.987930
round_time_total         0 days 00:09:16.863992
loss_total             3394177137698952708096.0
loss_critic            4242721347154589777920.0
loss_actor                 -220253545922.559998
memory_size                             17878.0 

=== epoch 9/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    765
episode_length                        13.069281
returns                              -14.768243
return_std                             1.717201
average_reward                        -1.129781
round_time               0 days 00:09:16.614949
episodes_test                             766.0
episode_length_test                    13.05483
returns_test                         -14.959745
return_std_test                        1.632045
average_reward_test                   -1.145916
round_time_test          0 days 00:00:08.972210
round_time_total         0 days 00:09:16.616026
loss_total             3412494905505102168064.0
loss_critic            4265618557985400291328.0
loss_actor                 -222551281491.967987
memory_size                             17878.0 

=== epoch 9/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    767
episode_length                        13.026076
returns                              -14.830042
return_std                             1.734247
average_reward                        -1.138324
round_time               0 days 00:09:17.344453
episodes_test                             770.0
episode_length_test                   12.985714
returns_test                          -14.96762
return_std_test                        1.724315
average_reward_test                   -1.152428
round_time_test          0 days 00:00:08.997356
round_time_total         0 days 00:09:17.345525
loss_total             3499020409575215464448.0
loss_critic            4373775436568910102528.0
loss_actor                 -224887640391.679993
memory_size                             17878.0 

=== epoch 9/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:29,  3.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.026076
returns                              -14.809728
return_std                             1.744552
average_reward                         -1.13697
round_time               0 days 00:09:16.141430
episodes_test                             765.0
episode_length_test                   13.065359
returns_test                         -14.797269
return_std_test                        1.761378
average_reward_test                   -1.131937
round_time_test          0 days 00:00:08.964338
round_time_total         0 days 00:09:16.142507
loss_total             3577020065100104466432.0
loss_critic            4471275001638547357696.0
loss_actor                 -226906397933.567993
memory_size                             17878.0 

=== epoch 9/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:52,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.033898
returns                              -14.860396
return_std                             1.723283
average_reward                        -1.139743
round_time               0 days 00:09:16.675219
episodes_test                             760.0
episode_length_test                   13.143421
returns_test                         -14.685515
return_std_test                        1.791587
average_reward_test                   -1.117155
round_time_test          0 days 00:00:08.936038
round_time_total         0 days 00:09:16.676287
loss_total             3608910216472576917504.0
loss_critic            4511137696518821642240.0
loss_actor                 -229206542196.735992
memory_size                             17878.0 

=== epoch 9/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    767
episode_length                        13.027379
returns                              -14.793857
return_std                             1.735448
average_reward                        -1.135352
round_time               0 days 00:09:17.137563
episodes_test                             762.0
episode_length_test                   13.116798
returns_test                         -14.806107
return_std_test                        1.683192
average_reward_test                   -1.128407
round_time_test          0 days 00:00:09.122281
round_time_total         0 days 00:09:17.138634
loss_total             3712948457600917176320.0
loss_critic            4641185498465808941056.0
loss_actor                 -230382365900.799988
memory_size                             17878.0 

=== epoch 9/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:26,  3.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    765
episode_length                         13.05098
returns                              -14.780736
return_std                             1.748817
average_reward                        -1.132223
round_time               0 days 00:09:16.105078
episodes_test                             762.0
episode_length_test                   13.114173
returns_test                         -14.361733
return_std_test                        1.831212
average_reward_test                   -1.095043
round_time_test          0 days 00:00:08.770012
round_time_total         0 days 00:09:16.106160
loss_total             3773133642281263300608.0
loss_critic            4716416973414062882816.0
loss_actor                 -230143955845.119995
memory_size                             17878.0 

=== epoch 9/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:18,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    760
episode_length                        13.140789
returns                              -14.702183
return_std                             1.750548
average_reward                        -1.119041
round_time               0 days 00:09:17.916202
episodes_test                             764.0
episode_length_test                   13.085079
returns_test                         -14.609738
return_std_test                        1.756449
average_reward_test                   -1.116183
round_time_test          0 days 00:00:08.832864
round_time_total         0 days 00:09:17.917288
loss_total             3834002284444966191104.0
loss_critic            4792502772063692390400.0
loss_actor                 -232156799393.791992
memory_size                             17878.0 

=== epoch 9/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    760
episode_length                        13.127632
returns                              -14.751857
return_std                             1.729476
average_reward                        -1.124212
round_time               0 days 00:09:17.655076
episodes_test                             760.0
episode_length_test                   13.155263
returns_test                         -14.889549
return_std_test                        1.680823
average_reward_test                   -1.131411
round_time_test          0 days 00:00:08.982432
round_time_total         0 days 00:09:17.656158
loss_total             3863619121463907844096.0
loss_critic            4829523815909648498688.0
loss_actor                 -234715340070.911987
memory_size                             17878.0 

=== epoch 9/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:59,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    762
episode_length                        13.108924
returns                              -14.724923
return_std                             1.749553
average_reward                        -1.123548
round_time               0 days 00:09:16.085650
episodes_test                             771.0
episode_length_test                   12.958495
returns_test                         -14.877939
return_std_test                        1.671955
average_reward_test                   -1.147557
round_time_test          0 days 00:00:08.821432
round_time_total         0 days 00:09:16.086729
loss_total             3936482318558755291136.0
loss_critic            4920602813685582266368.0
loss_actor                 -237612512067.584015
memory_size                             17878.0 

=== epoch 9/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:05,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    762
episode_length                        13.108924
returns                              -14.763801
return_std                             1.790063
average_reward                        -1.126055
round_time               0 days 00:09:16.861186
episodes_test                             772.0
episode_length_test                   12.946891
returns_test                         -14.993665
return_std_test                        1.732593
average_reward_test                   -1.157459
round_time_test          0 days 00:00:08.892889
round_time_total         0 days 00:09:16.862268
loss_total             3999125620058959642624.0
loss_critic            4998906938792822702080.0
loss_actor                 -240011525488.640015
memory_size                             17878.0 

=== epoch 9/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    762
episode_length                        13.103675
returns                              -14.768216
return_std                             1.789488
average_reward                        -1.127081
round_time               0 days 00:09:17.649884
episodes_test                             763.0
episode_length_test                   13.098296
returns_test                         -14.709717
return_std_test                        1.742702
average_reward_test                   -1.122007
round_time_test          0 days 00:00:08.901913
round_time_total         0 days 00:09:17.650962
loss_total             4046764357634384461824.0
loss_critic            5058455361105151655936.0
loss_actor                 -242862253981.696014
memory_size                             17878.0 

=== epoch 9/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:48,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    765
episode_length                        13.061438
returns                              -14.829458
return_std                             1.722715
average_reward                        -1.135707
round_time               0 days 00:09:16.655840
episodes_test                             765.0
episode_length_test                   13.071895
returns_test                         -15.093127
return_std_test                        1.600232
average_reward_test                   -1.154624
round_time_test          0 days 00:00:09.118075
round_time_total         0 days 00:09:16.657088
loss_total             4216171322961014292480.0
loss_critic            5270214061570989752320.0
loss_actor                  -245066999005.18399
memory_size                             17878.0 

=== epoch 9/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    767
episode_length                        13.022164
returns                              -14.811348
return_std                             1.742505
average_reward                        -1.137301
round_time               0 days 00:09:16.168874
episodes_test                             765.0
episode_length_test                   13.071895
returns_test                         -14.900724
return_std_test                        1.695031
average_reward_test                   -1.139905
round_time_test          0 days 00:00:08.866503
round_time_total         0 days 00:09:16.169954
loss_total             4277631418887945846784.0
loss_critic            5347039179685250465792.0
loss_actor                 -246778568908.799988
memory_size                             17878.0 

=== epoch 9/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:21,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    770
episode_length                        12.987013
returns                              -14.878946
return_std                             1.745651
average_reward                        -1.145679
round_time               0 days 00:09:17.363293
episodes_test                             765.0
episode_length_test                   13.070588
returns_test                          -14.89844
return_std_test                        1.596507
average_reward_test                   -1.139653
round_time_test          0 days 00:00:08.768256
round_time_total         0 days 00:09:17.364357
loss_total             4332861737312099762176.0
loss_critic            5416077073792386465792.0
loss_actor                 -249426915131.391998
memory_size                             17878.0 

=== epoch 9/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:14,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    769
episode_length                        12.983095
returns                              -14.823316
return_std                             1.754273
average_reward                        -1.142269
round_time               0 days 00:09:16.062761
episodes_test                             767.0
episode_length_test                   13.028683
returns_test                         -14.341094
return_std_test                        1.770677
average_reward_test                   -1.099936
round_time_test          0 days 00:00:08.974140
round_time_total         0 days 00:09:16.063853
loss_total             4374072775686414663680.0
loss_critic            5467590875155571671040.0
loss_actor                 -251942749601.791992
memory_size                             17878.0 

=== epoch 9/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:00,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    768
episode_length                        13.007812
returns                              -14.798431
return_std                             1.771978
average_reward                         -1.13746
round_time               0 days 00:09:17.585674
episodes_test                             767.0
episode_length_test                   13.032595
returns_test                         -14.598982
return_std_test                        1.774226
average_reward_test                   -1.119662
round_time_test          0 days 00:00:08.932383
round_time_total         0 days 00:09:17.586752
loss_total             4477346652902964330496.0
loss_critic            5596683216662485270528.0
loss_actor                 -253986147614.720001
memory_size                             17878.0 

=== epoch 9/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    767
episode_length                        13.016949
returns                              -14.768428
return_std                             1.838936
average_reward                        -1.133954
round_time               0 days 00:09:16.886013
episodes_test                             764.0
episode_length_test                   13.086387
returns_test                         -14.682356
return_std_test                        1.821753
average_reward_test                   -1.121605
round_time_test          0 days 00:00:08.968096
round_time_total         0 days 00:09:16.887090
loss_total             4538179187808498352128.0
loss_critic            5672723888082764431360.0
loss_actor                 -256733940228.096008
memory_size                             17878.0 

=== epoch 9/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:08,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.049608
returns                              -14.764801
return_std                             1.807282
average_reward                        -1.131283
round_time               0 days 00:09:16.738517
episodes_test                             761.0
episode_length_test                   13.127464
returns_test                         -14.696382
return_std_test                        1.663183
average_reward_test                   -1.119235
round_time_test          0 days 00:00:08.855927
round_time_total         0 days 00:09:16.739597
loss_total             4667319647286759784448.0
loss_critic            5834149462474571644928.0
loss_actor                 -258929370218.496002
memory_size                             17878.0 

=== epoch 9/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    764
episode_length                        13.087696
returns                              -14.749078
return_std                             1.772422
average_reward                        -1.126768
round_time               0 days 00:09:17.117835
episodes_test                             764.0
episode_length_test                   13.078534
returns_test                         -14.768913
return_std_test                        1.688972
average_reward_test                   -1.128949
round_time_test          0 days 00:00:08.908536
round_time_total         0 days 00:09:17.118919
loss_total             4788313593883135049728.0
loss_critic            5985391891876148150272.0
loss_actor                 -261495520845.824005
memory_size                             17878.0 

=== epoch 9/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    762
episode_length                        13.108924
returns                              -14.819842
return_std                             1.700467
average_reward                        -1.130155
round_time               0 days 00:09:17.326260
episodes_test                             760.0
episode_length_test                   13.146053
returns_test                         -14.771636
return_std_test                        1.754195
average_reward_test                    -1.12334
round_time_test          0 days 00:00:08.860025
round_time_total         0 days 00:09:17.327349
loss_total             4842119180902615482368.0
loss_critic            6052648870583949328384.0
loss_actor                    -263822940069.888
memory_size                             17878.0 

=== epoch 9/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    760
episode_length                        13.131579
returns                              -14.786155
return_std                             1.694189
average_reward                        -1.126774
round_time               0 days 00:09:16.384265
episodes_test                             764.0
episode_length_test                   13.089005
returns_test                         -14.516904
return_std_test                          1.7185
average_reward_test                   -1.109091
round_time_test          0 days 00:00:08.915955
round_time_total         0 days 00:09:16.385344
loss_total             4947392010685306109952.0
loss_critic            6184239911339545329664.0
loss_actor                 -265536817635.328003
memory_size                             17878.0 

=== epoch 9/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    760
episode_length                        13.134211
returns                              -14.788851
return_std                             1.621714
average_reward                        -1.126439
round_time               0 days 00:09:16.823280
episodes_test                             760.0
episode_length_test                   13.151316
returns_test                          -14.55488
return_std_test                        1.746427
average_reward_test                   -1.106076
round_time_test          0 days 00:00:09.038218
round_time_total         0 days 00:09:16.824354
loss_total             5006769253497966690304.0
loss_critic            6258461462550794993664.0
loss_actor                 -267960298872.832001
memory_size                             17878.0 

=== epoch 9/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:15,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    761
episode_length                        13.107753
returns                              -14.616594
return_std                             1.699678
average_reward                        -1.115349
round_time               0 days 00:09:16.964811
episodes_test                             755.0
episode_length_test                   13.242384
returns_test                         -14.587816
return_std_test                        1.636461
average_reward_test                   -1.101233
round_time_test          0 days 00:00:08.858434
round_time_total         0 days 00:09:16.965885
loss_total             5099323265086938677248.0
loss_critic            6374153968874235625472.0
loss_actor                 -270628355735.552002
memory_size                             17878.0 

=== epoch 9/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:14,  3.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    762
episode_length                        13.106299
returns                              -14.552757
return_std                             1.714761
average_reward                        -1.110708
round_time               0 days 00:09:15.771661
episodes_test                             757.0
episode_length_test                    13.19683
returns_test                          -14.44582
return_std_test                        1.653803
average_reward_test                   -1.094649
round_time_test          0 days 00:00:08.818209
round_time_total         0 days 00:09:15.772732
loss_total             5215233061507412525056.0
loss_critic            6519041207626836738048.0
loss_actor                  -273254172049.40799
memory_size                             17878.0 

=== epoch 9/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:04,  3.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    763
episode_length                        13.102228
returns                              -14.492763
return_std                             1.695517
average_reward                        -1.106355
round_time               0 days 00:09:16.742940
episodes_test                             761.0
episode_length_test                   13.122208
returns_test                         -14.439186
return_std_test                        1.710408
average_reward_test                   -1.100096
round_time_test          0 days 00:00:08.975646
round_time_total         0 days 00:09:16.744000
loss_total             5258777576661595455488.0
loss_critic            6573471855360681705472.0
loss_actor                 -275853160038.400024
memory_size                             17878.0 

=== epoch 9/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:54,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    766
episode_length                        13.039164
returns                              -14.484671
return_std                             1.702754
average_reward                        -1.110833
round_time               0 days 00:09:16.584811
episodes_test                             761.0
episode_length_test                    13.12615
returns_test                         -14.746702
return_std_test                        1.711506
average_reward_test                   -1.123498
round_time_test          0 days 00:00:08.930532
round_time_total         0 days 00:09:16.585873
loss_total             5381963298673609670656.0
loss_critic            6727454009696490356736.0
loss_actor                 -277668684005.375977
memory_size                             17878.0 

=== epoch 9/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    764
episode_length                        13.068063
returns                              -14.450435
return_std                             1.768223
average_reward                        -1.106026
round_time               0 days 00:09:17.073512
episodes_test                             761.0
episode_length_test                   13.131406
returns_test                         -14.689622
return_std_test                        1.814462
average_reward_test                   -1.117919
round_time_test          0 days 00:00:08.903114
round_time_total         0 days 00:09:17.074607
loss_total             5400981528710884622336.0
loss_critic            6751226794287596306432.0
loss_actor                 -278913086046.208008
memory_size                             17878.0 

=== epoch 9/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:22,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    761
episode_length                        13.122208
returns                              -14.529943
return_std                             1.695589
average_reward                        -1.107189
round_time               0 days 00:09:16.842853
episodes_test                             758.0
episode_length_test                   13.192612
returns_test                         -14.216665
return_std_test                        1.654012
average_reward_test                   -1.077623
round_time_test          0 days 00:00:08.891533
round_time_total         0 days 00:09:16.843928
loss_total             5500417083912808824832.0
loss_critic            6875521244438471901184.0
loss_actor                 -281874092548.096008
memory_size                             17878.0 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 7/2000 [00:01<09:43,  3.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    148
episode_length                        13.486486
returns                              -14.120262
return_std                             1.614471
average_reward                         -1.04374
round_time               0 days 00:09:15.181813
episodes_test                             746.0
episode_length_test                   13.395442
returns_test                         -14.140528
return_std_test                        1.735245
average_reward_test                   -1.054934
round_time_test          0 days 00:00:08.889314
round_time_total         0 days 00:09:15.182905
loss_total             5595530593873197793280.0
loss_critic            6994413124473850953728.0
loss_actor                 -284556972195.840027
memory_size                             17878.0 

=== epoch 10/10 ==== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:51,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    298
episode_length                        13.412752
returns                              -14.273398
return_std                             1.767131
average_reward                        -1.063019
round_time               0 days 00:09:15.747971
episodes_test                             749.0
episode_length_test                   13.339119
returns_test                         -14.357115
return_std_test                        1.675848
average_reward_test                   -1.076008
round_time_test          0 days 00:00:08.867555
round_time_total         0 days 00:09:15.749070
loss_total             5672072051076079026176.0
loss_critic            7090089947525564858368.0
loss_actor                 -287341971963.903992
memory_size                             17878.0 

=== epoch 10/10 ==== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:22,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    447
episode_length                        13.407159
returns                              -14.374131
return_std                             1.774021
average_reward                        -1.070902
round_time               0 days 00:09:16.138462
episodes_test                             758.0
episode_length_test                   13.184697
returns_test                         -14.483335
return_std_test                        1.754039
average_reward_test                   -1.097562
round_time_test          0 days 00:00:08.902114
round_time_total         0 days 00:09:16.139556
loss_total             5810585598497988280320.0
loss_critic            7263231868353725333504.0
loss_actor                 -290301689405.440002
memory_size                           17880.175 

=== epoch 10/10 ==== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    599
episode_length                        13.343907
returns                              -14.429795
return_std                             1.748573
average_reward                        -1.080469
round_time               0 days 00:09:16.430015
episodes_test                             757.0
episode_length_test                    13.19683
returns_test                          -14.56868
return_std_test                         1.77218
average_reward_test                   -1.103304
round_time_test          0 days 00:00:08.927752
round_time_total         0 days 00:09:16.431082
loss_total             5969475474142453039104.0
loss_critic            7461844215222678585344.0
loss_actor                 -292620404457.471985
memory_size                             17883.0 

=== epoch 10/10 ==== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:21,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    752
episode_length                        13.291223
returns                              -14.375193
return_std                             1.773104
average_reward                        -1.081099
round_time               0 days 00:09:17.251247
episodes_test                             760.0
episode_length_test                   13.151316
returns_test                         -14.345922
return_std_test                        1.775046
average_reward_test                   -1.090052
round_time_test          0 days 00:00:08.877105
round_time_total         0 days 00:09:17.252336
loss_total             5983034009878793289728.0
loss_critic            7478792389968449765376.0
loss_actor                    -295327692890.112
memory_size                             17883.0 

=== epoch 10/10 ==== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:36,  3.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    754
episode_length                        13.237401
returns                              -14.418321
return_std                             1.792064
average_reward                        -1.089864
round_time               0 days 00:09:16.828762
episodes_test                             754.0
episode_length_test                   13.251989
returns_test                         -14.253576
return_std_test                        1.807279
average_reward_test                   -1.075241
round_time_test          0 days 00:00:09.002395
round_time_total         0 days 00:09:16.830049
loss_total             6151898407418015514624.0
loss_critic            7689872877111221420032.0
loss_actor                 -298731715387.392029
memory_size                             17883.0 

=== epoch 10/10 ==== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:53,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    755
episode_length                         13.22649
returns                              -14.420245
return_std                             1.744691
average_reward                        -1.090219
round_time               0 days 00:09:17.150886
episodes_test                             761.0
episode_length_test                   13.124836
returns_test                          -14.34391
return_std_test                        1.646247
average_reward_test                   -1.092864
round_time_test          0 days 00:00:08.981917
round_time_total         0 days 00:09:17.152021
loss_total             6299985961170062278656.0
loss_critic            7874982305552986537984.0
loss_actor                 -301423465152.512024
memory_size                             17883.0 

=== epoch 10/10 ==== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:17,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                    757
episode_length                        13.188904
returns                              -14.410717
return_std                             1.693627
average_reward                        -1.092401
round_time               0 days 00:09:15.462572
episodes_test                             758.0
episode_length_test                   13.184697
returns_test                         -14.314876
return_std_test                         1.65378
average_reward_test                   -1.085368
round_time_test          0 days 00:00:09.020182
round_time_total         0 days 00:09:15.463661
loss_total             6378592856700772941824.0
loss_critic            7973240930877349822464.0
loss_actor                 -304143008727.039978
memory_size                             17883.0 

=== epoch 10/10 ==== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:17,  3.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    758
episode_length                        13.183377
returns                              -14.357102
return_std                             1.700329
average_reward                         -1.08988
round_time               0 days 00:09:15.621549
episodes_test                             762.0
episode_length_test                   13.122047
returns_test                         -14.551119
return_std_test                        1.528323
average_reward_test                   -1.108725
round_time_test          0 days 00:00:08.913882
round_time_total         0 days 00:09:15.622627
loss_total             6499297756749117784064.0
loss_critic            8124122048513877999616.0
loss_actor                  -307180061163.52002
memory_size                             17883.0 

=== epoch 10/10 ==== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:37,  3.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    757
episode_length                        13.194188
returns                              -14.408488
return_std                             1.631012
average_reward                        -1.091983
round_time               0 days 00:09:16.408509
episodes_test                             764.0
episode_length_test                   13.077225
returns_test                         -14.630852
return_std_test                        1.737152
average_reward_test                    -1.11886
round_time_test          0 days 00:00:09.162810
round_time_total         0 days 00:09:16.409599
loss_total             6562217544961679687680.0
loss_critic            8202771793683981271040.0
loss_actor                 -310319466807.296021
memory_size                             17883.0 

=== epoch 10/10 ==== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:42,  3.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    756
episode_length                        13.220899
returns                              -14.380934
return_std                             1.666162
average_reward                        -1.087857
round_time               0 days 00:09:16.637133
episodes_test                             761.0
episode_length_test                   13.140604
returns_test                         -14.621024
return_std_test                        1.752619
average_reward_test                    -1.11266
round_time_test          0 days 00:00:09.008182
round_time_total         0 days 00:09:16.638200
loss_total             6754590432081237508096.0
loss_critic            8443237900789026390016.0
loss_actor                 -312620542148.607971
memory_size                           17883.787 

=== epoch 10/10 ==== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    755
episode_length                         13.22649
returns                              -14.479739
return_std                             1.690006
average_reward                        -1.094867
round_time               0 days 00:09:16.420292
episodes_test                             759.0
episode_length_test                    13.16469
returns_test                          -14.65615
return_std_test                        1.663333
average_reward_test                   -1.112905
round_time_test          0 days 00:00:08.834260
round_time_total         0 days 00:09:16.421381
loss_total             6831399984347812462592.0
loss_critic            8539249833663156715520.0
loss_actor                 -315657301688.320007
memory_size                             17889.0 

=== epoch 10/10 ==== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:41,  3.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    758
episode_length                        13.187335
returns                              -14.469182
return_std                             1.763787
average_reward                        -1.097655
round_time               0 days 00:09:16.058573
episodes_test                             762.0
episode_length_test                   13.112861
returns_test                         -14.747217
return_std_test                        1.901671
average_reward_test                   -1.123898
round_time_test          0 days 00:00:08.986197
round_time_total         0 days 00:09:16.059653
loss_total             7018505962579762872320.0
loss_critic            8773132300031947177984.0
loss_actor                 -318367733825.536011
memory_size                             17889.0 

=== epoch 10/10 ==== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:45,  3.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.171278
returns                                -14.5166
return_std                             1.746293
average_reward                        -1.101549
round_time               0 days 00:09:16.365150
episodes_test                             760.0
episode_length_test                   13.157895
returns_test                         -14.441957
return_std_test                        1.865338
average_reward_test                   -1.097589
round_time_test          0 days 00:00:08.980626
round_time_total         0 days 00:09:16.366241
loss_total             7060810070604798492672.0
loss_critic            8826012443067687108608.0
loss_actor                  -320996944248.83197
memory_size                             17889.0 

=== epoch 10/10 ==== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    758
episode_length                        13.170185
returns                              -14.568286
return_std                              1.77503
average_reward                        -1.106579
round_time               0 days 00:09:15.922177
episodes_test                             763.0
episode_length_test                   13.102228
returns_test                         -14.840962
return_std_test                        1.677669
average_reward_test                   -1.132313
round_time_test          0 days 00:00:08.952806
round_time_total         0 days 00:09:15.923260
loss_total             7221660385157290393600.0
loss_critic            9027075329590862807040.0
loss_actor                 -324094091296.768005
memory_size                             17889.0 

=== epoch 10/10 ==== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:03,  3.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    761
episode_length                        13.116951
returns                              -14.661227
return_std                             1.704124
average_reward                        -1.117351
round_time               0 days 00:09:16.144202
episodes_test                             764.0
episode_length_test                   13.089005
returns_test                         -14.691443
return_std_test                        1.576576
average_reward_test                   -1.122426
round_time_test          0 days 00:00:08.846974
round_time_total         0 days 00:09:16.145281
loss_total             7395067774183923515392.0
loss_critic            9243834563200142016512.0
loss_actor                 -326578487705.599976
memory_size                             17889.0 

=== epoch 10/10 ==== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:05,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                    764
episode_length                        13.066754
returns                              -14.628443
return_std                             1.673196
average_reward                        -1.119569
round_time               0 days 00:09:17.729581
episodes_test                             763.0
episode_length_test                   13.096986
returns_test                         -14.701848
return_std_test                        1.706063
average_reward_test                   -1.121879
round_time_test          0 days 00:00:08.827239
round_time_total         0 days 00:09:17.730659
loss_total             7482636710144459669504.0
loss_critic            9353295732297591422976.0
loss_actor                 -329503619547.135986
memory_size                             17889.0 

=== epoch 10/10 ==== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:10,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                    763
episode_length                         13.10616
returns                              -14.698702
return_std                             1.599057
average_reward                        -1.121511
round_time               0 days 00:09:17.007105
episodes_test                             763.0
episode_length_test                   13.100917
returns_test                         -14.588702
return_std_test                        1.760949
average_reward_test                      -1.113
round_time_test          0 days 00:00:08.872429
round_time_total         0 days 00:09:17.008173
loss_total             7591602341736168292352.0
loss_critic            9489502764794334150656.0
loss_actor                 -332236492980.223999
memory_size                             17889.0 

=== epoch 10/10 ==== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                    760
episode_length                        13.135526
returns                               -14.75871
return_std                             1.582324
average_reward                        -1.123942
round_time               0 days 00:09:16.726835
episodes_test                             760.0
episode_length_test                   13.142105
returns_test                         -14.579314
return_std_test                        1.715962
average_reward_test                   -1.109183
round_time_test          0 days 00:00:08.959202
round_time_total         0 days 00:09:16.727914
loss_total             7813447658797041451008.0
loss_critic            9766809408551966998528.0
loss_actor                 -335150240694.271973
memory_size                             17889.0 

=== epoch 10/10 ==== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:59,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                    759
episode_length                        13.160738
returns                              -14.763095
return_std                             1.567008
average_reward                        -1.121587
round_time               0 days 00:09:16.496867
episodes_test                             758.0
episode_length_test                   13.186016
returns_test                         -14.568476
return_std_test                        1.714678
average_reward_test                   -1.104332
round_time_test          0 days 00:00:08.955115
round_time_total         0 days 00:09:16.497957
loss_total             7956172988247513759744.0
loss_critic            9945216062255058124800.0
loss_actor                 -337690279149.567993
memory_size                             17889.0 

=== epoch 10/10 ==== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:28,  3.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                     758
episode_length                         13.167546
returns                               -14.734894
return_std                              1.537819
average_reward                         -1.119598
round_time                0 days 00:09:17.189386
episodes_test                              760.0
episode_length_test                    13.146053
returns_test                          -14.489505
return_std_test                         1.665194
average_reward_test                    -1.101204
round_time_test           0 days 00:00:08.902053
round_time_total          0 days 00:09:17.190459
loss_total              8011182210681105547264.0
loss_critic            10013977589100778094592.0
loss_actor                  -340753857167.359985
memory_size                              17889.0 

=== epoch 10/10 ==== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:16,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     758
episode_length                         13.183377
returns                                -14.74759
return_std                              1.571383
average_reward                         -1.118732
round_time                0 days 00:09:16.424461
episodes_test                              758.0
episode_length_test                     13.17942
returns_test                          -14.849497
return_std_test                         1.604963
average_reward_test                    -1.126184
round_time_test           0 days 00:00:08.948055
round_time_total          0 days 00:09:16.425538
loss_total              8107405618168144592896.0
loss_critic            10134256854511290155008.0
loss_actor                  -343753183707.135986
memory_size                              17889.0 

=== epoch 10/10 ==== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:50,  3.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     759
episode_length                         13.166008
returns                               -14.728545
return_std                              1.578138
average_reward                         -1.118055
round_time                0 days 00:09:16.698431
episodes_test                              761.0
episode_length_test                    13.130092
returns_test                          -14.813406
return_std_test                         1.733957
average_reward_test                    -1.127769
round_time_test           0 days 00:00:08.878554
round_time_total          0 days 00:09:16.699517
loss_total              8267900314983260487680.0
loss_critic            10334875212599928029184.0
loss_actor                  -346344194883.583984
memory_size                              17889.0 

=== epoch 10/10 ==== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                     759
episode_length                          13.15415
returns                                -14.70091
return_std                               1.62275
average_reward                         -1.117554
round_time                0 days 00:09:17.361631
episodes_test                              760.0
episode_length_test                        13.15
returns_test                          -14.603493
return_std_test                         1.721919
average_reward_test                    -1.109765
round_time_test           0 days 00:00:08.823318
round_time_total          0 days 00:09:17.362745
loss_total              8425567376177204559872.0
loss_critic            10531959040781206945792.0
loss_actor                  -349523880017.919983
memory_size                              17889.0 

=== epoch 10/10 ==== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     760
episode_length                         13.147368
returns                               -14.675004
return_std                              1.644867
average_reward                          -1.11591
round_time                0 days 00:09:16.130351
episodes_test                              765.0
episode_length_test                    13.067974
returns_test                          -14.797817
return_std_test                         1.687853
average_reward_test                    -1.131899
round_time_test           0 days 00:00:08.849988
round_time_total          0 days 00:09:16.131418
loss_total              8527582141020952854528.0
loss_critic            10659477483782490030080.0
loss_actor                  -352728646205.440002
memory_size                              17889.0 

=== epoch 10/10 ==== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     761
episode_length                         13.130092
returns                               -14.704006
return_std                              1.721209
average_reward                           -1.1191
round_time                0 days 00:09:16.384515
episodes_test                              767.0
episode_length_test                    13.027379
returns_test                           -14.72681
return_std_test                         1.753355
average_reward_test                    -1.130102
round_time_test           0 days 00:00:08.932483
round_time_total          0 days 00:09:16.385610
loss_total              8653697937214603788288.0
loss_critic            10817122231329130283008.0
loss_actor                  -355530807705.599976
memory_size                              17889.0 

=== epoch 10/10 ==== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:51,  3.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     761
episode_length                         13.110381
returns                                -14.70853
return_std                              1.690807
average_reward                         -1.121856
round_time                0 days 00:09:16.693574
episodes_test                              763.0
episode_length_test                    13.104849
returns_test                          -14.837607
return_std_test                          1.67546
average_reward_test                    -1.132041
round_time_test           0 days 00:00:08.980743
round_time_total          0 days 00:09:16.694645
loss_total              8895952102005778939904.0
loss_critic            11119939936966257147904.0
loss_actor                  -358434863185.919983
memory_size                              17889.0 

=== epoch 10/10 ==== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:12,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     762
episode_length                         13.112861
returns                               -14.743477
return_std                              1.730367
average_reward                         -1.124703
round_time                0 days 00:09:16.697887
episodes_test                              766.0
episode_length_test                    13.039164
returns_test                          -14.754848
return_std_test                         1.669111
average_reward_test                    -1.131386
round_time_test           0 days 00:00:09.011400
round_time_total          0 days 00:09:16.698949
loss_total              8920338340338740494336.0
loss_critic            11150422743291522973696.0
loss_actor                  -360911424962.559998
memory_size                              17889.0 

=== epoch 10/10 ==== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:59,  3.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     762
episode_length                         13.099738
returns                               -14.732588
return_std                              1.753999
average_reward                         -1.123706
round_time                0 days 00:09:16.452923
episodes_test                              763.0
episode_length_test                    13.089122
returns_test                          -14.805318
return_std_test                         1.668757
average_reward_test                    -1.131142
round_time_test           0 days 00:00:08.968172
round_time_total          0 days 00:09:16.453992
loss_total              9037269266050226061312.0
loss_critic            11296586385354376871936.0
loss_actor                  -363040168902.656006
memory_size                              17889.0 

=== epoch 10/10 ==== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:05,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     765
episode_length                         13.058824
returns                                   -14.84
return_std                              1.724013
average_reward                         -1.137098
round_time                0 days 00:09:16.492547
episodes_test                              766.0
episode_length_test                    13.048303
returns_test                          -14.876109
return_std_test                         1.722511
average_reward_test                    -1.139536
round_time_test           0 days 00:00:08.898333
round_time_total          0 days 00:09:16.493624
loss_total              9183773030371512287232.0
loss_critic            11479716071510133178368.0
loss_actor                   -366251666030.59198
memory_size                              17889.0 

=== epoch 10/10 ==== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:44,  3.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     765
episode_length                         13.054902
returns                               -14.868343
return_std                              1.721431
average_reward                         -1.139069
round_time                0 days 00:09:16.090777
episodes_test                              762.0
episode_length_test                     13.12336
returns_test                          -14.907856
return_std_test                         1.689116
average_reward_test                    -1.135979
round_time_test           0 days 00:00:08.946542
round_time_total          0 days 00:09:16.091862
loss_total              9389374109585590190080.0
loss_critic            11736717436395882283008.0
loss_actor                  -369695554732.031982
memory_size                              17889.0 

=== epoch 10/10 ==== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     765
episode_length                         13.057516
returns                               -14.847014
return_std                              1.764484
average_reward                         -1.137043
round_time                0 days 00:09:15.857821
episodes_test                              766.0
episode_length_test                     13.04047
returns_test                          -14.789337
return_std_test                         1.600124
average_reward_test                    -1.133917
round_time_test           0 days 00:00:09.024745
round_time_total          0 days 00:09:15.858886
loss_total              9644676682422316695552.0
loss_critic            12055845630645073084416.0
loss_actor                  -372201951100.927979
memory_size                             17889.85 

=== epoch 10/10 ==== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     767
episode_length                         13.014342
returns                               -14.807382
return_std                              1.718196
average_reward                         -1.137226
round_time                0 days 00:09:16.504401
episodes_test                              771.0
episode_length_test                    12.967575
returns_test                            -14.8471
return_std_test                         1.806134
average_reward_test                    -1.144506
round_time_test           0 days 00:00:08.964701
round_time_total          0 days 00:09:16.505465
loss_total              9736700388892347465728.0
loss_critic            12170875279372063866880.0
loss_actor                  -375327616827.392029
memory_size                              17890.0 

=== epoch 10/10 ==== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:55,  3.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     769
episode_length                         12.983095
returns                               -14.859908
return_std                              1.696998
average_reward                         -1.144812
round_time                0 days 00:09:15.803216
episodes_test                              767.0
episode_length_test                    13.024772
returns_test                          -14.888134
return_std_test                          1.68701
average_reward_test                    -1.141747
round_time_test           0 days 00:00:09.099117
round_time_total          0 days 00:09:15.804294
loss_total              9873349775570170281984.0
loss_critic            12341686996657676222464.0
loss_actor                  -378177393033.216003
memory_size                              17890.0 

=== epoch 10/10 ==== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:43,  3.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.60it/s]
episodes                                     769
episode_length                         12.997399
returns                               -14.791954
return_std                              1.801879
average_reward                         -1.137591
round_time                0 days 00:09:15.324088
episodes_test                              759.0
episode_length_test                    13.173913
returns_test                          -14.866386
return_std_test                          1.65239
average_reward_test                    -1.128286
round_time_test           0 days 00:00:08.951674
round_time_total          0 days 00:09:15.325214
loss_total             10083677442547344474112.0
loss_critic            12604596571407128723456.0
loss_actor                  -381685808087.039978
memory_size                              17890.0 

=== epoch 10/10 ==== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:48,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     768
episode_length                         13.001302
returns                               -14.772659
return_std                              1.779196
average_reward                         -1.136169
round_time                0 days 00:09:16.672939
episodes_test                              761.0
episode_length_test                    13.140604
returns_test                          -14.747605
return_std_test                         1.754875
average_reward_test                    -1.122293
round_time_test           0 days 00:00:08.948260
round_time_total          0 days 00:09:16.674005
loss_total             10259212405995496013824.0
loss_critic            12824015266516605861888.0
loss_actor                   -384604145041.40802
memory_size                              17890.0 

=== epoch 10/10 ==== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     766
episode_length                         13.048303
returns                               -14.812231
return_std                              1.761901
average_reward                         -1.134952
round_time                0 days 00:09:16.488732
episodes_test                              763.0
episode_length_test                    13.104849
returns_test                          -14.579626
return_std_test                         1.755559
average_reward_test                    -1.112333
round_time_test           0 days 00:00:08.959846
round_time_total          0 days 00:09:16.489795
loss_total             10427211531160437915648.0
loss_critic            13034014184794730004480.0
loss_actor                  -387248034480.127991
memory_size                              17890.0 

=== epoch 10/10 ==== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:03,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     763
episode_length                         13.096986
returns                               -14.843954
return_std                              1.768862
average_reward                         -1.134263
round_time                0 days 00:09:15.922594
episodes_test                              770.0
episode_length_test                    12.971429
returns_test                          -14.775421
return_std_test                         1.731131
average_reward_test                    -1.139068
round_time_test           0 days 00:00:08.939672
round_time_total          0 days 00:09:15.923659
loss_total             10507866327579816361984.0
loss_critic            13134832693372356919296.0
loss_actor                  -390372562239.487976
memory_size                              17890.0 

=== epoch 10/10 ==== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:06,  3.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     760
episode_length                         13.138158
returns                               -14.794559
return_std                              1.770896
average_reward                         -1.126488
round_time                0 days 00:09:16.091868
episodes_test                              763.0
episode_length_test                    13.090433
returns_test                          -15.015289
return_std_test                         1.705492
average_reward_test                    -1.146671
round_time_test           0 days 00:00:08.963026
round_time_total          0 days 00:09:16.092949
loss_total             10787857394249689464832.0
loss_critic            13484821506689790377984.0
loss_actor                  -393566407966.719971
memory_size                           17890.5475 

=== epoch 10/10 ==== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:15,  3.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     760
episode_length                         13.135526
returns                               -14.855294
return_std                              1.683085
average_reward                         -1.131267
round_time                0 days 00:09:15.825613
episodes_test                              768.0
episode_length_test                    13.015625
returns_test                          -14.715238
return_std_test                         1.743611
average_reward_test                    -1.130088
round_time_test           0 days 00:00:09.013438
round_time_total          0 days 00:09:15.826697
loss_total             10956225858775644372992.0
loss_critic            13695282084884330315776.0
loss_actor                  -397069685866.495972
memory_size                              17893.0 

=== epoch 10/10 ==== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     761
episode_length                          13.12615
returns                               -14.842502
return_std                               1.71251
average_reward                         -1.131053
round_time                0 days 00:09:16.236887
episodes_test                              760.0
episode_length_test                    13.143421
returns_test                          -14.842557
return_std_test                         1.684793
average_reward_test                    -1.129374
round_time_test           0 days 00:00:08.815917
round_time_total          0 days 00:09:16.237954
loss_total             11193493084820329201664.0
loss_critic            13991866114097668947968.0
loss_actor                  -400278967369.728027
memory_size                              17893.0 

=== epoch 10/10 ==== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 6/2000 [00:01<09:36,  3.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.59it/s]
episodes                                     763
episode_length                         13.089122
returns                               -14.834869
return_std                              1.773801
average_reward                         -1.133174
round_time                0 days 00:09:17.782926
episodes_test                              766.0
episode_length_test                    13.050914
returns_test                          -14.850907
return_std_test                         1.709275
average_reward_test                    -1.137311
round_time_test           0 days 00:00:08.849849
round_time_total          0 days 00:09:17.783992
loss_total             11443051356583266091008.0
loss_critic            14303813942208089817088.0
loss_actor                  -403195275984.895996
memory_size                              17893.0 

=== epoch 10/10 ==== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:56,  3.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     764
episode_length                         13.077225
returns                               -14.815407
return_std                               1.85521
average_reward                         -1.132588
round_time                0 days 00:09:16.105736
episodes_test                              762.0
episode_length_test                    13.112861
returns_test                          -14.917494
return_std_test                         1.798753
average_reward_test                    -1.137397
round_time_test           0 days 00:00:08.939978
round_time_total          0 days 00:09:16.106810
loss_total             11556504064200867315712.0
loss_critic            14445629832412365062144.0
loss_actor                  -406583194828.799988
memory_size                           17893.4645 

=== epoch 10/10 ==== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<09:53,  3.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.59it/s]
episodes                                     766
episode_length                         13.037859
returns                               -14.839431
return_std                              1.862221
average_reward                         -1.138293
round_time                0 days 00:09:17.402913
episodes_test                              761.0
episode_length_test                     13.13272
returns_test                          -14.861654
return_std_test                         1.752336
average_reward_test                    -1.130669
round_time_test           0 days 00:00:08.979480
round_time_total          0 days 00:09:17.403994
loss_total             11645424043281944150016.0
loss_critic            14556779797819464613888.0
loss_actor                  -409537689305.088013
memory_size                              17894.0 

=== epoch 10/10 ==== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:13,  3.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:16<00:00,  3.60it/s]
episodes                                     764
episode_length                         13.082461
returns                               -14.784362
return_std                              1.861606
average_reward                         -1.130027
round_time                0 days 00:09:16.766896
episodes_test                              760.0
episode_length_test                    13.157895
returns_test                          -14.725119
return_std_test                         1.721417
average_reward_test                    -1.119109
round_time_test           0 days 00:00:08.898710
round_time_total          0 days 00:09:16.767969
loss_total             11848429109607330217984.0
loss_critic            14810536139451919237120.0
loss_actor                  -413560787271.679993
memory_size                           17894.4635 

=== epoch 10/10 ==== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     763
episode_length                         13.094364
returns                                -14.73831
return_std                              1.827655
average_reward                         -1.126008
round_time                0 days 00:09:15.755850
episodes_test                              764.0
episode_length_test                    13.089005
returns_test                          -14.829871
return_std_test                         1.689044
average_reward_test                    -1.133002
round_time_test           0 days 00:00:08.981421
round_time_total          0 days 00:09:15.756925
loss_total             12016128493758420353024.0
loss_critic            15020160361478009913344.0
loss_actor                  -417354254811.135986
memory_size                              17896.0 

=== epoch 10/10 ==== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:15<00:00,  3.60it/s]
episodes                                     764
episode_length                         13.064136
returns                               -14.672947
return_std                              1.770921
average_reward                         -1.123337
round_time                0 days 00:09:15.820128
episodes_test                              758.0
episode_length_test                    13.192612
returns_test                          -14.802559
return_std_test                         1.622604
average_reward_test                    -1.122034
round_time_test           0 days 00:00:09.126712
round_time_total          0 days 00:09:15.821205
loss_total             12253977636666356006912.0
loss_critic            15317471791274012966912.0
loss_actor                  -420965695160.320007
memory_size                              17896.0 

=== epoch 10/10 ==== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:40,  3.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:17<00:00,  3.58it/s]
episodes                                     763
episode_length                         13.091743
returns                               -14.649873
return_std                              1.706116
average_reward                         -1.118515
round_time                0 days 00:09:18.501676
episodes_test                              763.0
episode_length_test                    13.091743
returns_test                           -14.66259
return_std_test                          1.76922
average_reward_test                    -1.119951
round_time_test           0 days 00:00:08.834129
round_time_total          0 days 00:09:18.502749
loss_total             12573893596340241301504.0
loss_critic            15717366723837134110720.0
loss_actor                  -423686437044.223999
memory_size                           17897.1185 

=== epoch 10/10 ==== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
episodes                                     761
episode_length                         13.127464
returns                               -14.625881
return_std                              1.678353
average_reward                         -1.113824
round_time                0 days 00:09:14.510633
episodes_test                              757.0
episode_length_test                    13.207398
returns_test                          -14.747472
return_std_test                         1.716385
average_reward_test                     -1.11619
round_time_test           0 days 00:00:08.867752
round_time_total          0 days 00:09:14.511698
loss_total             12813336529027463118848.0
loss_critic            16016670392282211418112.0
loss_actor                  -426762928357.375977
memory_size                              17898.0 

=== epoch 10/10 ==== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:34,  3.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:14<00:00,  3.61it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
episodes                                     762
episode_length                          13.10105
returns                               -14.631644
return_std                              1.684165
average_reward                         -1.116619
round_time                0 days 00:09:14.844586
episodes_test                              766.0
episode_length_test                    13.053525
returns_test                          -14.684089
return_std_test                         1.713808
average_reward_test                    -1.124727
round_time_test           0 days 00:00:08.931057
round_time_total          0 days 00:09:14.845675
loss_total             12984442402370805039104.0
loss_critic            16230552741438069669888.0
loss_actor                  -430021658607.616028
memory_size                              17898.0 


