/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Walker2d-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 5
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>

<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:01<00:00, 1143.29it/s]
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  113
episode_length                      17.663717
returns                             -2.036096
return_std                           5.117879
average_reward                      -0.114821
round_time             0 days 00:00:01.765817
episodes_test                           233.0
episode_length_test                  42.88412
returns_test                         5.892178
return_std_test                     11.928236
average_reward_test                  0.137369
round_time_test        0 days 00:00:08.483814
round_time_total       0 days 00:00:10.231786 

=== epoch 1/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 96%|█████████▌| 1917/2000 [00:01<00:00, 1187.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1184.24it/s]
episodes                                  212
episode_length                      18.820755
returns                             -2.169102
return_std                           5.061533
average_reward                      -0.115142
round_time             0 days 00:00:02.188083
episodes_test                           236.0
episode_length_test                 42.190678
returns_test                         6.737754
return_std_test                     10.904686
average_reward_test                   0.16142
round_time_test        0 days 00:00:08.643305
round_time_total       0 days 00:00:10.403328 

=== epoch 1/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 88%|████████▊ | 1751/2000 [00:01<00:00, 1034.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1120.01it/s]
episodes                                  313
episode_length                      19.099042
returns                             -1.944345
return_std                           5.766504
average_reward                      -0.100835
round_time             0 days 00:00:02.345372
episodes_test                           227.0
episode_length_test                 43.845815
returns_test                         7.031277
return_std_test                     10.679614
average_reward_test                  0.160692
round_time_test        0 days 00:00:08.635601
round_time_total       0 days 00:00:10.378492 

=== epoch 1/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
 88%|████████▊ | 1768/2000 [00:01<00:00, 1061.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:01<00:00, 1137.36it/s]
episodes                                  420
episode_length                      19.047619
returns                              -1.78439
return_std                           5.693828
average_reward                       -0.09368
round_time             0 days 00:00:02.266436
episodes_test                           224.0
episode_length_test                 44.428571
returns_test                         6.396139
return_std_test                       11.4993
average_reward_test                  0.146186
round_time_test        0 days 00:00:08.724929
round_time_total       0 days 00:00:10.477104 

=== epoch 1/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
100%|██████████| 2000/2000 [00:01<00:00, 1186.81it/s]
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  532
episode_length                      18.770677
returns                             -1.751683
return_std                           5.538724
average_reward                      -0.092923
round_time             0 days 00:00:02.183361
episodes_test                           225.0
episode_length_test                 44.395556
returns_test                         4.782382
return_std_test                     11.622808
average_reward_test                  0.108295
round_time_test        0 days 00:00:08.734777
round_time_total       0 days 00:00:10.500247 

=== epoch 1/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 9/2000 [00:01<05:40,  5.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:42<00:00,  5.85it/s]
starting training
episodes                                  463
episode_length                      21.429806
returns                              0.476184
return_std                          16.543705
average_reward                       0.022899
round_time             0 days 00:05:42.603542
episodes_test                           225.0
episode_length_test                 44.342222
returns_test                         7.670123
return_std_test                     11.736637
average_reward_test                   0.17408
round_time_test        0 days 00:00:08.575317
round_time_total       0 days 00:05:42.604640
loss_total                        1769.356993
loss_critic                       2243.907995
loss_actor                        -128.847161
memory_size                          2943.635 

=== epoch 1/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:49,  5.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:45<00:00,  5.78it/s]
episodes                                  456
episode_length                      21.883772
returns                              2.369714
return_std                          17.357135
average_reward                       0.107544
round_time             0 days 00:05:46.480893
episodes_test                           531.0
episode_length_test                 18.821092
returns_test                         8.130221
return_std_test                      8.469129
average_reward_test                  0.432189
round_time_test        0 days 00:00:09.057432
round_time_total       0 days 00:05:46.481978
loss_total                        5385.778859
loss_critic                       6818.264857
loss_actor                        -344.165585
memory_size                         3857.8745 

=== epoch 1/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:54,  5.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.74it/s]
episodes                                  504
episode_length                      19.819444
returns                              2.369259
return_std                          16.333278
average_reward                       0.118477
round_time             0 days 00:05:48.973860
episodes_test                           503.0
episode_length_test                 19.874751
returns_test                         4.551128
return_std_test                      7.491004
average_reward_test                  0.229213
round_time_test        0 days 00:00:08.915938
round_time_total       0 days 00:05:48.975083
loss_total                       30703.975411
loss_critic                      38657.301195
loss_actor                       -1109.330371
memory_size                           4260.42 

=== epoch 1/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:41,  4.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  536
episode_length                      18.643657
returns                              2.000279
return_std                          15.861501
average_reward                       0.107378
round_time             0 days 00:05:50.136112
episodes_test                           768.0
episode_length_test                 13.010417
returns_test                        -1.276465
return_std_test                      3.429107
average_reward_test                 -0.098037
round_time_test        0 days 00:00:09.136944
round_time_total       0 days 00:05:50.137445
loss_total                     1728414.576586
loss_critic                    2162325.856184
loss_actor                       -7230.689676
memory_size                         4383.7505 

=== epoch 1/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  532
episode_length                      18.719925
returns                               2.10401
return_std                          15.907782
average_reward                       0.112488
round_time             0 days 00:05:52.081899
episodes_test                           608.0
episode_length_test                 16.442434
returns_test                        -1.416414
return_std_test                      4.815614
average_reward_test                 -0.085883
round_time_test        0 days 00:00:09.012106
round_time_total       0 days 00:05:52.083049
loss_total                       37905109.165
loss_critic                      47388445.836
loss_actor                      -28241.106961
memory_size                         4689.2525 

=== epoch 1/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:12,  4.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  581
episode_length                      17.177281
returns                              0.260413
return_std                            6.87701
average_reward                       0.016749
round_time             0 days 00:05:54.978977
episodes_test                           426.0
episode_length_test                 23.450704
returns_test                        -0.642259
return_std_test                      6.961712
average_reward_test                 -0.026912
round_time_test        0 days 00:00:08.721037
round_time_total       0 days 00:05:54.980296
loss_total                      261397304.942
loss_critic                      326762972.22
loss_actor                      -65385.016527
memory_size                         5302.7205 

=== epoch 1/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:30,  5.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  567
episode_length                      17.633157
returns                             -0.311589
return_std                           5.958981
average_reward                      -0.017498
round_time             0 days 00:05:54.227763
episodes_test                           457.0
episode_length_test                 21.820569
returns_test                         0.710025
return_std_test                        7.8432
average_reward_test                  0.033658
round_time_test        0 days 00:00:08.871671
round_time_total       0 days 00:05:54.229383
loss_total                      733527843.504
loss_critic                     916936512.912
loss_actor                     -106894.973922
memory_size                          5991.768 

=== epoch 1/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:10,  5.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  469
episode_length                      21.275053
returns                              2.025996
return_std                          26.849324
average_reward                       0.095224
round_time             0 days 00:05:53.788764
episodes_test                           171.0
episode_length_test                 58.099415
returns_test                        24.482295
return_std_test                     45.455789
average_reward_test                  0.426893
round_time_test        0 days 00:00:08.690577
round_time_total       0 days 00:05:53.789853
loss_total                     1162483765.792
loss_critic                     1453139998.56
loss_actor                     -141265.808797
memory_size                          7220.596 

=== epoch 1/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:30,  5.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  472
episode_length                      21.112288
returns                              2.064513
return_std                          26.746888
average_reward                       0.097298
round_time             0 days 00:05:55.333384
episodes_test                           637.0
episode_length_test                 15.690738
returns_test                        -0.214284
return_std_test                      4.768164
average_reward_test                 -0.013703
round_time_test        0 days 00:00:09.049970
round_time_total       0 days 00:05:55.334534
loss_total                     1806176580.896
loss_critic                      2257764472.8
loss_actor                     -175139.050328
memory_size                         7856.7855 

=== epoch 1/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:16,  5.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  517
episode_length                      19.297872
returns                              1.639139
return_std                          25.572674
average_reward                       0.083385
round_time             0 days 00:05:55.898083
episodes_test                           702.0
episode_length_test                 14.202279
returns_test                        -1.302815
return_std_test                      4.492942
average_reward_test                 -0.091171
round_time_test        0 days 00:00:09.163939
round_time_total       0 days 00:05:55.899189
loss_total                      2758735562.56
loss_critic                    3448473591.296
loss_actor                     -216797.159695
memory_size                         8026.7345 

=== epoch 1/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:36,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  562
episode_length                       17.69573
returns                              1.191668
return_std                          24.565039
average_reward                       0.069831
round_time             0 days 00:05:55.190966
episodes_test                           746.0
episode_length_test                 13.400804
returns_test                        -1.978997
return_std_test                      4.091188
average_reward_test                 -0.147445
round_time_test        0 days 00:00:08.981673
round_time_total       0 days 00:05:55.192058
loss_total                     4289186676.352
loss_critic                     5361550015.68
loss_actor                     -267060.987797
memory_size                         8174.9715 

=== epoch 1/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:23,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  626
episode_length                      15.789137
returns                              0.208764
return_std                           23.11756
average_reward                       0.016246
round_time             0 days 00:05:55.894838
episodes_test                           682.0
episode_length_test                  14.64956
returns_test                        -1.538083
return_std_test                      4.736625
average_reward_test                 -0.104425
round_time_test        0 days 00:00:08.885363
round_time_total       0 days 00:05:55.895962
loss_total                     6218037595.776
loss_critic                     7772624862.72
loss_actor                     -312027.208469
memory_size                         8427.3125 

=== epoch 1/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<06:15,  5.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  716
episode_length                      13.942737
returns                             -1.664268
return_std                           4.562394
average_reward                      -0.119776
round_time             0 days 00:05:55.976553
episodes_test                           682.0
episode_length_test                 14.659824
returns_test                        -2.200103
return_std_test                      4.801172
average_reward_test                 -0.149867
round_time_test        0 days 00:00:09.124136
round_time_total       0 days 00:05:55.977645
loss_total                       8285665446.4
loss_critic                10357170894.464001
loss_actor                     -357077.190359
memory_size                         8657.5595 

=== epoch 1/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  713
episode_length                      13.964937
returns                              -1.87681
return_std                           4.817948
average_reward                      -0.134664
round_time             0 days 00:05:56.529399
episodes_test                           686.0
episode_length_test                 14.575802
returns_test                        -2.026416
return_std_test                      4.876002
average_reward_test                  -0.13891
round_time_test        0 days 00:00:08.909908
round_time_total       0 days 00:05:56.530491
loss_total                     10336874961.92
loss_critic                   12921192091.392
loss_actor                     -394446.201172
memory_size                         8896.6745 

=== epoch 1/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:40,  5.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  683
episode_length                      14.622255
returns                             -1.883956
return_std                           5.132534
average_reward                      -0.129319
round_time             0 days 00:05:59.361598
episodes_test                           684.0
episode_length_test                 14.618421
returns_test                        -1.740592
return_std_test                       4.76133
average_reward_test                 -0.118943
round_time_test        0 days 00:00:09.060842
round_time_total       0 days 00:05:59.362700
loss_total                    12590109722.368
loss_critic                   15737743553.024
loss_actor                     -426677.587094
memory_size                           9205.59 

=== epoch 1/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:13,  4.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  662
episode_length                      15.072508
returns                             -1.973202
return_std                           5.394416
average_reward                      -0.131695
round_time             0 days 00:05:59.472065
episodes_test                           655.0
episode_length_test                 15.236641
returns_test                        -2.622409
return_std_test                      4.975481
average_reward_test                 -0.172515
round_time_test        0 days 00:00:08.729445
round_time_total       0 days 00:05:59.473222
loss_total                    14583707322.368
loss_critic                18229746484.223999
loss_actor                        -450586.923
memory_size                          9569.649 

=== epoch 1/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:47,  4.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  634
episode_length                      15.750789
returns                             -1.853874
return_std                            5.77487
average_reward                       -0.11848
round_time             0 days 00:05:58.347159
episodes_test                           600.0
episode_length_test                 16.648333
returns_test                        -1.655865
return_std_test                      5.918092
average_reward_test                 -0.099652
round_time_test        0 days 00:00:09.028121
round_time_total       0 days 00:05:58.348755
loss_total                 15523747651.072001
loss_critic                19404800078.080002
loss_actor                     -463389.595531
memory_size                         9953.7625 

=== epoch 1/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  587
episode_length                       17.02385
returns                             -1.576808
return_std                           6.178352
average_reward                      -0.092729
round_time             0 days 00:06:00.128867
episodes_test                           528.0
episode_length_test                  18.92803
returns_test                        -0.490647
return_std_test                       6.31437
average_reward_test                 -0.025235
round_time_test        0 days 00:00:08.941835
round_time_total       0 days 00:06:00.129970
loss_total                 17119307587.327999
loss_critic                21399253680.639999
loss_actor                     -478246.663766
memory_size                         10503.554 

=== epoch 1/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  559
episode_length                      17.828265
returns                             -1.145823
return_std                           6.780679
average_reward                      -0.064734
round_time             0 days 00:05:58.245033
episodes_test                           557.0
episode_length_test                  17.94614
returns_test                        -0.418645
return_std_test                      7.400546
average_reward_test                 -0.023356
round_time_test        0 days 00:00:08.930526
round_time_total       0 days 00:05:58.246261
loss_total                    17080041577.216
loss_critic                21350170317.824001
loss_actor                     -474759.287266
memory_size                        11022.9755 

=== epoch 1/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  549
episode_length                      18.178506
returns                             -0.864499
return_std                           6.949036
average_reward                      -0.046422
round_time             0 days 00:05:59.081421
episodes_test                           565.0
episode_length_test                 17.649558
returns_test                        -1.129477
return_std_test                      6.280135
average_reward_test                  -0.06265
round_time_test        0 days 00:00:08.954265
round_time_total       0 days 00:05:59.082642
loss_total                    17420329317.632
loss_critic                21775530193.664001
loss_actor                     -475679.124344
memory_size                         11503.061 

=== epoch 1/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:45,  4.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  569
episode_length                      17.541301
returns                             -0.781094
return_std                           6.774156
average_reward                      -0.046481
round_time             0 days 00:06:00.470169
episodes_test                           661.0
episode_length_test                  15.12708
returns_test                        -1.751833
return_std_test                      5.119931
average_reward_test                 -0.115695
round_time_test        0 days 00:00:08.908917
round_time_total       0 days 00:06:00.471385
loss_total                    16862180072.448
loss_critic                   21077842083.584
loss_actor                     -469361.717625
memory_size                         11865.457 

=== epoch 1/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.54it/s]
episodes                                  619
episode_length                      16.145396
returns                             -1.218551
return_std                           6.373058
average_reward                      -0.075783
round_time             0 days 00:06:01.413123
episodes_test                           688.0
episode_length_test                 14.527616
returns_test                        -1.751249
return_std_test                      5.445768
average_reward_test                 -0.120391
round_time_test        0 days 00:00:09.342072
round_time_total       0 days 00:06:01.414260
loss_total                    17251018123.264
loss_critic                21563891846.655998
loss_actor                     -478172.378172
memory_size                         12103.325 

=== epoch 1/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  698
episode_length                      14.282235
returns                             -1.803953
return_std                             5.9351
average_reward                      -0.126767
round_time             0 days 00:06:00.269373
episodes_test                           878.0
episode_length_test                 11.389522
returns_test                        -3.059661
return_std_test                      3.555924
average_reward_test                 -0.268638
round_time_test        0 days 00:00:09.089315
round_time_total       0 days 00:06:00.270490
loss_total                 19205335607.551998
loss_critic                24006797317.119999
loss_actor                     -512858.421313
memory_size                         12213.645 

=== epoch 1/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:10,  5.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.53it/s]
episodes                                  778
episode_length                      12.830334
returns                             -2.223174
return_std                           4.919027
average_reward                      -0.173593
round_time             0 days 00:06:02.256183
episodes_test                           899.0
episode_length_test                 11.120133
returns_test                          -2.6077
return_std_test                      3.667926
average_reward_test                 -0.234283
round_time_test        0 days 00:00:09.201028
round_time_total       0 days 00:06:02.257265
loss_total                 22069471003.647999
loss_critic                   27586978735.104
loss_actor                      -561870.81075
memory_size                        12264.2715 

=== epoch 1/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  859
episode_length                      11.618161
returns                             -2.762487
return_std                           4.294377
average_reward                      -0.237887
round_time             0 days 00:05:59.818017
episodes_test                           971.0
episode_length_test                 10.298661
returns_test                        -2.777454
return_std_test                      3.198295
average_reward_test                 -0.269691
round_time_test        0 days 00:00:09.266303
round_time_total       0 days 00:05:59.819136
loss_total                 26898868654.080002
loss_critic                33623743792.127998
loss_actor                     -634316.386438
memory_size                         12278.439 

=== epoch 1/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:31,  4.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  916
episode_length                      10.912664
returns                             -3.067077
return_std                           3.597645
average_reward                      -0.281166
round_time             0 days 00:05:59.589513
episodes_test                           966.0
episode_length_test                 10.350932
returns_test                        -3.187335
return_std_test                      3.263387
average_reward_test                 -0.307806
round_time_test        0 days 00:00:09.182733
round_time_total       0 days 00:05:59.590622
loss_total                 35002873733.632004
loss_critic                43753772748.800003
loss_actor                     -725156.051406
memory_size                        12297.4625 

=== epoch 1/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:47,  4.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  952
episode_length                      10.496849
returns                             -3.277637
return_std                           3.313843
average_reward                      -0.312246
round_time             0 days 00:06:00.861590
episodes_test                           989.0
episode_length_test                 10.106168
returns_test                        -3.238307
return_std_test                      2.991343
average_reward_test                  -0.32014
round_time_test        0 days 00:00:09.248236
round_time_total       0 days 00:06:00.862742
loss_total                 45209184345.599998
loss_critic                56511684361.216003
loss_actor                        -819391.864
memory_size                        12319.0755 

=== epoch 1/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:40,  4.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  977
episode_length                      10.223132
returns                             -3.357245
return_std                           3.000007
average_reward                      -0.327832
round_time             0 days 00:05:59.573824
episodes_test                          1009.0
episode_length_test                   9.90783
returns_test                        -4.460122
return_std_test                      2.724882
average_reward_test                 -0.450016
round_time_test        0 days 00:00:09.525756
round_time_total       0 days 00:05:59.574934
loss_total                 53044261410.816002
loss_critic                66305549788.160004
loss_actor                     -896617.690563
memory_size                        12324.9625 

=== epoch 1/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:55,  4.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.54it/s]
episodes                                  985
episode_length                      10.142132
returns                             -3.313396
return_std                           3.057202
average_reward                       -0.32599
round_time             0 days 00:06:01.720511
episodes_test                           961.0
episode_length_test                 10.405827
returns_test                        -3.452871
return_std_test                      3.106849
average_reward_test                 -0.331821
round_time_test        0 days 00:00:09.258691
round_time_total       0 days 00:06:01.721695
loss_total                    67698849490.944
loss_critic                84623813898.240005
loss_actor                    -1014334.350062
memory_size                        12348.2485 

=== epoch 1/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:30,  5.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  987
episode_length                       10.12462
returns                             -3.278935
return_std                           3.020896
average_reward                      -0.324029
round_time             0 days 00:06:00.183251
episodes_test                           968.0
episode_length_test                 10.328512
returns_test                        -3.181801
return_std_test                      3.069124
average_reward_test                 -0.307814
round_time_test        0 days 00:00:09.285408
round_time_total       0 days 00:06:00.184354
loss_total                 85930977891.328003
loss_critic               107414006284.287994
loss_actor                     -1143477.47675
memory_size                        12363.9135 

=== epoch 1/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:56,  5.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  995
episode_length                      10.037186
returns                             -3.286696
return_std                           2.998978
average_reward                      -0.327674
round_time             0 days 00:06:00.811900
episodes_test                           972.0
episode_length_test                 10.286008
returns_test                        -3.147268
return_std_test                      3.023511
average_reward_test                 -0.305703
round_time_test        0 days 00:00:09.220339
round_time_total       0 days 00:06:00.813159
loss_total                109173640499.199997
loss_critic               136467368873.983994
loss_actor                    -1281648.633375
memory_size                        12371.5235 

=== epoch 1/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:02,  5.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  995
episode_length                      10.031156
returns                             -3.164824
return_std                           2.960216
average_reward                      -0.315149
round_time             0 days 00:06:00.246661
episodes_test                           951.0
episode_length_test                 10.512093
returns_test                        -3.212298
return_std_test                      3.222204
average_reward_test                 -0.305304
round_time_test        0 days 00:00:09.174978
round_time_total       0 days 00:06:00.247770
loss_total                141078204786.687988
loss_critic               176348115582.976013
loss_actor                    -1451013.720312
memory_size                        12381.9665 

=== epoch 1/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  986
episode_length                       10.12069
returns                             -3.127399
return_std                           3.038181
average_reward                      -0.308417
round_time             0 days 00:06:01.222406
episodes_test                           968.0
episode_length_test                  10.32438
returns_test                        -3.164691
return_std_test                       3.24471
average_reward_test                 -0.305696
round_time_test        0 days 00:00:09.214317
round_time_total       0 days 00:06:01.223511
loss_total                   176543460818.944
loss_critic               220679726632.959991
loss_actor                    -1618014.431125
memory_size                        12393.9605 

=== epoch 1/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:17,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:03<00:00,  5.51it/s]
episodes                                  977
episode_length                      10.220061
returns                             -3.137523
return_std                           3.146771
average_reward                      -0.306558
round_time             0 days 00:06:03.750230
episodes_test                           975.0
episode_length_test                 10.247179
returns_test                        -2.981898
return_std_test                      3.104857
average_reward_test                 -0.290742
round_time_test        0 days 00:00:09.218734
round_time_total       0 days 00:06:03.751519
loss_total                216653460070.399994
loss_critic               270817267843.071991
loss_actor                    -1788907.514125
memory_size                        12434.2835 

=== epoch 1/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:15,  5.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.54it/s]
episodes                                  973
episode_length                      10.276465
returns                             -3.084923
return_std                           3.236803
average_reward                      -0.300168
round_time             0 days 00:06:01.339496
episodes_test                           981.0
episode_length_test                 10.191641
returns_test                        -3.197242
return_std_test                      2.870533
average_reward_test                 -0.313493
round_time_test        0 days 00:00:09.179943
round_time_total       0 days 00:06:01.340605
loss_total                258900215410.687988
loss_critic               323625747783.679993
loss_actor                    -1936065.798937
memory_size                         12471.705 

=== epoch 1/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  966
episode_length                      10.343685
returns                             -3.073352
return_std                           3.295479
average_reward                       -0.29742
round_time             0 days 00:06:00.195250
episodes_test                           961.0
episode_length_test                 10.396462
returns_test                        -2.891883
return_std_test                      3.153498
average_reward_test                 -0.277837
round_time_test        0 days 00:00:09.110690
round_time_total       0 days 00:06:00.196388
loss_total                300533579411.455994
loss_critic               375667490508.799988
loss_actor                    -2092400.199062
memory_size                         12499.668 

=== epoch 1/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.54it/s]
episodes                                  957
episode_length                      10.444096
returns                             -3.003049
return_std                           3.307913
average_reward                      -0.288221
round_time             0 days 00:06:01.817732
episodes_test                           965.0
episode_length_test                 10.362694
returns_test                        -3.060608
return_std_test                      3.117451
average_reward_test                 -0.295349
round_time_test        0 days 00:00:09.262481
round_time_total       0 days 00:06:01.818822
loss_total                355404944105.471985
loss_critic               444256739942.400024
loss_actor                    -2273195.428313
memory_size                         12534.349 

=== epoch 1/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  955
episode_length                      10.470157
returns                             -3.037727
return_std                            3.32541
average_reward                      -0.290103
round_time             0 days 00:06:00.241328
episodes_test                           953.0
episode_length_test                 10.493179
returns_test                         -2.97459
return_std_test                      3.098361
average_reward_test                 -0.283478
round_time_test        0 days 00:00:09.154213
round_time_total       0 days 00:06:00.242409
loss_total                414549786525.695984
loss_critic               518187829968.895996
loss_actor                     -2427565.11125
memory_size                        12582.4955 

=== epoch 1/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.53it/s]
episodes                                  961
episode_length                        10.3923
returns                             -3.153922
return_std                           3.172319
average_reward                        -0.3026
round_time             0 days 00:06:02.160065
episodes_test                           940.0
episode_length_test                 10.638298
returns_test                        -3.177831
return_std_test                      3.257206
average_reward_test                 -0.298716
round_time_test        0 days 00:00:09.343731
round_time_total       0 days 00:06:02.161174
loss_total                458921853902.848022
loss_critic               573652949786.624023
loss_actor                     -2565886.57775
memory_size                         12594.541 

=== epoch 1/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  954
episode_length                      10.480084
returns                             -3.104869
return_std                           3.274102
average_reward                      -0.296024
round_time             0 days 00:05:59.800117
episodes_test                           956.0
episode_length_test                 10.460251
returns_test                        -3.161323
return_std_test                       3.24804
average_reward_test                 -0.302222
round_time_test        0 days 00:00:09.332476
round_time_total       0 days 00:05:59.801201
loss_total                524904937021.440002
loss_critic               656131838140.416016
loss_actor                    -2722882.937125
memory_size                         12634.865 

=== epoch 1/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:34,  5.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:03<00:00,  5.49it/s]
episodes                                  949
episode_length                      10.534247
returns                             -3.059866
return_std                            3.39776
average_reward                      -0.290294
round_time             0 days 00:06:04.550074
episodes_test                           945.0
episode_length_test                 10.574603
returns_test                        -3.177995
return_std_test                       3.16107
average_reward_test                 -0.300711
round_time_test        0 days 00:00:09.223880
round_time_total       0 days 00:06:04.551184
loss_total                 583110011371.52002
loss_critic               728888222441.472046
loss_actor                    -2871438.734125
memory_size                        12677.2265 

=== epoch 1/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:16,  5.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:06<00:00,  5.45it/s]
episodes                                  947
episode_length                       10.55227
returns                             -3.143443
return_std                           3.410045
average_reward                      -0.297432
round_time             0 days 00:06:07.205183
episodes_test                           963.0
episode_length_test                 10.381101
returns_test                        -3.169689
return_std_test                      3.076113
average_reward_test                   -0.3051
round_time_test        0 days 00:00:09.252173
round_time_total       0 days 00:06:07.206279
loss_total                642761084690.432007
loss_critic               803452091432.959961
loss_actor                       -3009496.801
memory_size                         12695.687 

=== epoch 1/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:16,  5.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:03<00:00,  5.50it/s]
episodes                                  941
episode_length                       10.62593
returns                             -3.078512
return_std                           3.423969
average_reward                      -0.289628
round_time             0 days 00:06:04.450712
episodes_test                           950.0
episode_length_test                 10.507368
returns_test                        -3.405382
return_std_test                      3.244101
average_reward_test                 -0.324216
round_time_test        0 days 00:00:09.242658
round_time_total       0 days 00:06:04.451805
loss_total                691278730346.495972
loss_critic               864099183919.104004
loss_actor                    -3139052.194875
memory_size                        12731.5705 

=== epoch 1/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:22,  5.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:05<00:00,  5.47it/s]
episodes                                  927
episode_length                      10.773463
returns                             -3.086675
return_std                           3.510369
average_reward                      -0.286861
round_time             0 days 00:06:05.929213
episodes_test                           926.0
episode_length_test                 10.798056
returns_test                        -3.606203
return_std_test                      3.225674
average_reward_test                 -0.333868
round_time_test        0 days 00:00:09.216727
round_time_total       0 days 00:06:05.930297
loss_total                787695563767.807983
loss_critic               984620273303.552002
loss_actor                      -3343498.8515
memory_size                          12778.69 

=== epoch 1/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:04<00:00,  5.48it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  927
episode_length                      10.774542
returns                             -3.260698
return_std                           3.389825
average_reward                      -0.302327
round_time             0 days 00:06:05.503552
episodes_test                           902.0
episode_length_test                  11.08204
returns_test                        -3.578925
return_std_test                      3.437603
average_reward_test                 -0.322514
round_time_test        0 days 00:00:09.158402
round_time_total       0 days 00:06:05.504663
loss_total                883574012985.343994
loss_critic              1104468373028.864014
loss_actor                    -3514547.476625
memory_size                          12817.79 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  1%|          | 11/2000 [00:02<06:27,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.73it/s]
episodes                                  180
episode_length                      11.094444
returns                             -3.385174
return_std                           3.766853
average_reward                      -0.303734
round_time             0 days 00:05:48.950968
episodes_test                           948.0
episode_length_test                  10.53903
returns_test                        -3.183728
return_std_test                      3.293986
average_reward_test                 -0.300946
round_time_test        0 days 00:00:09.212205
round_time_total       0 days 00:05:48.952065
loss_total                987471329591.296021
loss_critic              1234340070440.959961
loss_actor                    -3707324.604875
memory_size                         12859.591 

=== epoch 2/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<06:30,  5.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  359
episode_length                      11.111421
returns                             -3.480617
return_std                           3.716884
average_reward                      -0.312288
round_time             0 days 00:05:50.946975
episodes_test                           945.0
episode_length_test                 10.577778
returns_test                        -3.872548
return_std_test                      3.132062
average_reward_test                 -0.365741
round_time_test        0 days 00:00:09.370315
round_time_total       0 days 00:05:50.948163
loss_total               1095472328097.791992
loss_critic              1369341370695.679932
loss_actor                     -3914363.61325
memory_size                         12909.887 

=== epoch 2/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:27,  4.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  547
episode_length                      10.968921
returns                             -3.489719
return_std                           3.648245
average_reward                      -0.318146
round_time             0 days 00:05:50.780837
episodes_test                           938.0
episode_length_test                 10.659915
returns_test                        -3.619278
return_std_test                      3.561792
average_reward_test                 -0.339421
round_time_test        0 days 00:00:09.274261
round_time_total       0 days 00:05:50.781949
loss_total               1244041873424.384033
loss_critic              1555053337493.503906
loss_actor                     -4143912.02375
memory_size                         12976.638 

=== epoch 2/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:40,  5.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  725
episode_length                      11.031724
returns                             -3.549443
return_std                           3.626284
average_reward                      -0.321526
round_time             0 days 00:05:52.052928
episodes_test                           925.0
episode_length_test                      10.8
returns_test                        -3.732145
return_std_test                      3.408969
average_reward_test                 -0.345964
round_time_test        0 days 00:00:09.354335
round_time_total       0 days 00:05:52.054064
loss_total               1401532922740.736084
loss_critic              1751917225902.080078
loss_actor                    -4369849.252625
memory_size                        13025.4765 

=== epoch 2/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:20,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                  912
episode_length                      10.964912
returns                             -3.762588
return_std                           3.597981
average_reward                      -0.343148
round_time             0 days 00:05:52.467428
episodes_test                           916.0
episode_length_test                 10.908297
returns_test                        -4.090447
return_std_test                      3.294585
average_reward_test                 -0.374639
round_time_test        0 days 00:00:09.414334
round_time_total       0 days 00:05:52.468547
loss_total               1555363932078.080078
loss_critic              1944206023131.135986
loss_actor                    -4597690.124875
memory_size                        13066.1335 

=== epoch 2/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:50,  5.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                  918
episode_length                      10.881264
returns                             -3.852114
return_std                           3.506805
average_reward                      -0.354436
round_time             0 days 00:05:52.417861
episodes_test                           930.0
episode_length_test                 10.752688
returns_test                        -4.017107
return_std_test                      3.100959
average_reward_test                 -0.373591
round_time_test        0 days 00:00:09.243486
round_time_total       0 days 00:05:52.418976
loss_total               1757205518188.543945
loss_critic              2196508077457.407959
loss_actor                     -4847556.83175
memory_size                        13099.5895 

=== epoch 2/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:49,  5.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  931
episode_length                      10.729323
returns                             -3.927459
return_std                           3.398357
average_reward                      -0.366126
round_time             0 days 00:05:52.584644
episodes_test                           937.0
episode_length_test                 10.667022
returns_test                        -4.310624
return_std_test                       3.55527
average_reward_test                 -0.403709
round_time_test        0 days 00:00:09.408581
round_time_total       0 days 00:05:52.585775
loss_total               1954410703781.887939
loss_critic              2443014641680.383789
loss_actor                     -5115766.60275
memory_size                         13143.746 

=== epoch 2/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  929
episode_length                      10.763186
returns                             -4.001602
return_std                           3.376471
average_reward                      -0.371657
round_time             0 days 00:05:54.688094
episodes_test                           916.0
episode_length_test                 10.914847
returns_test                          -4.2347
return_std_test                      3.423815
average_reward_test                 -0.387772
round_time_test        0 days 00:00:09.430405
round_time_total       0 days 00:05:54.689241
loss_total               2158839821697.023926
loss_critic              2698551054303.231934
loss_actor                      -5329067.9425
memory_size                        13181.5625 

=== epoch 2/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  928
episode_length                      10.751078
returns                             -4.130144
return_std                           3.462001
average_reward                      -0.383814
round_time             0 days 00:05:53.305369
episodes_test                           880.0
episode_length_test                 11.363636
returns_test                        -4.161529
return_std_test                      3.662189
average_reward_test                 -0.366215
round_time_test        0 days 00:00:09.251261
round_time_total       0 days 00:05:53.306458
loss_total               2373804847398.912109
loss_critic              2967257371181.056152
loss_actor                      -5582042.9305
memory_size                        13228.8435 

=== epoch 2/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  927
episode_length                      10.776699
returns                             -4.080737
return_std                           3.509431
average_reward                       -0.37947
round_time             0 days 00:05:52.106508
episodes_test                           926.0
episode_length_test                 10.794816
returns_test                        -4.257674
return_std_test                      3.507828
average_reward_test                 -0.394164
round_time_test        0 days 00:00:09.419085
round_time_total       0 days 00:05:52.107671
loss_total               2667270570082.304199
loss_critic              3334089605840.895996
loss_actor                     -5875109.99825
memory_size                        13279.2975 

=== epoch 2/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  922
episode_length                      10.835141
returns                              -4.15583
return_std                           3.569584
average_reward                      -0.383689
round_time             0 days 00:05:52.804427
episodes_test                           936.0
episode_length_test                 10.663462
returns_test                        -4.016957
return_std_test                      3.540718
average_reward_test                 -0.374808
round_time_test        0 days 00:00:09.368346
round_time_total       0 days 00:05:52.805541
loss_total               2937459370229.759766
loss_critic              3671825728602.111816
loss_actor                     -6129890.49975
memory_size                         13325.028 

=== epoch 2/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  910
episode_length                      10.973626
returns                             -4.239725
return_std                           3.526631
average_reward                      -0.385822
round_time             0 days 00:05:55.900566
episodes_test                           917.0
episode_length_test                 10.899673
returns_test                        -4.299245
return_std_test                      3.285639
average_reward_test                  -0.39389
round_time_test        0 days 00:00:09.371258
round_time_total       0 days 00:05:55.901911
loss_total               3211079923662.848145
loss_critic              4013851449163.775879
loss_actor                      -6405507.0635
memory_size                        13376.8245 

=== epoch 2/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:44,  5.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  902
episode_length                      11.068736
returns                             -4.321809
return_std                           3.549157
average_reward                      -0.390703
round_time             0 days 00:05:54.104233
episodes_test                           882.0
episode_length_test                 11.337868
returns_test                        -3.937026
return_std_test                      3.497962
average_reward_test                 -0.347246
round_time_test        0 days 00:00:09.235781
round_time_total       0 days 00:05:54.105452
loss_total               3523262033297.408203
loss_critic              4404079093415.935547
loss_actor                       -6659259.701
memory_size                         13438.253 

=== epoch 2/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  906
episode_length                      11.027594
returns                             -4.255048
return_std                           3.423616
average_reward                      -0.385639
round_time             0 days 00:05:52.717700
episodes_test                           886.0
episode_length_test                  11.27991
returns_test                        -4.424776
return_std_test                      3.675445
average_reward_test                 -0.391779
round_time_test        0 days 00:00:09.402439
round_time_total       0 days 00:05:52.719135
loss_total                3861367757275.13623
loss_critic              4826711361454.080078
loss_actor                     -6958165.58875
memory_size                        13495.9285 

=== epoch 2/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:39,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  907
episode_length                      11.022051
returns                             -4.354976
return_std                           3.413855
average_reward                      -0.395011
round_time             0 days 00:05:53.719977
episodes_test                           895.0
episode_length_test                 11.172067
returns_test                        -4.497898
return_std_test                      3.524596
average_reward_test                 -0.402481
round_time_test        0 days 00:00:09.253779
round_time_total       0 days 00:05:53.721074
loss_total               4123297315225.600098
loss_critic              5154123425906.688477
loss_actor                      -7175585.8505
memory_size                         13542.141 

=== epoch 2/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:56,  5.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  905
episode_length                      11.039779
returns                             -4.465186
return_std                           3.334858
average_reward                      -0.404395
round_time             0 days 00:05:54.864677
episodes_test                           874.0
episode_length_test                 11.440503
returns_test                        -4.775154
return_std_test                      3.611531
average_reward_test                  -0.41726
round_time_test        0 days 00:00:09.322606
round_time_total       0 days 00:05:54.865851
loss_total                4465017691766.78418
loss_critic              5581273966706.688477
loss_actor                       -7406849.962
memory_size                        13582.6785 

=== epoch 2/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:28,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                  894
episode_length                      11.177852
returns                             -4.540937
return_std                           3.511331
average_reward                      -0.406199
round_time             0 days 00:05:52.414912
episodes_test                           896.0
episode_length_test                  11.15067
returns_test                        -4.574893
return_std_test                      3.633955
average_reward_test                  -0.40991
round_time_test        0 days 00:00:09.177153
round_time_total       0 days 00:05:52.416037
loss_total               4759943592411.135742
loss_critic              5949931346460.671875
loss_actor                      -7607511.9675
memory_size                        13679.4205 

=== epoch 2/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  894
episode_length                      11.175615
returns                             -4.617022
return_std                           3.501025
average_reward                      -0.413314
round_time             0 days 00:05:53.911748
episodes_test                           878.0
episode_length_test                  11.38041
returns_test                        -4.882164
return_std_test                      3.640137
average_reward_test                 -0.428386
round_time_test        0 days 00:00:09.257983
round_time_total       0 days 00:05:53.912858
loss_total               5136326780256.255859
loss_critic              6420410325729.280273
loss_actor                      -7850489.5755
memory_size                         13775.559 

=== epoch 2/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  889
episode_length                      11.239595
returns                             -4.667161
return_std                           3.562638
average_reward                      -0.415666
round_time             0 days 00:05:54.628943
episodes_test                           889.0
episode_length_test                  11.23847
returns_test                        -4.913282
return_std_test                       3.51155
average_reward_test                 -0.436464
round_time_test        0 days 00:00:09.150069
round_time_total       0 days 00:05:54.630062
loss_total               5429559809015.807617
loss_critic              6786951610499.072266
loss_actor                       -8075729.631
memory_size                         13854.583 

=== epoch 2/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:26,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  876
episode_length                      11.390411
returns                             -4.557667
return_std                           3.721996
average_reward                      -0.400152
round_time             0 days 00:05:55.554827
episodes_test                           891.0
episode_length_test                 11.223345
returns_test                        -4.751261
return_std_test                      3.633117
average_reward_test                 -0.423337
round_time_test        0 days 00:00:09.210616
round_time_total       0 days 00:05:55.556021
loss_total               5887121511153.664062
loss_critic              7358903770284.032227
loss_actor                     -8299602.15475
memory_size                        13940.3455 

=== epoch 2/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:17,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  868
episode_length                      11.514977
returns                             -4.505434
return_std                           3.809589
average_reward                      -0.391312
round_time             0 days 00:05:54.248198
episodes_test                           888.0
episode_length_test                 11.261261
returns_test                        -4.629151
return_std_test                      3.743476
average_reward_test                 -0.411069
round_time_test        0 days 00:00:09.016602
round_time_total       0 days 00:05:54.249350
loss_total               6247562369171.456055
loss_critic              7809454868267.007812
loss_actor                       -8562894.027
memory_size                         14025.744 

=== epoch 2/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  879
episode_length                      11.367463
returns                             -4.480594
return_std                           3.737392
average_reward                      -0.394679
round_time             0 days 00:05:55.467459
episodes_test                           873.0
episode_length_test                 11.454754
returns_test                        -4.607214
return_std_test                      3.854239
average_reward_test                  -0.40221
round_time_test        0 days 00:00:09.187571
round_time_total       0 days 00:05:55.468540
loss_total               6721515030315.007812
loss_critic              8401895750631.423828
loss_actor                      -8795693.2815
memory_size                         14073.773 

=== epoch 2/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  879
episode_length                      11.361775
returns                             -4.464254
return_std                           3.725331
average_reward                      -0.392797
round_time             0 days 00:05:55.954951
episodes_test                           930.0
episode_length_test                 10.752688
returns_test                        -4.383278
return_std_test                      3.402472
average_reward_test                 -0.407645
round_time_test        0 days 00:00:09.069810
round_time_total       0 days 00:05:55.956045
loss_total               7036498816139.263672
loss_critic              8795625632563.200195
loss_actor                      -9021171.4395
memory_size                        14119.2165 

=== epoch 2/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:18,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  880
episode_length                      11.346591
returns                             -4.525189
return_std                            3.64007
average_reward                      -0.399565
round_time             0 days 00:05:55.313833
episodes_test                           917.0
episode_length_test                 10.904035
returns_test                         -4.66846
return_std_test                      3.476125
average_reward_test                 -0.428019
round_time_test        0 days 00:00:09.286357
round_time_total       0 days 00:05:55.315398
loss_total               7482086097158.143555
loss_critic              9352609903017.984375
loss_actor                       -9222995.506
memory_size                        14202.8055 

=== epoch 2/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:15,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  889
episode_length                      11.237345
returns                             -4.553047
return_std                           3.424477
average_reward                      -0.405975
round_time             0 days 00:05:56.636468
episodes_test                           933.0
episode_length_test                 10.718114
returns_test                        -4.427921
return_std_test                      3.350489
average_reward_test                 -0.413125
round_time_test        0 days 00:00:09.197418
round_time_total       0 days 00:05:56.637649
loss_total               7829746297733.120117
loss_critic              9787185277566.976562
loss_actor                       -9478738.874
memory_size                         14265.794 

=== epoch 2/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:22,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  898
episode_length                      11.114699
returns                             -4.527442
return_std                           3.419535
average_reward                      -0.406786
round_time             0 days 00:05:54.967064
episodes_test                           874.0
episode_length_test                 11.433638
returns_test                        -4.357265
return_std_test                      3.874593
average_reward_test                 -0.380285
round_time_test        0 days 00:00:09.192756
round_time_total       0 days 00:05:54.968341
loss_total               8207817869754.368164
loss_critic             10259774779031.552734
loss_actor                      -9654128.1795
memory_size                         14332.372 

=== epoch 2/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:50,  5.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  906
episode_length                       11.02649
returns                              -4.41577
return_std                           3.448764
average_reward                      -0.399068
round_time             0 days 00:05:55.372809
episodes_test                           929.0
episode_length_test                  10.74704
returns_test                        -4.474627
return_std_test                      3.500009
average_reward_test                 -0.414395
round_time_test        0 days 00:00:09.140314
round_time_total       0 days 00:05:55.373924
loss_total               8798198473424.896484
loss_critic             10997750509862.912109
loss_actor                     -9935879.12625
memory_size                         14398.465 

=== epoch 2/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:48,  5.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  922
episode_length                      10.836226
returns                              -4.28925
return_std                           3.449059
average_reward                      -0.396126
round_time             0 days 00:05:57.488449
episodes_test                           930.0
episode_length_test                 10.744086
returns_test                        -4.160874
return_std_test                      3.445969
average_reward_test                 -0.386873
round_time_test        0 days 00:00:09.284317
round_time_total       0 days 00:05:57.489655
loss_total               9184029697245.183594
loss_critic             11480039541047.296875
loss_actor                      -10140999.167
memory_size                         14443.223 

=== epoch 2/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:39,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  943
episode_length                       10.59491
returns                             -4.266835
return_std                           3.484761
average_reward                      -0.403061
round_time             0 days 00:05:55.912391
episodes_test                           937.0
episode_length_test                 10.672359
returns_test                        -4.307367
return_std_test                       3.45983
average_reward_test                   -0.4036
round_time_test        0 days 00:00:09.265947
round_time_total       0 days 00:05:55.913855
loss_total               9511017897787.392578
loss_critic             11888774774259.712891
loss_actor                      -10471008.053
memory_size                         14476.649 

=== epoch 2/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  944
episode_length                      10.574153
returns                              -4.26723
return_std                           3.552865
average_reward                      -0.402136
round_time             0 days 00:05:57.015720
episodes_test                           928.0
episode_length_test                 10.773707
returns_test                        -4.316998
return_std_test                         3.359
average_reward_test                 -0.400461
round_time_test        0 days 00:00:09.157637
round_time_total       0 days 00:05:57.016915
loss_total              10060573943398.400391
loss_critic             12575719808303.103516
loss_actor                      -10678032.891
memory_size                        14523.0235 

=== epoch 2/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:41,  4.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  949
episode_length                       10.51844
returns                             -4.156339
return_std                           3.510755
average_reward                      -0.394232
round_time             0 days 00:05:55.426717
episodes_test                           939.0
episode_length_test                 10.640043
returns_test                        -4.212617
return_std_test                      3.464476
average_reward_test                 -0.395677
round_time_test        0 days 00:00:09.377064
round_time_total       0 days 00:05:55.428095
loss_total               10637442102329.34375
loss_critic              13296805033017.34375
loss_actor                     -10993105.1185
memory_size                         14584.233 

=== epoch 2/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  949
episode_length                      10.505796
returns                             -4.170631
return_std                           3.475679
average_reward                      -0.395727
round_time             0 days 00:05:56.412170
episodes_test                           905.0
episode_length_test                 11.045304
returns_test                        -4.580978
return_std_test                      3.774654
average_reward_test                 -0.414502
round_time_test        0 days 00:00:09.241701
round_time_total       0 days 00:05:56.413328
loss_total              11346934138142.720703
loss_critic             14183670079553.535156
loss_actor                      -11300992.372
memory_size                         14637.319 

=== epoch 2/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  936
episode_length                       10.67094
returns                             -4.181545
return_std                           3.443023
average_reward                      -0.391675
round_time             0 days 00:05:56.052138
episodes_test                           910.0
episode_length_test                 10.989011
returns_test                        -4.229059
return_std_test                      3.370647
average_reward_test                 -0.384844
round_time_test        0 days 00:00:09.199841
round_time_total       0 days 00:05:56.053243
loss_total              11759394129182.720703
loss_critic             14699245113442.304688
loss_actor                     -11580286.9315
memory_size                        14696.2435 

=== epoch 2/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  923
episode_length                      10.832069
returns                             -4.084747
return_std                           3.415848
average_reward                      -0.377095
round_time             0 days 00:05:57.664084
episodes_test                           913.0
episode_length_test                 10.946331
returns_test                        -3.894725
return_std_test                       3.72552
average_reward_test                 -0.355986
round_time_test        0 days 00:00:09.278471
round_time_total       0 days 00:05:57.665226
loss_total              12261632273022.976562
loss_critic             15327042785509.376953
loss_actor                      -11769416.231
memory_size                        14758.0665 

=== epoch 2/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  920
episode_length                      10.856522
returns                             -3.962437
return_std                           3.452002
average_reward                      -0.363983
round_time             0 days 00:05:57.653985
episodes_test                           959.0
episode_length_test                 10.423358
returns_test                        -3.901113
return_std_test                      3.375495
average_reward_test                 -0.374345
round_time_test        0 days 00:00:09.315129
round_time_total       0 days 00:05:57.655226
loss_total              12796635776548.863281
loss_critic             15995797155348.480469
loss_actor                     -11925612.8205
memory_size                         14828.272 

=== epoch 2/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  926
episode_length                      10.792657
returns                             -4.005954
return_std                           3.478416
average_reward                      -0.371571
round_time             0 days 00:05:56.239651
episodes_test                           897.0
episode_length_test                 11.147157
returns_test                        -3.822421
return_std_test                      3.713413
average_reward_test                 -0.342802
round_time_test        0 days 00:00:09.353877
round_time_total       0 days 00:05:56.240811
loss_total               13229968515923.96875
loss_critic             16537463034478.591797
loss_actor                     -12212237.0095
memory_size                        14892.3065 

=== epoch 2/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:45,  5.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  932
episode_length                      10.726395
returns                             -3.848119
return_std                            3.39688
average_reward                      -0.358943
round_time             0 days 00:05:56.768929
episodes_test                           957.0
episode_length_test                 10.435737
returns_test                        -4.003329
return_std_test                      3.556248
average_reward_test                 -0.383945
round_time_test        0 days 00:00:09.371920
round_time_total       0 days 00:05:56.770025
loss_total               13766618373292.03125
loss_critic             17208275426344.960938
loss_actor                     -12532294.6845
memory_size                         14935.475 

=== epoch 2/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:02,  5.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  946
episode_length                      10.567653
returns                             -3.784705
return_std                           3.404142
average_reward                      -0.358305
round_time             0 days 00:05:58.096046
episodes_test                           946.0
episode_length_test                 10.566596
returns_test                        -3.877828
return_std_test                      3.825106
average_reward_test                 -0.366705
round_time_test        0 days 00:00:09.301330
round_time_total       0 days 00:05:58.097256
loss_total              14546588911796.224609
loss_critic             18183238830522.367188
loss_actor                       -12898986.42
memory_size                         14967.318 

=== epoch 2/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:39,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  956
episode_length                      10.457113
returns                             -3.839192
return_std                           3.430047
average_reward                      -0.366897
round_time             0 days 00:05:56.853814
episodes_test                           939.0
episode_length_test                 10.640043
returns_test                        -3.829517
return_std_test                      3.600436
average_reward_test                 -0.359791
round_time_test        0 days 00:00:09.295611
round_time_total       0 days 00:05:56.854900
loss_total              15518092381913.087891
loss_critic             19397618378211.328125
loss_actor                     -13159532.4855
memory_size                         14999.781 

=== epoch 2/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:39,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  962
episode_length                      10.382536
returns                             -3.718378
return_std                           3.367489
average_reward                      -0.357736
round_time             0 days 00:05:57.312164
episodes_test                           966.0
episode_length_test                 10.351967
returns_test                        -3.814986
return_std_test                      3.444848
average_reward_test                 -0.368528
round_time_test        0 days 00:00:09.152239
round_time_total       0 days 00:05:57.313284
loss_total              16154220439535.615234
loss_critic             20192778598481.921875
loss_actor                      -13467425.672
memory_size                        15037.3805 

=== epoch 2/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  957
episode_length                      10.431557
returns                             -3.552013
return_std                           3.388113
average_reward                      -0.340284
round_time             0 days 00:05:59.173030
episodes_test                           950.0
episode_length_test                 10.518947
returns_test                        -3.507082
return_std_test                      3.739218
average_reward_test                 -0.332839
round_time_test        0 days 00:00:09.279475
round_time_total       0 days 00:05:59.174226
loss_total              16877071036841.984375
loss_critic             21096341838495.742188
loss_actor                     -13788556.6115
memory_size                          15096.21 

=== epoch 2/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  954
episode_length                       10.47065
returns                               -3.5675
return_std                           3.605597
average_reward                      -0.340865
round_time             0 days 00:05:57.288392
episodes_test                           911.0
episode_length_test                 10.973655
returns_test                        -3.489642
return_std_test                      3.809195
average_reward_test                 -0.317697
round_time_test        0 days 00:00:09.213625
round_time_total       0 days 00:05:57.289499
loss_total              17791372159877.121094
loss_critic             22239218100862.976562
loss_actor                      -13919491.659
memory_size                         15162.001 

=== epoch 2/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:59,  4.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  946
episode_length                      10.567653
returns                             -3.536916
return_std                           3.690832
average_reward                      -0.334379
round_time             0 days 00:05:57.793206
episodes_test                           947.0
episode_length_test                 10.558606
returns_test                        -3.447282
return_std_test                      3.608766
average_reward_test                  -0.32635
round_time_test        0 days 00:00:09.414567
round_time_total       0 days 00:05:57.794390
loss_total              18029078750691.328125
loss_critic             22536351339511.808594
loss_actor                     -14196615.0175
memory_size                        15221.6895 

=== epoch 2/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:10,  5.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  947
episode_length                      10.548046
returns                             -3.274621
return_std                           3.624171
average_reward                      -0.310848
round_time             0 days 00:05:57.975684
episodes_test                           959.0
episode_length_test                 10.426486
returns_test                        -3.418745
return_std_test                      3.510508
average_reward_test                 -0.327782
round_time_test        0 days 00:00:09.230721
round_time_total       0 days 00:05:57.976779
loss_total              19347731065077.761719
loss_critic             24184666574553.089844
loss_actor                      -14577520.643
memory_size                        15258.2105 

=== epoch 2/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  946
episode_length                       10.55074
returns                             -3.317063
return_std                           3.790287
average_reward                      -0.313398
round_time             0 days 00:05:58.409776
episodes_test                           959.0
episode_length_test                   10.4244
returns_test                        -3.549875
return_std_test                      3.617286
average_reward_test                 -0.340148
round_time_test        0 days 00:00:09.307684
round_time_total       0 days 00:05:58.411020
loss_total              20415812750016.511719
loss_critic             25519768635703.296875
loss_actor                     -15028815.8175
memory_size                         15310.626 

=== epoch 2/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  952
episode_length                      10.495798
returns                             -3.366026
return_std                           3.798661
average_reward                       -0.31987
round_time             0 days 00:05:58.866213
episodes_test                           947.0
episode_length_test                 10.551214
returns_test                        -3.363708
return_std_test                      3.678483
average_reward_test                 -0.319195
round_time_test        0 days 00:00:09.338371
round_time_total       0 days 00:05:58.867305
loss_total              21504113247780.863281
loss_critic             26880144483287.039062
loss_actor                     -15407980.7695
memory_size                         15384.716 

=== epoch 2/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:53,  5.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  956
episode_length                      10.444561
returns                             -3.376976
return_std                           3.644154
average_reward                      -0.323686
round_time             0 days 00:05:56.962789
episodes_test                           959.0
episode_length_test                 10.425443
returns_test                        -3.558226
return_std_test                      3.446055
average_reward_test                 -0.341103
round_time_test        0 days 00:00:09.266338
round_time_total       0 days 00:05:56.963897
loss_total              22450317481213.953125
loss_critic             28062900175241.214844
loss_actor                      -15694565.289
memory_size                        15410.6225 

=== epoch 2/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:54,  4.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  959
episode_length                      10.420229
returns                             -3.322317
return_std                           3.604826
average_reward                       -0.31886
round_time             0 days 00:06:00.269735
episodes_test                           952.0
episode_length_test                 10.504202
returns_test                        -3.279467
return_std_test                      3.444152
average_reward_test                 -0.312205
round_time_test        0 days 00:00:09.488240
round_time_total       0 days 00:06:00.270832
loss_total              23543250658787.328125
loss_critic             29429067152883.710938
loss_actor                      -15995917.064
memory_size                         15454.479 

=== epoch 2/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  951
episode_length                      10.505783
returns                             -3.352272
return_std                             3.7951
average_reward                      -0.319272
round_time             0 days 00:05:57.930289
episodes_test                           984.0
episode_length_test                 10.151423
returns_test                        -3.651054
return_std_test                       3.29033
average_reward_test                 -0.359486
round_time_test        0 days 00:00:09.258391
round_time_total       0 days 00:05:57.931571
loss_total              23982196937457.664062
loss_critic             29977750347448.320312
loss_actor                      -16219749.398
memory_size                         15499.885 

=== epoch 2/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:01,  4.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  956
episode_length                      10.451883
returns                             -3.342268
return_std                           3.768899
average_reward                      -0.319717
round_time             0 days 00:05:59.936705
episodes_test                           932.0
episode_length_test                 10.727468
returns_test                        -3.262174
return_std_test                      3.624275
average_reward_test                   -0.3039
round_time_test        0 days 00:00:09.254222
round_time_total       0 days 00:05:59.937871
loss_total              24798293896200.191406
loss_critic             30997871680552.960938
loss_actor                      -16416427.234
memory_size                         15534.969 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  1%|          | 12/2000 [00:02<06:12,  5.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.75it/s]
episodes                                  199
episode_length                       10.01005
returns                             -3.624327
return_std                           3.404431
average_reward                      -0.360648
round_time             0 days 00:05:48.183496
episodes_test                           960.0
episode_length_test                 10.416667
returns_test                        -3.146363
return_std_test                      3.409926
average_reward_test                 -0.302051
round_time_test        0 days 00:00:09.286491
round_time_total       0 days 00:05:48.184695
loss_total              26192252724314.113281
loss_critic             32740320442056.703125
loss_actor                      -16883823.315
memory_size                        15577.2215 

=== epoch 3/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:52,  5.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  401
episode_length                       9.962594
returns                             -3.514847
return_std                           3.234252
average_reward                      -0.352316
round_time             0 days 00:05:50.059073
episodes_test                           991.0
episode_length_test                 10.082745
returns_test                        -3.506622
return_std_test                      3.418641
average_reward_test                 -0.347089
round_time_test        0 days 00:00:09.265248
round_time_total       0 days 00:05:50.060205
loss_total              27597624103665.664062
loss_critic             34497034725621.761719
loss_actor                     -17271131.6265
memory_size                         15603.853 

=== epoch 3/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:54,  5.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  602
episode_length                       9.950166
returns                              -3.56048
return_std                           3.133339
average_reward                      -0.357224
round_time             0 days 00:05:50.333098
episodes_test                           994.0
episode_length_test                 10.055332
returns_test                        -3.414769
return_std_test                      3.197857
average_reward_test                 -0.339459
round_time_test        0 days 00:00:09.333475
round_time_total       0 days 00:05:50.334192
loss_total              29034697810509.824219
loss_critic             36293376907673.601562
loss_actor                     -17820395.4515
memory_size                         15643.538 

=== epoch 3/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  808
episode_length                       9.892327
returns                             -3.564134
return_std                           3.112139
average_reward                      -0.359331
round_time             0 days 00:05:52.287968
episodes_test                           977.0
episode_length_test                 10.226203
returns_test                        -3.496284
return_std_test                       3.72351
average_reward_test                 -0.341382
round_time_test        0 days 00:00:09.350437
round_time_total       0 days 00:05:52.289047
loss_total              31082481056743.425781
loss_critic             38853105882759.171875
loss_actor                      -18312232.156
memory_size                         15656.488 

=== epoch 3/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.74it/s]
episodes                                 1012
episode_length                       9.881423
returns                             -3.549512
return_std                             3.1903
average_reward                      -0.359211
round_time             0 days 00:05:49.204435
episodes_test                           989.0
episode_length_test                 10.104146
returns_test                        -3.551835
return_std_test                      3.288397
average_reward_test                 -0.351415
round_time_test        0 days 00:00:09.425422
round_time_total       0 days 00:05:49.205621
loss_total              32457175810965.503906
loss_critic             40571474278350.851562
loss_actor                     -18740570.9725
memory_size                         15676.289 

=== epoch 3/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1009
episode_length                       9.905847
returns                              -3.41872
return_std                           3.138236
average_reward                       -0.34528
round_time             0 days 00:05:52.318825
episodes_test                          1008.0
episode_length_test                  9.914683
returns_test                        -3.575775
return_std_test                      3.397894
average_reward_test                 -0.360294
round_time_test        0 days 00:00:09.281028
round_time_total       0 days 00:05:52.319941
loss_total              33801249913044.992188
loss_critic                  42251566841856.0
loss_actor                      -19288731.827
memory_size                         15693.445 

=== epoch 3/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:34,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1005
episode_length                       9.943284
returns                             -3.456949
return_std                           3.152337
average_reward                      -0.347598
round_time             0 days 00:05:51.294528
episodes_test                          1005.0
episode_length_test                  9.941294
returns_test                        -3.227311
return_std_test                      3.259081
average_reward_test                 -0.324827
round_time_test        0 days 00:00:09.470897
round_time_total       0 days 00:05:51.295628
loss_total              36246365614899.203125
loss_critic             45307961437847.554688
loss_actor                      -19841889.887
memory_size                        15718.8925 

=== epoch 3/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1004
episode_length                       9.947211
returns                             -3.363083
return_std                           3.196044
average_reward                      -0.338756
round_time             0 days 00:05:51.837997
episodes_test                          1006.0
episode_length_test                  9.935388
returns_test                        -3.484786
return_std_test                      3.298553
average_reward_test                 -0.350501
round_time_test        0 days 00:00:09.131577
round_time_total       0 days 00:05:51.839113
loss_total              38431831549804.546875
loss_critic              48039793830395.90625
loss_actor                      -20337310.384
memory_size                        15737.8035 

=== epoch 3/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:48,  4.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1010
episode_length                       9.885149
returns                             -3.326464
return_std                           3.132287
average_reward                      -0.335958
round_time             0 days 00:05:53.304235
episodes_test                          1007.0
episode_length_test                    9.9285
returns_test                        -3.466742
return_std_test                      3.229329
average_reward_test                  -0.34895
round_time_test        0 days 00:00:09.232284
round_time_total       0 days 00:05:53.305784
loss_total              40501965028327.421875
loss_critic               50627460556914.6875
loss_actor                      -20861075.325
memory_size                        15759.3645 

=== epoch 3/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:48,  4.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1012
episode_length                       9.874506
returns                             -3.359112
return_std                           3.036148
average_reward                      -0.340021
round_time             0 days 00:05:52.535478
episodes_test                          1012.0
episode_length_test                  9.880435
returns_test                        -3.378826
return_std_test                      3.005832
average_reward_test                 -0.341835
round_time_test        0 days 00:00:09.424003
round_time_total       0 days 00:05:52.536579
loss_total              42980542545657.859375
loss_critic             53725682433261.570312
loss_actor                      -21486517.539
memory_size                         15779.811 

=== epoch 3/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:44,  5.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1017
episode_length                       9.824975
returns                             -3.432366
return_std                           3.081319
average_reward                      -0.349484
round_time             0 days 00:05:53.285350
episodes_test                          1029.0
episode_length_test                  9.714286
returns_test                        -3.553164
return_std_test                      3.135633
average_reward_test                 -0.365589
round_time_test        0 days 00:00:09.211839
round_time_total       0 days 00:05:53.286499
loss_total                45663632677666.8125
loss_critic             57079545055936.515625
loss_actor                      -22239970.289
memory_size                        15801.0755 

=== epoch 3/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:00,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1018
episode_length                       9.799607
returns                             -3.376699
return_std                           3.106327
average_reward                      -0.342889
round_time             0 days 00:05:52.961594
episodes_test                          1022.0
episode_length_test                   9.77593
returns_test                        -3.775462
return_std_test                      3.101885
average_reward_test                 -0.384766
round_time_test        0 days 00:00:09.350060
round_time_total       0 days 00:05:52.962676
loss_total              49338909738401.789062
loss_critic                61673641577807.875
loss_actor                      -22900638.801
memory_size                         15827.998 

=== epoch 3/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1029
episode_length                       9.707483
returns                              -3.39241
return_std                           3.141532
average_reward                      -0.349102
round_time             0 days 00:05:52.524080
episodes_test                          1029.0
episode_length_test                  9.710398
returns_test                        -3.511964
return_std_test                      3.051684
average_reward_test                 -0.361755
round_time_test        0 days 00:00:09.456625
round_time_total       0 days 00:05:52.525283
loss_total              53414137694257.148438
loss_critic             66767676795518.976562
loss_actor                      -23582559.959
memory_size                         15867.784 

=== epoch 3/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:47,  5.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1020
episode_length                       9.793137
returns                             -3.343056
return_std                           3.208632
average_reward                      -0.341226
round_time             0 days 00:05:52.880393
episodes_test                          1008.0
episode_length_test                  9.914683
returns_test                        -3.084046
return_std_test                      3.241161
average_reward_test                 -0.310705
round_time_test        0 days 00:00:09.194027
round_time_total       0 days 00:05:52.881647
loss_total                55676467606978.5625
loss_critic               69595589285773.3125
loss_actor                      -24261631.347
memory_size                         15885.754 

=== epoch 3/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:27,  5.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1022
episode_length                       9.782779
returns                             -3.297342
return_std                           3.204763
average_reward                      -0.337295
round_time             0 days 00:05:52.769952
episodes_test                          1007.0
episode_length_test                  9.929494
returns_test                        -3.189145
return_std_test                      3.287406
average_reward_test                 -0.321085
round_time_test        0 days 00:00:09.450022
round_time_total       0 days 00:05:52.771162
loss_total              58893439403556.867188
loss_critic              73616804396662.78125
loss_actor                      -24912170.397
memory_size                         15895.399 

=== epoch 3/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:28,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1028
episode_length                       9.705253
returns                             -3.335571
return_std                           3.125674
average_reward                      -0.342285
round_time             0 days 00:05:52.900641
episodes_test                          1052.0
episode_length_test                  9.502852
returns_test                        -3.449772
return_std_test                      3.015158
average_reward_test                  -0.36289
round_time_test        0 days 00:00:09.298519
round_time_total       0 days 00:05:52.901733
loss_total              63305792883785.726562
loss_critic              79132246625222.65625
loss_actor                       -25661208.03
memory_size                        15909.3605 

=== epoch 3/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<05:58,  5.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1044
episode_length                       9.565134
returns                             -3.341383
return_std                             3.0111
average_reward                       -0.34883
round_time             0 days 00:05:53.486810
episodes_test                          1015.0
episode_length_test                   9.84335
returns_test                        -3.387254
return_std_test                      3.111816
average_reward_test                 -0.343008
round_time_test        0 days 00:00:09.122953
round_time_total       0 days 00:05:53.487898
loss_total              66041489647468.546875
loss_critic              82551867778793.46875
loss_actor                      -26170301.574
memory_size                        15931.7375 

=== epoch 3/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1043
episode_length                       9.577181
returns                             -3.422929
return_std                           2.914251
average_reward                      -0.357411
round_time             0 days 00:05:52.108718
episodes_test                          1053.0
episode_length_test                  9.487179
returns_test                        -3.539426
return_std_test                      2.973013
average_reward_test                 -0.372546
round_time_test        0 days 00:00:09.325155
round_time_total       0 days 00:05:52.109841
loss_total              70454713755107.328125
loss_critic                 88068398310752.25
loss_actor                      -27070946.284
memory_size                         15938.303 

=== epoch 3/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:43,  5.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1041
episode_length                        9.59366
returns                             -3.503068
return_std                           2.886538
average_reward                      -0.364418
round_time             0 days 00:05:53.132608
episodes_test                          1039.0
episode_length_test                  9.618864
returns_test                        -3.336581
return_std_test                      3.049105
average_reward_test                 -0.346233
round_time_test        0 days 00:00:09.321665
round_time_total       0 days 00:05:53.133774
loss_total                   75169105575936.0
loss_critic             93961388673728.515625
loss_actor                      -27853261.858
memory_size                         15958.622 

=== epoch 3/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:29,  5.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1044
episode_length                       9.573755
returns                             -3.423165
return_std                           2.873467
average_reward                      -0.357527
round_time             0 days 00:05:54.295215
episodes_test                          1014.0
episode_length_test                  9.857002
returns_test                        -3.411773
return_std_test                      3.367923
average_reward_test                 -0.345383
round_time_test        0 days 00:00:09.154903
round_time_total       0 days 00:05:54.296302
loss_total              79382966934962.171875
loss_critic             99228715570954.234375
loss_actor                      -28591001.561
memory_size                         15994.346 

=== epoch 3/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1045
episode_length                       9.557895
returns                             -3.402273
return_std                           2.851326
average_reward                      -0.356265
round_time             0 days 00:05:52.018020
episodes_test                          1041.0
episode_length_test                  9.605187
returns_test                        -3.568302
return_std_test                      3.083143
average_reward_test                 -0.371372
round_time_test        0 days 00:00:09.239779
round_time_total       0 days 00:05:52.019139
loss_total              83203231355764.734375
loss_critic             104004046477590.53125
loss_actor                      -29190138.693
memory_size                        16002.6755 

=== epoch 3/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:39,  4.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1039
episode_length                       9.620789
returns                             -3.420987
return_std                           2.919746
average_reward                      -0.355676
round_time             0 days 00:05:54.352047
episodes_test                          1045.0
episode_length_test                  9.563636
returns_test                        -3.554913
return_std_test                      2.927036
average_reward_test                 -0.371732
round_time_test        0 days 00:00:09.264302
round_time_total       0 days 00:05:54.353166
loss_total               86409471684771.84375
loss_critic            108011847227015.171875
loss_actor                      -30087387.508
memory_size                        16018.4315 

=== epoch 3/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:31,  5.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1041
episode_length                       9.598463
returns                             -3.315221
return_std                           2.952689
average_reward                      -0.345237
round_time             0 days 00:05:52.176260
episodes_test                          1056.0
episode_length_test                  9.463068
returns_test                        -3.528583
return_std_test                      2.891404
average_reward_test                 -0.372849
round_time_test        0 days 00:00:09.338073
round_time_total       0 days 00:05:52.177400
loss_total              93942751789318.140625
loss_critic            117428447619317.765625
loss_actor                      -31030327.556
memory_size                           16021.0 

=== epoch 3/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1057
episode_length                       9.456008
returns                             -3.324222
return_std                           2.906136
average_reward                      -0.351186
round_time             0 days 00:05:52.819997
episodes_test                          1043.0
episode_length_test                  9.576222
returns_test                        -3.228564
return_std_test                      3.002769
average_reward_test                 -0.336393
round_time_test        0 days 00:00:09.408127
round_time_total       0 days 00:05:52.821119
loss_total              96485807637921.796875
loss_critic                120607267511861.25
loss_actor                      -31600136.904
memory_size                         16027.297 

=== epoch 3/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:09,  5.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1058
episode_length                       9.444234
returns                             -3.414122
return_std                           2.956882
average_reward                      -0.361098
round_time             0 days 00:05:54.712960
episodes_test                          1054.0
episode_length_test                  9.482922
returns_test                        -3.204969
return_std_test                      2.750844
average_reward_test                 -0.337802
round_time_test        0 days 00:00:09.217519
round_time_total       0 days 00:05:54.714054
loss_total             100873477589827.578125
loss_critic              126091854673346.5625
loss_actor                      -32520189.263
memory_size                         16035.283 

=== epoch 3/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1064
episode_length                       9.382519
returns                             -3.436525
return_std                           2.929746
average_reward                       -0.36532
round_time             0 days 00:05:54.776934
episodes_test                          1037.0
episode_length_test                  9.639344
returns_test                        -3.428089
return_std_test                      3.195928
average_reward_test                 -0.355494
round_time_test        0 days 00:00:09.253462
round_time_total       0 days 00:05:54.778084
loss_total             108081783219683.328125
loss_critic            135102235924234.234375
loss_actor                      -33301099.637
memory_size                        16040.5925 

=== epoch 3/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1072
episode_length                       9.317164
returns                             -3.454871
return_std                           2.836017
average_reward                      -0.370866
round_time             0 days 00:05:54.964531
episodes_test                          1046.0
episode_length_test                  9.550669
returns_test                         -3.44767
return_std_test                      2.953001
average_reward_test                 -0.360429
round_time_test        0 days 00:00:09.449489
round_time_total       0 days 00:05:54.965737
loss_total             113527522011906.046875
loss_critic                 141909409136640.0
loss_actor                      -34323605.591
memory_size                        16042.3665 

=== epoch 3/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:01,  5.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1078
episode_length                       9.267161
returns                             -3.507792
return_std                           2.822817
average_reward                      -0.378286
round_time             0 days 00:05:54.315536
episodes_test                          1055.0
episode_length_test                  9.473934
returns_test                        -3.279621
return_std_test                      2.881932
average_reward_test                 -0.346257
round_time_test        0 days 00:00:09.402281
round_time_total       0 days 00:05:54.316703
loss_total             118380027959050.234375
loss_critic             147975040734855.15625
loss_actor                      -35056341.284
memory_size                        16051.6855 

=== epoch 3/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:12,  4.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1065
episode_length                       9.371831
returns                             -3.470052
return_std                           2.868875
average_reward                      -0.370487
round_time             0 days 00:05:53.703767
episodes_test                          1036.0
episode_length_test                  9.646718
returns_test                        -3.254861
return_std_test                      3.013195
average_reward_test                 -0.337162
round_time_test        0 days 00:00:09.276920
round_time_total       0 days 00:05:53.704925
loss_total              122257724657369.09375
loss_critic              152822161239703.5625
loss_actor                      -35524436.201
memory_size                         16063.528 

=== epoch 3/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:22,  5.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1064
episode_length                       9.390977
returns                             -3.429187
return_std                            2.82622
average_reward                      -0.364971
round_time             0 days 00:05:53.050143
episodes_test                          1042.0
episode_length_test                  9.596929
returns_test                        -3.101844
return_std_test                        2.8693
average_reward_test                 -0.323212
round_time_test        0 days 00:00:09.268967
round_time_total       0 days 00:05:53.051624
loss_total              126758675189596.15625
loss_critic                 158448348464939.0
loss_actor                      -36210959.661
memory_size                        16071.7205 

=== epoch 3/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:53,  4.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1059
episode_length                       9.433428
returns                             -3.449479
return_std                           2.803346
average_reward                      -0.366267
round_time             0 days 00:05:54.539885
episodes_test                          1044.0
episode_length_test                   9.57567
returns_test                        -3.443453
return_std_test                      2.976202
average_reward_test                 -0.359367
round_time_test        0 days 00:00:09.351070
round_time_total       0 days 00:05:54.541444
loss_total             135176647209910.265625
loss_critic             168970812156542.96875
loss_actor                      -37318781.784
memory_size                        16075.9605 

=== epoch 3/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1053
episode_length                        9.48433
returns                             -3.379517
return_std                           2.860565
average_reward                       -0.35573
round_time             0 days 00:05:54.149716
episodes_test                          1055.0
episode_length_test                  9.472038
returns_test                        -3.343996
return_std_test                      2.994146
average_reward_test                 -0.353017
round_time_test        0 days 00:00:09.249378
round_time_total       0 days 00:05:54.150889
loss_total               141576031480840.1875
loss_critic             176970041827262.46875
loss_actor                      -38173644.858
memory_size                        16082.8265 

=== epoch 3/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1049
episode_length                       9.521449
returns                             -3.307565
return_std                           2.897071
average_reward                      -0.347675
round_time             0 days 00:05:53.503878
episodes_test                           992.0
episode_length_test                 10.074597
returns_test                         -3.04097
return_std_test                      3.509897
average_reward_test                  -0.30113
round_time_test        0 days 00:00:09.278152
round_time_total       0 days 00:05:53.505028
loss_total               145159841852162.0625
loss_critic                181449804111413.25
loss_actor                      -38871361.856
memory_size                        16086.0385 

=== epoch 3/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1060
episode_length                       9.427358
returns                             -3.314125
return_std                           2.876812
average_reward                      -0.351257
round_time             0 days 00:05:54.077216
episodes_test                          1044.0
episode_length_test                  9.576628
returns_test                        -3.218526
return_std_test                       3.12624
average_reward_test                 -0.335935
round_time_test        0 days 00:00:09.200598
round_time_total       0 days 00:05:54.078360
loss_total               154766621292888.0625
loss_critic             193458277491671.03125
loss_actor                      -39991350.538
memory_size                        16088.9695 

=== epoch 3/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1061
episode_length                       9.412818
returns                             -3.316638
return_std                           2.848601
average_reward                      -0.351993
round_time             0 days 00:05:55.067874
episodes_test                          1049.0
episode_length_test                  9.529075
returns_test                        -3.113057
return_std_test                       3.07232
average_reward_test                 -0.326503
round_time_test        0 days 00:00:09.326333
round_time_total       0 days 00:05:55.069065
loss_total              161321004570771.46875
loss_critic                201651257381748.75
loss_actor                      -40937399.882
memory_size                        16095.7715 

=== epoch 3/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1061
episode_length                       9.419416
returns                             -3.228561
return_std                           3.045584
average_reward                      -0.342747
round_time             0 days 00:05:54.447244
episodes_test                          1058.0
episode_length_test                   9.44518
returns_test                        -3.294158
return_std_test                      2.916727
average_reward_test                 -0.348372
round_time_test        0 days 00:00:09.314588
round_time_total       0 days 00:05:54.448406
loss_total              171948790301851.65625
loss_critic                214935994116341.75
loss_actor                      -42041528.478
memory_size                         16127.516 

=== epoch 3/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1065
episode_length                       9.380282
returns                             -3.229116
return_std                           3.035349
average_reward                      -0.344153
round_time             0 days 00:05:54.040066
episodes_test                          1051.0
episode_length_test                   9.50999
returns_test                        -3.176446
return_std_test                      2.985487
average_reward_test                 -0.333592
round_time_test        0 days 00:00:09.303176
round_time_total       0 days 00:05:54.041345
loss_total                 182546868218101.75
loss_critic             228183596672745.46875
loss_actor                      -43283256.996
memory_size                        16137.4805 

=== epoch 3/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1068
episode_length                       9.353933
returns                             -3.314551
return_std                           3.055695
average_reward                        -0.3547
round_time             0 days 00:05:52.836131
episodes_test                          1073.0
episode_length_test                  9.318733
returns_test                        -3.270593
return_std_test                      2.782986
average_reward_test                 -0.350868
round_time_test        0 days 00:00:09.461565
round_time_total       0 days 00:05:52.837241
loss_total                192378208664944.625
loss_critic              240472774513000.4375
loss_actor                       -44209491.51
memory_size                           16139.0 

=== epoch 3/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1072
episode_length                       9.314366
returns                              -3.40554
return_std                           2.983809
average_reward                      -0.365295
round_time             0 days 00:05:53.633367
episodes_test                          1062.0
episode_length_test                  9.416196
returns_test                        -3.376721
return_std_test                      2.999957
average_reward_test                 -0.358608
round_time_test        0 days 00:00:09.456706
round_time_total       0 days 00:05:53.634465
loss_total              200504894585569.28125
loss_critic             250631132628910.09375
loss_actor                      -45229010.826
memory_size                         16140.477 

=== epoch 3/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1070
episode_length                       9.342056
returns                             -3.399203
return_std                           3.037706
average_reward                      -0.363864
round_time             0 days 00:05:55.059186
episodes_test                          1068.0
episode_length_test                   9.36236
returns_test                        -3.572849
return_std_test                      2.878455
average_reward_test                 -0.381489
round_time_test        0 days 00:00:09.491342
round_time_total       0 days 00:05:55.060271
loss_total              211799791275868.15625
loss_critic               264749753177210.875
loss_actor                      -46321015.298
memory_size                        16151.9855 

=== epoch 3/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:00,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1074
episode_length                       9.306331
returns                             -3.412689
return_std                           2.850888
average_reward                      -0.366426
round_time             0 days 00:05:54.857835
episodes_test                          1037.0
episode_length_test                  9.636451
returns_test                        -3.287096
return_std_test                      2.944282
average_reward_test                 -0.341072
round_time_test        0 days 00:00:09.365581
round_time_total       0 days 00:05:54.858912
loss_total              219861031625687.03125
loss_critic              274826302823858.1875
loss_actor                      -47401444.302
memory_size                         16157.033 

=== epoch 3/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:39,  5.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1068
episode_length                       9.356742
returns                             -3.467069
return_std                           2.858895
average_reward                      -0.370397
round_time             0 days 00:05:54.690141
episodes_test                          1049.0
episode_length_test                  9.524309
returns_test                        -3.268704
return_std_test                      2.908785
average_reward_test                 -0.343262
round_time_test        0 days 00:00:09.232200
round_time_total       0 days 00:05:54.691460
loss_total               227992741701746.6875
loss_critic              284990939842215.9375
loss_actor                      -48138683.196
memory_size                        16165.0425 

=== epoch 3/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:09,  4.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1059
episode_length                       9.433428
returns                              -3.41318
return_std                           2.798123
average_reward                      -0.362076
round_time             0 days 00:05:53.332689
episodes_test                          1050.0
episode_length_test                  9.521905
returns_test                        -3.173666
return_std_test                      3.126134
average_reward_test                 -0.333056
round_time_test        0 days 00:00:09.322050
round_time_total       0 days 00:05:53.333818
loss_total              240319619582132.21875
loss_critic               300399535627173.875
loss_actor                      -49289586.586
memory_size                         16178.597 

=== epoch 3/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:58,  4.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                 1059
episode_length                       9.439093
returns                             -3.366214
return_std                            2.82529
average_reward                      -0.356926
round_time             0 days 00:05:56.315678
episodes_test                          1062.0
episode_length_test                  9.411488
returns_test                        -3.143921
return_std_test                      2.913501
average_reward_test                  -0.33407
round_time_test        0 days 00:00:09.302226
round_time_total       0 days 00:05:56.316751
loss_total              250473480179417.09375
loss_critic                 313091859821887.5
loss_actor                      -50116059.186
memory_size                         16196.235 

=== epoch 3/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:15,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1063
episode_length                        9.39793
returns                             -3.402937
return_std                           2.784206
average_reward                      -0.361512
round_time             0 days 00:05:54.932991
episodes_test                          1068.0
episode_length_test                  9.356742
returns_test                        -3.418854
return_std_test                      2.817839
average_reward_test                 -0.365442
round_time_test        0 days 00:00:09.424804
round_time_total       0 days 00:05:54.934096
loss_total                261937557552496.625
loss_critic              327421954799697.9375
loss_actor                       -51390314.97
memory_size                        16203.7765 

=== epoch 3/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1062
episode_length                       9.403955
returns                             -3.434909
return_std                           2.820071
average_reward                      -0.365326
round_time             0 days 00:05:54.341774
episodes_test                          1070.0
episode_length_test                  9.342056
returns_test                        -3.141329
return_std_test                      2.872843
average_reward_test                  -0.33595
round_time_test        0 days 00:00:09.518353
round_time_total       0 days 00:05:54.342865
loss_total               267286400519372.8125
loss_critic              334108007648460.8125
loss_actor                      -51915246.276
memory_size                        16208.1625 

=== epoch 3/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:39,  5.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1063
episode_length                       9.402634
returns                             -3.404854
return_std                           2.913254
average_reward                      -0.362037
round_time             0 days 00:05:55.053613
episodes_test                          1070.0
episode_length_test                  9.340187
returns_test                        -3.408285
return_std_test                      2.827443
average_reward_test                 -0.364746
round_time_test        0 days 00:00:09.288951
round_time_total       0 days 00:05:55.055123
loss_total                276972539614330.875
loss_critic              346215679822659.5625
loss_actor                       -53107470.47
memory_size                         16214.593 

=== epoch 3/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1067
episode_length                       9.368322
returns                             -3.391491
return_std                           2.926075
average_reward                      -0.361819
round_time             0 days 00:05:54.145807
episodes_test                          1034.0
episode_length_test                   9.66441
returns_test                        -3.440308
return_std_test                      2.976873
average_reward_test                 -0.355636
round_time_test        0 days 00:00:09.434666
round_time_total       0 days 00:05:54.146900
loss_total               290581849284018.1875
loss_critic              363227315557105.6875
loss_actor                      -54377101.952
memory_size                        16236.0855 

=== epoch 3/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                 1066
episode_length                       9.374296
returns                             -3.354945
return_std                           2.919382
average_reward                      -0.357633
round_time             0 days 00:05:55.682068
episodes_test                          1058.0
episode_length_test                  9.446125
returns_test                        -3.353173
return_std_test                      3.027999
average_reward_test                 -0.355119
round_time_test        0 days 00:00:09.281256
round_time_total       0 days 00:05:55.683133
loss_total                  299991152797417.5
loss_critic               374988943139012.625
loss_actor                      -55478669.422
memory_size                         16238.866 

=== epoch 3/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:20,  5.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1066
episode_length                       9.378987
returns                             -3.302835
return_std                           2.924401
average_reward                      -0.352334
round_time             0 days 00:05:53.804646
episodes_test                          1058.0
episode_length_test                  9.449905
returns_test                        -3.263723
return_std_test                      2.948947
average_reward_test                  -0.34515
round_time_test        0 days 00:00:09.287905
round_time_total       0 days 00:05:53.805799
loss_total                  318310431850496.0
loss_critic              397888039512178.6875
loss_actor                      -56817663.924
memory_size                         16253.997 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  1%|          | 12/2000 [00:02<05:41,  5.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  208
episode_length                       9.596154
returns                             -3.427347
return_std                           3.362537
average_reward                      -0.354974
round_time             0 days 00:05:49.539343
episodes_test                          1066.0
episode_length_test                  9.368668
returns_test                        -3.375889
return_std_test                       2.87899
average_reward_test                 -0.359922
round_time_test        0 days 00:00:09.094557
round_time_total       0 days 00:05:49.540484
loss_total                 331176644267999.25
loss_critic               413970803139280.875
loss_actor                      -57879221.922
memory_size                        16271.8375 

=== epoch 4/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:58,  5.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.73it/s]
episodes                                  425
episode_length                       9.395294
returns                             -3.520036
return_std                           2.978744
average_reward                      -0.374809
round_time             0 days 00:05:49.709834
episodes_test                          1057.0
episode_length_test                  9.454115
returns_test                        -3.490115
return_std_test                      2.965229
average_reward_test                 -0.369404
round_time_test        0 days 00:00:09.400871
round_time_total       0 days 00:05:49.710927
loss_total               349570958211678.1875
loss_critic              436963694333657.0625
loss_actor                       -59225550.73
memory_size                           16284.0 

=== epoch 4/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  643
episode_length                       9.326594
returns                              -3.41834
return_std                           2.979346
average_reward                      -0.366211
round_time             0 days 00:05:50.078182
episodes_test                          1069.0
episode_length_test                  9.354537
returns_test                        -3.432802
return_std_test                      2.795505
average_reward_test                 -0.366967
round_time_test        0 days 00:00:09.369288
round_time_total       0 days 00:05:50.079312
loss_total                  367008789775253.5
loss_critic              458760982798270.4375
loss_actor                       -60630960.17
memory_size                        16285.5415 

=== epoch 4/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  853
episode_length                       9.377491
returns                              -3.36176
return_std                           2.967811
average_reward                      -0.358353
round_time             0 days 00:05:50.584005
episodes_test                          1074.0
episode_length_test                  9.301676
returns_test                        -3.341911
return_std_test                      2.818015
average_reward_test                  -0.35929
round_time_test        0 days 00:00:09.348350
round_time_total       0 days 00:05:50.585122
loss_total               380512068178018.3125
loss_critic                 475640079767830.5
loss_actor                      -61507698.226
memory_size                        16301.5845 

=== epoch 4/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1066
episode_length                       9.376173
returns                             -3.319526
return_std                           2.957013
average_reward                      -0.353951
round_time             0 days 00:05:51.905288
episodes_test                          1045.0
episode_length_test                  9.562679
returns_test                         -3.38359
return_std_test                      3.199256
average_reward_test                  -0.35365
round_time_test        0 days 00:00:09.391020
round_time_total       0 days 00:05:51.906530
loss_total                391068396201967.625
loss_critic                488835489067958.25
loss_actor                      -62728739.088
memory_size                         16320.286 

=== epoch 4/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:48,  5.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                 1071
episode_length                       9.327731
returns                             -3.279163
return_std                           2.890941
average_reward                      -0.351753
round_time             0 days 00:05:51.605064
episodes_test                          1058.0
episode_length_test                  9.442344
returns_test                        -3.321284
return_std_test                      2.971814
average_reward_test                 -0.351707
round_time_test        0 days 00:00:09.312531
round_time_total       0 days 00:05:51.606205
loss_total                407245998696431.625
loss_critic              509057491481395.1875
loss_actor                      -64010242.144
memory_size                         16332.373 

=== epoch 4/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1066
episode_length                       9.378987
returns                             -3.168319
return_std                           2.905568
average_reward                      -0.337922
round_time             0 days 00:05:51.296281
episodes_test                          1043.0
episode_length_test                  9.583893
returns_test                        -3.300428
return_std_test                      3.090701
average_reward_test                 -0.344094
round_time_test        0 days 00:00:09.309164
round_time_total       0 days 00:05:51.297368
loss_total               423740724396163.0625
loss_critic              529675898278903.8125
loss_actor                      -65242630.298
memory_size                        16339.3425 

=== epoch 4/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.71it/s]
episodes                                 1066
episode_length                       9.365854
returns                             -3.235284
return_std                           2.829634
average_reward                      -0.345991
round_time             0 days 00:05:50.547195
episodes_test                          1071.0
episode_length_test                  9.331466
returns_test                        -3.580919
return_std_test                          2.74
average_reward_test                 -0.383683
round_time_test        0 days 00:00:09.351357
round_time_total       0 days 00:05:50.548467
loss_total                440796248488804.375
loss_critic              550995302409043.9375
loss_actor                      -66274935.362
memory_size                           16340.0 

=== epoch 4/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                 1063
episode_length                        9.39699
returns                             -3.287342
return_std                           2.746914
average_reward                      -0.349659
round_time             0 days 00:05:50.369912
episodes_test                          1077.0
episode_length_test                  9.284123
returns_test                        -3.295243
return_std_test                      2.874109
average_reward_test                 -0.354792
round_time_test        0 days 00:00:09.402603
round_time_total       0 days 00:05:50.371369
loss_total                455478506462117.875
loss_critic               569348124426895.375
loss_actor                      -67505711.846
memory_size                        16350.6395 

=== epoch 4/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1059
episode_length                       9.435316
returns                             -3.386426
return_std                           2.780233
average_reward                      -0.358784
round_time             0 days 00:05:51.100589
episodes_test                          1044.0
episode_length_test                  9.577586
returns_test                        -3.369662
return_std_test                      3.053537
average_reward_test                 -0.351692
round_time_test        0 days 00:00:09.276549
round_time_total       0 days 00:05:51.101787
loss_total                  473219988690305.0
loss_critic                591524976398434.25
loss_actor                      -68974759.964
memory_size                        16361.3815 

=== epoch 4/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1057
episode_length                       9.443709
returns                             -3.387341
return_std                           2.729004
average_reward                      -0.359131
round_time             0 days 00:05:52.143155
episodes_test                          1075.0
episode_length_test                  9.298605
returns_test                         -3.57497
return_std_test                      2.857839
average_reward_test                 -0.384309
round_time_test        0 days 00:00:09.423026
round_time_total       0 days 00:05:52.144271
loss_total               497438250990829.5625
loss_critic                621797803202445.25
loss_actor                      -69886849.632
memory_size                         16365.892 

=== epoch 4/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                 1061
episode_length                       9.418473
returns                              -3.52025
return_std                           2.766894
average_reward                      -0.372771
round_time             0 days 00:05:50.690882
episodes_test                          1050.0
episode_length_test                  9.518095
returns_test                         -3.31894
return_std_test                      3.016445
average_reward_test                 -0.348653
round_time_test        0 days 00:00:09.296949
round_time_total       0 days 00:05:50.692118
loss_total                493296502207152.125
loss_critic               616620617713582.125
loss_actor                      -70359093.944
memory_size                         16367.542 

=== epoch 4/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                 1051
episode_length                       9.511893
returns                             -3.505611
return_std                           2.823335
average_reward                      -0.368622
round_time             0 days 00:05:50.677791
episodes_test                          1066.0
episode_length_test                  9.374296
returns_test                        -3.296017
return_std_test                      2.984094
average_reward_test                 -0.351153
round_time_test        0 days 00:00:09.389909
round_time_total       0 days 00:05:50.678907
loss_total               517983896797708.3125
loss_critic                647479860427489.25
loss_actor                      -72055616.214
memory_size                         16378.588 

=== epoch 4/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                 1060
episode_length                       9.416981
returns                             -3.459535
return_std                           2.890378
average_reward                      -0.366468
round_time             0 days 00:05:51.072443
episodes_test                          1068.0
episode_length_test                  9.363296
returns_test                        -3.419558
return_std_test                      2.811479
average_reward_test                 -0.365209
round_time_test        0 days 00:00:09.406678
round_time_total       0 days 00:05:51.073742
loss_total                541976480184270.875
loss_critic               677470588368846.875
loss_actor                      -73385665.168
memory_size                        16386.6855 

=== epoch 4/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1064
episode_length                       9.390977
returns                              -3.38892
return_std                           2.889876
average_reward                      -0.360391
round_time             0 days 00:05:51.744891
episodes_test                          1066.0
episode_length_test                  9.375235
returns_test                        -3.483075
return_std_test                      2.940903
average_reward_test                 -0.371573
round_time_test        0 days 00:00:09.420617
round_time_total       0 days 00:05:51.746174
loss_total                 556107250708512.75
loss_critic               695134051970842.625
loss_actor                      -74863650.556
memory_size                        16402.3795 

=== epoch 4/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1061
episode_length                       9.418473
returns                             -3.370557
return_std                           2.886321
average_reward                      -0.358124
round_time             0 days 00:05:51.881643
episodes_test                          1031.0
episode_length_test                  9.698351
returns_test                        -3.138208
return_std_test                      3.190032
average_reward_test                 -0.323482
round_time_test        0 days 00:00:09.380068
round_time_total       0 days 00:05:51.882725
loss_total                  587641196856213.5
loss_critic               734551483529297.875
loss_actor                      -76603475.682
memory_size                         16410.262 

=== epoch 4/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1057
episode_length                       9.446547
returns                              -3.36225
return_std                           2.915152
average_reward                      -0.356375
round_time             0 days 00:05:51.130542
episodes_test                          1046.0
episode_length_test                  9.554493
returns_test                        -3.304389
return_std_test                      2.935033
average_reward_test                 -0.345897
round_time_test        0 days 00:00:09.326875
round_time_total       0 days 00:05:51.131637
loss_total                 605717178141900.75
loss_critic                 757146460281110.5
loss_actor                      -77487033.032
memory_size                           16415.0 

=== epoch 4/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:02,  5.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                 1056
episode_length                       9.454545
returns                             -3.334016
return_std                           2.881561
average_reward                      -0.352123
round_time             0 days 00:05:51.653957
episodes_test                          1082.0
episode_length_test                  9.235675
returns_test                        -3.575058
return_std_test                      2.637219
average_reward_test                 -0.386836
round_time_test        0 days 00:00:09.369910
round_time_total       0 days 00:05:51.655167
loss_total                 625145112480972.75
loss_critic                 781431377915543.5
loss_actor                      -78819941.084
memory_size                         16417.633 

=== epoch 4/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:08,  5.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1049
episode_length                       9.522402
returns                             -3.416151
return_std                           2.893235
average_reward                      -0.357652
round_time             0 days 00:05:52.361862
episodes_test                          1059.0
episode_length_test                  9.438149
returns_test                         -3.32515
return_std_test                      2.782722
average_reward_test                  -0.35188
round_time_test        0 days 00:00:09.305617
round_time_total       0 days 00:05:52.363066
loss_total                 651243014320029.75
loss_critic                 814053756881600.5
loss_actor                      -80274835.684
memory_size                         16435.307 

=== epoch 4/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1044
episode_length                       9.564176
returns                             -3.451682
return_std                           2.955067
average_reward                      -0.360996
round_time             0 days 00:05:51.501771
episodes_test                          1052.0
episode_length_test                  9.505703
returns_test                        -3.390334
return_std_test                      2.955309
average_reward_test                 -0.356663
round_time_test        0 days 00:00:09.239727
round_time_total       0 days 00:05:51.502880
loss_total                670732239570468.875
loss_critic               838415289350619.125
loss_actor                        -81684885.8
memory_size                        16450.6585 

=== epoch 4/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1056
episode_length                       9.462121
returns                             -3.562372
return_std                           2.891248
average_reward                      -0.376316
round_time             0 days 00:05:52.792279
episodes_test                          1040.0
episode_length_test                  9.610577
returns_test                        -3.328311
return_std_test                      2.979934
average_reward_test                 -0.345904
round_time_test        0 days 00:00:09.266883
round_time_total       0 days 00:05:52.793478
loss_total                685496679780057.125
loss_critic                 856870841483264.0
loss_actor                      -82652400.004
memory_size                        16454.7445 

=== epoch 4/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:01,  5.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1063
episode_length                       9.405456
returns                             -3.528805
return_std                           2.918781
average_reward                      -0.375322
round_time             0 days 00:05:52.251787
episodes_test                          1061.0
episode_length_test                  9.419416
returns_test                        -3.441447
return_std_test                      2.860543
average_reward_test                 -0.364632
round_time_test        0 days 00:00:09.376767
round_time_total       0 days 00:05:52.252883
loss_total                715357787898511.375
loss_critic                894197225263988.75
loss_actor                       -84415367.86
memory_size                         16461.774 

=== epoch 4/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1065
episode_length                       9.379343
returns                             -3.531099
return_std                           2.958665
average_reward                      -0.377078
round_time             0 days 00:05:53.294897
episodes_test                          1045.0
episode_length_test                  9.564593
returns_test                        -3.430967
return_std_test                      3.099451
average_reward_test                 -0.358031
round_time_test        0 days 00:00:09.569447
round_time_total       0 days 00:05:53.296016
loss_total                744369018776846.375
loss_critic                930461262507147.25
loss_actor                      -86048714.204
memory_size                        16469.8865 

=== epoch 4/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                 1064
episode_length                       9.383459
returns                             -3.574436
return_std                           2.933017
average_reward                      -0.381496
round_time             0 days 00:05:59.979842
episodes_test                          1040.0
episode_length_test                  9.610577
returns_test                        -3.351491
return_std_test                      2.866959
average_reward_test                 -0.348314
round_time_test        0 days 00:00:09.166628
round_time_total       0 days 00:05:59.981068
loss_total                 775952116974878.75
loss_critic                 969940133119787.0
loss_actor                      -87572968.432
memory_size                         16486.603 

=== epoch 4/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:49,  5.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:08<00:00,  5.43it/s]
episodes                                 1065
episode_length                       9.381221
returns                             -3.494525
return_std                           2.905751
average_reward                      -0.372003
round_time             0 days 00:06:08.849679
episodes_test                          1045.0
episode_length_test                  9.568421
returns_test                        -3.507953
return_std_test                      2.899987
average_reward_test                 -0.366493
round_time_test        0 days 00:00:09.376307
round_time_total       0 days 00:06:08.850948
loss_total                 803012179426017.25
loss_critic              1003765210202243.125
loss_actor                      -89238621.204
memory_size                         16495.481 

=== epoch 4/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                 1064
episode_length                       9.384398
returns                             -3.469593
return_std                           2.970226
average_reward                      -0.370666
round_time             0 days 00:05:59.074465
episodes_test                          1056.0
episode_length_test                  9.463068
returns_test                        -3.401561
return_std_test                      2.836033
average_reward_test                 -0.359726
round_time_test        0 days 00:00:09.398189
round_time_total       0 days 00:05:59.075596
loss_total                837573295640412.125
loss_critic               1046966605189218.25
loss_actor                      -90726778.176
memory_size                           16503.0 

=== epoch 4/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1057
episode_length                       9.459792
returns                             -3.384995
return_std                           2.975331
average_reward                      -0.357719
round_time             0 days 00:05:52.740457
episodes_test                          1027.0
episode_length_test                  9.731256
returns_test                        -3.235351
return_std_test                       3.12592
average_reward_test                 -0.332484
round_time_test        0 days 00:00:09.575359
round_time_total       0 days 00:05:52.741544
loss_total                842551459038887.875
loss_critic                1053189308946579.5
loss_actor                      -91399033.336
memory_size                        16506.1565 

=== epoch 4/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:38,  5.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                 1057
episode_length                       9.456008
returns                             -3.418431
return_std                           2.948118
average_reward                      -0.361157
round_time             0 days 00:05:51.582337
episodes_test                          1057.0
episode_length_test                  9.451277
returns_test                        -3.336762
return_std_test                      2.893707
average_reward_test                 -0.353042
round_time_test        0 days 00:00:09.259876
round_time_total       0 days 00:05:51.583439
loss_total                878730990703345.625
loss_critic                1098413721711018.0
loss_actor                      -92837877.856
memory_size                         16513.062 

=== epoch 4/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:50,  5.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1061
episode_length                       9.420358
returns                              -3.33595
return_std                           2.940432
average_reward                      -0.354042
round_time             0 days 00:05:52.849108
episodes_test                          1039.0
episode_length_test                  9.614052
returns_test                        -3.430239
return_std_test                      3.029564
average_reward_test                 -0.355997
round_time_test        0 days 00:00:09.356498
round_time_total       0 days 00:05:52.850222
loss_total                  906081644534825.0
loss_critic               1132602037565915.25
loss_actor                      -94517682.932
memory_size                        16517.8485 

=== epoch 4/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1057
episode_length                       9.448439
returns                             -3.360119
return_std                           2.957678
average_reward                      -0.355418
round_time             0 days 00:05:54.187140
episodes_test                          1067.0
episode_length_test                   9.36926
returns_test                        -3.474869
return_std_test                      2.789213
average_reward_test                  -0.37055
round_time_test        0 days 00:00:09.287521
round_time_total       0 days 00:05:54.188226
loss_total                937466686708121.625
loss_critic               1171833340727132.25
loss_actor                      -96245782.356
memory_size                         16534.587 

=== epoch 4/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1054
episode_length                       9.476281
returns                             -3.301327
return_std                           2.849998
average_reward                       -0.34866
round_time             0 days 00:05:53.839160
episodes_test                          1061.0
episode_length_test                  9.420358
returns_test                         -3.49411
return_std_test                      2.826339
average_reward_test                 -0.370548
round_time_test        0 days 00:00:09.301863
round_time_total       0 days 00:05:53.840336
loss_total                 969087740773662.75
loss_critic                1211359655880556.5
loss_actor                      -97501223.088
memory_size                        16544.9875 

=== epoch 4/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:23,  5.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1056
episode_length                       9.458333
returns                             -3.416725
return_std                           2.840669
average_reward                       -0.36091
round_time             0 days 00:05:53.473910
episodes_test                          1042.0
episode_length_test                  9.595969
returns_test                        -3.453074
return_std_test                      2.841593
average_reward_test                 -0.359708
round_time_test        0 days 00:00:09.370123
round_time_total       0 days 00:05:53.475121
loss_total                  991930266488406.0
loss_critic                1239912813145620.5
loss_actor                      -98536990.348
memory_size                         16557.234 

=== epoch 4/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1061
episode_length                       9.411876
returns                              -3.32662
return_std                            2.82486
average_reward                      -0.353898
round_time             0 days 00:05:52.467858
episodes_test                          1072.0
episode_length_test                  9.323694
returns_test                        -3.591021
return_std_test                       2.61667
average_reward_test                 -0.384536
round_time_test        0 days 00:00:09.227540
round_time_total       0 days 00:05:52.468973
loss_total               1015621459638222.875
loss_critic                1269526803173605.5
loss_actor                     -100321752.948
memory_size                        16574.1725 

=== epoch 4/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:27,  5.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1054
episode_length                       9.480076
returns                             -3.380077
return_std                           2.813743
average_reward                      -0.356643
round_time             0 days 00:05:52.331886
episodes_test                          1038.0
episode_length_test                  9.633911
returns_test                        -3.351057
return_std_test                      3.040038
average_reward_test                  -0.34784
round_time_test        0 days 00:00:09.328626
round_time_total       0 days 00:05:52.333374
loss_total               1060722155248943.125
loss_critic                1325902671206416.5
loss_actor                     -102091096.504
memory_size                        16596.4275 

=== epoch 4/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1055
episode_length                       9.472038
returns                             -3.440456
return_std                           2.755974
average_reward                      -0.362848
round_time             0 days 00:05:51.968255
episodes_test                          1063.0
episode_length_test                  9.406397
returns_test                        -3.380931
return_std_test                       2.85828
average_reward_test                 -0.359318
round_time_test        0 days 00:00:09.421809
round_time_total       0 days 00:05:51.969364
loss_total                 1081435016984002.5
loss_critic               1351793746437472.25
loss_actor                     -103254949.072
memory_size                         16622.907 

=== epoch 4/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1047
episode_length                       9.541547
returns                             -3.452973
return_std                           2.834923
average_reward                      -0.361317
round_time             0 days 00:05:53.437335
episodes_test                          1052.0
episode_length_test                  9.500951
returns_test                        -3.284129
return_std_test                      2.781372
average_reward_test                 -0.345299
round_time_test        0 days 00:00:09.310580
round_time_total       0 days 00:05:53.438452
loss_total                1129284719314206.75
loss_critic               1411605873821745.25
loss_actor                     -104636649.624
memory_size                        16656.7335 

=== epoch 4/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:16,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1040
episode_length                       9.605769
returns                             -3.359091
return_std                           2.844248
average_reward                      -0.349933
round_time             0 days 00:05:52.825033
episodes_test                          1027.0
episode_length_test                  9.735151
returns_test                        -3.506692
return_std_test                       2.97807
average_reward_test                 -0.359891
round_time_test        0 days 00:00:09.323875
round_time_total       0 days 00:05:52.826143
loss_total                1137594086929203.25
loss_critic               1421992585114681.25
loss_actor                     -106062836.032
memory_size                         16660.367 

=== epoch 4/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1037
episode_length                       9.639344
returns                                -3.458
return_std                           2.839775
average_reward                      -0.358708
round_time             0 days 00:05:53.191197
episodes_test                          1026.0
episode_length_test                  9.744639
returns_test                        -3.153011
return_std_test                      3.012623
average_reward_test                  -0.32335
round_time_test        0 days 00:00:09.302507
round_time_total       0 days 00:05:53.192447
loss_total                 1194849491954434.0
loss_critic               1493561838636367.75
loss_actor                      -108046576.02
memory_size                         16662.281 

=== epoch 4/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1043
episode_length                       9.579099
returns                             -3.407439
return_std                           3.002729
average_reward                      -0.355479
round_time             0 days 00:05:53.403820
episodes_test                          1051.0
episode_length_test                  9.511893
returns_test                        -3.477555
return_std_test                      3.042957
average_reward_test                 -0.365416
round_time_test        0 days 00:00:09.396051
round_time_total       0 days 00:05:53.405092
loss_total                1228289122498510.75
loss_critic               1535361376229261.25
loss_actor                      -109018650.14
memory_size                         16687.013 

=== epoch 4/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1047
episode_length                       9.518625
returns                             -3.389889
return_std                            2.97353
average_reward                      -0.355584
round_time             0 days 00:05:51.916862
episodes_test                          1041.0
episode_length_test                  9.603266
returns_test                        -3.489704
return_std_test                      2.880644
average_reward_test                 -0.363063
round_time_test        0 days 00:00:09.384657
round_time_total       0 days 00:05:51.918019
loss_total                 1259612583866925.0
loss_critic                1574515702419685.5
loss_actor                     -110959163.492
memory_size                        16705.8485 

=== epoch 4/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:28,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1043
episode_length                       9.577181
returns                             -3.370205
return_std                           3.008148
average_reward                      -0.351752
round_time             0 days 00:05:54.795906
episodes_test                          1051.0
episode_length_test                  9.512845
returns_test                        -3.504614
return_std_test                       2.87233
average_reward_test                 -0.368189
round_time_test        0 days 00:00:09.354767
round_time_total       0 days 00:05:54.797004
loss_total                 1299301148137619.5
loss_critic                1624126406705283.0
loss_actor                      -111833309.54
memory_size                         16737.011 

=== epoch 4/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1053
episode_length                       9.490028
returns                             -3.516063
return_std                           2.964408
average_reward                      -0.370177
round_time             0 days 00:05:54.644250
episodes_test                          1030.0
episode_length_test                  9.701942
returns_test                        -3.146405
return_std_test                      3.087282
average_reward_test                 -0.324316
round_time_test        0 days 00:00:09.390453
round_time_total       0 days 00:05:54.645354
loss_total                 1305364763487764.5
loss_critic               1631705926920568.75
loss_actor                     -113233679.244
memory_size                        16753.7825 

=== epoch 4/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:53,  5.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1050
episode_length                       9.510476
returns                              -3.45369
return_std                           2.977502
average_reward                      -0.363268
round_time             0 days 00:05:54.869329
episodes_test                          1043.0
episode_length_test                  9.580058
returns_test                        -3.501512
return_std_test                      2.901316
average_reward_test                 -0.365292
round_time_test        0 days 00:00:09.333000
round_time_total       0 days 00:05:54.870471
loss_total                1340327797322678.25
loss_critic               1675409716747960.25
loss_actor                     -114423076.032
memory_size                         16760.192 

=== epoch 4/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1059
episode_length                       9.434372
returns                             -3.518277
return_std                            2.74612
average_reward                      -0.372867
round_time             0 days 00:05:53.415517
episodes_test                          1083.0
episode_length_test                  9.228994
returns_test                        -3.927982
return_std_test                      2.688491
average_reward_test                  -0.42563
round_time_test        0 days 00:00:09.394988
round_time_total       0 days 00:05:53.416615
loss_total                 1357562443009622.0
loss_critic               1696953023831474.25
loss_actor                     -114502466.028
memory_size                        16764.1635 

=== epoch 4/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1053
episode_length                       9.488129
returns                             -3.516951
return_std                           2.832978
average_reward                      -0.370463
round_time             0 days 00:05:53.692398
episodes_test                          1039.0
episode_length_test                  9.617902
returns_test                        -3.688215
return_std_test                      3.084186
average_reward_test                 -0.383082
round_time_test        0 days 00:00:09.355015
round_time_total       0 days 00:05:53.693487
loss_total                1385396190937350.25
loss_critic                1731745210309804.0
loss_actor                     -116262331.276
memory_size                        16783.5785 

=== epoch 4/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1057
episode_length                         9.4579
returns                             -3.624614
return_std                           2.792485
average_reward                      -0.383644
round_time             0 days 00:05:53.537524
episodes_test                          1045.0
episode_length_test                  9.564593
returns_test                        -3.608131
return_std_test                      2.756998
average_reward_test                 -0.377115
round_time_test        0 days 00:00:09.427619
round_time_total       0 days 00:05:53.538604
loss_total                 1436998366666948.5
loss_critic               1796247925936881.75
loss_actor                     -118277481.236
memory_size                         16800.513 

=== epoch 4/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                 1053
episode_length                       9.492877
returns                             -3.576656
return_std                           2.758272
average_reward                      -0.377494
round_time             0 days 00:05:56.275491
episodes_test                          1043.0
episode_length_test                   9.58581
returns_test                         -3.42611
return_std_test                      2.934433
average_reward_test                 -0.357205
round_time_test        0 days 00:00:09.387702
round_time_total       0 days 00:05:56.276586
loss_total                1510061819396358.25
loss_critic               1887577240732958.75
loss_actor                     -120066011.324
memory_size                         16809.699 

=== epoch 4/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:15,  5.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                 1057
episode_length                       9.451277
returns                             -3.660554
return_std                           2.798024
average_reward                      -0.387268
round_time             0 days 00:05:54.722995
episodes_test                          1059.0
episode_length_test                  9.439093
returns_test                        -3.649427
return_std_test                      2.788354
average_reward_test                 -0.386599
round_time_test        0 days 00:00:09.375034
round_time_total       0 days 00:05:54.724255
loss_total                 1531612140631228.5
loss_critic               1914515143878770.75
loss_actor                     -121395844.408
memory_size                        16826.8815 

=== epoch 4/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                 1053
episode_length                       9.482431
returns                              -3.65804
return_std                            2.88445
average_reward                      -0.386113
round_time             0 days 00:05:55.656889
episodes_test                          1074.0
episode_length_test                  9.309125
returns_test                        -3.620401
return_std_test                      2.791332
average_reward_test                 -0.388746
round_time_test        0 days 00:00:09.255391
round_time_total       0 days 00:05:55.658109
loss_total                1553754947739189.25
loss_critic               1942193653501919.25
loss_actor                     -122039406.324
memory_size                         16833.909 

=== epoch 4/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:52,  5.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                 1060
episode_length                       9.424528
returns                             -3.692306
return_std                           2.822843
average_reward                      -0.392052
round_time             0 days 00:05:54.709439
episodes_test                          1067.0
episode_length_test                  9.372071
returns_test                        -3.617087
return_std_test                      2.787215
average_reward_test                 -0.385943
round_time_test        0 days 00:00:09.224338
round_time_total       0 days 00:05:54.710527
loss_total                 1579435371802919.0
loss_critic                1974294180922392.5
loss_actor                     -124060945.312
memory_size                        16845.1505 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  1%|          | 12/2000 [00:02<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  202
episode_length                       9.831683
returns                             -3.351169
return_std                           2.908734
average_reward                      -0.340063
round_time             0 days 00:05:49.752266
episodes_test                          1054.0
episode_length_test                  9.482922
returns_test                        -3.581261
return_std_test                      2.864556
average_reward_test                 -0.377446
round_time_test        0 days 00:00:09.308302
round_time_total       0 days 00:05:49.753400
loss_total                1593286225727324.25
loss_critic                1991607748311122.0
loss_actor                     -123989950.312
memory_size                         16857.733 

=== epoch 5/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:42,  4.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                  408
episode_length                       9.796569
returns                             -3.498173
return_std                           2.747709
average_reward                      -0.356313
round_time             0 days 00:05:51.393441
episodes_test                          1032.0
episode_length_test                  9.687016
returns_test                         -3.44155
return_std_test                      3.041075
average_reward_test                 -0.354956
round_time_test        0 days 00:00:09.356138
round_time_total       0 days 00:05:51.394946
loss_total                1641765116808527.75
loss_critic               2052206362288455.75
loss_actor                     -126036968.676
memory_size                        16876.6935 

=== epoch 5/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:27,  5.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  625
episode_length                         9.5968
returns                             -3.669545
return_std                           2.757267
average_reward                      -0.381881
round_time             0 days 00:05:53.573898
episodes_test                          1042.0
episode_length_test                  9.595969
returns_test                        -3.481033
return_std_test                      3.012494
average_reward_test                 -0.362643
round_time_test        0 days 00:00:09.308753
round_time_total       0 days 00:05:53.575004
loss_total                 1698263975591936.0
loss_critic               2122829936539467.75
loss_actor                     -127838416.008
memory_size                        16881.8915 

=== epoch 5/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  823
episode_length                       9.708384
returns                             -3.600157
return_std                           2.967498
average_reward                      -0.370358
round_time             0 days 00:05:50.934796
episodes_test                          1050.0
episode_length_test                  9.520952
returns_test                        -3.532836
return_std_test                      2.977702
average_reward_test                  -0.37091
round_time_test        0 days 00:00:09.369694
round_time_total       0 days 00:05:50.936131
loss_total                 1722727496002044.0
loss_critic                2153409333612773.5
loss_actor                     -129371366.128
memory_size                        16899.0775 

=== epoch 5/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1033
episode_length                       9.679574
returns                             -3.577264
return_std                           2.951846
average_reward                      -0.369438
round_time             0 days 00:05:51.091081
episodes_test                          1034.0
episode_length_test                   9.67118
returns_test                        -3.494649
return_std_test                      2.961644
average_reward_test                 -0.361347
round_time_test        0 days 00:00:09.551589
round_time_total       0 days 00:05:51.092165
loss_total                1780152407682973.75
loss_critic               2225190472450572.25
loss_actor                     -130486689.464
memory_size                        16928.0475 

=== epoch 5/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1036
episode_length                       9.643822
returns                             -3.679579
return_std                           2.989231
average_reward                      -0.381852
round_time             0 days 00:05:51.306036
episodes_test                          1033.0
episode_length_test                  9.673766
returns_test                        -3.674428
return_std_test                      2.789843
average_reward_test                 -0.379628
round_time_test        0 days 00:00:09.538221
round_time_total       0 days 00:05:51.307216
loss_total                1824811440884678.75
loss_critic                2281014262467920.0
loss_actor                     -132115901.976
memory_size                         16944.679 

=== epoch 5/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:50,  4.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                 1036
episode_length                       9.643822
returns                              -3.59854
return_std                           3.071665
average_reward                      -0.373728
round_time             0 days 00:05:51.759482
episodes_test                          1055.0
episode_length_test                  9.472986
returns_test                         -3.58978
return_std_test                      2.806577
average_reward_test                 -0.378416
round_time_test        0 days 00:00:09.540535
round_time_total       0 days 00:05:51.760582
loss_total                 1823051795199426.5
loss_critic                2278814708045709.5
loss_actor                     -133102131.004
memory_size                        16980.5065 

=== epoch 5/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                 1028
episode_length                       9.713035
returns                             -3.547012
return_std                           3.101966
average_reward                      -0.364845
round_time             0 days 00:05:50.315351
episodes_test                          1046.0
episode_length_test                  9.556405
returns_test                        -3.630153
return_std_test                      2.996462
average_reward_test                 -0.379482
round_time_test        0 days 00:00:09.198577
round_time_total       0 days 00:05:50.316532
loss_total                 1904908469329199.0
loss_critic                2381135545909641.0
loss_actor                     -135353543.204
memory_size                        16998.8545 

=== epoch 5/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:56,  5.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1039
episode_length                        9.61309
returns                             -3.653524
return_std                           3.008355
average_reward                      -0.380117
round_time             0 days 00:05:52.774654
episodes_test                          1060.0
episode_length_test                  9.430189
returns_test                         -3.64685
return_std_test                      2.892707
average_reward_test                 -0.386347
round_time_test        0 days 00:00:09.339335
round_time_total       0 days 00:05:52.775971
loss_total                 1951010203520467.0
loss_critic                2438762714680525.0
loss_actor                      -136396545.76
memory_size                        17014.7325 

=== epoch 5/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:00,  5.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                 1039
episode_length                       9.612127
returns                             -3.702952
return_std                           3.076255
average_reward                      -0.385034
round_time             0 days 00:05:49.935987
episodes_test                          1030.0
episode_length_test                  9.702913
returns_test                        -3.704834
return_std_test                      2.988375
average_reward_test                 -0.381027
round_time_test        0 days 00:00:09.326068
round_time_total       0 days 00:05:49.937070
loss_total                1951823810046984.25
loss_critic                2439779723207770.0
loss_actor                      -136576016.62
memory_size                        17023.4065 

=== epoch 5/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1040
episode_length                       9.609615
returns                             -3.637739
return_std                           3.019667
average_reward                      -0.378099
round_time             0 days 00:05:51.787501
episodes_test                          1022.0
episode_length_test                  9.779843
returns_test                        -3.688855
return_std_test                      3.052137
average_reward_test                  -0.37738
round_time_test        0 days 00:00:09.187577
round_time_total       0 days 00:05:51.788674
loss_total                 2009840131376152.5
loss_critic                2512300122486866.0
loss_actor                     -138567884.364
memory_size                         17047.361 

=== epoch 5/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1047
episode_length                       9.531996
returns                             -3.711083
return_std                           2.958917
average_reward                      -0.389652
round_time             0 days 00:05:53.291078
episodes_test                          1032.0
episode_length_test                  9.687984
returns_test                        -3.567224
return_std_test                      3.127404
average_reward_test                 -0.367984
round_time_test        0 days 00:00:09.505591
round_time_total       0 days 00:05:53.292272
loss_total                 2063272829717053.5
loss_critic                2579090992141435.0
loss_actor                     -141099168.188
memory_size                         17065.799 

=== epoch 5/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1051
episode_length                        9.50999
returns                             -3.741046
return_std                           2.940413
average_reward                       -0.39326
round_time             0 days 00:05:53.110923
episodes_test                          1050.0
episode_length_test                  9.519048
returns_test                        -3.736933
return_std_test                      2.864638
average_reward_test                 -0.391884
round_time_test        0 days 00:00:09.430399
round_time_total       0 days 00:05:53.112023
loss_total                2105037562841137.25
loss_critic                2631296908269715.5
loss_actor                     -141975465.292
memory_size                        17088.2025 

=== epoch 5/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1051
episode_length                        9.50999
returns                             -3.617564
return_std                           2.886218
average_reward                      -0.380158
round_time             0 days 00:05:52.097649
episodes_test                          1033.0
episode_length_test                  9.677638
returns_test                        -3.774018
return_std_test                      3.035763
average_reward_test                 -0.389605
round_time_test        0 days 00:00:09.299362
round_time_total       0 days 00:05:52.098749
loss_total                2161814744565022.75
loss_critic                2702268383528747.0
loss_actor                     -143653077.836
memory_size                        17092.9445 

=== epoch 5/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:29,  5.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                 1046
episode_length                       9.550669
returns                             -3.566673
return_std                            2.85201
average_reward                       -0.37303
round_time             0 days 00:05:51.673675
episodes_test                          1043.0
episode_length_test                  9.581016
returns_test                        -3.786093
return_std_test                      3.001579
average_reward_test                 -0.394805
round_time_test        0 days 00:00:09.440325
round_time_total       0 days 00:05:51.674750
loss_total                2178942346782048.25
loss_critic                2723677885838655.5
loss_actor                     -144985882.552
memory_size                         17099.952 

=== epoch 5/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1047
episode_length                       9.542502
returns                             -3.591829
return_std                           2.854966
average_reward                      -0.376898
round_time             0 days 00:05:53.136546
episodes_test                          1052.0
episode_length_test                  9.499049
returns_test                        -3.866633
return_std_test                      2.875709
average_reward_test                 -0.406897
round_time_test        0 days 00:00:09.262565
round_time_total       0 days 00:05:53.137669
loss_total                 2264392716450267.0
loss_critic                2830490846204264.5
loss_actor                     -147038750.588
memory_size                         17111.146 

=== epoch 5/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1047
episode_length                       9.547278
returns                             -3.642282
return_std                           2.893592
average_reward                      -0.381576
round_time             0 days 00:05:52.109767
episodes_test                          1046.0
episode_length_test                  9.557361
returns_test                        -3.940015
return_std_test                      3.085004
average_reward_test                 -0.411809
round_time_test        0 days 00:00:09.401624
round_time_total       0 days 00:05:52.110888
loss_total                 2307364288542540.0
loss_critic                2884205310505910.5
loss_actor                      -148840960.38
memory_size                        17127.0435 

=== epoch 5/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1040
episode_length                       9.610577
returns                             -3.646896
return_std                           2.968783
average_reward                      -0.379623
round_time             0 days 00:05:52.597050
episodes_test                          1047.0
episode_length_test                  9.548233
returns_test                        -3.737202
return_std_test                       2.85543
average_reward_test                 -0.391019
round_time_test        0 days 00:00:09.336022
round_time_total       0 days 00:05:52.598189
loss_total                 2378751524219126.0
loss_critic                2973439352115298.5
loss_actor                      -150478325.44
memory_size                        17145.5415 

=== epoch 5/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:34,  5.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1030
episode_length                       9.704854
returns                             -3.655558
return_std                           3.103971
average_reward                      -0.377277
round_time             0 days 00:05:52.484042
episodes_test                          1016.0
episode_length_test                  9.840551
returns_test                        -3.870625
return_std_test                      2.976635
average_reward_test                 -0.393109
round_time_test        0 days 00:00:09.372283
round_time_total       0 days 00:05:52.485143
loss_total                 2434301201035034.5
loss_critic                3042876447422152.5
loss_actor                     -151882149.716
memory_size                         17189.346 

=== epoch 5/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<06:03,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                 1031
episode_length                       9.690592
returns                             -3.696145
return_std                           3.071434
average_reward                      -0.381629
round_time             0 days 00:05:50.912435
episodes_test                          1024.0
episode_length_test                  9.763672
returns_test                        -3.646552
return_std_test                      2.948717
average_reward_test                 -0.373182
round_time_test        0 days 00:00:09.263700
round_time_total       0 days 00:05:50.913520
loss_total                 2468366647639933.0
loss_critic                3085458254956855.5
loss_actor                     -153192043.032
memory_size                        17211.2975 

=== epoch 5/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1025
episode_length                       9.742439
returns                             -3.693024
return_std                           3.109148
average_reward                      -0.379654
round_time             0 days 00:05:51.330810
episodes_test                          1064.0
episode_length_test                  9.391917
returns_test                        -3.648922
return_std_test                      2.784041
average_reward_test                 -0.388124
round_time_test        0 days 00:00:09.409923
round_time_total       0 days 00:05:51.331957
loss_total                 2523230336742064.0
loss_critic                3154037865411772.5
loss_actor                     -155172250.832
memory_size                         17221.808 

=== epoch 5/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1019
episode_length                       9.807655
returns                             -3.609373
return_std                           3.132711
average_reward                      -0.368354
round_time             0 days 00:05:52.449665
episodes_test                          1039.0
episode_length_test                  9.624639
returns_test                        -3.840016
return_std_test                       2.79065
average_reward_test                 -0.398978
round_time_test        0 days 00:00:09.304374
round_time_total       0 days 00:05:52.450788
loss_total                 2607574447556657.0
loss_critic                3259467999786041.5
loss_actor                     -156961633.008
memory_size                        17237.8205 

=== epoch 5/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1023
episode_length                       9.769306
returns                             -3.594796
return_std                           3.064962
average_reward                      -0.367776
round_time             0 days 00:05:53.390613
episodes_test                          1034.0
episode_length_test                   9.65764
returns_test                        -3.609521
return_std_test                      2.989295
average_reward_test                 -0.372019
round_time_test        0 days 00:00:09.263109
round_time_total       0 days 00:05:53.391736
loss_total                 2678429491755221.0
loss_critic                3348036806930071.5
loss_actor                      -159626706.16
memory_size                        17249.5475 

=== epoch 5/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:31,  5.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1034
episode_length                       9.665377
returns                             -3.756434
return_std                           2.943827
average_reward                      -0.388702
round_time             0 days 00:05:51.903849
episodes_test                          1019.0
episode_length_test                  9.812561
returns_test                        -3.824001
return_std_test                      3.005597
average_reward_test                 -0.389591
round_time_test        0 days 00:00:09.487234
round_time_total       0 days 00:05:51.904928
loss_total                 2776283385222398.0
loss_critic                3470354173696934.0
loss_actor                     -161688522.816
memory_size                           17254.0 

=== epoch 5/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:04,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1029
episode_length                       9.709427
returns                             -3.809916
return_std                           2.926351
average_reward                      -0.393264
round_time             0 days 00:05:52.352986
episodes_test                          1009.0
episode_length_test                   9.89891
returns_test                        -3.842645
return_std_test                      3.193707
average_reward_test                  -0.38801
round_time_test        0 days 00:00:09.337437
round_time_total       0 days 00:05:52.354064
loss_total                 2816656334294876.0
loss_critic                3520820354568159.0
loss_actor                      -163558419.32
memory_size                         17258.333 

=== epoch 5/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:36,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1029
episode_length                       9.702624
returns                             -3.842167
return_std                           2.867205
average_reward                      -0.396249
round_time             0 days 00:05:54.523863
episodes_test                          1027.0
episode_length_test                  9.726388
returns_test                        -4.075683
return_std_test                      3.156148
average_reward_test                 -0.418556
round_time_test        0 days 00:00:09.260462
round_time_total       0 days 00:05:54.525162
loss_total                 2880718768462364.5
loss_critic                3600898399123013.5
loss_actor                     -165440251.784
memory_size                        17273.3635 

=== epoch 5/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                 1027
episode_length                       9.725414
returns                               -3.9545
return_std                           2.816683
average_reward                      -0.406985
round_time             0 days 00:05:50.851352
episodes_test                          1016.0
episode_length_test                  9.834646
returns_test                         -3.67489
return_std_test                      2.975469
average_reward_test                   -0.3734
round_time_test        0 days 00:00:09.329408
round_time_total       0 days 00:05:50.852476
loss_total                 2950892028545728.5
loss_critic                3688614971207319.5
loss_actor                     -166738449.576
memory_size                        17299.6185 

=== epoch 5/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1020
episode_length                       9.796078
returns                             -4.033913
return_std                           2.811218
average_reward                      -0.411787
round_time             0 days 00:05:52.882575
episodes_test                          1020.0
episode_length_test                  9.797059
returns_test                        -3.779662
return_std_test                       3.07433
average_reward_test                 -0.385821
round_time_test        0 days 00:00:09.614042
round_time_total       0 days 00:05:52.884067
loss_total                 3023078342753517.5
loss_critic                3778847865107906.5
loss_actor                     -169053788.792
memory_size                         17328.584 

=== epoch 5/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:07,  5.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                 1004
episode_length                       9.958167
returns                             -4.006152
return_std                           2.856127
average_reward                      -0.402291
round_time             0 days 00:05:52.472079
episodes_test                          1039.0
episode_length_test                  9.618864
returns_test                        -3.471845
return_std_test                      2.852323
average_reward_test                 -0.360993
round_time_test        0 days 00:00:09.337880
round_time_total       0 days 00:05:52.473183
loss_total                 3120044705992147.0
loss_critic                3900055819609178.0
loss_actor                       -170217759.4
memory_size                         17347.346 

=== epoch 5/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                 1007
episode_length                       9.929494
returns                              -4.01007
return_std                           2.912669
average_reward                      -0.403723
round_time             0 days 00:05:51.375392
episodes_test                          1000.0
episode_length_test                     9.999
returns_test                        -3.761868
return_std_test                      3.043932
average_reward_test                 -0.376115
round_time_test        0 days 00:00:09.400745
round_time_total       0 days 00:05:51.376476
loss_total                 3151017130489544.5
loss_critic                3938771347445907.5
loss_actor                     -172076642.264
memory_size                          17355.96 

=== epoch 5/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                 1006
episode_length                       9.929423
returns                             -3.936299
return_std                           2.944191
average_reward                      -0.396471
round_time             0 days 00:05:54.503161
episodes_test                          1020.0
episode_length_test                  9.801961
returns_test                        -3.927019
return_std_test                      2.962401
average_reward_test                 -0.400427
round_time_test        0 days 00:00:09.260082
round_time_total       0 days 00:05:54.504294
loss_total                 3235606041756434.5
loss_critic                4044507483274739.5
loss_actor                     -173894518.584
memory_size                         17371.002 

=== epoch 5/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:36,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                 1006
episode_length                       9.929423
returns                             -3.865091
return_std                           2.968202
average_reward                      -0.389875
round_time             0 days 00:05:53.020805
episodes_test                          1022.0
episode_length_test                  9.779843
returns_test                        -3.686006
return_std_test                      3.163253
average_reward_test                 -0.376707
round_time_test        0 days 00:00:09.362744
round_time_total       0 days 00:05:53.021894
loss_total                 3344273295666053.0
loss_critic                4180341549067927.5
loss_actor                     -176683203.792
memory_size                         17381.087 

=== epoch 5/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:49,  4.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1012
episode_length                       9.854743
returns                             -3.854446
return_std                           3.051952
average_reward                       -0.38969
round_time             0 days 00:05:53.325767
episodes_test                          1029.0
episode_length_test                  9.709427
returns_test                        -3.754606
return_std_test                      3.027719
average_reward_test                 -0.387133
round_time_test        0 days 00:00:09.208795
round_time_total       0 days 00:05:53.327159
loss_total                 3452451838735941.5
loss_critic                4315564723341885.5
loss_actor                       -178719712.6
memory_size                         17388.192 

=== epoch 5/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:30,  5.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1022
episode_length                       9.770059
returns                             -3.798513
return_std                           3.099223
average_reward                      -0.389164
round_time             0 days 00:05:53.211456
episodes_test                           999.0
episode_length_test                  9.998999
returns_test                        -3.938416
return_std_test                      3.095174
average_reward_test                 -0.394162
round_time_test        0 days 00:00:09.314880
round_time_total       0 days 00:05:53.212581
loss_total                 3510160693758263.5
loss_critic                4387700793612959.5
loss_actor                     -180803012.704
memory_size                        17407.6695 

=== epoch 5/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:32,  5.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                 1022
episode_length                       9.763209
returns                             -3.730432
return_std                           3.142392
average_reward                       -0.38063
round_time             0 days 00:05:52.044658
episodes_test                           995.0
episode_length_test                 10.047236
returns_test                         -3.84817
return_std_test                      3.252976
average_reward_test                 -0.382677
round_time_test        0 days 00:00:09.143063
round_time_total       0 days 00:05:52.045751
loss_total                 3602404012390875.0
loss_critic                4503004938950934.5
loss_actor                     -182378316.944
memory_size                        17433.5035 

=== epoch 5/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:45,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1015
episode_length                       9.832512
returns                             -3.844922
return_std                           3.210997
average_reward                       -0.39125
round_time             0 days 00:05:53.347646
episodes_test                           980.0
episode_length_test                  10.20102
returns_test                        -3.867776
return_std_test                      3.407299
average_reward_test                 -0.378814
round_time_test        0 days 00:00:09.279315
round_time_total       0 days 00:05:53.348720
loss_total                 3673258062371619.0
loss_critic                4591572496997679.0
loss_actor                     -183493632.088
memory_size                         17460.842 

=== epoch 5/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:25,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                 1010
episode_length                        9.89703
returns                             -3.997848
return_std                           3.262749
average_reward                      -0.404141
round_time             0 days 00:05:54.105162
episodes_test                          1007.0
episode_length_test                  9.929494
returns_test                         -3.91172
return_std_test                       3.09109
average_reward_test                 -0.393831
round_time_test        0 days 00:00:09.366958
round_time_total       0 days 00:05:54.106249
loss_total                 3779688727966646.5
loss_critic                4724610831877145.0
loss_actor                     -185564327.048
memory_size                         17487.592 

=== epoch 5/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:31,  5.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                 1000
episode_length                          9.986
returns                             -4.024143
return_std                           3.285603
average_reward                      -0.403086
round_time             0 days 00:05:55.397179
episodes_test                          1002.0
episode_length_test                  9.974052
returns_test                        -3.933741
return_std_test                      3.034798
average_reward_test                 -0.394275
round_time_test        0 days 00:00:09.273269
round_time_total       0 days 00:05:55.398269
loss_total                 3831548176297361.5
loss_critic                4789435144253473.0
loss_actor                     -186181451.504
memory_size                         17509.742 

=== epoch 5/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 7/2000 [00:01<08:01,  4.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  986
episode_length                      10.136917
returns                             -4.077265
return_std                           3.236541
average_reward                      -0.402236
round_time             0 days 00:05:54.592799
episodes_test                           992.0
episode_length_test                 10.074597
returns_test                        -4.085959
return_std_test                      3.112772
average_reward_test                  -0.40585
round_time_test        0 days 00:00:09.164009
round_time_total       0 days 00:05:54.594257
loss_total                 3928847640153817.0
loss_critic                4911059467514151.0
loss_actor                     -189704223.096
memory_size                        17531.8785 

=== epoch 5/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:38,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  985
episode_length                      10.144162
returns                             -4.229995
return_std                            3.15462
average_reward                      -0.417632
round_time             0 days 00:05:54.674524
episodes_test                          1014.0
episode_length_test                  9.856016
returns_test                        -3.897074
return_std_test                      2.982665
average_reward_test                 -0.394853
round_time_test        0 days 00:00:09.342960
round_time_total       0 days 00:05:54.675769
loss_total                 3991242333017342.0
loss_critic                4989052833392230.0
loss_actor                      -191042244.76
memory_size                         17564.595 

=== epoch 5/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  993
episode_length                      10.062437
returns                             -4.159043
return_std                           3.090384
average_reward                      -0.413517
round_time             0 days 00:05:55.809112
episodes_test                          1014.0
episode_length_test                   9.85503
returns_test                        -4.029973
return_std_test                      3.047259
average_reward_test                  -0.40858
round_time_test        0 days 00:00:09.247999
round_time_total       0 days 00:05:55.810288
loss_total                 4102883141834768.5
loss_critic                5128603837367583.0
loss_actor                      -193363055.08
memory_size                        17576.5305 

=== epoch 5/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:16,  5.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  990
episode_length                      10.092929
returns                             -4.120068
return_std                           3.101764
average_reward                      -0.408196
round_time             0 days 00:05:54.223480
episodes_test                           982.0
episode_length_test                 10.174134
returns_test                        -4.011308
return_std_test                      3.167707
average_reward_test                 -0.394193
round_time_test        0 days 00:00:09.304418
round_time_total       0 days 00:05:54.224593
loss_total                 4236198102941303.0
loss_critic                5295247537593123.0
loss_actor                     -195444027.048
memory_size                        17588.6235 

=== epoch 5/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  997
episode_length                      10.024072
returns                             -4.186147
return_std                           3.063761
average_reward                      -0.417607
round_time             0 days 00:05:53.722361
episodes_test                          1000.0
episode_length_test                     9.996
returns_test                        -4.137972
return_std_test                      3.119452
average_reward_test                 -0.413739
round_time_test        0 days 00:00:09.398858
round_time_total       0 days 00:05:53.723637
loss_total                 4345531081345204.0
loss_critic                5431913761554301.0
loss_actor                     -197655797.048
memory_size                         17603.532 

=== epoch 5/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 7/2000 [00:01<07:18,  4.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                 1007
episode_length                       9.912612
returns                             -4.093678
return_std                           3.038835
average_reward                      -0.412616
round_time             0 days 00:05:53.457700
episodes_test                           979.0
episode_length_test                  10.21144
returns_test                         -4.14108
return_std_test                      3.256312
average_reward_test                 -0.405381
round_time_test        0 days 00:00:09.246873
round_time_total       0 days 00:05:53.459011
loss_total                 4446188662220128.5
loss_critic                5557735730970624.0
loss_actor                      -199526689.16
memory_size                        17626.5265 

=== epoch 5/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:57,  4.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  990
episode_length                      10.087879
returns                             -3.992727
return_std                           3.122857
average_reward                      -0.396091
round_time             0 days 00:05:56.571470
episodes_test                           985.0
episode_length_test                 10.140102
returns_test                        -3.922046
return_std_test                       3.06162
average_reward_test                 -0.386209
round_time_test        0 days 00:00:09.288260
round_time_total       0 days 00:05:56.572567
loss_total                 4550544805930729.0
loss_critic                5688180906129359.0
loss_actor                     -201506973.584
memory_size                        17655.1215 

=== epoch 5/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:15,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  972
episode_length                      10.279835
returns                              -4.04303
return_std                           3.252612
average_reward                      -0.393626
round_time             0 days 00:05:52.608853
episodes_test                           991.0
episode_length_test                 10.088799
returns_test                        -4.004221
return_std_test                       3.25822
average_reward_test                 -0.396623
round_time_test        0 days 00:00:09.302212
round_time_total       0 days 00:05:52.609933
loss_total                 4644947186772083.0
loss_critic                5806183883019911.0
loss_actor                     -202828324.568
memory_size                          17715.86 

=== epoch 5/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:53,  5.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  978
episode_length                      10.214724
returns                             -4.006368
return_std                           3.198572
average_reward                      -0.393201
round_time             0 days 00:05:55.385511
episodes_test                           979.0
episode_length_test                  10.21144
returns_test                        -4.058724
return_std_test                      3.207617
average_reward_test                 -0.397156
round_time_test        0 days 00:00:09.254254
round_time_total       0 days 00:05:55.386614
loss_total                 4646259416117543.0
loss_critic                5807824165792645.0
loss_actor                     -202786982.808
memory_size                         17743.041 

=== epoch 5/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  969
episode_length                      10.308566
returns                             -3.900877
return_std                           3.165506
average_reward                      -0.378357
round_time             0 days 00:05:53.722391
episodes_test                           967.0
episode_length_test                 10.331954
returns_test                        -4.009323
return_std_test                      2.996638
average_reward_test                 -0.388495
round_time_test        0 days 00:00:09.178369
round_time_total       0 days 00:05:53.723490
loss_total                 4695024446006624.0
loss_critic                5868780453976080.0
loss_actor                     -203897860.216
memory_size                         17759.726 

=== epoch 5/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  960
episode_length                      10.405208
returns                             -3.966977
return_std                           3.325097
average_reward                       -0.38159
round_time             0 days 00:05:53.513673
episodes_test                           981.0
episode_length_test                 10.184506
returns_test                        -4.227343
return_std_test                      3.356083
average_reward_test                 -0.414711
round_time_test        0 days 00:00:09.395823
round_time_total       0 days 00:05:53.514773
loss_total                 4694581007746073.0
loss_critic                5868226155160535.0
loss_actor                     -205227394.072
memory_size                        17786.1275 

=== epoch 5/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:48,  5.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  971
episode_length                      10.295572
returns                             -4.052477
return_std                           3.293047
average_reward                      -0.393408
round_time             0 days 00:05:54.044901
episodes_test                           969.0
episode_length_test                 10.317853
returns_test                        -4.123881
return_std_test                      3.286861
average_reward_test                 -0.399531
round_time_test        0 days 00:00:09.329451
round_time_total       0 days 00:05:54.046141
loss_total                 4827449751681106.0
loss_critic                6034312082680185.0
loss_actor                      -207943773.08
memory_size                         17801.156 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  1%|          | 11/2000 [00:02<06:05,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  189
episode_length                       10.57672
returns                             -4.149027
return_std                           3.400185
average_reward                      -0.391564
round_time             0 days 00:05:50.639723
episodes_test                           976.0
episode_length_test                 10.235656
returns_test                        -3.819878
return_std_test                      3.283167
average_reward_test                 -0.372375
round_time_test        0 days 00:00:09.346483
round_time_total       0 days 00:05:50.640913
loss_total                 4964839336785740.0
loss_critic                6206049055840141.0
loss_actor                     -209265140.736
memory_size                        17820.2085 

=== epoch 6/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:49<00:00,  5.72it/s]
episodes                                  384
episode_length                      10.390625
returns                             -3.954683
return_std                           3.220218
average_reward                       -0.38101
round_time             0 days 00:05:50.132868
episodes_test                           964.0
episode_length_test                 10.373444
returns_test                        -4.296772
return_std_test                       3.36988
average_reward_test                 -0.414209
round_time_test        0 days 00:00:09.092191
round_time_total       0 days 00:05:50.133971
loss_total                 4992241698231812.0
loss_critic                6240302011791704.0
loss_actor                      -210731899.16
memory_size                         17851.528 

=== epoch 6/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:53,  4.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.73it/s]
episodes                                  572
episode_length                      10.487762
returns                             -4.204885
return_std                           3.403214
average_reward                      -0.400709
round_time             0 days 00:05:49.412072
episodes_test                          1004.0
episode_length_test                  9.960159
returns_test                        -4.354027
return_std_test                      3.169347
average_reward_test                 -0.437144
round_time_test        0 days 00:00:09.239611
round_time_total       0 days 00:05:49.413206
loss_total                 5129902797545800.0
loss_critic                6412378383467938.0
loss_actor                     -213023013.656
memory_size                         17869.917 

=== epoch 6/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  765
episode_length                      10.445752
returns                             -4.201831
return_std                           3.329353
average_reward                      -0.401452
round_time             0 days 00:05:51.038182
episodes_test                           975.0
episode_length_test                  10.25641
returns_test                        -4.157036
return_std_test                      3.064965
average_reward_test                 -0.405311
round_time_test        0 days 00:00:09.417475
round_time_total       0 days 00:05:51.039438
loss_total                 5267666007291331.0
loss_critic                6584582396874588.0
loss_actor                     -215114147.256
memory_size                         17897.076 

=== epoch 6/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:04,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                  959
episode_length                      10.427529
returns                             -4.246247
return_std                           3.391902
average_reward                      -0.407215
round_time             0 days 00:05:51.256677
episodes_test                          1031.0
episode_length_test                  9.691562
returns_test                        -3.791509
return_std_test                      3.101364
average_reward_test                 -0.390735
round_time_test        0 days 00:00:09.307996
round_time_total       0 days 00:05:51.257758
loss_total                 5292827812007248.0
loss_critic                6616034640085910.0
loss_actor                     -215335445.256
memory_size                         17947.296 

=== epoch 6/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 7/2000 [00:01<07:11,  4.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  958
episode_length                        10.4238
returns                             -4.258416
return_std                           3.358383
average_reward                      -0.407896
round_time             0 days 00:05:51.976877
episodes_test                           954.0
episode_length_test                 10.477987
returns_test                        -4.506976
return_std_test                      3.346372
average_reward_test                 -0.429754
round_time_test        0 days 00:00:09.211440
round_time_total       0 days 00:05:51.977987
loss_total                 5436831091623199.0
loss_critic                6796038744941003.0
loss_actor                     -216871959.976
memory_size                         17980.482 

=== epoch 6/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:34,  5.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:48<00:00,  5.73it/s]
episodes                                  957
episode_length                      10.443051
returns                             -4.303821
return_std                           3.392888
average_reward                      -0.412201
round_time             0 days 00:05:49.372572
episodes_test                           955.0
episode_length_test                 10.464921
returns_test                        -4.037506
return_std_test                      3.236672
average_reward_test                  -0.38579
round_time_test        0 days 00:00:09.453235
round_time_total       0 days 00:05:49.373777
loss_total                 5460395903078629.0
loss_critic                6825494765065208.0
loss_actor                      -217307376.72
memory_size                         18016.719 

=== epoch 6/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:04,  5.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                  964
episode_length                      10.365145
returns                             -4.311615
return_std                           3.333321
average_reward                      -0.416098
round_time             0 days 00:05:51.582469
episodes_test                           956.0
episode_length_test                 10.456067
returns_test                        -4.178267
return_std_test                      3.332964
average_reward_test                 -0.399431
round_time_test        0 days 00:00:09.244644
round_time_total       0 days 00:05:51.583568
loss_total                 5564492843164107.0
loss_critic                6955615941983994.0
loss_actor                      -219183121.84
memory_size                        18034.4705 

=== epoch 6/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  963
episode_length                      10.380062
returns                              -4.28455
return_std                           3.367235
average_reward                       -0.41258
round_time             0 days 00:05:50.687444
episodes_test                           948.0
episode_length_test                 10.540084
returns_test                        -4.420281
return_std_test                      3.436481
average_reward_test                 -0.419286
round_time_test        0 days 00:00:09.460044
round_time_total       0 days 00:05:50.688701
loss_total                 5626758821494915.0
loss_critic                7033448401140187.0
loss_actor                     -220613319.312
memory_size                         18066.272 

=== epoch 6/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  960
episode_length                      10.410417
returns                             -4.190855
return_std                           3.275238
average_reward                      -0.402564
round_time             0 days 00:05:53.440367
episodes_test                           976.0
episode_length_test                 10.240779
returns_test                        -4.369217
return_std_test                      3.225661
average_reward_test                 -0.426837
round_time_test        0 days 00:00:09.355651
round_time_total       0 days 00:05:53.441812
loss_total                 5747570021412897.0
loss_critic                7184462396138717.0
loss_actor                     -222177538.144
memory_size                        18103.5385 

=== epoch 6/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  958
episode_length                      10.415449
returns                             -4.268772
return_std                           3.386069
average_reward                      -0.409867
round_time             0 days 00:05:53.226303
episodes_test                           962.0
episode_length_test                 10.390852
returns_test                        -4.211412
return_std_test                      3.294626
average_reward_test                 -0.404785
round_time_test        0 days 00:00:09.144370
round_time_total       0 days 00:05:53.227403
loss_total                 5823149402465763.0
loss_critic                7278936626951094.0
loss_actor                     -222628524.064
memory_size                        18131.8995 

=== epoch 6/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  950
episode_length                      10.513684
returns                             -4.371001
return_std                           3.420409
average_reward                      -0.415125
round_time             0 days 00:05:53.091220
episodes_test                           945.0
episode_length_test                 10.577778
returns_test                        -4.165057
return_std_test                      3.664231
average_reward_test                  -0.39339
round_time_test        0 days 00:00:09.328656
round_time_total       0 days 00:05:53.092304
loss_total                 5918603922385142.0
loss_critic                7398254781749264.0
loss_actor                     -223565871.896
memory_size                        18176.4985 

=== epoch 6/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.70it/s]
episodes                                  957
episode_length                      10.447231
returns                             -4.324361
return_std                            3.38606
average_reward                      -0.413728
round_time             0 days 00:05:51.528603
episodes_test                           949.0
episode_length_test                 10.536354
returns_test                        -4.423983
return_std_test                       3.51269
average_reward_test                 -0.419752
round_time_test        0 days 00:00:09.288733
round_time_total       0 days 00:05:51.529744
loss_total                 6040425690392691.0
loss_critic                7550531981524599.0
loss_actor                     -225510258.344
memory_size                        18211.4085 

=== epoch 6/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  956
episode_length                      10.443515
returns                              -4.43942
return_std                           3.310465
average_reward                      -0.424115
round_time             0 days 00:05:53.972305
episodes_test                           958.0
episode_length_test                 10.427975
returns_test                        -4.278599
return_std_test                      3.434745
average_reward_test                 -0.409817
round_time_test        0 days 00:00:09.314645
round_time_total       0 days 00:05:53.973453
loss_total                 6037597151297536.0
loss_critic                7546996311430529.0
loss_actor                       -225092118.8
memory_size                        18239.3225 

=== epoch 6/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:46,  5.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  956
episode_length                      10.456067
returns                              -4.47409
return_std                            3.35311
average_reward                      -0.427849
round_time             0 days 00:05:54.358922
episodes_test                           980.0
episode_length_test                  10.19898
returns_test                        -4.009224
return_std_test                      3.263035
average_reward_test                 -0.392707
round_time_test        0 days 00:00:09.196795
round_time_total       0 days 00:05:54.360012
loss_total                 6006502831400419.0
loss_critic                7508128415535333.0
loss_actor                     -225700471.872
memory_size                          18273.18 

=== epoch 6/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  962
episode_length                      10.388773
returns                             -4.479621
return_std                           3.235037
average_reward                        -0.4315
round_time             0 days 00:05:52.729706
episodes_test                           991.0
episode_length_test                 10.084763
returns_test                        -4.201672
return_std_test                      3.200161
average_reward_test                 -0.416416
round_time_test        0 days 00:00:09.430749
round_time_total       0 days 00:05:52.730793
loss_total                 6158571303952450.0
loss_critic                7698214004111442.0
loss_actor                     -227484514.016
memory_size                        18294.3215 

=== epoch 6/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:29,  5.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  976
episode_length                      10.235656
returns                             -4.387313
return_std                           3.135564
average_reward                      -0.429075
round_time             0 days 00:05:53.701099
episodes_test                           971.0
episode_length_test                 10.297631
returns_test                        -4.147353
return_std_test                      3.391737
average_reward_test                 -0.402627
round_time_test        0 days 00:00:09.338172
round_time_total       0 days 00:05:53.702217
loss_total                 6220981436082553.0
loss_critic                7776226663586595.0
loss_actor                     -229002517.128
memory_size                         18310.406 

=== epoch 6/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  975
episode_length                          10.24
returns                             -4.351287
return_std                           3.063941
average_reward                      -0.425848
round_time             0 days 00:05:53.192319
episodes_test                           980.0
episode_length_test                 10.192857
returns_test                        -4.568471
return_std_test                      3.298237
average_reward_test                 -0.447152
round_time_test        0 days 00:00:09.231540
round_time_total       0 days 00:05:53.193407
loss_total                 6278593761043808.0
loss_critic                7848242065342202.0
loss_actor                     -230477872.608
memory_size                         18319.867 

=== epoch 6/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  973
episode_length                      10.264132
returns                             -4.257577
return_std                           3.137632
average_reward                       -0.41453
round_time             0 days 00:05:54.221392
episodes_test                           960.0
episode_length_test                 10.404167
returns_test                        -4.395286
return_std_test                      3.436398
average_reward_test                 -0.422335
round_time_test        0 days 00:00:09.368591
round_time_total       0 days 00:05:54.222494
loss_total                 6367696969090990.0
loss_critic                7959621079662592.0
loss_actor                     -230939217.016
memory_size                          18329.71 

=== epoch 6/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  975
episode_length                          10.24
returns                             -4.330085
return_std                           3.082225
average_reward                      -0.422807
round_time             0 days 00:05:54.739149
episodes_test                           982.0
episode_length_test                 10.175153
returns_test                        -4.137764
return_std_test                      3.379574
average_reward_test                 -0.406505
round_time_test        0 days 00:00:09.202217
round_time_total       0 days 00:05:54.740332
loss_total                 6453224709474484.0
loss_critic                8066530755175514.0
loss_actor                     -231483893.072
memory_size                        18361.6775 

=== epoch 6/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:11,  4.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  977
episode_length                      10.229273
returns                             -4.218894
return_std                           3.109052
average_reward                      -0.412583
round_time             0 days 00:05:54.215318
episodes_test                           982.0
episode_length_test                 10.176171
returns_test                        -4.363183
return_std_test                      3.209578
average_reward_test                 -0.428467
round_time_test        0 days 00:00:09.251216
round_time_total       0 days 00:05:54.216448
loss_total                 6496377685785182.0
loss_critic                8120471971805790.0
loss_actor                     -232907763.136
memory_size                        18399.0495 

=== epoch 6/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  975
episode_length                      10.250256
returns                             -4.312172
return_std                           3.208227
average_reward                      -0.420634
round_time             0 days 00:05:54.261434
episodes_test                           965.0
episode_length_test                 10.353368
returns_test                        -4.332435
return_std_test                      3.384403
average_reward_test                  -0.41791
round_time_test        0 days 00:00:09.389463
round_time_total       0 days 00:05:54.262579
loss_total                 6579720064533004.0
loss_critic                8224649944636588.0
loss_actor                     -234491875.672
memory_size                        18419.5305 

=== epoch 6/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:18,  4.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  973
episode_length                      10.271326
returns                             -4.255753
return_std                           3.344688
average_reward                      -0.414117
round_time             0 days 00:05:54.795650
episodes_test                           990.0
episode_length_test                 10.090909
returns_test                        -4.186104
return_std_test                      3.238599
average_reward_test                 -0.415013
round_time_test        0 days 00:00:09.459036
round_time_total       0 days 00:05:54.796883
loss_total                 6700238331172618.0
loss_critic                8375297767769113.0
loss_actor                     -235885343.608
memory_size                         18434.983 

=== epoch 6/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:00,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  977
episode_length                       10.22825
returns                             -4.253692
return_std                           3.348324
average_reward                      -0.416141
round_time             0 days 00:05:55.017814
episodes_test                           953.0
episode_length_test                 10.486884
returns_test                        -4.293341
return_std_test                      3.464578
average_reward_test                 -0.409025
round_time_test        0 days 00:00:09.359298
round_time_total       0 days 00:05:55.018990
loss_total                 6778780471397450.0
loss_critic                8473475445935833.0
loss_actor                     -237454861.488
memory_size                         18460.478 

=== epoch 6/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:21,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  979
episode_length                      10.207354
returns                             -4.207384
return_std                           3.502218
average_reward                      -0.412074
round_time             0 days 00:05:55.055645
episodes_test                           965.0
episode_length_test                 10.356477
returns_test                        -4.185989
return_std_test                      3.416211
average_reward_test                 -0.403622
round_time_test        0 days 00:00:09.287570
round_time_total       0 days 00:05:55.056755
loss_total                 6805449305915130.0
loss_critic                8506811496028701.0
loss_actor                     -239083764.672
memory_size                         18478.328 

=== epoch 6/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  983
episode_length                      10.170905
returns                             -4.190054
return_std                           3.482187
average_reward                      -0.411779
round_time             0 days 00:05:53.229746
episodes_test                           950.0
episode_length_test                 10.524211
returns_test                        -4.390738
return_std_test                      3.614921
average_reward_test                 -0.416975
round_time_test        0 days 00:00:09.235229
round_time_total       0 days 00:05:53.231008
loss_total                 6956749028847518.0
loss_critic                8695936146338742.0
loss_actor                      -239834799.48
memory_size                        18515.8685 

=== epoch 6/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:38,  5.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  980
episode_length                      10.189796
returns                             -4.269492
return_std                           3.488456
average_reward                      -0.418632
round_time             0 days 00:05:55.805933
episodes_test                           977.0
episode_length_test                 10.225179
returns_test                        -4.376317
return_std_test                      3.379868
average_reward_test                 -0.426745
round_time_test        0 days 00:00:09.494698
round_time_total       0 days 00:05:55.807195
loss_total                 7062412577715782.0
loss_critic                8828015576954700.0
loss_actor                     -242416985.904
memory_size                         18531.601 

=== epoch 6/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:02,  5.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  977
episode_length                      10.223132
returns                             -4.344066
return_std                           3.431577
average_reward                      -0.424742
round_time             0 days 00:05:54.923182
episodes_test                           975.0
episode_length_test                 10.253333
returns_test                        -4.218923
return_std_test                       3.37153
average_reward_test                 -0.411101
round_time_test        0 days 00:00:09.226315
round_time_total       0 days 00:05:54.924505
loss_total                 7056883440427205.0
loss_critic                8821104151686545.0
loss_actor                     -241966916.072
memory_size                         18549.276 

=== epoch 6/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:36,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  978
episode_length                      10.213701
returns                             -4.443439
return_std                           3.507608
average_reward                       -0.43538
round_time             0 days 00:05:55.041205
episodes_test                           973.0
episode_length_test                 10.273381
returns_test                        -4.534847
return_std_test                      3.284368
average_reward_test                 -0.441169
round_time_test        0 days 00:00:09.218513
round_time_total       0 days 00:05:55.042325
loss_total                 7150792603196195.0
loss_critic                8938490599175094.0
loss_actor                     -242582265.448
memory_size                        18568.1095 

=== epoch 6/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  972
episode_length                      10.268519
returns                              -4.46166
return_std                           3.417313
average_reward                      -0.434743
round_time             0 days 00:05:54.493550
episodes_test                           964.0
episode_length_test                 10.372407
returns_test                        -4.556757
return_std_test                      3.442993
average_reward_test                 -0.439186
round_time_test        0 days 00:00:09.337313
round_time_total       0 days 00:05:54.494682
loss_total                 7254080493044367.0
loss_critic                9067600466988236.0
loss_actor                     -243988023.504
memory_size                         18614.347 

=== epoch 6/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  969
episode_length                       10.30547
returns                             -4.565414
return_std                           3.463742
average_reward                      -0.442703
round_time             0 days 00:05:54.825373
episodes_test                           957.0
episode_length_test                 10.448276
returns_test                        -4.405964
return_std_test                      3.463793
average_reward_test                 -0.421555
round_time_test        0 days 00:00:09.194304
round_time_total       0 days 00:05:54.826444
loss_total                 7281735714448867.0
loss_critic                9102169494683386.0
loss_actor                     -245585507.192
memory_size                         18654.595 

=== epoch 6/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  965
episode_length                      10.351295
returns                             -4.426971
return_std                            3.42995
average_reward                      -0.428482
round_time             0 days 00:05:55.430575
episodes_test                           926.0
episode_length_test                 10.796976
returns_test                        -4.281926
return_std_test                      3.490842
average_reward_test                 -0.396318
round_time_test        0 days 00:00:09.262330
round_time_total       0 days 00:05:55.432162
loss_total                 7348723081877127.0
loss_critic                9185903692962856.0
loss_actor                     -244734328.336
memory_size                        18688.5645 

=== epoch 6/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:58,  4.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  957
episode_length                      10.437827
returns                              -4.38688
return_std                           3.433313
average_reward                      -0.419857
round_time             0 days 00:05:55.821558
episodes_test                           978.0
episode_length_test                 10.223926
returns_test                        -4.417591
return_std_test                      3.282441
average_reward_test                 -0.431939
round_time_test        0 days 00:00:09.060632
round_time_total       0 days 00:05:55.822638
loss_total                 7388608749998440.0
loss_critic                9235760773114888.0
loss_actor                     -245385840.464
memory_size                         18717.937 

=== epoch 6/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:07,  5.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  953
episode_length                      10.486884
returns                             -4.315037
return_std                           3.346343
average_reward                       -0.41145
round_time             0 days 00:05:56.593639
episodes_test                           952.0
episode_length_test                 10.502101
returns_test                        -4.545196
return_std_test                      3.378437
average_reward_test                 -0.432535
round_time_test        0 days 00:00:08.889060
round_time_total       0 days 00:05:56.595064
loss_total                 7398930054626935.0
loss_critic                9248662407390168.0
loss_actor                     -247098115.256
memory_size                         18770.602 

=== epoch 6/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:21,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  965
episode_length                      10.352332
returns                             -4.337926
return_std                           3.374861
average_reward                      -0.418294
round_time             0 days 00:05:57.317710
episodes_test                           975.0
episode_length_test                 10.249231
returns_test                        -4.440487
return_std_test                      3.418532
average_reward_test                 -0.432785
round_time_test        0 days 00:00:09.394431
round_time_total       0 days 00:05:57.318841
loss_total                 7516474952183185.0
loss_critic                9395593534402200.0
loss_actor                     -247606674.608
memory_size                         18791.895 

=== epoch 6/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:06,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  964
episode_length                      10.368257
returns                             -4.282223
return_std                           3.320622
average_reward                      -0.413052
round_time             0 days 00:05:53.589204
episodes_test                           962.0
episode_length_test                 10.385655
returns_test                        -4.446648
return_std_test                      3.352032
average_reward_test                 -0.427772
round_time_test        0 days 00:00:09.312440
round_time_total       0 days 00:05:53.590653
loss_total                 7571329043174261.0
loss_critic                9464161145792234.0
loss_actor                     -248651674.552
memory_size                        18814.6835 

=== epoch 6/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:51,  4.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  964
episode_length                      10.363071
returns                             -4.363148
return_std                            3.40287
average_reward                      -0.421979
round_time             0 days 00:05:57.467317
episodes_test                           928.0
episode_length_test                 10.770474
returns_test                        -4.562299
return_std_test                      3.620603
average_reward_test                 -0.423351
round_time_test        0 days 00:00:09.342329
round_time_total       0 days 00:05:57.468419
loss_total                 7612418488086299.0
loss_critic                9515522954717298.0
loss_actor                     -249585671.096
memory_size                         18845.169 

=== epoch 6/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:39,  4.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  972
episode_length                      10.274691
returns                             -4.484998
return_std                           3.456718
average_reward                      -0.436031
round_time             0 days 00:05:56.349314
episodes_test                           980.0
episode_length_test                 10.192857
returns_test                        -4.244331
return_std_test                       3.39104
average_reward_test                 -0.416035
round_time_test        0 days 00:00:09.436460
round_time_total       0 days 00:05:56.350459
loss_total                 7605146605003473.0
loss_critic                9506433097743204.0
loss_actor                     -249246846.752
memory_size                          18869.25 

=== epoch 6/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  979
episode_length                      10.209397
returns                              -4.52314
return_std                           3.471138
average_reward                      -0.443038
round_time             0 days 00:05:55.412590
episodes_test                           965.0
episode_length_test                 10.358549
returns_test                         -4.50908
return_std_test                      3.427057
average_reward_test                 -0.434932
round_time_test        0 days 00:00:09.397715
round_time_total       0 days 00:05:55.413681
loss_total                 7726625469264560.0
loss_critic                9658281665151108.0
loss_actor                     -250099054.312
memory_size                          18886.95 

=== epoch 6/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:49,  4.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  975
episode_length                      10.237949
returns                              -4.52076
return_std                            3.44154
average_reward                      -0.442489
round_time             0 days 00:05:57.468140
episodes_test                           926.0
episode_length_test                 10.788337
returns_test                        -4.319432
return_std_test                      3.768943
average_reward_test                 -0.400115
round_time_test        0 days 00:00:09.325426
round_time_total       0 days 00:05:57.469223
loss_total                 7812690520928420.0
loss_critic                9765862992213180.0
loss_actor                     -251362309.704
memory_size                        18902.1985 

=== epoch 6/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  975
episode_length                      10.249231
returns                             -4.621005
return_std                           3.534624
average_reward                      -0.450661
round_time             0 days 00:05:55.832231
episodes_test                           987.0
episode_length_test                 10.131712
returns_test                        -4.315869
return_std_test                      3.360873
average_reward_test                 -0.425976
round_time_test        0 days 00:00:09.409941
round_time_total       0 days 00:05:55.833375
loss_total                 7738890541470843.0
loss_critic                9673613012623164.0
loss_actor                     -251379856.672
memory_size                         18917.805 

=== epoch 6/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:49,  4.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  978
episode_length                      10.212679
returns                             -4.580754
return_std                           3.455914
average_reward                      -0.449133
round_time             0 days 00:05:56.458726
episodes_test                           951.0
episode_length_test                 10.511041
returns_test                        -4.571381
return_std_test                      3.585789
average_reward_test                 -0.434508
round_time_test        0 days 00:00:09.268075
round_time_total       0 days 00:05:56.459803
loss_total                 7878934136654660.0
loss_critic                9848667502542848.0
loss_actor                     -253566944.768
memory_size                         18938.992 

=== epoch 6/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  978
episode_length                      10.218814
returns                             -4.609897
return_std                           3.551401
average_reward                      -0.450846
round_time             0 days 00:05:56.795264
episodes_test                           974.0
episode_length_test                 10.262834
returns_test                        -4.784524
return_std_test                       3.53836
average_reward_test                 -0.465853
round_time_test        0 days 00:00:09.212353
round_time_total       0 days 00:05:56.796350
loss_total                 8055296013008634.0
loss_critic               10069119846710248.0
loss_actor                     -254690059.248
memory_size                        18970.4725 

=== epoch 6/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  974
episode_length                       10.25154
returns                             -4.590809
return_std                           3.539036
average_reward                      -0.448146
round_time             0 days 00:05:56.223131
episodes_test                           972.0
episode_length_test                 10.280864
returns_test                        -4.540862
return_std_test                      3.414175
average_reward_test                 -0.441285
round_time_test        0 days 00:00:09.190950
round_time_total       0 days 00:05:56.224229
loss_total                 8051866136414781.0
loss_critic               10064832497075618.0
loss_actor                      -255247292.36
memory_size                         19020.414 

=== epoch 6/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  970
episode_length                      10.301031
returns                             -4.454799
return_std                           3.504603
average_reward                      -0.433597
round_time             0 days 00:05:57.156365
episodes_test                           972.0
episode_length_test                 10.281893
returns_test                        -4.341764
return_std_test                       3.38875
average_reward_test                 -0.421725
round_time_test        0 days 00:00:09.257230
round_time_total       0 days 00:05:57.157434
loss_total                 8066257124419699.0
loss_critic               10082821225102442.0
loss_actor                     -254424211.168
memory_size                         19058.467 

=== epoch 6/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  972
episode_length                      10.271605
returns                             -4.413239
return_std                           3.443573
average_reward                      -0.429448
round_time             0 days 00:05:57.635771
episodes_test                           979.0
episode_length_test                 10.213483
returns_test                        -4.396643
return_std_test                      3.376659
average_reward_test                 -0.430363
round_time_test        0 days 00:00:09.493237
round_time_total       0 days 00:05:57.637042
loss_total                 8150840727375446.0
loss_critic               10188550729099116.0
loss_actor                     -257114653.272
memory_size                         19071.694 

=== epoch 6/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  971
episode_length                      10.295572
returns                             -4.353639
return_std                           3.408489
average_reward                      -0.422549
round_time             0 days 00:05:54.951189
episodes_test                           953.0
episode_length_test                 10.486884
returns_test                        -4.352361
return_std_test                      3.480187
average_reward_test                 -0.414848
round_time_test        0 days 00:00:09.388133
round_time_total       0 days 00:05:54.952268
loss_total                 8209519072010830.0
loss_critic               10261898653182460.0
loss_actor                     -257801231.136
memory_size                        19096.5575 

=== epoch 6/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  964
episode_length                      10.340249
returns                             -4.297885
return_std                           3.336947
average_reward                      -0.414727
round_time             0 days 00:05:57.441499
episodes_test                           988.0
episode_length_test                 10.116397
returns_test                        -4.336283
return_std_test                      3.677952
average_reward_test                 -0.428629
round_time_test        0 days 00:00:09.309342
round_time_total       0 days 00:05:57.442596
loss_total                 8328668344534172.0
loss_critic               10410835247124972.0
loss_actor                     -258580855.376
memory_size                         19119.508 

=== epoch 6/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:16,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  969
episode_length                      10.309598
returns                             -4.300932
return_std                           3.398741
average_reward                      -0.417409
round_time             0 days 00:05:57.530584
episodes_test                           955.0
episode_length_test                 10.459686
returns_test                        -4.420589
return_std_test                      3.458048
average_reward_test                 -0.423159
round_time_test        0 days 00:00:09.280181
round_time_total       0 days 00:05:57.531802
loss_total                 8328846731840061.0
loss_critic               10411058232163304.0
loss_actor                      -258838249.52
memory_size                         19142.076 

=== epoch 6/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  969
episode_length                       10.30547
returns                             -4.423379
return_std                           3.396907
average_reward                      -0.429112
round_time             0 days 00:05:56.264016
episodes_test                           973.0
episode_length_test                 10.271326
returns_test                        -4.543064
return_std_test                       3.47566
average_reward_test                 -0.442373
round_time_test        0 days 00:00:09.302316
round_time_total       0 days 00:05:56.265101
loss_total                 8449037906789532.0
loss_critic               10561297194005038.0
loss_actor                     -261150858.888
memory_size                        19158.6045 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  1%|          | 11/2000 [00:02<06:08,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  194
episode_length                      10.273196
returns                             -4.592532
return_std                           3.616231
average_reward                      -0.447048
round_time             0 days 00:05:51.580334
episodes_test                           981.0
episode_length_test                 10.192661
returns_test                        -4.290268
return_std_test                      3.204133
average_reward_test                 -0.420764
round_time_test        0 days 00:00:09.111799
round_time_total       0 days 00:05:51.581455
loss_total                 8423165238278357.0
loss_critic               10528956368365290.0
loss_actor                     -260597207.528
memory_size                        19188.0135 

=== epoch 7/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  382
episode_length                      10.434555
returns                             -4.585748
return_std                           3.408082
average_reward                      -0.436305
round_time             0 days 00:05:50.827326
episodes_test                           968.0
episode_length_test                 10.327479
returns_test                        -4.560791
return_std_test                      3.545005
average_reward_test                  -0.44121
round_time_test        0 days 00:00:09.374641
round_time_total       0 days 00:05:50.828425
loss_total                 8449982766308655.0
loss_critic               10562478267330200.0
loss_actor                     -261114427.368
memory_size                         19221.169 

=== epoch 7/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:55,  5.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  565
episode_length                       10.60531
returns                             -4.747103
return_std                           3.647485
average_reward                      -0.446053
round_time             0 days 00:05:51.926963
episodes_test                           965.0
episode_length_test                 10.357513
returns_test                        -4.576131
return_std_test                      3.467238
average_reward_test                 -0.441435
round_time_test        0 days 00:00:09.257078
round_time_total       0 days 00:05:51.928072
loss_total                 8645369119232754.0
loss_critic               10806711206539166.0
loss_actor                      -261880428.84
memory_size                         19244.851 

=== epoch 7/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  764
episode_length                      10.469895
returns                             -4.791065
return_std                           3.662596
average_reward                      -0.457444
round_time             0 days 00:05:53.083997
episodes_test                           953.0
episode_length_test                 10.485834
returns_test                        -4.208937
return_std_test                      3.542759
average_reward_test                 -0.401459
round_time_test        0 days 00:00:09.140834
round_time_total       0 days 00:05:53.085077
loss_total                 8573842480460464.0
loss_critic               10717302920925152.0
loss_actor                     -261253562.416
memory_size                         19277.804 

=== epoch 7/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:28,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  959
episode_length                        10.4244
returns                             -4.724881
return_std                           3.542879
average_reward                       -0.45291
round_time             0 days 00:05:52.178704
episodes_test                           938.0
episode_length_test                 10.656716
returns_test                         -4.72604
return_std_test                      3.726078
average_reward_test                  -0.44314
round_time_test        0 days 00:00:09.351977
round_time_total       0 days 00:05:52.179912
loss_total                 8504085936223551.0
loss_critic               10630107243011376.0
loss_actor                       -259316845.4
memory_size                        19312.0345 

=== epoch 7/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:34,  5.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  960
episode_length                        10.4125
returns                             -4.660521
return_std                           3.470235
average_reward                      -0.447288
round_time             0 days 00:05:53.849692
episodes_test                           970.0
episode_length_test                 10.308247
returns_test                        -4.419314
return_std_test                      3.290929
average_reward_test                 -0.428583
round_time_test        0 days 00:00:09.198422
round_time_total       0 days 00:05:53.850783
loss_total                 8392478946112831.0
loss_critic               10490598500641800.0
loss_actor                     -258367165.464
memory_size                         19336.855 

=== epoch 7/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  966
episode_length                       10.34472
returns                             -4.748334
return_std                           3.660042
average_reward                      -0.458544
round_time             0 days 00:05:54.292599
episodes_test                           961.0
episode_length_test                 10.399584
returns_test                        -4.719416
return_std_test                      3.410697
average_reward_test                 -0.453521
round_time_test        0 days 00:00:09.278849
round_time_total       0 days 00:05:54.293686
loss_total                 8377001333261074.0
loss_critic               10471251483704688.0
loss_actor                     -256938309.824
memory_size                         19371.196 

=== epoch 7/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:36,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  973
episode_length                      10.264132
returns                             -4.805035
return_std                           3.547508
average_reward                      -0.467748
round_time             0 days 00:05:53.901433
episodes_test                           953.0
episode_length_test                 10.490031
returns_test                        -4.897894
return_std_test                      3.735647
average_reward_test                 -0.466673
round_time_test        0 days 00:00:09.343078
round_time_total       0 days 00:05:53.902524
loss_total                 8498703608567562.0
loss_critic               10623379321026248.0
loss_actor                     -257983702.528
memory_size                        19405.5985 

=== epoch 7/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:39,  4.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  963
episode_length                       10.36864
returns                             -4.674253
return_std                           3.543189
average_reward                      -0.450453
round_time             0 days 00:05:54.919179
episodes_test                           971.0
episode_length_test                 10.296601
returns_test                        -4.636096
return_std_test                      3.543534
average_reward_test                 -0.450052
round_time_test        0 days 00:00:09.319417
round_time_total       0 days 00:05:54.920397
loss_total                 8461880393996435.0
loss_critic               10577350315026154.0
loss_actor                     -258526322.552
memory_size                        19431.5715 

=== epoch 7/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:28,  5.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  970
episode_length                      10.298969
returns                             -4.683199
return_std                           3.625031
average_reward                      -0.454752
round_time             0 days 00:05:53.766623
episodes_test                           965.0
episode_length_test                 10.358549
returns_test                        -4.608601
return_std_test                      3.556112
average_reward_test                 -0.444698
round_time_test        0 days 00:00:09.360057
round_time_total       0 days 00:05:53.767722
loss_total                 8474158640209592.0
loss_critic               10592698115712614.0
loss_actor                     -257363907.968
memory_size                        19464.5015 

=== epoch 7/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:04,  4.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  976
episode_length                      10.231557
returns                             -4.657861
return_std                            3.59342
average_reward                      -0.454863
round_time             0 days 00:05:53.259957
episodes_test                           963.0
episode_length_test                 10.372793
returns_test                        -5.172627
return_std_test                      3.678012
average_reward_test                 -0.498092
round_time_test        0 days 00:00:09.282776
round_time_total       0 days 00:05:53.261518
loss_total                 8443222839435198.0
loss_critic               10554028377528860.0
loss_actor                     -257880351.176
memory_size                        19481.0855 

=== epoch 7/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 7/2000 [00:01<06:35,  5.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  977
episode_length                      10.230297
returns                             -4.670063
return_std                           3.478624
average_reward                      -0.456617
round_time             0 days 00:05:54.732778
episodes_test                           962.0
episode_length_test                 10.393971
returns_test                        -4.674769
return_std_test                      3.607296
average_reward_test                 -0.449633
round_time_test        0 days 00:00:09.236493
round_time_total       0 days 00:05:54.733890
loss_total                 8571549717261976.0
loss_critic               10714436962531410.0
loss_actor                     -259107143.088
memory_size                        19510.8915 

=== epoch 7/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:32,  5.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  989
episode_length                      10.105157
returns                               -4.4285
return_std                           3.513452
average_reward                      -0.438112
round_time             0 days 00:05:55.203820
episodes_test                           963.0
episode_length_test                 10.379024
returns_test                        -4.652356
return_std_test                      3.671966
average_reward_test                 -0.448024
round_time_test        0 days 00:00:09.223992
round_time_total       0 days 00:05:55.204909
loss_total                 8543740196070359.0
loss_critic               10679675043694248.0
loss_actor                     -258395974.296
memory_size                        19543.7645 

=== epoch 7/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:36,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  996
episode_length                        10.0251
returns                             -4.523106
return_std                           3.432568
average_reward                      -0.451596
round_time             0 days 00:05:55.073627
episodes_test                           956.0
episode_length_test                 10.453975
returns_test                        -4.512287
return_std_test                      3.404819
average_reward_test                 -0.430975
round_time_test        0 days 00:00:09.201070
round_time_total       0 days 00:05:55.074793
loss_total                 8489400418151956.0
loss_critic               10611750342603310.0
loss_actor                     -259603285.416
memory_size                        19564.1605 

=== epoch 7/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:00,  5.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  987
episode_length                      10.118541
returns                             -4.538072
return_std                           3.440084
average_reward                      -0.449181
round_time             0 days 00:05:55.869156
episodes_test                           997.0
episode_length_test                 10.023069
returns_test                        -4.730832
return_std_test                       3.34589
average_reward_test                 -0.471574
round_time_test        0 days 00:00:09.191672
round_time_total       0 days 00:05:55.870619
loss_total                 8566334019170468.0
loss_critic               10707917335856940.0
loss_actor                     -259882263.872
memory_size                        19575.5975 

=== epoch 7/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:28,  5.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  983
episode_length                      10.157681
returns                             -4.662015
return_std                           3.464671
average_reward                      -0.459616
round_time             0 days 00:05:53.650972
episodes_test                          1006.0
episode_length_test                  9.934394
returns_test                        -4.489614
return_std_test                       3.32674
average_reward_test                  -0.45121
round_time_test        0 days 00:00:09.340250
round_time_total       0 days 00:05:53.652080
loss_total                 8676867934653514.0
loss_critic               10846084736183436.0
loss_actor                     -261329850.008
memory_size                        19589.9285 

=== epoch 7/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:05,  4.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  983
episode_length                      10.156663
returns                             -4.512963
return_std                           3.387765
average_reward                      -0.444353
round_time             0 days 00:05:55.832597
episodes_test                           972.0
episode_length_test                 10.283951
returns_test                        -4.681063
return_std_test                      3.819044
average_reward_test                 -0.454965
round_time_test        0 days 00:00:09.361458
round_time_total       0 days 00:05:55.833713
loss_total                 8612953477007540.0
loss_critic               10766191658690150.0
loss_actor                      -260897278.76
memory_size                         19613.538 

=== epoch 7/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  976
episode_length                      10.218238
returns                             -4.610301
return_std                           3.336791
average_reward                      -0.450503
round_time             0 days 00:05:54.647455
episodes_test                           969.0
episode_length_test                 10.318885
returns_test                        -4.663411
return_std_test                      3.533707
average_reward_test                 -0.451777
round_time_test        0 days 00:00:09.348562
round_time_total       0 days 00:05:54.648566
loss_total                 8712476764267348.0
loss_critic               10890595764510130.0
loss_actor                     -261369065.064
memory_size                        19632.5285 

=== epoch 7/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  970
episode_length                      10.307216
returns                             -4.667947
return_std                           3.528692
average_reward                      -0.453039
round_time             0 days 00:05:55.230485
episodes_test                           942.0
episode_length_test                  10.61465
returns_test                        -4.524461
return_std_test                      3.592571
average_reward_test                 -0.426117
round_time_test        0 days 00:00:09.310618
round_time_total       0 days 00:05:55.231551
loss_total                 8704059151547892.0
loss_critic               10880073750154314.0
loss_actor                     -261828221.528
memory_size                        19658.3665 

=== epoch 7/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  967
episode_length                      10.327818
returns                              -4.68055
return_std                            3.58291
average_reward                      -0.453018
round_time             0 days 00:05:55.725661
episodes_test                           945.0
episode_length_test                 10.575661
returns_test                        -4.863442
return_std_test                      3.537973
average_reward_test                 -0.459273
round_time_test        0 days 00:00:09.370870
round_time_total       0 days 00:05:55.726754
loss_total                 8648072512040600.0
loss_critic               10810090460500984.0
loss_actor                      -262602736.32
memory_size                        19677.3905 

=== epoch 7/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:10,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  964
episode_length                      10.371369
returns                             -4.592575
return_std                           3.589588
average_reward                      -0.442641
round_time             0 days 00:05:56.079611
episodes_test                           964.0
episode_length_test                 10.364108
returns_test                        -4.805797
return_std_test                      3.418558
average_reward_test                 -0.463678
round_time_test        0 days 00:00:09.262849
round_time_total       0 days 00:05:56.080734
loss_total                 8728129771235443.0
loss_critic               10910162025703276.0
loss_actor                       -261918790.8
memory_size                         19712.885 

=== epoch 7/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:15,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  964
episode_length                      10.360996
returns                             -4.700419
return_std                           3.603858
average_reward                      -0.454484
round_time             0 days 00:05:56.698977
episodes_test                           968.0
episode_length_test                 10.330579
returns_test                        -4.520107
return_std_test                      3.620701
average_reward_test                 -0.437546
round_time_test        0 days 00:00:09.455840
round_time_total       0 days 00:05:56.700073
loss_total                 8801067410112119.0
loss_critic               11001334076010398.0
loss_actor                     -262354464.248
memory_size                         19735.443 

=== epoch 7/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:09,  4.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  969
episode_length                      10.304438
returns                             -4.673865
return_std                           3.590611
average_reward                      -0.453094
round_time             0 days 00:05:55.007439
episodes_test                           976.0
episode_length_test                 10.245902
returns_test                        -4.482986
return_std_test                      3.535516
average_reward_test                 -0.437539
round_time_test        0 days 00:00:09.338619
round_time_total       0 days 00:05:55.008886
loss_total                 8679679787196744.0
loss_critic               10849599548439920.0
loss_actor                     -262045719.432
memory_size                        19762.5105 

=== epoch 7/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:41,  4.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  973
episode_length                      10.273381
returns                             -4.571291
return_std                             3.4708
average_reward                      -0.444488
round_time             0 days 00:05:54.652304
episodes_test                           929.0
episode_length_test                 10.756728
returns_test                        -4.905609
return_std_test                      3.803752
average_reward_test                  -0.45543
round_time_test        0 days 00:00:09.192957
round_time_total       0 days 00:05:54.653468
loss_total                 8851722187292279.0
loss_critic               11064652553659614.0
loss_actor                     -262886968.544
memory_size                        19799.6545 

=== epoch 7/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  972
episode_length                      10.281893
returns                             -4.543277
return_std                            3.48208
average_reward                      -0.441499
round_time             0 days 00:05:56.210068
episodes_test                           977.0
episode_length_test                 10.229273
returns_test                        -4.970604
return_std_test                      3.548819
average_reward_test                 -0.485693
round_time_test        0 days 00:00:09.226307
round_time_total       0 days 00:05:56.211220
loss_total                 8780396502255141.0
loss_critic               10975495431022182.0
loss_actor                     -262385796.008
memory_size                        19818.8005 

=== epoch 7/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:20,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  969
episode_length                      10.303406
returns                             -4.551412
return_std                           3.510569
average_reward                      -0.441693
round_time             0 days 00:05:56.703305
episodes_test                           958.0
episode_length_test                  10.43215
returns_test                        -4.521558
return_std_test                      3.621407
average_reward_test                 -0.433473
round_time_test        0 days 00:00:09.048304
round_time_total       0 days 00:05:56.704450
loss_total                 8787608964868080.0
loss_critic               10984511011502948.0
loss_actor                     -264142611.504
memory_size                        19865.8415 

=== epoch 7/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  965
episode_length                      10.357513
returns                              -4.44961
return_std                           3.622796
average_reward                      -0.429426
round_time             0 days 00:05:56.676551
episodes_test                           950.0
episode_length_test                 10.526316
returns_test                        -4.728818
return_std_test                      3.617557
average_reward_test                 -0.449238
round_time_test        0 days 00:00:09.265020
round_time_total       0 days 00:05:56.677704
loss_total                 8921171536543482.0
loss_critic               11151464227137388.0
loss_actor                     -263374450.304
memory_size                        19907.7085 

=== epoch 7/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:23,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  961
episode_length                      10.381894
returns                             -4.447589
return_std                           3.551844
average_reward                      -0.429831
round_time             0 days 00:05:56.384608
episodes_test                           943.0
episode_length_test                 10.598091
returns_test                        -4.617049
return_std_test                      3.638804
average_reward_test                 -0.435314
round_time_test        0 days 00:00:09.383794
round_time_total       0 days 00:05:56.385770
loss_total                 8903010536573108.0
loss_critic               11128762980731192.0
loss_actor                      -266223286.44
memory_size                        19933.8845 

=== epoch 7/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:54,  5.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.63it/s]
episodes                                  962
episode_length                      10.382536
returns                             -4.439366
return_std                           3.582516
average_reward                      -0.427827
round_time             0 days 00:05:55.488273
episodes_test                           950.0
episode_length_test                 10.525263
returns_test                        -4.789804
return_std_test                      3.637861
average_reward_test                 -0.454933
round_time_test        0 days 00:00:09.358969
round_time_total       0 days 00:05:55.489452
loss_total                 9072607855944664.0
loss_critic               11340759618503574.0
loss_actor                     -265017867.976
memory_size                         19963.769 

=== epoch 7/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  968
episode_length                      10.308884
returns                             -4.376779
return_std                           3.509525
average_reward                      -0.423765
round_time             0 days 00:05:55.936077
episodes_test                           937.0
episode_length_test                 10.672359
returns_test                        -4.879885
return_std_test                      3.723845
average_reward_test                 -0.457245
round_time_test        0 days 00:00:09.255679
round_time_total       0 days 00:05:55.937177
loss_total                 8898377939213615.0
loss_critic               11122972216180868.0
loss_actor                     -263311335.584
memory_size                        19984.8155 

=== epoch 7/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  960
episode_length                      10.401042
returns                             -4.452709
return_std                           3.544553
average_reward                      -0.427985
round_time             0 days 00:05:54.884816
episodes_test                           936.0
episode_length_test                 10.680556
returns_test                        -4.404419
return_std_test                      3.777028
average_reward_test                 -0.412022
round_time_test        0 days 00:00:09.239109
round_time_total       0 days 00:05:54.885889
loss_total                 8749886157719863.0
loss_critic               10937357500655076.0
loss_actor                     -260990328.288
memory_size                        20022.0335 

=== epoch 7/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:58,  5.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  960
episode_length                       10.40625
returns                             -4.542543
return_std                           3.499281
average_reward                      -0.435944
round_time             0 days 00:05:56.144164
episodes_test                           959.0
episode_length_test                 10.419187
returns_test                        -5.145008
return_std_test                      3.544354
average_reward_test                 -0.493507
round_time_test        0 days 00:00:09.430128
round_time_total       0 days 00:05:56.145380
loss_total                 8910849192532902.0
loss_critic               11138561287997358.0
loss_actor                      -262993345.44
memory_size                        20073.4045 

=== epoch 7/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:42,  4.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  951
episode_length                      10.505783
returns                             -4.549117
return_std                           3.593727
average_reward                      -0.432838
round_time             0 days 00:05:56.861057
episodes_test                           971.0
episode_length_test                 10.297631
returns_test                        -4.774862
return_std_test                      3.351571
average_reward_test                 -0.463542
round_time_test        0 days 00:00:09.288538
round_time_total       0 days 00:05:56.862162
loss_total                 8973546207204868.0
loss_critic               11216932569859752.0
loss_actor                     -262118235.168
memory_size                        20115.5385 

=== epoch 7/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:34,  5.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  947
episode_length                       10.54699
returns                             -4.582886
return_std                           3.588876
average_reward                      -0.434759
round_time             0 days 00:05:54.831931
episodes_test                           965.0
episode_length_test                 10.361658
returns_test                        -5.029301
return_std_test                      3.741694
average_reward_test                 -0.485222
round_time_test        0 days 00:00:09.288662
round_time_total       0 days 00:05:54.833011
loss_total                 8815037103969665.0
loss_critic               11018796189372908.0
loss_actor                     -262016345.368
memory_size                         20137.675 

=== epoch 7/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:23,  5.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  940
episode_length                       10.62766
returns                              -4.67121
return_std                           3.539358
average_reward                      -0.439982
round_time             0 days 00:05:59.579567
episodes_test                           928.0
episode_length_test                 10.771552
returns_test                        -4.915269
return_std_test                      3.721443
average_reward_test                 -0.455881
round_time_test        0 days 00:00:09.628708
round_time_total       0 days 00:05:59.580660
loss_total                 8949544963855614.0
loss_critic               11186931009432060.0
loss_actor                     -262373223.184
memory_size                          20175.56 

=== epoch 7/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  941
episode_length                      10.621679
returns                             -4.712404
return_std                           3.573163
average_reward                      -0.443596
round_time             0 days 00:05:58.536897
episodes_test                           984.0
episode_length_test                 10.153455
returns_test                        -4.887018
return_std_test                      3.291038
average_reward_test                 -0.481312
round_time_test        0 days 00:00:09.625741
round_time_total       0 days 00:05:58.538201
loss_total                 8969728893019750.0
loss_critic               11212160912465068.0
loss_actor                     -261259641.984
memory_size                         20215.325 

=== epoch 7/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:19,  4.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  940
episode_length                      10.625532
returns                             -4.701876
return_std                           3.552525
average_reward                      -0.442123
round_time             0 days 00:05:58.013392
episodes_test                           958.0
episode_length_test                 10.438413
returns_test                         -4.91406
return_std_test                      3.754413
average_reward_test                 -0.470767
round_time_test        0 days 00:00:09.379703
round_time_total       0 days 00:05:58.014465
loss_total                 8805836012577096.0
loss_critic               11007294829863370.0
loss_actor                     -259668061.176
memory_size                        20270.1605 

=== epoch 7/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:00,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  948
episode_length                      10.545359
returns                             -4.734228
return_std                           3.560382
average_reward                      -0.448986
round_time             0 days 00:05:59.730964
episodes_test                           930.0
episode_length_test                 10.745161
returns_test                        -4.873097
return_std_test                      3.703748
average_reward_test                 -0.452703
round_time_test        0 days 00:00:09.380452
round_time_total       0 days 00:05:59.732138
loss_total                 8713106165149467.0
loss_critic               10891382519840636.0
loss_actor                     -257755056.096
memory_size                         20299.059 

=== epoch 7/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:03,  4.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  949
episode_length                      10.533193
returns                             -4.762783
return_std                           3.541156
average_reward                      -0.452278
round_time             0 days 00:05:58.344337
episodes_test                           965.0
episode_length_test                 10.353368
returns_test                        -4.519144
return_std_test                      3.600129
average_reward_test                 -0.436777
round_time_test        0 days 00:00:09.341809
round_time_total       0 days 00:05:58.345459
loss_total                 8670653212002353.0
loss_critic               10838316333272138.0
loss_actor                     -257097597.424
memory_size                        20351.9485 

=== epoch 7/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  947
episode_length                      10.545935
returns                             -4.734619
return_std                           3.614231
average_reward                      -0.448044
round_time             0 days 00:06:00.354863
episodes_test                           959.0
episode_length_test                  10.41293
returns_test                        -4.652215
return_std_test                      3.550721
average_reward_test                 -0.446973
round_time_test        0 days 00:00:09.257987
round_time_total       0 days 00:06:00.355970
loss_total                 8639582069292270.0
loss_critic               10799477393106928.0
loss_actor                     -257170369.808
memory_size                        20409.6435 

=== epoch 7/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:02,  4.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  956
episode_length                      10.451883
returns                             -4.639406
return_std                           3.604509
average_reward                      -0.444002
round_time             0 days 00:06:00.110457
episodes_test                           956.0
episode_length_test                 10.458159
returns_test                        -5.067961
return_std_test                      3.774371
average_reward_test                 -0.484331
round_time_test        0 days 00:00:09.392393
round_time_total       0 days 00:06:00.111536
loss_total                 8590391053071155.0
loss_critic               10737988624541810.0
loss_actor                      -256524090.28
memory_size                         20426.769 

=== epoch 7/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:32,  5.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  960
episode_length                       10.40625
returns                             -4.611504
return_std                           3.547426
average_reward                       -0.44265
round_time             0 days 00:05:58.330286
episodes_test                           954.0
episode_length_test                 10.474843
returns_test                        -4.639802
return_std_test                      3.437078
average_reward_test                 -0.442172
round_time_test        0 days 00:00:09.230547
round_time_total       0 days 00:05:58.331490
loss_total                 8504270013287367.0
loss_critic               10630337333402010.0
loss_actor                     -255056742.728
memory_size                          20448.21 

=== epoch 7/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  958
episode_length                      10.430063
returns                             -4.585212
return_std                           3.651159
average_reward                      -0.439662
round_time             0 days 00:05:58.110721
episodes_test                           955.0
episode_length_test                  10.46911
returns_test                        -4.667139
return_std_test                      3.552123
average_reward_test                 -0.445596
round_time_test        0 days 00:00:09.265102
round_time_total       0 days 00:05:58.111836
loss_total                 8383342981054202.0
loss_critic               10479178542171030.0
loss_actor                     -254162302.088
memory_size                        20464.8835 

=== epoch 7/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:32,  5.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  956
episode_length                      10.438285
returns                             -4.466007
return_std                           3.662681
average_reward                      -0.427593
round_time             0 days 00:05:59.745522
episodes_test                           941.0
episode_length_test                 10.623804
returns_test                        -4.758097
return_std_test                      3.678422
average_reward_test                 -0.447474
round_time_test        0 days 00:00:09.382377
round_time_total       0 days 00:05:59.746611
loss_total                 8340130455827251.0
loss_critic               10425162876980298.0
loss_actor                     -253976531.448
memory_size                        20496.5685 

=== epoch 7/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  963
episode_length                       10.36864
returns                             -4.441708
return_std                            3.62292
average_reward                      -0.427814
round_time             0 days 00:06:00.125923
episodes_test                           942.0
episode_length_test                  10.61465
returns_test                        -4.670138
return_std_test                      3.576772
average_reward_test                 -0.439866
round_time_test        0 days 00:00:09.259941
round_time_total       0 days 00:06:00.127048
loss_total                 8321052146002821.0
loss_critic               10401315003557740.0
loss_actor                     -253174520.848
memory_size                         20514.854 

=== epoch 7/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:29,  5.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  965
episode_length                      10.348187
returns                             -4.502989
return_std                           3.655529
average_reward                      -0.435449
round_time             0 days 00:05:57.732178
episodes_test                           932.0
episode_length_test                 10.726395
returns_test                        -4.572612
return_std_test                      3.581264
average_reward_test                 -0.425886
round_time_test        0 days 00:00:09.270003
round_time_total       0 days 00:05:57.733274
loss_total                 8356182276815454.0
loss_critic               10445227665228038.0
loss_actor                     -251513188.592
memory_size                        20537.2285 

=== epoch 7/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  957
episode_length                      10.429467
returns                             -4.494782
return_std                           3.673754
average_reward                      -0.431414
round_time             0 days 00:05:58.883266
episodes_test                           929.0
episode_length_test                 10.761033
returns_test                        -4.602005
return_std_test                      3.822968
average_reward_test                 -0.427248
round_time_test        0 days 00:00:09.241097
round_time_total       0 days 00:05:58.884475
loss_total                 8212739520496402.0
loss_critic               10265924228184276.0
loss_actor                      -251810278.16
memory_size                         20564.964 

=== epoch 7/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:46,  4.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.59it/s]
episodes                                  954
episode_length                      10.473795
returns                             -4.380516
return_std                           3.528837
average_reward                      -0.418449
round_time             0 days 00:05:58.529602
episodes_test                           977.0
episode_length_test                  10.23132
returns_test                        -4.398903
return_std_test                      3.322539
average_reward_test                 -0.429673
round_time_test        0 days 00:00:09.401993
round_time_total       0 days 00:05:58.530684
loss_total                 8184106940897427.0
loss_critic               10230133498786612.0
loss_actor                     -250787131.176
memory_size                        20590.7165 

=== epoch 7/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:48,  4.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  956
episode_length                      10.439331
returns                             -4.480441
return_std                           3.515097
average_reward                      -0.428773
round_time             0 days 00:05:59.089922
episodes_test                           960.0
episode_length_test                 10.408333
returns_test                        -4.513732
return_std_test                       3.49193
average_reward_test                 -0.433558
round_time_test        0 days 00:00:09.249262
round_time_total       0 days 00:05:59.091000
loss_total                 8193176120825217.0
loss_critic               10241469970206688.0
loss_actor                     -250740205.496
memory_size                        20626.6455 

=== epoch 7/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:13,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  954
episode_length                      10.474843
returns                             -4.580551
return_std                           3.554273
average_reward                      -0.437509
round_time             0 days 00:05:58.361745
episodes_test                           930.0
episode_length_test                 10.744086
returns_test                        -4.784975
return_std_test                      3.485485
average_reward_test                 -0.445425
round_time_test        0 days 00:00:09.168531
round_time_total       0 days 00:05:58.362864
loss_total                 8147291880053277.0
loss_critic               10184114672530096.0
loss_actor                     -250072430.888
memory_size                        20652.5715 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  1%|          | 11/2000 [00:02<06:16,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                  193
episode_length                      10.352332
returns                             -4.832745
return_std                            2.92661
average_reward                       -0.46596
round_time             0 days 00:05:50.695250
episodes_test                           946.0
episode_length_test                  10.55814
returns_test                        -4.456175
return_std_test                      3.614964
average_reward_test                 -0.421984
round_time_test        0 days 00:00:09.259824
round_time_total       0 days 00:05:50.696496
loss_total                 8098653765306941.0
loss_critic               10123317030238028.0
loss_actor                     -249901947.864
memory_size                        20676.2415 

=== epoch 8/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:34,  5.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  383
episode_length                      10.417755
returns                             -4.821049
return_std                           3.375421
average_reward                      -0.459926
round_time             0 days 00:05:51.909105
episodes_test                           957.0
episode_length_test                 10.443051
returns_test                        -4.579469
return_std_test                      3.389542
average_reward_test                  -0.43796
round_time_test        0 days 00:00:09.337964
round_time_total       0 days 00:05:51.910204
loss_total                 8068254058323902.0
loss_critic               10085317390301660.0
loss_actor                     -249074426.392
memory_size                          20703.95 

=== epoch 8/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.70it/s]
episodes                                  571
episode_length                      10.507881
returns                             -4.718455
return_std                           3.629423
average_reward                       -0.44904
round_time             0 days 00:05:51.474840
episodes_test                           938.0
episode_length_test                 10.657783
returns_test                         -4.62428
return_std_test                      3.659383
average_reward_test                 -0.433677
round_time_test        0 days 00:00:09.316299
round_time_total       0 days 00:05:51.475921
loss_total                 7967856898227569.0
loss_critic                9959820962494940.0
loss_actor                       -249282152.0
memory_size                          20728.82 

=== epoch 8/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  765
episode_length                       10.45098
returns                             -4.739942
return_std                            3.66622
average_reward                      -0.453355
round_time             0 days 00:05:54.407429
episodes_test                           959.0
episode_length_test                 10.416058
returns_test                        -4.472496
return_std_test                      3.521657
average_reward_test                 -0.428097
round_time_test        0 days 00:00:09.157694
round_time_total       0 days 00:05:54.408563
loss_total                 7868480866605859.0
loss_critic                9835600915619382.0
loss_actor                     -247949659.592
memory_size                         20749.347 

=== epoch 8/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  961
episode_length                      10.400624
returns                             -4.673567
return_std                           3.655234
average_reward                      -0.449173
round_time             0 days 00:05:53.680870
episodes_test                           953.0
episode_length_test                 10.483736
returns_test                        -4.760323
return_std_test                       3.46008
average_reward_test                  -0.45423
round_time_test        0 days 00:00:09.145112
round_time_total       0 days 00:05:53.682100
loss_total                 7943841740392235.0
loss_critic                9929802000604594.0
loss_actor                     -246475333.176
memory_size                         20758.702 

=== epoch 8/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:08,  5.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.68it/s]
episodes                                  952
episode_length                      10.491597
returns                             -4.651263
return_std                           3.812499
average_reward                      -0.443573
round_time             0 days 00:05:52.553137
episodes_test                           913.0
episode_length_test                 10.951807
returns_test                        -5.138674
return_std_test                      3.846106
average_reward_test                 -0.469064
round_time_test        0 days 00:00:09.193496
round_time_total       0 days 00:05:52.554318
loss_total                 7819464257866564.0
loss_critic                9774330164157612.0
loss_actor                     -247465505.952
memory_size                        20795.6155 

=== epoch 8/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:48,  5.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  954
episode_length                      10.474843
returns                             -4.621601
return_std                           3.765235
average_reward                      -0.441005
round_time             0 days 00:05:54.320214
episodes_test                           965.0
episode_length_test                 10.360622
returns_test                        -4.502754
return_std_test                      3.289368
average_reward_test                 -0.434352
round_time_test        0 days 00:00:09.128570
round_time_total       0 days 00:05:54.321703
loss_total                 7837740020571046.0
loss_critic                9797174857505440.0
loss_actor                     -247145355.896
memory_size                        20848.1375 

=== epoch 8/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  960
episode_length                       10.40625
returns                             -4.596922
return_std                           3.653853
average_reward                      -0.441278
round_time             0 days 00:05:55.288179
episodes_test                           928.0
episode_length_test                 10.772629
returns_test                        -4.682852
return_std_test                      3.798244
average_reward_test                 -0.434641
round_time_test        0 days 00:00:09.254290
round_time_total       0 days 00:05:55.289332
loss_total                 7919081214202348.0
loss_critic                9898851341793492.0
loss_actor                      -246898510.72
memory_size                        20892.5555 

=== epoch 8/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  953
episode_length                       10.47639
returns                             -4.531644
return_std                           3.560709
average_reward                      -0.432362
round_time             0 days 00:05:55.546097
episodes_test                           982.0
episode_length_test                 10.175153
returns_test                        -4.893518
return_std_test                      3.414112
average_reward_test                 -0.480752
round_time_test        0 days 00:00:09.334376
round_time_total       0 days 00:05:55.547186
loss_total                 7870839029808234.0
loss_critic                9838548621870498.0
loss_actor                     -246795559.984
memory_size                         20906.876 

=== epoch 8/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:56,  4.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  941
episode_length                      10.618491
returns                             -4.702354
return_std                           3.684285
average_reward                      -0.442734
round_time             0 days 00:05:54.741138
episodes_test                           937.0
episode_length_test                 10.669157
returns_test                         -4.91203
return_std_test                      3.723202
average_reward_test                 -0.460197
round_time_test        0 days 00:00:09.311673
round_time_total       0 days 00:05:54.742318
loss_total                 7833516913983488.0
loss_critic                9791895967023236.0
loss_actor                     -246946281.104
memory_size                        20919.9925 

=== epoch 8/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  943
episode_length                      10.601273
returns                             -4.755139
return_std                           3.623932
average_reward                       -0.44847
round_time             0 days 00:05:54.245162
episodes_test                           946.0
episode_length_test                 10.565539
returns_test                        -4.705307
return_std_test                      3.666075
average_reward_test                 -0.444885
round_time_test        0 days 00:00:09.203459
round_time_total       0 days 00:05:54.246357
loss_total                 7808731618582987.0
loss_critic                9760914355993444.0
loss_actor                      -246029988.12
memory_size                        20960.6465 

=== epoch 8/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:58,  5.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  945
episode_length                      10.567196
returns                             -4.857104
return_std                           3.616492
average_reward                      -0.459665
round_time             0 days 00:05:54.238711
episodes_test                           960.0
episode_length_test                  10.40625
returns_test                        -4.593878
return_std_test                      3.501978
average_reward_test                 -0.440861
round_time_test        0 days 00:00:09.320907
round_time_total       0 days 00:05:54.239817
loss_total                 7831723877477646.0
loss_critic                9789654693905956.0
loss_actor                     -246098859.888
memory_size                          20982.98 

=== epoch 8/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  945
episode_length                      10.571429
returns                             -4.869619
return_std                           3.552901
average_reward                      -0.460748
round_time             0 days 00:05:57.100866
episodes_test                           920.0
episode_length_test                  10.86413
returns_test                         -4.72122
return_std_test                      3.647546
average_reward_test                 -0.434002
round_time_test        0 days 00:00:09.217455
round_time_total       0 days 00:05:57.102148
loss_total                 7882617480340832.0
loss_critic                9853271680103744.0
loss_actor                     -246910036.016
memory_size                        20998.5695 

=== epoch 8/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  952
episode_length                      10.489496
returns                             -4.900082
return_std                           3.530823
average_reward                      -0.466177
round_time             0 days 00:05:53.978960
episodes_test                           958.0
episode_length_test                 10.431106
returns_test                        -4.377911
return_std_test                      3.341055
average_reward_test                 -0.419804
round_time_test        0 days 00:00:09.265136
round_time_total       0 days 00:05:53.980072
loss_total                 7849649954548089.0
loss_critic                9812062271923290.0
loss_actor                     -246335294.032
memory_size                          21017.72 

=== epoch 8/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:31,  5.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  968
episode_length                      10.322314
returns                             -4.831682
return_std                             3.4142
average_reward                      -0.468406
round_time             0 days 00:05:55.799201
episodes_test                           953.0
episode_length_test                 10.485834
returns_test                        -4.555196
return_std_test                      3.484512
average_reward_test                 -0.434313
round_time_test        0 days 00:00:09.223743
round_time_total       0 days 00:05:55.800309
loss_total                 7815966042571670.0
loss_critic                9769957382254756.0
loss_actor                     -245986428.256
memory_size                        21035.8665 

=== epoch 8/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  962
episode_length                      10.386694
returns                             -4.716472
return_std                           3.468167
average_reward                      -0.454352
round_time             0 days 00:05:56.177887
episodes_test                           919.0
episode_length_test                  10.87704
returns_test                        -4.735319
return_std_test                      3.586878
average_reward_test                  -0.43528
round_time_test        0 days 00:00:09.294431
round_time_total       0 days 00:05:56.179140
loss_total                 7745871483603255.0
loss_critic                9682339192570380.0
loss_actor                     -246224679.472
memory_size                         21067.328 

=== epoch 8/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:29,  5.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  956
episode_length                      10.453975
returns                             -4.614024
return_std                           3.468348
average_reward                      -0.441325
round_time             0 days 00:05:54.185960
episodes_test                           945.0
episode_length_test                 10.575661
returns_test                        -4.895461
return_std_test                      3.624979
average_reward_test                 -0.462966
round_time_test        0 days 00:00:09.208244
round_time_total       0 days 00:05:54.187064
loss_total                 7869629778007949.0
loss_critic                9837037052825174.0
loss_actor                     -245449370.368
memory_size                        21118.3975 

=== epoch 8/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:19,  5.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  953
episode_length                      10.481637
returns                             -4.694925
return_std                           3.519244
average_reward                      -0.447884
round_time             0 days 00:05:55.736645
episodes_test                           949.0
episode_length_test                 10.531085
returns_test                        -4.606405
return_std_test                      3.458354
average_reward_test                 -0.437158
round_time_test        0 days 00:00:09.147253
round_time_total       0 days 00:05:55.737930
loss_total                 7769466686381490.0
loss_critic                9711833191613988.0
loss_actor                     -245090101.976
memory_size                        21155.3265 

=== epoch 8/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  946
episode_length                      10.562368
returns                             -4.689833
return_std                           3.568826
average_reward                      -0.444774
round_time             0 days 00:05:55.601796
episodes_test                           968.0
episode_length_test                 10.327479
returns_test                        -4.895203
return_std_test                      3.417268
average_reward_test                 -0.473673
round_time_test        0 days 00:00:09.259802
round_time_total       0 days 00:05:55.602868
loss_total                 7836434412924305.0
loss_critic                9795542853550604.0
loss_actor                      -245461243.24
memory_size                        21189.8575 

=== epoch 8/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:49,  5.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  933
episode_length                      10.705252
returns                              -4.72229
return_std                           3.587879
average_reward                      -0.440932
round_time             0 days 00:05:55.805370
episodes_test                           962.0
episode_length_test                 10.393971
returns_test                        -4.502303
return_std_test                      3.415016
average_reward_test                 -0.433024
round_time_test        0 days 00:00:09.235446
round_time_total       0 days 00:05:55.806463
loss_total                 7813570691663397.0
loss_critic                9766963194122732.0
loss_actor                     -244598461.464
memory_size                        21226.7585 

=== epoch 8/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  929
episode_length                      10.754575
returns                             -4.715326
return_std                           3.567305
average_reward                      -0.438328
round_time             0 days 00:05:55.753509
episodes_test                           950.0
episode_length_test                 10.525263
returns_test                        -4.771303
return_std_test                      3.635264
average_reward_test                 -0.453187
round_time_test        0 days 00:00:09.337434
round_time_total       0 days 00:05:55.754773
loss_total                 7767220301447823.0
loss_critic                9709025210312688.0
loss_actor                       -243773746.0
memory_size                         21264.481 

=== epoch 8/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:42,  4.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  922
episode_length                      10.838395
returns                             -4.826957
return_std                           3.599996
average_reward                      -0.445695
round_time             0 days 00:05:58.351666
episodes_test                           926.0
episode_length_test                 10.787257
returns_test                        -4.594286
return_std_test                      3.641154
average_reward_test                 -0.425028
round_time_test        0 days 00:00:09.191494
round_time_total       0 days 00:05:58.352940
loss_total                 7683619059920798.0
loss_critic                9604523659477648.0
loss_actor                      -243125375.44
memory_size                         21320.046 

=== epoch 8/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:40,  4.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  914
episode_length                       10.92779
returns                              -4.79861
return_std                           3.706167
average_reward                      -0.437905
round_time             0 days 00:05:56.351853
episodes_test                           995.0
episode_length_test                 10.041206
returns_test                        -4.685882
return_std_test                      3.311496
average_reward_test                 -0.466046
round_time_test        0 days 00:00:09.284376
round_time_total       0 days 00:05:56.352953
loss_total                 7754454319553839.0
loss_critic                9693067737810600.0
loss_actor                     -241741956.592
memory_size                        21370.5375 

=== epoch 8/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<05:56,  5.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  913
episode_length                      10.947426
returns                             -4.867297
return_std                           3.681281
average_reward                      -0.443923
round_time             0 days 00:05:56.388063
episodes_test                           928.0
episode_length_test                 10.775862
returns_test                         -4.39558
return_std_test                      3.441375
average_reward_test                  -0.40791
round_time_test        0 days 00:00:09.161558
round_time_total       0 days 00:05:56.389332
loss_total                 7608819168187515.0
loss_critic                9511023793133322.0
loss_actor                      -241415635.12
memory_size                         21423.158 

=== epoch 8/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:49,  5.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  907
episode_length                      11.007718
returns                             -4.711035
return_std                           3.635554
average_reward                      -0.427048
round_time             0 days 00:05:56.797239
episodes_test                           939.0
episode_length_test                 10.644302
returns_test                        -4.940146
return_std_test                      3.560172
average_reward_test                 -0.463407
round_time_test        0 days 00:00:09.339254
round_time_total       0 days 00:05:56.798340
loss_total                 7622697015243178.0
loss_critic                9528371108462460.0
loss_actor                     -241549661.544
memory_size                          21459.08 

=== epoch 8/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:35,  5.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  907
episode_length                      11.005513
returns                             -4.739945
return_std                           3.737253
average_reward                      -0.430956
round_time             0 days 00:05:55.171971
episodes_test                           948.0
episode_length_test                 10.544304
returns_test                        -4.592239
return_std_test                      3.518863
average_reward_test                 -0.435288
round_time_test        0 days 00:00:09.298121
round_time_total       0 days 00:05:55.173055
loss_total                 7645340947592511.0
loss_critic                9556676023664246.0
loss_actor                      -239553781.52
memory_size                         21526.685 

=== epoch 8/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:40,  5.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  910
episode_length                      10.974725
returns                             -4.720914
return_std                           3.718333
average_reward                      -0.430919
round_time             0 days 00:05:56.078091
episodes_test                           922.0
episode_length_test                 10.844902
returns_test                        -4.739344
return_std_test                      3.454719
average_reward_test                 -0.436894
round_time_test        0 days 00:00:09.252645
round_time_total       0 days 00:05:56.079201
loss_total                 7536076342450717.0
loss_critic                9420095274316988.0
loss_actor                       -238673788.8
memory_size                         21577.475 

=== epoch 8/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  905
episode_length                      11.036464
returns                             -4.734182
return_std                           3.716986
average_reward                      -0.428893
round_time             0 days 00:05:56.801855
episodes_test                           877.0
episode_length_test                 11.391106
returns_test                        -4.800932
return_std_test                       3.73428
average_reward_test                 -0.421407
round_time_test        0 days 00:00:09.079723
round_time_total       0 days 00:05:56.803003
loss_total                 7516061826124087.0
loss_critic                9395077124982834.0
loss_actor                      -237821857.04
memory_size                        21614.1315 

=== epoch 8/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:42,  4.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  905
episode_length                      11.041989
returns                             -4.685821
return_std                            3.66487
average_reward                       -0.42471
round_time             0 days 00:05:56.339262
episodes_test                           966.0
episode_length_test                 10.341615
returns_test                        -4.679572
return_std_test                      3.198191
average_reward_test                 -0.451291
round_time_test        0 days 00:00:09.179689
round_time_total       0 days 00:05:56.340348
loss_total                 7448026663306134.0
loss_critic                9310033181795156.0
loss_actor                     -236368346.832
memory_size                        21668.4325 

=== epoch 8/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 10/2000 [00:01<05:51,  5.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  910
episode_length                      10.975824
returns                             -4.774293
return_std                           3.725391
average_reward                      -0.435177
round_time             0 days 00:05:59.322578
episodes_test                           930.0
episode_length_test                 10.746237
returns_test                        -4.690442
return_std_test                      3.494915
average_reward_test                 -0.436549
round_time_test        0 days 00:00:09.489945
round_time_total       0 days 00:05:59.324007
loss_total                 7331553026827616.0
loss_critic                9164441125862244.0
loss_actor                     -235305649.984
memory_size                        21700.1315 

=== epoch 8/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  917
episode_length                      10.889858
returns                             -4.743705
return_std                           3.673653
average_reward                      -0.435571
round_time             0 days 00:05:59.122534
episodes_test                           924.0
episode_length_test                   10.8171
returns_test                        -4.620633
return_std_test                      3.649544
average_reward_test                 -0.426841
round_time_test        0 days 00:00:09.240703
round_time_total       0 days 00:05:59.123675
loss_total                 7261160934805078.0
loss_critic                9076451020464194.0
loss_actor                      -233023977.16
memory_size                           21742.9 

=== epoch 8/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:04,  4.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  915
episode_length                      10.918033
returns                             -4.581537
return_std                           3.617926
average_reward                      -0.420343
round_time             0 days 00:05:58.429006
episodes_test                           934.0
episode_length_test                 10.699143
returns_test                        -4.558949
return_std_test                      3.520134
average_reward_test                 -0.426013
round_time_test        0 days 00:00:09.213517
round_time_total       0 days 00:05:58.430295
loss_total                 7177854299177746.0
loss_critic                8972317726701781.0
loss_actor                     -231951891.912
memory_size                        21775.9585 

=== epoch 8/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  912
episode_length                      10.945175
returns                             -4.583275
return_std                           3.509408
average_reward                      -0.418721
round_time             0 days 00:05:58.342091
episodes_test                           916.0
episode_length_test                 10.913755
returns_test                        -4.739275
return_std_test                      3.568094
average_reward_test                 -0.433944
round_time_test        0 days 00:00:09.291869
round_time_total       0 days 00:05:58.343208
loss_total                 7163043426177384.0
loss_critic                8953804130183283.0
loss_actor                     -231054780.112
memory_size                         21836.142 

=== epoch 8/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:23,  5.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  906
episode_length                      11.032009
returns                             -4.615628
return_std                           3.627199
average_reward                      -0.417862
round_time             0 days 00:05:57.333750
episodes_test                           914.0
episode_length_test                 10.939825
returns_test                        -4.276521
return_std_test                      3.608695
average_reward_test                 -0.390781
round_time_test        0 days 00:00:09.335194
round_time_total       0 days 00:05:57.334843
loss_total                 7080888164042146.0
loss_critic                8851110057010528.0
loss_actor                     -228208628.024
memory_size                         21892.723 

=== epoch 8/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:58,  5.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  901
episode_length                       11.09323
returns                             -4.596416
return_std                           3.611814
average_reward                      -0.414491
round_time             0 days 00:05:58.400085
episodes_test                           875.0
episode_length_test                 11.421714
returns_test                        -4.539598
return_std_test                      3.849359
average_reward_test                 -0.397158
round_time_test        0 days 00:00:09.090259
round_time_total       0 days 00:05:58.401168
loss_total                 6938238978361393.0
loss_critic                8672798580378960.0
loss_actor                     -227208530.576
memory_size                         21934.108 

=== epoch 8/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:31,  5.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  894
episode_length                      11.166667
returns                             -4.878358
return_std                           3.705782
average_reward                      -0.437221
round_time             0 days 00:05:59.372044
episodes_test                           891.0
episode_length_test                 11.203143
returns_test                        -4.691098
return_std_test                      3.600477
average_reward_test                 -0.418002
round_time_test        0 days 00:00:09.182040
round_time_total       0 days 00:05:59.373133
loss_total                 6982166794421142.0
loss_critic                8727708347567964.0
loss_actor                     -226699295.488
memory_size                         21973.487 

=== epoch 8/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:02,  4.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  895
episode_length                      11.160894
returns                             -5.000502
return_std                            3.82739
average_reward                      -0.447056
round_time             0 days 00:05:58.777633
episodes_test                           895.0
episode_length_test                 11.173184
returns_test                        -4.630288
return_std_test                      3.583242
average_reward_test                 -0.414411
round_time_test        0 days 00:00:09.377234
round_time_total       0 days 00:05:58.779251
loss_total                 6805194239752798.0
loss_critic                8506492662889578.0
loss_actor                      -223737180.04
memory_size                        22023.5245 

=== epoch 8/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:56,  5.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  901
episode_length                       11.08768
returns                             -4.901994
return_std                           3.932977
average_reward                      -0.441515
round_time             0 days 00:05:58.910228
episodes_test                           930.0
episode_length_test                 10.745161
returns_test                        -4.829339
return_std_test                      3.308352
average_reward_test                 -0.449242
round_time_test        0 days 00:00:09.334602
round_time_total       0 days 00:05:58.911299
loss_total                 6677909073762976.0
loss_critic                8347386193322705.0
loss_actor                     -222039374.288
memory_size                        22084.0405 

=== epoch 8/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  908
episode_length                      10.996696
returns                             -4.958281
return_std                           3.913533
average_reward                      -0.451292
round_time             0 days 00:06:00.389680
episodes_test                           889.0
episode_length_test                 11.247469
returns_test                        -4.877212
return_std_test                      3.583889
average_reward_test                 -0.433497
round_time_test        0 days 00:00:09.161283
round_time_total       0 days 00:06:00.390805
loss_total                 6643062672382231.0
loss_critic                8303828201461776.0
loss_actor                     -221410161.048
memory_size                         22144.264 

=== epoch 8/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  903
episode_length                      11.052049
returns                             -4.969141
return_std                           4.011771
average_reward                      -0.449611
round_time             0 days 00:06:00.410237
episodes_test                           911.0
episode_length_test                 10.973655
returns_test                        -5.229946
return_std_test                      3.750003
average_reward_test                 -0.476315
round_time_test        0 days 00:00:09.214318
round_time_total       0 days 00:06:00.411414
loss_total                 6559224073782559.0
loss_critic                8199029952138445.0
loss_actor                     -220109413.544
memory_size                        22177.9255 

=== epoch 8/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:52,  4.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  901
episode_length                      11.072142
returns                             -4.712125
return_std                            3.86543
average_reward                      -0.425637
round_time             0 days 00:06:00.113596
episodes_test                           881.0
episode_length_test                 11.349603
returns_test                        -4.875684
return_std_test                      3.538323
average_reward_test                 -0.429451
round_time_test        0 days 00:00:09.302804
round_time_total       0 days 00:06:00.114714
loss_total                 6491863474501583.0
loss_critic                8114829207936172.0
loss_actor                     -216610958.112
memory_size                         22228.935 

=== epoch 8/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:33,  5.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  888
episode_length                      11.233108
returns                             -4.782741
return_std                           3.846645
average_reward                      -0.424119
round_time             0 days 00:05:59.327249
episodes_test                           861.0
episode_length_test                 11.608595
returns_test                        -4.566217
return_std_test                      3.720695
average_reward_test                 -0.392894
round_time_test        0 days 00:00:09.201639
round_time_total       0 days 00:05:59.328349
loss_total                 6344251704157929.0
loss_critic                7930314499033137.0
loss_actor                     -214223045.296
memory_size                        22317.1785 

=== epoch 8/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:58,  4.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  882
episode_length                      11.325397
returns                              -4.82017
return_std                           3.801374
average_reward                      -0.425089
round_time             0 days 00:05:59.582577
episodes_test                           864.0
episode_length_test                 11.565972
returns_test                        -4.863348
return_std_test                      3.801494
average_reward_test                 -0.420093
round_time_test        0 days 00:00:09.108680
round_time_total       0 days 00:05:59.583670
loss_total                 6229226152477590.0
loss_critic                7786532551564198.0
loss_actor                     -212530574.712
memory_size                        22404.4165 

=== epoch 8/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  875
episode_length                         11.416
returns                             -4.749996
return_std                           3.791114
average_reward                      -0.416038
round_time             0 days 00:05:57.877307
episodes_test                           904.0
episode_length_test                 11.053097
returns_test                        -4.656246
return_std_test                      3.658362
average_reward_test                 -0.420346
round_time_test        0 days 00:00:09.252296
round_time_total       0 days 00:05:57.878616
loss_total                 6154720306620531.0
loss_critic                7693400250534330.0
loss_actor                     -210387479.592
memory_size                         22447.131 

=== epoch 8/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:01,  5.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  879
episode_length                      11.365188
returns                             -4.886202
return_std                           3.696505
average_reward                       -0.42986
round_time             0 days 00:05:59.827914
episodes_test                           844.0
episode_length_test                 11.830569
returns_test                        -4.860457
return_std_test                      3.818246
average_reward_test                 -0.409566
round_time_test        0 days 00:00:09.193994
round_time_total       0 days 00:05:59.829050
loss_total                 5946116776763851.0
loss_critic                7432645840092529.0
loss_actor                     -207182843.504
memory_size                         22495.531 

=== epoch 8/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  866
episode_length                      11.536952
returns                             -4.992532
return_std                           3.750307
average_reward                      -0.433549
round_time             0 days 00:06:00.886382
episodes_test                           885.0
episode_length_test                 11.292655
returns_test                        -4.607199
return_std_test                      3.843063
average_reward_test                 -0.407203
round_time_test        0 days 00:00:09.252763
round_time_total       0 days 00:06:00.887981
loss_total                 5835494591716917.0
loss_critic                7294368118967632.0
loss_actor                     -204923582.272
memory_size                        22551.8995 

=== epoch 8/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:28,  5.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.54it/s]
episodes                                  869
episode_length                      11.483314
returns                             -4.959078
return_std                           3.650425
average_reward                      -0.432547
round_time             0 days 00:06:01.539148
episodes_test                           878.0
episode_length_test                 11.382688
returns_test                        -4.829185
return_std_test                      3.799734
average_reward_test                 -0.423967
round_time_test        0 days 00:00:09.208486
round_time_total       0 days 00:06:01.540221
loss_total                 5699608868137468.0
loss_critic                7124510960919773.0
loss_actor                      -201054423.96
memory_size                        22613.1835 

=== epoch 8/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  867
episode_length                      11.510957
returns                             -4.986582
return_std                           3.613933
average_reward                      -0.432241
round_time             0 days 00:06:00.099380
episodes_test                           874.0
episode_length_test                 11.421053
returns_test                        -4.322955
return_std_test                      3.605038
average_reward_test                 -0.378279
round_time_test        0 days 00:00:09.121347
round_time_total       0 days 00:06:00.100458
loss_total                 5485515197214884.0
loss_critic                6856893871209578.0
loss_actor                     -197876524.128
memory_size                         22708.764 

=== epoch 8/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:16,  4.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  861
episode_length                      11.607433
returns                             -4.957415
return_std                           3.669847
average_reward                      -0.426956
round_time             0 days 00:05:59.919698
episodes_test                           891.0
episode_length_test                 11.223345
returns_test                        -4.302719
return_std_test                      3.568163
average_reward_test                 -0.383372
round_time_test        0 days 00:00:09.305377
round_time_total       0 days 00:05:59.920971
loss_total                 5373089429973893.0
loss_critic                6716361674237936.0
loss_actor                     -193692061.888
memory_size                        22772.9735 

=== epoch 8/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:25,  5.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  848
episode_length                       11.78184
returns                              -4.67356
return_std                           3.732954
average_reward                      -0.397082
round_time             0 days 00:06:00.804819
episodes_test                           899.0
episode_length_test                 11.112347
returns_test                        -4.546635
return_std_test                      3.291071
average_reward_test                 -0.409163
round_time_test        0 days 00:00:09.232329
round_time_total       0 days 00:06:00.805919
loss_total                 5124497114058981.0
loss_critic                6405621281877656.0
loss_actor                     -189653703.752
memory_size                        22834.0955 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  1%|          | 12/2000 [00:02<06:00,  5.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  173
episode_length                       11.50289
returns                              -4.32579
return_std                           4.017063
average_reward                      -0.373616
round_time             0 days 00:05:53.872249
episodes_test                           857.0
episode_length_test                 11.656943
returns_test                        -4.659637
return_std_test                       3.73199
average_reward_test                 -0.399087
round_time_test        0 days 00:00:09.324700
round_time_total       0 days 00:05:53.873435
loss_total                 4982860703611224.0
loss_critic                6228575766318154.0
loss_actor                     -187685558.528
memory_size                        22894.6855 

=== epoch 9/10 ===== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:39,  4.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  345
episode_length                      11.594203
returns                             -4.248202
return_std                           3.668782
average_reward                      -0.366407
round_time             0 days 00:05:53.210818
episodes_test                           876.0
episode_length_test                   11.4121
returns_test                         -3.98076
return_std_test                      3.330641
average_reward_test                 -0.348443
round_time_test        0 days 00:00:09.169434
round_time_total       0 days 00:05:53.212040
loss_total                 4847695914921361.0
loss_critic                6059619792736748.0
loss_actor                      -184233201.84
memory_size                         23010.926 

=== epoch 9/10 ===== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  520
episode_length                         11.525
returns                             -4.004653
return_std                           3.668331
average_reward                      -0.347571
round_time             0 days 00:05:53.227244
episodes_test                           846.0
episode_length_test                 11.806147
returns_test                         -4.55524
return_std_test                      3.757777
average_reward_test                 -0.385845
round_time_test        0 days 00:00:09.258275
round_time_total       0 days 00:05:53.228391
loss_total                 4712431076243407.0
loss_critic                5890538745815368.0
loss_actor                     -181121437.768
memory_size                        23092.0245 

=== epoch 9/10 ===== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:17,  5.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  695
episode_length                      11.492086
returns                             -4.152772
return_std                           3.717712
average_reward                       -0.36102
round_time             0 days 00:05:53.563866
episodes_test                           927.0
episode_length_test                 10.781014
returns_test                        -4.110809
return_std_test                      3.248725
average_reward_test                 -0.381236
round_time_test        0 days 00:00:09.276976
round_time_total       0 days 00:05:53.564972
loss_total                 4552572593000940.0
loss_critic                5690715644832514.0
loss_actor                     -177956054.392
memory_size                        23150.8415 

=== epoch 9/10 ===== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  877
episode_length                      11.394527
returns                             -4.261464
return_std                           3.665527
average_reward                      -0.373094
round_time             0 days 00:05:54.755359
episodes_test                           902.0
episode_length_test                 11.085366
returns_test                        -4.064504
return_std_test                      3.463643
average_reward_test                 -0.366539
round_time_test        0 days 00:00:09.138874
round_time_total       0 days 00:05:54.756892
loss_total                 4369711941971607.5
loss_critic                5462139832270586.0
loss_actor                     -175011336.688
memory_size                        23202.0455 

=== epoch 9/10 ===== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  875
episode_length                      11.419429
returns                             -4.283234
return_std                           3.543903
average_reward                      -0.375004
round_time             0 days 00:05:54.812315
episodes_test                           854.0
episode_length_test                 11.700234
returns_test                        -4.372216
return_std_test                      3.573222
average_reward_test                 -0.373554
round_time_test        0 days 00:00:09.178417
round_time_total       0 days 00:05:54.813436
loss_total                 4286331026159960.0
loss_critic                5357913689234080.0
loss_actor                       -171851569.0
memory_size                        23261.1195 

=== epoch 9/10 ===== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  878
episode_length                      11.387244
returns                             -4.417272
return_std                           3.610516
average_reward                       -0.38771
round_time             0 days 00:05:54.265477
episodes_test                           900.0
episode_length_test                 11.097778
returns_test                        -4.524089
return_std_test                      3.544618
average_reward_test                 -0.406659
round_time_test        0 days 00:00:09.339711
round_time_total       0 days 00:05:54.266949
loss_total                 4153223146700275.5
loss_critic                5191528842191176.0
loss_actor                     -168838915.696
memory_size                        23334.4015 

=== epoch 9/10 ===== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:42,  5.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  877
episode_length                      11.399088
returns                             -4.545242
return_std                           3.492048
average_reward                      -0.398692
round_time             0 days 00:05:55.318613
episodes_test                           891.0
episode_length_test                 11.218855
returns_test                        -4.506044
return_std_test                      3.777326
average_reward_test                 -0.401309
round_time_test        0 days 00:00:09.076885
round_time_total       0 days 00:05:55.319696
loss_total                 4010809610976887.0
loss_critic                5013511926295036.0
loss_actor                     -165618118.664
memory_size                        23389.1375 

=== epoch 9/10 ===== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:02,  5.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  888
episode_length                      11.251126
returns                               -4.5286
return_std                           3.526605
average_reward                      -0.402652
round_time             0 days 00:05:54.549247
episodes_test                           902.0
episode_length_test                 11.085366
returns_test                        -4.402472
return_std_test                      3.676281
average_reward_test                 -0.397019
round_time_test        0 days 00:00:09.213571
round_time_total       0 days 00:05:54.550356
loss_total                 3838721000657125.5
loss_critic                4798401175844028.0
loss_actor                      -162958157.32
memory_size                        23424.8245 

=== epoch 9/10 ===== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:48,  5.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  890
episode_length                      11.225843
returns                             -4.494979
return_std                            3.57148
average_reward                      -0.400879
round_time             0 days 00:05:55.929146
episodes_test                           899.0
episode_length_test                 11.106785
returns_test                        -4.730022
return_std_test                      3.352411
average_reward_test                 -0.426183
round_time_test        0 days 00:00:09.228240
round_time_total       0 days 00:05:55.930277
loss_total                 3687983955786596.5
loss_critic                4609979871257428.0
loss_actor                     -159639154.768
memory_size                        23469.1295 

=== epoch 9/10 ===== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  905
episode_length                      11.041989
returns                             -4.596104
return_std                           3.573483
average_reward                      -0.416271
round_time             0 days 00:05:56.247702
episodes_test                           897.0
episode_length_test                 11.148272
returns_test                        -4.246434
return_std_test                      3.556094
average_reward_test                 -0.380905
round_time_test        0 days 00:00:09.176403
round_time_total       0 days 00:05:56.248789
loss_total                 3582676887224189.0
loss_critic                4478346037055979.5
loss_actor                     -156059057.784
memory_size                        23499.8935 

=== epoch 9/10 ===== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:58,  5.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  916
episode_length                      10.902838
returns                              -4.66505
return_std                            3.54555
average_reward                      -0.427795
round_time             0 days 00:05:56.110937
episodes_test                           951.0
episode_length_test                 10.514196
returns_test                        -4.605115
return_std_test                      3.412828
average_reward_test                 -0.437852
round_time_test        0 days 00:00:09.288184
round_time_total       0 days 00:05:56.112030
loss_total                 3427108089776570.5
loss_critic                4283885043090194.5
loss_actor                     -153121160.388
memory_size                        23547.2295 

=== epoch 9/10 ===== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  920
episode_length                          10.85
returns                             -4.753532
return_std                           3.642375
average_reward                      -0.436763
round_time             0 days 00:05:55.379998
episodes_test                           896.0
episode_length_test                 11.157366
returns_test                         -4.38585
return_std_test                      3.747298
average_reward_test                 -0.392874
round_time_test        0 days 00:00:09.176682
round_time_total       0 days 00:05:55.381464
loss_total                 3337089672886092.0
loss_critic                4171362017724072.0
loss_actor                     -151118822.268
memory_size                        23596.5025 

=== epoch 9/10 ===== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  909
episode_length                      10.983498
returns                             -4.650464
return_std                           3.553379
average_reward                      -0.422395
round_time             0 days 00:05:56.177204
episodes_test                           943.0
episode_length_test                 10.602333
returns_test                        -4.486675
return_std_test                      3.507768
average_reward_test                 -0.422897
round_time_test        0 days 00:00:09.625922
round_time_total       0 days 00:05:56.178436
loss_total                 3196328577981219.0
loss_critic                3995410655216664.5
loss_actor                     -148541098.524
memory_size                         23647.308 

=== epoch 9/10 ===== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  907
episode_length                      11.024256
returns                             -4.684196
return_std                            3.70531
average_reward                       -0.42476
round_time             0 days 00:05:56.281548
episodes_test                           908.0
episode_length_test                 11.006608
returns_test                        -4.604578
return_std_test                       3.62616
average_reward_test                 -0.418125
round_time_test        0 days 00:00:09.190439
round_time_total       0 days 00:05:56.282687
loss_total                 3085476305160896.5
loss_critic                3856845316439408.5
loss_actor                     -146834120.964
memory_size                        23695.8235 

=== epoch 9/10 ===== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:20,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  897
episode_length                      11.125975
returns                             -4.638654
return_std                           3.693991
average_reward                       -0.41653
round_time             0 days 00:05:56.351970
episodes_test                           884.0
episode_length_test                 11.306561
returns_test                        -4.333497
return_std_test                      3.882603
average_reward_test                 -0.382738
round_time_test        0 days 00:00:09.292279
round_time_total       0 days 00:05:56.353109
loss_total                 3018782798623277.0
loss_critic                3773478434601173.0
loss_actor                     -144659030.456
memory_size                           23753.8 

=== epoch 9/10 ===== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:14,  5.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  891
episode_length                      11.198653
returns                             -4.504918
return_std                           3.705014
average_reward                      -0.402606
round_time             0 days 00:05:56.301374
episodes_test                           863.0
episode_length_test                 11.587486
returns_test                        -4.439827
return_std_test                       3.93316
average_reward_test                 -0.383157
round_time_test        0 days 00:00:09.150820
round_time_total       0 days 00:05:56.302477
loss_total                 2927387452228239.5
loss_critic                3659234256556654.5
loss_actor                     -141124969.136
memory_size                          23818.23 

=== epoch 9/10 ===== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:20,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  886
episode_length                      11.257336
returns                              -4.39181
return_std                           3.775036
average_reward                      -0.390217
round_time             0 days 00:05:55.979885
episodes_test                           826.0
episode_length_test                 12.101695
returns_test                        -4.607854
return_std_test                      3.942023
average_reward_test                 -0.380263
round_time_test        0 days 00:00:09.101266
round_time_total       0 days 00:05:55.981072
loss_total                 2785160071393313.0
loss_critic                3481450029841907.5
loss_actor                      -138037448.92
memory_size                        23901.6785 

=== epoch 9/10 ===== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<07:25,  4.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  885
episode_length                      11.275706
returns                             -4.418508
return_std                             3.8709
average_reward                      -0.392385
round_time             0 days 00:05:55.397983
episodes_test                           825.0
episode_length_test                     12.12
returns_test                        -4.421943
return_std_test                      4.257773
average_reward_test                 -0.364714
round_time_test        0 days 00:00:09.220569
round_time_total       0 days 00:05:55.399507
loss_total                 2700621423960916.0
loss_critic                3375776722606620.5
loss_actor                     -134854007.132
memory_size                         23999.801 

=== epoch 9/10 ===== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  882
episode_length                      11.327664
returns                             -4.268449
return_std                             3.7323
average_reward                      -0.377006
round_time             0 days 00:05:56.131359
episodes_test                           838.0
episode_length_test                 11.930788
returns_test                        -3.976113
return_std_test                      4.275228
average_reward_test                 -0.333038
round_time_test        0 days 00:00:09.071957
round_time_total       0 days 00:05:56.132497
loss_total                 2575338712183341.0
loss_critic                3219173334243606.5
loss_actor                      -131951344.44
memory_size                         24083.459 

=== epoch 9/10 ===== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:44,  4.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  876
episode_length                      11.408676
returns                             -4.252034
return_std                           3.809744
average_reward                      -0.372889
round_time             0 days 00:05:57.793860
episodes_test                           900.0
episode_length_test                 11.105556
returns_test                         -3.89866
return_std_test                      3.584534
average_reward_test                 -0.351178
round_time_test        0 days 00:00:09.246163
round_time_total       0 days 00:05:57.794922
loss_total                 2462545513174532.0
loss_critic                3078181836053020.5
loss_actor                      -129481292.16
memory_size                        24145.3685 

=== epoch 9/10 ===== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:53,  5.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  883
episode_length                      11.310306
returns                             -4.302206
return_std                           3.742938
average_reward                      -0.379985
round_time             0 days 00:05:56.277198
episodes_test                           887.0
episode_length_test                 11.266065
returns_test                        -4.182834
return_std_test                      3.669787
average_reward_test                 -0.371046
round_time_test        0 days 00:00:09.256735
round_time_total       0 days 00:05:56.278282
loss_total                 2370522001237344.5
loss_critic                2963152453530288.0
loss_actor                     -125883659.792
memory_size                         24199.417 

=== epoch 9/10 ===== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:20,  5.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.62it/s]
episodes                                  883
episode_length                      11.319366
returns                             -4.465329
return_std                           3.736765
average_reward                      -0.394364
round_time             0 days 00:05:56.716488
episodes_test                           896.0
episode_length_test                 11.151786
returns_test                        -4.708548
return_std_test                      3.804932
average_reward_test                 -0.421159
round_time_test        0 days 00:00:09.133839
round_time_total       0 days 00:05:56.717574
loss_total                 2246032782922875.0
loss_critic                2807540931123740.5
loss_actor                     -122456388.584
memory_size                          24257.93 

=== epoch 9/10 ===== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:10,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  888
episode_length                      11.246622
returns                             -4.506124
return_std                           3.839076
average_reward                      -0.400712
round_time             0 days 00:05:57.712298
episodes_test                           904.0
episode_length_test                 11.053097
returns_test                        -4.647668
return_std_test                       3.80907
average_reward_test                 -0.420185
round_time_test        0 days 00:00:09.245425
round_time_total       0 days 00:05:57.713397
loss_total                 2132231737491587.0
loss_critic                2665289625441927.0
loss_actor                     -119356397.684
memory_size                          24354.17 

=== epoch 9/10 ===== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  890
episode_length                      11.223596
returns                             -4.554133
return_std                           3.922601
average_reward                      -0.405212
round_time             0 days 00:05:56.318718
episodes_test                           915.0
episode_length_test                 10.926776
returns_test                        -4.320551
return_std_test                       3.71726
average_reward_test                 -0.395087
round_time_test        0 days 00:00:09.221198
round_time_total       0 days 00:05:56.319939
loss_total                2033653904167665.75
loss_critic                2542067337083748.5
loss_actor                     -116170810.072
memory_size                         24414.269 

=== epoch 9/10 ===== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:21,  5.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  892
episode_length                       11.19843
returns                             -4.494026
return_std                           3.901792
average_reward                      -0.400715
round_time             0 days 00:05:58.790733
episodes_test                           971.0
episode_length_test                 10.297631
returns_test                         -4.26219
return_std_test                      3.231792
average_reward_test                 -0.413786
round_time_test        0 days 00:00:09.299915
round_time_total       0 days 00:05:58.791878
loss_total                1934571315482591.25
loss_critic                2418214105413320.5
loss_actor                     -113303441.616
memory_size                         24490.304 

=== epoch 9/10 ===== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:04,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.60it/s]
episodes                                  867
episode_length                      11.529412
returns                             -4.398564
return_std                           4.044354
average_reward                       -0.38165
round_time             0 days 00:05:57.534198
episodes_test                           880.0
episode_length_test                 11.356818
returns_test                        -3.760604
return_std_test                      3.912332
average_reward_test                 -0.330476
round_time_test        0 days 00:00:09.231033
round_time_total       0 days 00:05:57.535280
loss_total                 1836772972686213.0
loss_critic                2295966177999978.5
loss_actor                     -110618569.436
memory_size                         24613.219 

=== epoch 9/10 ===== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:59,  4.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  868
episode_length                      11.501152
returns                             -4.373852
return_std                           4.040815
average_reward                      -0.380545
round_time             0 days 00:05:58.535513
episodes_test                           796.0
episode_length_test                  12.55402
returns_test                        -3.755621
return_std_test                      4.659701
average_reward_test                 -0.298453
round_time_test        0 days 00:00:09.127581
round_time_total       0 days 00:05:58.536654
loss_total                1751920517360648.25
loss_critic                2189900611502211.0
loss_actor                     -107290809.764
memory_size                        24744.6205 

=== epoch 9/10 ===== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:24,  4.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  857
episode_length                      11.654609
returns                             -4.250427
return_std                            4.00587
average_reward                       -0.36515
round_time             0 days 00:05:59.425008
episodes_test                           877.0
episode_length_test                 11.401368
returns_test                        -4.015968
return_std_test                      4.041812
average_reward_test                  -0.35212
round_time_test        0 days 00:00:09.127575
round_time_total       0 days 00:05:59.426486
loss_total                1666217256159281.25
loss_critic                2082771533893206.0
loss_actor                     -103589757.576
memory_size                         24814.255 

=== epoch 9/10 ===== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:07,  5.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  855
episode_length                      11.690058
returns                              -4.26017
return_std                           3.950213
average_reward                      -0.365164
round_time             0 days 00:05:57.782951
episodes_test                           863.0
episode_length_test                 11.582851
returns_test                         -4.03744
return_std_test                      4.189629
average_reward_test                 -0.348384
round_time_test        0 days 00:00:09.184932
round_time_total       0 days 00:05:57.784092
loss_total                 1569133575452754.0
loss_critic               1961416937531506.75
loss_actor                     -100346402.472
memory_size                        24901.2375 

=== epoch 9/10 ===== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:59,  4.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  856
episode_length                      11.654206
returns                             -4.159757
return_std                           3.993221
average_reward                      -0.355905
round_time             0 days 00:06:00.051410
episodes_test                           790.0
episode_length_test                 12.644304
returns_test                        -4.153122
return_std_test                      4.412364
average_reward_test                 -0.328622
round_time_test        0 days 00:00:09.252388
round_time_total       0 days 00:06:00.052531
loss_total                1471302802130075.75
loss_critic                1839128471364698.0
loss_actor                      -96258953.796
memory_size                         25002.048 

=== epoch 9/10 ===== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:33,  5.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  869
episode_length                      11.506329
returns                             -4.092234
return_std                           4.097008
average_reward                      -0.355537
round_time             0 days 00:06:00.125507
episodes_test                           910.0
episode_length_test                 10.985714
returns_test                        -4.107838
return_std_test                      3.632547
average_reward_test                 -0.373472
round_time_test        0 days 00:00:09.237885
round_time_total       0 days 00:06:00.126675
loss_total                1364602549675491.25
loss_critic               1705753156438196.25
loss_actor                      -92575201.196
memory_size                        25053.7125 

=== epoch 9/10 ===== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:14,  5.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  868
episode_length                      11.508065
returns                              -3.93051
return_std                           4.074835
average_reward                      -0.341785
round_time             0 days 00:05:58.648662
episodes_test                           822.0
episode_length_test                 12.148418
returns_test                        -4.779884
return_std_test                      4.285698
average_reward_test                 -0.392424
round_time_test        0 days 00:00:09.145981
round_time_total       0 days 00:05:58.649998
loss_total                 1286841297688068.0
loss_critic                1608551595555946.5
loss_actor                       -89170276.86
memory_size                         25158.168 

=== epoch 9/10 ===== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:16,  5.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  870
episode_length                      11.485057
returns                             -4.009503
return_std                           3.964979
average_reward                      -0.349386
round_time             0 days 00:06:00.980815
episodes_test                           879.0
episode_length_test                 11.374289
returns_test                        -3.977836
return_std_test                      4.002874
average_reward_test                 -0.349468
round_time_test        0 days 00:00:09.198175
round_time_total       0 days 00:06:00.981946
loss_total                1209700727799152.75
loss_critic                1512125883136082.0
loss_actor                       -86056739.76
memory_size                         25271.726 

=== epoch 9/10 ===== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:23,  4.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  863
episode_length                      11.573581
returns                             -4.165161
return_std                            4.05022
average_reward                      -0.359819
round_time             0 days 00:06:00.582487
episodes_test                           899.0
episode_length_test                 11.113459
returns_test                        -4.523032
return_std_test                      3.932658
average_reward_test                 -0.406406
round_time_test        0 days 00:00:09.117434
round_time_total       0 days 00:06:00.583592
loss_total                 1132135293588078.5
loss_critic               1415169092138041.25
loss_actor                      -83443822.152
memory_size                        25357.7575 

=== epoch 9/10 ===== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:45,  4.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.54it/s]
episodes                                  856
episode_length                       11.66472
returns                             -4.067841
return_std                           4.139175
average_reward                      -0.348331
round_time             0 days 00:06:01.665534
episodes_test                           885.0
episode_length_test                 11.298305
returns_test                        -4.331799
return_std_test                      4.069015
average_reward_test                 -0.383273
round_time_test        0 days 00:00:09.931842
round_time_total       0 days 00:06:01.666856
loss_total               1072000867999154.125
loss_critic                1340001061712167.0
loss_actor                      -80579582.032
memory_size                         25486.041 

=== epoch 9/10 ===== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:30,  5.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:02<00:00,  5.52it/s]
episodes                                  856
episode_length                      11.667056
returns                             -4.129365
return_std                           4.077873
average_reward                      -0.353905
round_time             0 days 00:06:02.604237
episodes_test                           920.0
episode_length_test                  10.86087
returns_test                        -3.958984
return_std_test                      3.907809
average_reward_test                 -0.364379
round_time_test        0 days 00:00:09.261001
round_time_total       0 days 00:06:02.605337
loss_total               1006606910083825.625
loss_critic               1258258615827955.75
loss_actor                      -78070029.632
memory_size                        25561.1475 

=== epoch 9/10 ===== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<08:22,  3.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  859
episode_length                       11.63213
returns                             -4.199774
return_std                           4.006491
average_reward                       -0.36075
round_time             0 days 00:05:59.808363
episodes_test                           836.0
episode_length_test                 11.949761
returns_test                        -3.283599
return_std_test                      4.848552
average_reward_test                 -0.274192
round_time_test        0 days 00:00:09.209407
round_time_total       0 days 00:05:59.809484
loss_total                  950379830623338.5
loss_critic               1187974768674996.25
loss_actor                      -75262515.554
memory_size                         25593.875 

=== epoch 9/10 ===== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.57it/s]
episodes                                  868
episode_length                      11.509217
returns                             -4.189997
return_std                            3.99519
average_reward                      -0.363929
round_time             0 days 00:05:59.514420
episodes_test                           995.0
episode_length_test                 10.045226
returns_test                        -4.362405
return_std_test                      3.122965
average_reward_test                 -0.434256
round_time_test        0 days 00:00:09.362218
round_time_total       0 days 00:05:59.515510
loss_total                892879025611997.125
loss_critic              1116098762146578.375
loss_actor                      -72946612.756
memory_size                        25659.1775 

=== epoch 9/10 ===== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:00,  5.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  885
episode_length                      11.285876
returns                             -4.111811
return_std                           3.805963
average_reward                      -0.364475
round_time             0 days 00:06:00.825979
episodes_test                           914.0
episode_length_test                 10.936543
returns_test                        -4.430022
return_std_test                      3.522103
average_reward_test                 -0.404567
round_time_test        0 days 00:00:09.114249
round_time_total       0 days 00:06:00.827249
loss_total                 846416222579851.25
loss_critic                1058020260646486.0
loss_actor                       -70706751.95
memory_size                         25701.022 

=== epoch 9/10 ===== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:17,  5.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  893
episode_length                      11.191489
returns                             -4.489246
return_std                           3.701425
average_reward                      -0.401609
round_time             0 days 00:06:00.126971
episodes_test                           914.0
episode_length_test                 10.939825
returns_test                         -5.28871
return_std_test                       3.82428
average_reward_test                 -0.483311
round_time_test        0 days 00:00:09.179453
round_time_total       0 days 00:06:00.128062
loss_total                  810375174787432.5
loss_critic              1012968952059396.125
loss_actor                      -68217609.524
memory_size                         25746.774 

=== epoch 9/10 ===== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:25,  4.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.54it/s]
episodes                                  900
episode_length                      11.101111
returns                             -4.733126
return_std                           3.628852
average_reward                      -0.426405
round_time             0 days 00:06:01.757570
episodes_test                           958.0
episode_length_test                 10.431106
returns_test                        -5.162288
return_std_test                      3.473422
average_reward_test                 -0.494036
round_time_test        0 days 00:00:09.316697
round_time_total       0 days 00:06:01.758667
loss_total                 766866253431177.25
loss_critic                 958582800053698.5
loss_actor                      -66333562.792
memory_size                         25792.665 

=== epoch 9/10 ===== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:49,  4.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  891
episode_length                      11.215488
returns                             -4.813056
return_std                           3.600479
average_reward                      -0.429123
round_time             0 days 00:05:59.874972
episodes_test                           862.0
episode_length_test                 11.595128
returns_test                        -4.859229
return_std_test                      3.937203
average_reward_test                  -0.41887
round_time_test        0 days 00:00:09.167929
round_time_total       0 days 00:05:59.876528
loss_total                 731223367914356.75
loss_critic               914029194021699.625
loss_actor                       -64830151.05
memory_size                         25837.178 

=== epoch 9/10 ===== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:26,  5.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  899
episode_length                      11.105673
returns                             -4.836822
return_std                           3.653805
average_reward                      -0.435945
round_time             0 days 00:06:00.798602
episodes_test                           856.0
episode_length_test                 11.682243
returns_test                        -4.706994
return_std_test                       3.78741
average_reward_test                 -0.402919
round_time_test        0 days 00:00:09.118905
round_time_total       0 days 00:06:00.800171
loss_total                705570035699023.875
loss_critic                 881962529062912.0
loss_actor                      -63370321.136
memory_size                         25886.572 

=== epoch 9/10 ===== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:44,  5.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:58<00:00,  5.58it/s]
episodes                                  894
episode_length                      11.182327
returns                             -5.093202
return_std                           3.842527
average_reward                      -0.455364
round_time             0 days 00:05:59.304336
episodes_test                           871.0
episode_length_test                 11.464983
returns_test                        -4.121789
return_std_test                      4.117385
average_reward_test                 -0.359682
round_time_test        0 days 00:00:09.256569
round_time_total       0 days 00:05:59.305538
loss_total                  667370185727934.5
loss_critic               834212718394212.375
loss_actor                      -61496361.922
memory_size                        25903.2695 

=== epoch 9/10 ===== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:21,  5.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.57it/s]
episodes                                  892
episode_length                      11.195067
returns                             -5.187116
return_std                           3.905224
average_reward                      -0.461923
round_time             0 days 00:05:59.939359
episodes_test                           901.0
episode_length_test                 11.098779
returns_test                        -4.721529
return_std_test                      3.730197
average_reward_test                  -0.42541
round_time_test        0 days 00:00:08.975391
round_time_total       0 days 00:05:59.940526
loss_total                 644850876854304.75
loss_critic               806063581689806.875
loss_actor                      -60656728.026
memory_size                         25935.843 

=== epoch 9/10 ===== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:00,  5.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:00<00:00,  5.55it/s]
episodes                                  880
episode_length                      11.356818
returns                             -5.204909
return_std                           4.012027
average_reward                      -0.458792
round_time             0 days 00:06:00.704624
episodes_test                           885.0
episode_length_test                 11.292655
returns_test                        -5.352799
return_std_test                      3.893148
average_reward_test                 -0.474119
round_time_test        0 days 00:00:09.132157
round_time_total       0 days 00:06:00.705906
loss_total                628643881969254.375
loss_critic                785804837584371.75
loss_actor                      -59735074.666
memory_size                         25998.542 

=== epoch 9/10 ===== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:46,  4.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  878
episode_length                      11.372437
returns                             -5.304255
return_std                           4.223719
average_reward                      -0.466133
round_time             0 days 00:06:00.359873
episodes_test                           820.0
episode_length_test                 12.184146
returns_test                         -5.95658
return_std_test                      4.275821
average_reward_test                 -0.488855
round_time_test        0 days 00:00:09.122976
round_time_total       0 days 00:06:00.360971
loss_total                 604748894827446.25
loss_critic                755936105764749.25
loss_actor                      -58292005.232
memory_size                        26068.6305 

=== epoch 9/10 ===== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:15,  5.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  853
episode_length                      11.716295
returns                             -5.677778
return_std                           4.233655
average_reward                      -0.484238
round_time             0 days 00:06:00.390306
episodes_test                           835.0
episode_length_test                 11.972455
returns_test                        -5.056517
return_std_test                      4.242037
average_reward_test                 -0.421951
round_time_test        0 days 00:00:09.176795
round_time_total       0 days 00:06:00.391394
loss_total                585438754378874.875
loss_critic               731798430011097.125
loss_actor                      -56967357.962
memory_size                        26163.2045 

=== epoch 9/10 ===== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:58,  4.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [06:01<00:00,  5.53it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  837
episode_length                       11.90681
returns                              -5.61387
return_std                           4.144738
average_reward                      -0.469655
round_time             0 days 00:06:02.131979
episodes_test                           866.0
episode_length_test                  11.54157
returns_test                        -6.724565
return_std_test                      3.896893
average_reward_test                 -0.582349
round_time_test        0 days 00:00:09.202239
round_time_total       0 days 00:06:02.133084
loss_total                570724519283523.625
loss_critic                 713405636343234.5
loss_actor                      -55690087.314
memory_size                          26245.23 


<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  1%|          | 11/2000 [00:02<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:51<00:00,  5.69it/s]
episodes                                  181
episode_length                      11.027624
returns                             -5.645456
return_std                           4.569639
average_reward                      -0.509045
round_time             0 days 00:05:51.689085
episodes_test                           931.0
episode_length_test                  10.73362
returns_test                        -5.982276
return_std_test                      3.572815
average_reward_test                 -0.557228
round_time_test        0 days 00:00:09.153376
round_time_total       0 days 00:05:51.690230
loss_total               541303819745624.0625
loss_critic               676629763200122.875
loss_actor                      -53965593.362
memory_size                          26296.78 

=== epoch 10/10 ==== round 2/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:05,  5.45it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:50<00:00,  5.71it/s]
episodes                                  346
episode_length                      11.554913
returns                             -5.795542
return_std                           4.463892
average_reward                      -0.501023
round_time             0 days 00:05:50.900122
episodes_test                           846.0
episode_length_test                 11.814421
returns_test                        -6.171137
return_std_test                      4.378707
average_reward_test                 -0.521969
round_time_test        0 days 00:00:09.032483
round_time_total       0 days 00:05:50.901309
loss_total                518548468898201.625
loss_critic                 648185574422741.0
loss_actor                      -53365602.656
memory_size                         26352.977 

=== epoch 10/10 ==== round 3/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<07:41,  4.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  514
episode_length                      11.657588
returns                             -5.546568
return_std                           4.642134
average_reward                       -0.47487
round_time             0 days 00:05:54.383734
episodes_test                           751.0
episode_length_test                 13.303595
returns_test                        -4.802844
return_std_test                      6.740695
average_reward_test                 -0.361503
round_time_test        0 days 00:00:08.958577
round_time_total       0 days 00:05:54.384985
loss_total               512451378270961.6875
loss_critic                640564211679756.25
loss_actor                      -52875868.766
memory_size                        26448.9605 

=== epoch 10/10 ==== round 4/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:49,  4.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.68it/s]
episodes                                  687
episode_length                      11.622999
returns                             -5.733047
return_std                           4.420291
average_reward                      -0.493423
round_time             0 days 00:05:52.766252
episodes_test                           870.0
episode_length_test                 11.487356
returns_test                        -5.784296
return_std_test                       4.17899
average_reward_test                 -0.503256
round_time_test        0 days 00:00:09.041272
round_time_total       0 days 00:05:52.767374
loss_total                501809119443812.375
loss_critic               627261389022429.125
loss_actor                      -52227698.652
memory_size                        26496.5775 

=== epoch 10/10 ==== round 5/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:37,  5.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.65it/s]
episodes                                  864
episode_length                      11.571759
returns                             -5.863963
return_std                           4.281811
average_reward                      -0.506473
round_time             0 days 00:05:54.451708
episodes_test                           821.0
episode_length_test                 12.174178
returns_test                         -6.07279
return_std_test                      4.227088
average_reward_test                 -0.498571
round_time_test        0 days 00:00:09.186995
round_time_total       0 days 00:05:54.452816
loss_total                  488189462887530.5
loss_critic                 610236818859753.5
loss_actor                      -51648862.448
memory_size                        26529.0945 

=== epoch 10/10 ==== round 6/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:27,  5.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:52<00:00,  5.67it/s]
episodes                                  839
episode_length                      11.907032
returns                             -6.083333
return_std                           4.351286
average_reward                       -0.51101
round_time             0 days 00:05:53.361832
episodes_test                           838.0
episode_length_test                 11.928401
returns_test                        -6.752987
return_std_test                      4.212981
average_reward_test                 -0.565538
round_time_test        0 days 00:00:09.135075
round_time_total       0 days 00:05:53.363336
loss_total                 473520010032054.25
loss_critic               591900002993831.875
loss_actor                       -50958117.71
memory_size                        26579.9625 

=== epoch 10/10 ==== round 7/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:59,  5.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.65it/s]
episodes                                  831
episode_length                      12.019254
returns                              -6.11091
return_std                           4.389082
average_reward                      -0.508832
round_time             0 days 00:05:54.681432
episodes_test                           789.0
episode_length_test                 12.662864
returns_test                         -5.47858
return_std_test                      5.101578
average_reward_test                 -0.431709
round_time_test        0 days 00:00:09.108973
round_time_total       0 days 00:05:54.682534
loss_total                  469748157824106.5
loss_critic               587185187645816.875
loss_actor                      -50088453.132
memory_size                        26687.1365 

=== epoch 10/10 ==== round 8/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:11,  5.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  812
episode_length                      12.304187
returns                             -6.121114
return_std                           4.661055
average_reward                      -0.497658
round_time             0 days 00:05:56.207588
episodes_test                           804.0
episode_length_test                 12.429104
returns_test                         -6.93289
return_std_test                      4.440989
average_reward_test                 -0.558052
round_time_test        0 days 00:00:09.252348
round_time_total       0 days 00:05:56.208841
loss_total               462052521708355.5625
loss_critic               577565643017027.625
loss_actor                      -49332397.398
memory_size                         26801.758 

=== epoch 10/10 ==== round 9/50 ======================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:09,  5.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  793
episode_length                      12.602774
returns                              -5.98121
return_std                           4.781565
average_reward                      -0.474462
round_time             0 days 00:05:55.176449
episodes_test                           780.0
episode_length_test                 12.819231
returns_test                        -4.809608
return_std_test                       4.87561
average_reward_test                 -0.375069
round_time_test        0 days 00:00:09.170158
round_time_total       0 days 00:05:55.177554
loss_total               441767934759534.5625
loss_critic              552209910184542.1875
loss_actor                      -48111494.684
memory_size                        26909.2555 

=== epoch 10/10 ==== round 10/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:47,  4.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  771
episode_length                      12.948119
returns                             -5.902068
return_std                           4.993403
average_reward                      -0.455497
round_time             0 days 00:05:56.953600
episodes_test                           836.0
episode_length_test                 11.952153
returns_test                        -5.803019
return_std_test                      4.787185
average_reward_test                 -0.485589
round_time_test        0 days 00:00:08.965955
round_time_total       0 days 00:05:56.954695
loss_total                423095347116507.125
loss_critic              528869175964205.0625
loss_actor                      -47173285.798
memory_size                         27009.599 

=== epoch 10/10 ==== round 11/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.63it/s]
episodes                                  763
episode_length                      13.093054
returns                             -5.777971
return_std                           4.990103
average_reward                      -0.441189
round_time             0 days 00:05:55.507830
episodes_test                           770.0
episode_length_test                 12.983117
returns_test                        -6.441065
return_std_test                      4.501762
average_reward_test                 -0.495736
round_time_test        0 days 00:00:09.105050
round_time_total       0 days 00:05:55.509041
loss_total                  416828111502442.5
loss_critic                521035131708768.25
loss_actor                      -46195067.876
memory_size                        27117.3785 

=== epoch 10/10 ==== round 12/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:54<00:00,  5.64it/s]
episodes                                  751
episode_length                      13.302264
returns                             -5.634863
return_std                           5.135475
average_reward                      -0.424233
round_time             0 days 00:05:55.123446
episodes_test                           761.0
episode_length_test                 13.140604
returns_test                        -5.970324
return_std_test                      4.565895
average_reward_test                 -0.454342
round_time_test        0 days 00:00:08.993325
round_time_total       0 days 00:05:55.124520
loss_total               398143787743838.1875
loss_critic              497679726744174.5625
loss_actor                      -45215815.558
memory_size                        27234.9045 

=== epoch 10/10 ==== round 13/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:29,  5.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  760
episode_length                      13.144737
returns                             -5.803767
return_std                           5.061705
average_reward                      -0.441322
round_time             0 days 00:05:56.509706
episodes_test                           751.0
episode_length_test                 13.314248
returns_test                        -4.764242
return_std_test                       6.17603
average_reward_test                 -0.357715
round_time_test        0 days 00:00:08.970614
round_time_total       0 days 00:05:56.510797
loss_total                 383436995776479.25
loss_critic              479296238139736.0625
loss_actor                      -43974403.726
memory_size                         27373.951 

=== epoch 10/10 ==== round 14/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:36,  5.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  746
episode_length                      13.399464
returns                             -5.783984
return_std                           5.261648
average_reward                      -0.432435
round_time             0 days 00:05:56.806154
episodes_test                           791.0
episode_length_test                 12.594185
returns_test                        -6.312245
return_std_test                      4.518922
average_reward_test                 -0.499206
round_time_test        0 days 00:00:08.953832
round_time_total       0 days 00:05:56.807283
loss_total                370110948695867.375
loss_critic              462638680012488.6875
loss_actor                      -42976933.868
memory_size                        27509.5265 

=== epoch 10/10 ==== round 15/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:53,  5.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:53<00:00,  5.66it/s]
episodes                                  742
episode_length                      13.433962
returns                             -5.714851
return_std                           5.169235
average_reward                      -0.423579
round_time             0 days 00:05:54.224497
episodes_test                           731.0
episode_length_test                 13.679891
returns_test                         -5.29325
return_std_test                       5.68039
average_reward_test                 -0.386937
round_time_test        0 days 00:00:09.031382
round_time_total       0 days 00:05:54.225606
loss_total                 358145841510219.75
loss_critic              447682296581980.1875
loss_actor                      -41999513.306
memory_size                         27656.671 

=== epoch 10/10 ==== round 16/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.62it/s]
episodes                                  754
episode_length                      13.233422
returns                             -5.403371
return_std                           5.180195
average_reward                      -0.408321
round_time             0 days 00:05:56.452810
episodes_test                           807.0
episode_length_test                 12.384139
returns_test                         -6.11443
return_std_test                      4.812432
average_reward_test                 -0.493588
round_time_test        0 days 00:00:09.093724
round_time_total       0 days 00:05:56.453983
loss_total                345117761938128.875
loss_critic               431397196013764.625
loss_actor                      -40295782.295
memory_size                        27748.6925 

=== epoch 10/10 ==== round 17/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:38,  5.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  762
episode_length                      13.106299
returns                             -5.058861
return_std                           5.360474
average_reward                      -0.385741
round_time             0 days 00:05:56.094468
episodes_test                           772.0
episode_length_test                 12.953368
returns_test                         -3.32201
return_std_test                      5.759609
average_reward_test                 -0.256459
round_time_test        0 days 00:00:09.136297
round_time_total       0 days 00:05:56.095556
loss_total                324317050052804.625
loss_critic                405396306060640.25
loss_actor                       -38599972.18
memory_size                        27892.3255 

=== epoch 10/10 ==== round 18/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:43,  4.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.60it/s]
episodes                                  744
episode_length                      13.418011
returns                             -4.672838
return_std                           5.522421
average_reward                      -0.347706
round_time             0 days 00:05:58.020441
episodes_test                           682.0
episode_length_test                 14.659824
returns_test                        -4.327119
return_std_test                      6.380182
average_reward_test                 -0.294954
round_time_test        0 days 00:00:09.154487
round_time_total       0 days 00:05:58.021545
loss_total               306886289028808.6875
loss_critic              383607854357020.6875
loss_actor                      -37264930.728
memory_size                        28157.1185 

=== epoch 10/10 ==== round 19/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:57,  5.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:55<00:00,  5.63it/s]
episodes                                  748
episode_length                      13.359626
returns                             -4.384895
return_std                           5.495354
average_reward                      -0.328549
round_time             0 days 00:05:56.111459
episodes_test                           695.0
episode_length_test                 14.376978
returns_test                        -3.049535
return_std_test                      6.324717
average_reward_test                 -0.211964
round_time_test        0 days 00:00:09.130987
round_time_total       0 days 00:05:56.112668
loss_total                293252975877423.125
loss_critic              366566213318344.6875
loss_actor                      -36209284.262
memory_size                         28382.005 

=== epoch 10/10 ==== round 20/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:24,  5.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:56<00:00,  5.61it/s]
episodes                                  734
episode_length                      13.595368
returns                             -4.092522
return_std                           5.771839
average_reward                      -0.300765
round_time             0 days 00:05:57.219959
episodes_test                           728.0
episode_length_test                 13.733516
returns_test                        -5.554472
return_std_test                      5.368337
average_reward_test                 -0.404243
round_time_test        0 days 00:00:09.153301
round_time_total       0 days 00:05:57.221244
loss_total              279487414341206.03125
loss_critic                 349359261496639.5
loss_actor                      -35230254.791
memory_size                        28632.1965 

=== epoch 10/10 ==== round 21/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:22,  5.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:59<00:00,  5.56it/s]
episodes                                  710
episode_length                      14.070423
returns                             -4.302453
return_std                           6.009507
average_reward                      -0.305384
round_time             0 days 00:06:00.364679
episodes_test                           624.0
episode_length_test                 16.011218
returns_test                        -3.799209
return_std_test                      7.942431
average_reward_test                 -0.237523
round_time_test        0 days 00:00:09.137977
round_time_total       0 days 00:06:00.365910
loss_total               268349403041890.3125
loss_critic                335436747930337.25
loss_actor                      -34138780.144
memory_size                         28845.132 

=== epoch 10/10 ==== round 22/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:48,  4.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:57<00:00,  5.59it/s]
episodes                                  699
episode_length                      14.283262
returns                             -4.420793
return_std                           6.070583
average_reward                      -0.309028
round_time             0 days 00:05:58.179609
episodes_test                           680.0
episode_length_test                 14.683824
returns_test                         -5.64271
return_std_test                      6.314863
average_reward_test                 -0.383917
round_time_test        0 days 00:00:09.014780
round_time_total       0 days 00:05:58.180735
loss_total              264424581542969.34375
loss_critic                 330530721435222.0
loss_actor                      -33038579.759
memory_size                        29070.5715 

=== epoch 10/10 ==== round 23/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:40,  5.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  700
episode_length                      14.231429
returns                             -4.708037
return_std                           5.691109
average_reward                      -0.329764
round_time             0 days 00:05:39.730133
episodes_test                           747.0
episode_length_test                 13.378849
returns_test                         -4.03809
return_std_test                      5.199619
average_reward_test                  -0.30172
round_time_test        0 days 00:00:08.728996
round_time_total       0 days 00:05:39.731250
loss_total              250268691747307.53125
loss_critic              312835859432865.8125
loss_actor                      -32266729.292
memory_size                        29364.1525 

=== epoch 10/10 ==== round 24/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:12,  5.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  710
episode_length                      14.069014
returns                             -5.160586
return_std                           5.637994
average_reward                      -0.367671
round_time             0 days 00:05:38.638343
episodes_test                           736.0
episode_length_test                 13.578804
returns_test                        -5.920588
return_std_test                      5.245771
average_reward_test                 -0.435358
round_time_test        0 days 00:00:08.710161
round_time_total       0 days 00:05:38.639429
loss_total               245023176629682.1875
loss_critic               306278965608185.875
loss_actor                      -31522263.704
memory_size                         29590.375 

=== epoch 10/10 ==== round 25/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:33,  5.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  738
episode_length                       13.53794
returns                             -5.746387
return_std                           5.255264
average_reward                      -0.423595
round_time             0 days 00:05:38.651379
episodes_test                           789.0
episode_length_test                 12.664132
returns_test                        -6.753182
return_std_test                      4.566691
average_reward_test                 -0.532415
round_time_test        0 days 00:00:08.761080
round_time_total       0 days 00:05:38.652519
loss_total              235998205014179.84375
loss_critic               294997751756750.875
loss_actor                      -30986491.855
memory_size                         29682.812 

=== epoch 10/10 ==== round 26/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<05:59,  5.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.90it/s]
episodes                                  780
episode_length                      12.812821
returns                              -6.04801
return_std                           4.867102
average_reward                      -0.472423
round_time             0 days 00:05:39.417470
episodes_test                           852.0
episode_length_test                 11.737089
returns_test                        -7.553259
return_std_test                      3.768146
average_reward_test                 -0.643538
round_time_test        0 days 00:00:08.757527
round_time_total       0 days 00:05:39.418578
loss_total               235515335745208.3125
loss_critic              294394165020590.0625
loss_actor                      -30593309.617
memory_size                        29734.6415 

=== epoch 10/10 ==== round 27/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:38,  5.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.93it/s]
episodes                                  800
episode_length                        12.4875
returns                              -6.87283
return_std                           4.469999
average_reward                      -0.550301
round_time             0 days 00:05:37.851735
episodes_test                           872.0
episode_length_test                  11.46445
returns_test                        -7.636375
return_std_test                        3.9596
average_reward_test                 -0.665703
round_time_test        0 days 00:00:08.794766
round_time_total       0 days 00:05:37.852847
loss_total                 234160883536756.75
loss_critic              292701099549261.8125
loss_actor                      -30376884.162
memory_size                         29784.264 

=== epoch 10/10 ==== round 28/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  821
episode_length                       12.17905
returns                              -7.61343
return_std                            4.43563
average_reward                       -0.62516
round_time             0 days 00:05:38.785820
episodes_test                           807.0
episode_length_test                 12.390335
returns_test                        -9.181626
return_std_test                      4.134485
average_reward_test                 -0.740855
round_time_test        0 days 00:00:08.739098
round_time_total       0 days 00:05:38.786880
loss_total               233248219847458.8125
loss_critic              291560270034108.4375
loss_actor                      -30314439.576
memory_size                        29849.9895 

=== epoch 10/10 ==== round 29/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:47,  4.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.93it/s]
episodes                                  830
episode_length                       12.03253
returns                             -8.246062
return_std                           4.210114
average_reward                      -0.686087
round_time             0 days 00:05:37.836738
episodes_test                           816.0
episode_length_test                 12.254902
returns_test                        -9.421329
return_std_test                      3.698088
average_reward_test                  -0.76878
round_time_test        0 days 00:00:08.768911
round_time_total       0 days 00:05:37.837852
loss_total                  234989095175061.5
loss_critic              293736363906301.9375
loss_actor                      -30701696.303
memory_size                        29895.1855 

=== epoch 10/10 ==== round 30/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:44,  5.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  829
episode_length                      12.047045
returns                             -8.691581
return_std                           4.219221
average_reward                       -0.72153
round_time             0 days 00:05:39.095300
episodes_test                           826.0
episode_length_test                 12.101695
returns_test                         -8.99083
return_std_test                      3.901452
average_reward_test                 -0.742527
round_time_test        0 days 00:00:08.781802
round_time_total       0 days 00:05:39.096421
loss_total                237605464697733.125
loss_critic              297006826099048.4375
loss_actor                       -30907616.88
memory_size                        29937.8935 

=== epoch 10/10 ==== round 31/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:54,  5.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.93it/s]
episodes                                  820
episode_length                       12.17439
returns                             -9.257126
return_std                           4.188585
average_reward                      -0.758924
round_time             0 days 00:05:37.975301
episodes_test                           855.0
episode_length_test                  11.68538
returns_test                        -9.542328
return_std_test                      3.822355
average_reward_test                 -0.816215
round_time_test        0 days 00:00:08.918963
round_time_total       0 days 00:05:37.976416
loss_total              241763066046316.53125
loss_critic                302203827619102.75
loss_actor                       -31134438.05
memory_size                         29960.483 

=== epoch 10/10 ==== round 32/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:19,  5.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  824
episode_length                      12.112864
returns                              -9.65698
return_std                           4.130023
average_reward                      -0.798063
round_time             0 days 00:05:38.681972
episodes_test                           825.0
episode_length_test                     12.12
returns_test                       -10.164668
return_std_test                      3.894637
average_reward_test                 -0.838483
round_time_test        0 days 00:00:08.777584
round_time_total       0 days 00:05:38.683051
loss_total               249585111345922.0625
loss_critic              311981384086323.1875
loss_actor                       -31858948.72
memory_size                         29974.506 

=== epoch 10/10 ==== round 33/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<05:58,  5.56it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  820
episode_length                      12.179268
returns                            -10.049497
return_std                           3.963727
average_reward                      -0.825173
round_time             0 days 00:05:39.163689
episodes_test                           821.0
episode_length_test                 12.176614
returns_test                       -10.413394
return_std_test                      3.650088
average_reward_test                 -0.854862
round_time_test        0 days 00:00:08.747279
round_time_total       0 days 00:05:39.164766
loss_total              263564417663238.15625
loss_critic               329455516604432.375
loss_actor                      -32907206.721
memory_size                        29981.5215 

=== epoch 10/10 ==== round 34/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:44,  5.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  825
episode_length                      12.111515
returns                            -10.353633
return_std                           3.989304
average_reward                      -0.855213
round_time             0 days 00:05:38.821509
episodes_test                           785.0
episode_length_test                 12.723567
returns_test                       -11.147137
return_std_test                      3.614896
average_reward_test                 -0.875749
round_time_test        0 days 00:00:08.709362
round_time_total       0 days 00:05:38.822578
loss_total              279759860076118.03125
loss_critic              349699819343708.1875
loss_actor                      -34133883.573
memory_size                         29995.143 

=== epoch 10/10 ==== round 35/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:36,  5.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  826
episode_length                      12.089588
returns                            -10.748595
return_std                           3.737872
average_reward                      -0.888843
round_time             0 days 00:05:39.712280
episodes_test                           855.0
episode_length_test                 11.684211
returns_test                       -10.887921
return_std_test                      3.554639
average_reward_test                 -0.931993
round_time_test        0 days 00:00:08.848132
round_time_total       0 days 00:05:39.713349
loss_total                  290322150139625.5
loss_critic              362902680965742.5625
loss_actor                      -35133939.618
memory_size                        29999.7405 

=== epoch 10/10 ==== round 36/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:51,  5.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.92it/s]
episodes                                  811
episode_length                      12.303329
returns                            -11.202845
return_std                           3.742517
average_reward                      -0.910442
round_time             0 days 00:05:38.076392
episodes_test                           782.0
episode_length_test                 12.786445
returns_test                       -11.900228
return_std_test                       3.53318
average_reward_test                 -0.930497
round_time_test        0 days 00:00:08.660395
round_time_total       0 days 00:05:38.077466
loss_total               316164255789350.9375
loss_critic              395205312904167.4375
loss_actor                      -36019912.643
memory_size                        30006.8545 

=== epoch 10/10 ==== round 37/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:41,  5.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  800
episode_length                       12.48875
returns                            -11.386743
return_std                           3.591049
average_reward                      -0.911746
round_time             0 days 00:05:39.745685
episodes_test                           776.0
episode_length_test                    12.875
returns_test                       -12.329193
return_std_test                       3.66788
average_reward_test                 -0.957448
round_time_test        0 days 00:00:08.710515
round_time_total       0 days 00:05:39.746779
loss_total                323199681041530.875
loss_critic                403999593936715.75
loss_actor                      -37125700.065
memory_size                         30017.694 

=== epoch 10/10 ==== round 38/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:43,  5.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.93it/s]
episodes                                  800
episode_length                       12.49625
returns                            -11.611511
return_std                            3.56202
average_reward                      -0.928834
round_time             0 days 00:05:37.942877
episodes_test                           776.0
episode_length_test                 12.876289
returns_test                       -12.456688
return_std_test                      3.508062
average_reward_test                 -0.966152
round_time_test        0 days 00:00:08.706155
round_time_total       0 days 00:05:37.943946
loss_total                  333133185783169.0
loss_critic               416416474849083.375
loss_actor                      -37930549.944
memory_size                         30029.993 

=== epoch 10/10 ==== round 39/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:40,  5.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.92it/s]
episodes                                  786
episode_length                      12.708651
returns                            -11.897804
return_std                           3.574247
average_reward                      -0.935782
round_time             0 days 00:05:38.568294
episodes_test                           778.0
episode_length_test                 12.849614
returns_test                       -12.509693
return_std_test                      3.886964
average_reward_test                 -0.973088
round_time_test        0 days 00:00:08.820112
round_time_total       0 days 00:05:38.569418
loss_total                 337318427620278.25
loss_critic                 421648027208384.5
loss_actor                      -38354615.073
memory_size                         30045.442 

=== epoch 10/10 ==== round 40/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:03,  5.47it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.92it/s]
episodes                                  781
episode_length                      12.793854
returns                            -12.062789
return_std                           3.626092
average_reward                      -0.942811
round_time             0 days 00:05:38.304996
episodes_test                           749.0
episode_length_test                 13.331108
returns_test                       -12.301831
return_std_test                      3.652442
average_reward_test                 -0.922235
round_time_test        0 days 00:00:09.102642
round_time_total       0 days 00:05:38.306081
loss_total               342684110902263.8125
loss_critic              428355131067596.8125
loss_actor                      -38872838.787
memory_size                        30050.4925 

=== epoch 10/10 ==== round 41/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<05:42,  5.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.92it/s]
episodes                                  779
episode_length                      12.815148
returns                             -12.06628
return_std                           3.658814
average_reward                      -0.942172
round_time             0 days 00:05:38.200986
episodes_test                           791.0
episode_length_test                 12.638432
returns_test                       -11.203431
return_std_test                       3.75056
average_reward_test                 -0.886043
round_time_test        0 days 00:00:08.767277
round_time_total       0 days 00:05:38.202071
loss_total               354011462261276.6875
loss_critic              442514320299917.3125
loss_actor                      -39278912.067
memory_size                          30058.73 

=== epoch 10/10 ==== round 42/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:06,  5.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.90it/s]
episodes                                  773
episode_length                      12.914618
returns                            -12.301574
return_std                           3.731053
average_reward                      -0.952037
round_time             0 days 00:05:39.236676
episodes_test                           750.0
episode_length_test                 13.321333
returns_test                       -12.231569
return_std_test                      3.883041
average_reward_test                 -0.917809
round_time_test        0 days 00:00:08.740972
round_time_total       0 days 00:05:39.237763
loss_total               355380409803997.1875
loss_critic                 444225505005142.0
loss_actor                      -39763815.238
memory_size                         30075.178 

=== epoch 10/10 ==== round 43/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:54,  5.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.92it/s]
episodes                                  761
episode_length                      13.107753
returns                            -12.328007
return_std                           3.746679
average_reward                      -0.939474
round_time             0 days 00:05:38.575964
episodes_test                           776.0
episode_length_test                 12.880155
returns_test                       -11.467325
return_std_test                      3.851837
average_reward_test                  -0.88959
round_time_test        0 days 00:00:08.700165
round_time_total       0 days 00:05:38.577070
loss_total                 360472320613023.75
loss_critic              450590393814220.8125
loss_actor                      -40343582.556
memory_size                         30098.117 

=== epoch 10/10 ==== round 44/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<05:43,  5.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.91it/s]
episodes                                  758
episode_length                      13.184697
returns                            -12.208121
return_std                           3.728154
average_reward                      -0.926278
round_time             0 days 00:05:38.695443
episodes_test                           718.0
episode_length_test                 13.917827
returns_test                       -11.945878
return_std_test                      3.474262
average_reward_test                 -0.858385
round_time_test        0 days 00:00:08.613085
round_time_total       0 days 00:05:38.696517
loss_total                355448425543630.875
loss_critic                 444310525333995.5
loss_actor                      -40719111.226
memory_size                        30112.5925 

=== epoch 10/10 ==== round 45/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:18,  5.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:38<00:00,  5.92it/s]
episodes                                  747
episode_length                      13.365462
returns                            -12.295017
return_std                           3.743874
average_reward                      -0.920338
round_time             0 days 00:05:38.595231
episodes_test                           739.0
episode_length_test                 13.516915
returns_test                        -12.27425
return_std_test                      3.744672
average_reward_test                 -0.907405
round_time_test        0 days 00:00:08.655271
round_time_total       0 days 00:05:38.596326
loss_total                 363189903850209.25
loss_critic               453987373055737.875
loss_actor                      -41186384.624
memory_size                         30140.113 

=== epoch 10/10 ==== round 46/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  745
episode_length                      13.414765
returns                            -12.331162
return_std                           3.747862
average_reward                      -0.919074
round_time             0 days 00:05:39.596256
episodes_test                           753.0
episode_length_test                 13.270916
returns_test                       -12.412868
return_std_test                      3.599601
average_reward_test                 -0.935015
round_time_test        0 days 00:00:08.796615
round_time_total       0 days 00:05:39.597311
loss_total                  371219009917419.5
loss_critic                 464023755492950.0
loss_actor                       -41324548.58
memory_size                        30158.7075 

=== epoch 10/10 ==== round 47/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:30,  5.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:40<00:00,  5.87it/s]
episodes                                  746
episode_length                       13.39008
returns                            -12.316806
return_std                           3.814004
average_reward                      -0.919889
round_time             0 days 00:05:41.129774
episodes_test                           758.0
episode_length_test                   13.1781
returns_test                       -12.442698
return_std_test                      3.834676
average_reward_test                 -0.944153
round_time_test        0 days 00:00:08.822444
round_time_total       0 days 00:05:41.130854
loss_total               372982529369571.3125
loss_critic              466228154520829.9375
loss_actor                       -41104844.01
memory_size                         30173.803 

=== epoch 10/10 ==== round 48/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:23,  5.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  754
episode_length                      13.241379
returns                             -12.31629
return_std                           3.900045
average_reward                      -0.929377
round_time             0 days 00:05:39.807379
episodes_test                           738.0
episode_length_test                 13.550136
returns_test                       -12.409229
return_std_test                      3.934396
average_reward_test                 -0.915801
round_time_test        0 days 00:00:08.828969
round_time_total       0 days 00:05:39.808486
loss_total               364456336798253.0625
loss_critic              455570413762641.9375
loss_actor                      -40751185.408
memory_size                        30189.0525 

=== epoch 10/10 ==== round 49/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 8/2000 [00:01<06:06,  5.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:39<00:00,  5.90it/s]
episodes                                  750
episode_length                      13.302667
returns                            -12.447523
return_std                           3.970248
average_reward                      -0.935226
round_time             0 days 00:05:39.693542
episodes_test                           764.0
episode_length_test                 13.074607
returns_test                       -12.309718
return_std_test                      3.655062
average_reward_test                 -0.940615
round_time_test        0 days 00:00:08.667317
round_time_total       0 days 00:05:39.694619
loss_total                 361698438548553.75
loss_critic              452123041451737.0625
loss_actor                      -40326733.234
memory_size                         30207.802 

=== epoch 10/10 ==== round 50/50 =====================================
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
  0%|          | 9/2000 [00:01<06:13,  5.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Walker2d-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [05:37<00:00,  5.93it/s]
<MM1_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<Walker2dEnv<Walker2d-v4>>>>>>>>>>
episodes                                  750
episode_length                      13.325333
returns                            -12.557783
return_std                           4.002516
average_reward                      -0.942523
round_time             0 days 00:05:37.549188
episodes_test                           750.0
episode_length_test                 13.333333
returns_test                         -12.2489
return_std_test                       3.93316
average_reward_test                 -0.918668
round_time_test        0 days 00:00:08.726822
round_time_total       0 days 00:05:37.550287
loss_total               350229231482961.9375
loss_critic              437786532649107.4375
loss_actor                      -39954317.371
memory_size                        30238.3905 


