/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Ant-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 4
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>

<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
 94%|█████████▍| 1886/2000 [00:02<00:00, 922.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 902.41it/s]
episodes                                   13
episode_length                     146.384615
returns                            -71.264226
return_std                         139.297695
average_reward                      -0.484962
round_time             0 days 00:00:02.294149
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       951.819718
return_std_test                     14.015781
average_reward_test                   0.95182
round_time_test        0 days 00:00:11.340080
round_time_total       0 days 00:00:13.175016 

=== epoch 1/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 80%|███████▉  | 1591/2000 [00:01<00:00, 886.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 903.38it/s]
episodes                                   17
episode_length                          184.0
returns                            -98.861361
return_std                         170.717509
average_reward                      -0.530473
round_time             0 days 00:00:02.737546
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       951.266965
return_std_test                     23.228957
average_reward_test                  0.951267
round_time_test        0 days 00:00:11.188362
round_time_total       0 days 00:00:13.052454 

=== epoch 1/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 73%|███████▎  | 1465/2000 [00:01<00:00, 854.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 888.68it/s]
episodes                                   30
episode_length                          191.7
returns                           -103.293624
return_std                         178.594587
average_reward                      -0.535824
round_time             0 days 00:00:02.811923
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       953.005874
return_std_test                      9.609723
average_reward_test                  0.953006
round_time_test        0 days 00:00:11.236199
round_time_total       0 days 00:00:13.097301 

=== epoch 1/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 69%|██████▊   | 1373/2000 [00:01<00:00, 877.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 891.14it/s]
episodes                                   40
episode_length                          182.5
returns                           -100.743399
return_std                         173.970167
average_reward                      -0.549732
round_time             0 days 00:00:02.853560
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       945.223092
return_std_test                       13.3711
average_reward_test                  0.945223
round_time_test        0 days 00:00:11.369996
round_time_total       0 days 00:00:13.222604 

=== epoch 1/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 78%|███████▊  | 1557/2000 [00:01<00:00, 817.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 876.69it/s]
episodes                                   54
episode_length                     183.574074
returns                           -100.328204
return_std                         176.941786
average_reward                      -0.546488
round_time             0 days 00:00:02.768515
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       950.459566
return_std_test                     15.031122
average_reward_test                   0.95046
round_time_test        0 days 00:00:11.640754
round_time_total       0 days 00:00:13.442622 

=== epoch 1/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 3/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
starting training
episodes                                   43
episode_length                     190.906977
returns                           -105.882298
return_std                         178.405116
average_reward                      -0.561963
round_time             0 days 00:16:29.738078
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       953.137793
return_std_test                     12.064425
average_reward_test                  0.953138
round_time_test        0 days 00:00:11.600679
round_time_total       0 days 00:16:29.739257
loss_total                         888.453799
loss_critic                        1140.85796
loss_actor                        -121.162926
memory_size                          9288.288 

=== epoch 1/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:36,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.01it/s]
episodes                                   59
episode_length                     154.372881
returns                            -88.462089
return_std                         162.055002
average_reward                      -0.585665
round_time             0 days 00:16:37.659759
episodes_test                            36.0
episode_length_test                    251.25
returns_test                        61.178494
return_std_test                      77.22485
average_reward_test                  0.240751
round_time_test        0 days 00:00:11.298282
round_time_total       0 days 00:16:37.660875
loss_total                         496.921336
loss_critic                        677.912396
loss_actor                        -227.042951
memory_size                        10858.3945 

=== epoch 1/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   62
episode_length                     148.822581
returns                            -96.166147
return_std                         182.647896
average_reward                       -0.64202
round_time             0 days 00:16:39.293354
episodes_test                            16.0
episode_length_test                  577.6875
returns_test                        90.612767
return_std_test                      90.65716
average_reward_test                  0.154055
round_time_test        0 days 00:00:11.276887
round_time_total       0 days 00:16:39.294457
loss_total                         568.529552
loss_critic                         785.54323
loss_actor                        -299.525217
memory_size                         12395.973 

=== epoch 1/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:42<00:00,  2.00it/s]
episodes                                   68
episode_length                     128.073529
returns                             -84.78127
return_std                         165.956443
average_reward                      -0.679831
round_time             0 days 00:16:42.669983
episodes_test                            26.0
episode_length_test                355.961538
returns_test                        88.925789
return_std_test                    116.594946
average_reward_test                  0.251529
round_time_test        0 days 00:00:11.540391
round_time_total       0 days 00:16:42.671437
loss_total                          810.49287
loss_critic                       1094.467717
loss_actor                        -325.406593
memory_size                        13856.8205 

=== epoch 1/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:33,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:43<00:00,  1.99it/s]
episodes                                   79
episode_length                     114.898734
returns                            -93.288255
return_std                         192.725588
average_reward                      -0.783356
round_time             0 days 00:16:43.669291
episodes_test                            27.0
episode_length_test                337.518519
returns_test                       -87.136657
return_std_test                    116.583686
average_reward_test                 -0.270647
round_time_test        0 days 00:00:11.274078
round_time_total       0 days 00:16:43.670391
loss_total                        8728.498575
loss_critic                      11028.615576
loss_actor                        -471.970177
memory_size                        15408.9565 

=== epoch 1/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:44<00:00,  1.99it/s]
episodes                                   93
episode_length                     101.860215
returns                           -100.908917
return_std                         248.552002
average_reward                      -1.029753
round_time             0 days 00:16:44.915047
episodes_test                            36.0
episode_length_test                250.222222
returns_test                      -332.854764
return_std_test                    492.470627
average_reward_test                 -1.326509
round_time_test        0 days 00:00:11.344933
round_time_total       0 days 00:16:44.916596
loss_total                     2543256.052192
loss_critic                    3180203.481298
loss_actor                       -4533.892798
memory_size                        16868.0645 

=== epoch 1/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:31,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:46<00:00,  1.99it/s]
episodes                                   93
episode_length                      95.387097
returns                           -118.573438
return_std                         315.967112
average_reward                      -1.327615
round_time             0 days 00:16:47.528792
episodes_test                           186.0
episode_length_test                 53.704301
returns_test                      -104.935564
return_std_test                    293.749476
average_reward_test                 -1.952694
round_time_test        0 days 00:00:11.556219
round_time_total       0 days 00:16:47.529909
loss_total                     28708336.17625
loss_critic                    35891419.98975
loss_actor                      -24001.549312
memory_size                        18333.2325 

=== epoch 1/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:57,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                   97
episode_length                      94.402062
returns                           -147.289667
return_std                         386.274146
average_reward                       -1.61502
round_time             0 days 00:16:51.601907
episodes_test                           120.0
episode_length_test                 83.183333
returns_test                       -180.56875
return_std_test                    479.118101
average_reward_test                 -2.169614
round_time_test        0 days 00:00:11.351159
round_time_total       0 days 00:16:51.603046
loss_total                       74403511.025
loss_critic                      93017738.885
loss_actor                      -53407.006494
memory_size                        19773.8735 

=== epoch 1/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:54,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:56<00:00,  1.97it/s]
episodes                                  129
episode_length                      76.372093
returns                           -144.360481
return_std                         376.406332
average_reward                      -1.895925
round_time             0 days 00:16:57.069146
episodes_test                           155.0
episode_length_test                 63.058065
returns_test                      -139.603626
return_std_test                    402.942546
average_reward_test                 -2.211482
round_time_test        0 days 00:00:11.476326
round_time_total       0 days 00:16:57.070234
loss_total                       98378904.642
loss_critic                     122993202.022
loss_actor                      -78293.385008
memory_size                         21008.977 

=== epoch 1/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:41,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:02<00:00,  1.96it/s]
episodes                                  117
episode_length                      78.615385
returns                           -167.715064
return_std                         438.136612
average_reward                      -2.160416
round_time             0 days 00:17:03.435842
episodes_test                            87.0
episode_length_test                114.367816
returns_test                      -265.917811
return_std_test                      595.8089
average_reward_test                 -2.324843
round_time_test        0 days 00:00:11.523901
round_time_total       0 days 00:17:03.437332
loss_total                      155642385.032
loss_critic                     194578525.924
loss_actor                      -102192.35552
memory_size                        22265.7245 

=== epoch 1/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:14<00:00,  1.93it/s]
episodes                                  118
episode_length                      74.059322
returns                           -167.516421
return_std                         435.952973
average_reward                      -2.266288
round_time             0 days 00:17:14.878098
episodes_test                           243.0
episode_length_test                 41.106996
returns_test                       -88.956198
return_std_test                    219.747378
average_reward_test                 -2.163427
round_time_test        0 days 00:00:11.847450
round_time_total       0 days 00:17:14.879186
loss_total                       187156851.02
loss_critic                     233975217.276
loss_actor                     -116628.608992
memory_size                         23871.563 

=== epoch 1/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                  143
episode_length                      69.153846
returns                           -154.939373
return_std                          402.26229
average_reward                      -2.238958
round_time             0 days 00:17:30.064348
episodes_test                           102.0
episode_length_test                 90.529412
returns_test                      -209.315984
return_std_test                    500.997834
average_reward_test                 -2.325532
round_time_test        0 days 00:00:11.204418
round_time_total       0 days 00:17:30.065454
loss_total                      184204157.164
loss_critic                     230285272.532
loss_actor                     -120319.821219
memory_size                        25121.1595 

=== epoch 1/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:00,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:30<00:00,  1.90it/s]
episodes                                  161
episode_length                      61.484472
returns                           -137.726283
return_std                         365.765576
average_reward                      -2.243653
round_time             0 days 00:17:30.846292
episodes_test                           174.0
episode_length_test                 57.344828
returns_test                      -130.133136
return_std_test                    363.937811
average_reward_test                 -2.268745
round_time_test        0 days 00:00:11.516975
round_time_total       0 days 00:17:30.847386
loss_total                      209249717.148
loss_critic                      261593581.06
loss_actor                     -125757.419582
memory_size                        25892.7245 

=== epoch 1/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:59,  1.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:24<00:00,  1.92it/s]
episodes                                  145
episode_length                      62.903448
returns                           -142.282743
return_std                         386.787459
average_reward                      -2.275179
round_time             0 days 00:17:24.850924
episodes_test                           187.0
episode_length_test                  52.55615
returns_test                      -118.360123
return_std_test                    314.004711
average_reward_test                 -2.249463
round_time_test        0 days 00:00:11.494060
round_time_total       0 days 00:17:24.852016
loss_total                      207097692.312
loss_critic                     258904294.016
loss_actor                     -128732.635137
memory_size                        26965.1855 

=== epoch 1/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:10,  1.50it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:30<00:00,  1.90it/s]
episodes                                  164
episode_length                      59.621951
returns                           -135.260677
return_std                         365.152801
average_reward                      -2.267892
round_time             0 days 00:17:31.421463
episodes_test                           221.0
episode_length_test                 42.927602
returns_test                       -96.751958
return_std_test                    220.416573
average_reward_test                 -2.272588
round_time_test        0 days 00:00:11.688505
round_time_total       0 days 00:17:31.422568
loss_total                      208885112.192
loss_critic                     261138254.672
loss_actor                      -127476.28193
memory_size                         28157.321 

=== epoch 1/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.90it/s]
episodes                                  186
episode_length                           48.5
returns                           -108.576872
return_std                         286.330131
average_reward                      -2.262566
round_time             0 days 00:17:30.520490
episodes_test                           146.0
episode_length_test                  67.60274
returns_test                      -158.066158
return_std_test                    458.532375
average_reward_test                 -2.346133
round_time_test        0 days 00:00:11.518345
round_time_total       0 days 00:17:30.521715
loss_total                      207679846.748
loss_critic                     259630397.616
loss_actor                     -122373.962398
memory_size                          29190.94 

=== epoch 1/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:05,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:35<00:00,  1.89it/s]
episodes                                  158
episode_length                      62.689873
returns                           -145.149908
return_std                         404.784767
average_reward                      -2.312059
round_time             0 days 00:17:36.046172
episodes_test                           156.0
episode_length_test                  63.74359
returns_test                      -144.382404
return_std_test                     388.55699
average_reward_test                 -2.266082
round_time_test        0 days 00:00:11.600384
round_time_total       0 days 00:17:36.047283
loss_total                      192253168.316
loss_critic                     240344834.156
loss_actor                     -113512.060129
memory_size                        30635.7805 

=== epoch 1/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:12,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:28<00:00,  1.91it/s]
episodes                                  167
episode_length                      59.640719
returns                            -136.78374
return_std                         370.230674
average_reward                      -2.292757
round_time             0 days 00:17:29.272889
episodes_test                           112.0
episode_length_test                 89.258929
returns_test                      -209.370844
return_std_test                    529.468499
average_reward_test                 -2.345088
round_time_test        0 days 00:00:11.388757
round_time_total       0 days 00:17:29.273998
loss_total                       151617320.37
loss_critic                     189546163.472
loss_actor                      -98064.780402
memory_size                         31714.746 

=== epoch 1/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:09,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                  151
episode_length                      62.993377
returns                            -143.95985
return_std                         385.220738
average_reward                      -2.295004
round_time             0 days 00:17:32.814879
episodes_test                           160.0
episode_length_test                   62.4375
returns_test                       -145.01085
return_std_test                    396.254089
average_reward_test                 -2.322165
round_time_test        0 days 00:00:11.456439
round_time_total       0 days 00:17:32.816096
loss_total                      128796123.106
loss_critic                     161016576.048
loss_actor                      -85699.135691
memory_size                         32823.448 

=== epoch 1/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                  150
episode_length                      66.453333
returns                              -154.185
return_std                         394.876923
average_reward                      -2.317487
round_time             0 days 00:17:32.699374
episodes_test                           128.0
episode_length_test                 78.101562
returns_test                      -181.657776
return_std_test                    474.153026
average_reward_test                 -2.325272
round_time_test        0 days 00:00:11.366646
round_time_total       0 days 00:17:32.700487
loss_total                       98075499.626
loss_critic                     122612778.168
loss_actor                      -73622.976861
memory_size                         34443.226 

=== epoch 1/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:44,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:34<00:00,  1.90it/s]
episodes                                  166
episode_length                      57.584337
returns                           -130.375946
return_std                         328.742822
average_reward                      -2.272717
round_time             0 days 00:17:35.592662
episodes_test                           142.0
episode_length_test                 69.950704
returns_test                      -159.677944
return_std_test                    461.877453
average_reward_test                 -2.276659
round_time_test        0 days 00:00:11.438890
round_time_total       0 days 00:17:35.593924
loss_total                       66917075.474
loss_critic                      83661568.678
loss_actor                       -60902.91115
memory_size                         35169.718 

=== epoch 1/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                  156
episode_length                      58.262821
returns                           -133.726809
return_std                         342.895829
average_reward                      -2.318143
round_time             0 days 00:17:33.230277
episodes_test                           194.0
episode_length_test                  49.85567
returns_test                      -112.146017
return_std_test                    347.013998
average_reward_test                 -2.264702
round_time_test        0 days 00:00:11.499229
round_time_total       0 days 00:17:33.231432
loss_total                        53861394.23
loss_critic                       67339511.47
loss_actor                      -51079.098662
memory_size                         36341.449 

=== epoch 1/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:33<00:00,  1.90it/s]
episodes                                  138
episode_length                      72.115942
returns                           -169.182511
return_std                         459.208006
average_reward                      -2.341734
round_time             0 days 00:17:34.365960
episodes_test                           107.0
episode_length_test                 89.906542
returns_test                      -213.710931
return_std_test                    528.068758
average_reward_test                 -2.380526
round_time_test        0 days 00:00:11.631298
round_time_total       0 days 00:17:34.367072
loss_total                       46689239.977
loss_critic                       58372267.33
loss_actor                      -42873.397025
memory_size                        38023.5835 

=== epoch 1/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:35,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                  173
episode_length                      52.433526
returns                           -119.696646
return_std                         346.369235
average_reward                      -2.297774
round_time             0 days 00:17:30.161044
episodes_test                           178.0
episode_length_test                 56.157303
returns_test                      -124.076414
return_std_test                    323.578093
average_reward_test                 -2.208725
round_time_test        0 days 00:00:11.488471
round_time_total       0 days 00:17:30.162294
loss_total                       43249319.816
loss_critic                      54070776.503
loss_actor                      -36510.920992
memory_size                        38776.8245 

=== epoch 1/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:13,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:35<00:00,  1.89it/s]
episodes                                  179
episode_length                      52.497207
returns                           -118.491712
return_std                         351.202944
average_reward                      -2.264494
round_time             0 days 00:17:36.034210
episodes_test                           221.0
episode_length_test                 45.171946
returns_test                       -98.847664
return_std_test                    294.136182
average_reward_test                 -2.187729
round_time_test        0 days 00:00:11.629855
round_time_total       0 days 00:17:36.035389
loss_total                       43207008.829
loss_critic                      54016789.504
loss_actor                      -32117.708871
memory_size                        39803.6405 

=== epoch 1/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:08,  1.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:34<00:00,  1.90it/s]
episodes                                  159
episode_length                      56.899371
returns                           -129.262624
return_std                         369.640566
average_reward                      -2.296315
round_time             0 days 00:17:35.316331
episodes_test                           176.0
episode_length_test                 51.272727
returns_test                      -113.167794
return_std_test                    283.074594
average_reward_test                 -2.229107
round_time_test        0 days 00:00:11.574717
round_time_total       0 days 00:17:35.317413
loss_total                       49836849.312
loss_critic                      62303274.527
loss_actor                      -28855.804635
memory_size                         41076.445 

=== epoch 1/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:48,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:39<00:00,  1.89it/s]
episodes                                  174
episode_length                       56.41954
returns                           -127.381484
return_std                         346.616676
average_reward                       -2.25708
round_time             0 days 00:17:39.714522
episodes_test                           148.0
episode_length_test                 65.668919
returns_test                      -141.920907
return_std_test                    387.787088
average_reward_test                 -2.161839
round_time_test        0 days 00:00:11.416776
round_time_total       0 days 00:17:39.715668
loss_total                       61326293.578
loss_critic                      76664831.285
loss_actor                      -27862.797043
memory_size                         42108.325 

=== epoch 1/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:39<00:00,  1.89it/s]
episodes                                  177
episode_length                      55.621469
returns                           -125.086844
return_std                         329.993196
average_reward                      -2.248837
round_time             0 days 00:17:40.388729
episodes_test                           175.0
episode_length_test                 57.074286
returns_test                      -122.781153
return_std_test                    316.902901
average_reward_test                 -2.150982
round_time_test        0 days 00:00:11.535098
round_time_total       0 days 00:17:40.389833
loss_total                        70118160.19
loss_critic                      87654550.646
loss_actor                      -27407.370272
memory_size                        43591.3715 

=== epoch 1/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:37,  1.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                  146
episode_length                       65.39726
returns                            -148.65414
return_std                          376.92858
average_reward                      -2.272023
round_time             0 days 00:17:40.704516
episodes_test                           131.0
episode_length_test                 76.068702
returns_test                      -172.369086
return_std_test                     429.21979
average_reward_test                 -2.264664
round_time_test        0 days 00:00:11.476811
round_time_total       0 days 00:17:40.705616
loss_total                       80065065.865
loss_critic                      100088243.81
loss_actor                      -27653.217173
memory_size                         44727.259 

=== epoch 1/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:28,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                  149
episode_length                      64.127517
returns                           -144.081443
return_std                         351.896004
average_reward                      -2.251699
round_time             0 days 00:17:37.479831
episodes_test                           109.0
episode_length_test                 89.045872
returns_test                      -197.547979
return_std_test                    525.718974
average_reward_test                 -2.219926
round_time_test        0 days 00:00:11.417830
round_time_total       0 days 00:17:37.481317
loss_total                       86755659.884
loss_critic                     108451364.632
loss_actor                      -27166.218225
memory_size                        46142.6095 

=== epoch 1/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:25,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                  120
episode_length                      77.308333
returns                           -174.055257
return_std                         432.273571
average_reward                      -2.266324
round_time             0 days 00:17:39.367825
episodes_test                           126.0
episode_length_test                 75.825397
returns_test                      -168.756796
return_std_test                    449.406655
average_reward_test                 -2.241067
round_time_test        0 days 00:00:11.680426
round_time_total       0 days 00:17:39.369269
loss_total                        96371559.64
loss_critic                     120471105.734
loss_actor                      -26632.825805
memory_size                        47407.8985 

=== epoch 1/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                  141
episode_length                      65.297872
returns                           -142.509641
return_std                         367.988481
average_reward                      -2.196447
round_time             0 days 00:17:37.311999
episodes_test                           132.0
episode_length_test                      75.5
returns_test                      -166.397008
return_std_test                    436.410364
average_reward_test                 -2.203884
round_time_test        0 days 00:00:11.378514
round_time_total       0 days 00:17:37.313133
loss_total                       87657266.596
loss_critic                      109577660.37
loss_actor                      -24316.128074
memory_size                        48816.7645 

=== epoch 1/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:51,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:35<00:00,  1.90it/s]
episodes                                  146
episode_length                      62.130137
returns                           -133.825658
return_std                         355.878304
average_reward                      -2.175928
round_time             0 days 00:17:35.623627
episodes_test                           108.0
episode_length_test                 92.166667
returns_test                      -211.230571
return_std_test                    522.475445
average_reward_test                 -2.290694
round_time_test        0 days 00:00:11.428425
round_time_total       0 days 00:17:35.624755
loss_total                        84194209.64
loss_critic                     105248367.542
loss_actor                      -22429.173179
memory_size                         49511.316 

=== epoch 1/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:00,  1.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                  177
episode_length                      52.943503
returns                           -112.503057
return_std                         299.207593
average_reward                      -2.149032
round_time             0 days 00:17:38.835304
episodes_test                           153.0
episode_length_test                 64.130719
returns_test                      -143.546052
return_std_test                    374.072165
average_reward_test                 -2.241704
round_time_test        0 days 00:00:11.372428
round_time_total       0 days 00:17:38.836645
loss_total                       83035560.313
loss_critic                     103799644.988
loss_actor                      -20785.374157
memory_size                        50671.2695 

=== epoch 1/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:48,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                  159
episode_length                      59.867925
returns                           -128.819719
return_std                         358.818284
average_reward                      -2.165779
round_time             0 days 00:17:38.604205
episodes_test                            83.0
episode_length_test                 113.46988
returns_test                      -261.422541
return_std_test                    596.677193
average_reward_test                 -2.309884
round_time_test        0 days 00:00:11.633125
round_time_total       0 days 00:17:38.605320
loss_total                       79864508.577
loss_critic                      99835369.408
loss_actor                      -18940.926207
memory_size                         51472.586 

=== epoch 1/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:56,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  162
episode_length                      59.679012
returns                           -129.196993
return_std                         360.541504
average_reward                      -2.164075
round_time             0 days 00:17:38.095947
episodes_test                           114.0
episode_length_test                 86.877193
returns_test                      -202.864202
return_std_test                    487.569451
average_reward_test                 -2.336722
round_time_test        0 days 00:00:11.623019
round_time_total       0 days 00:17:38.097041
loss_total                       72111440.376
loss_critic                      90143131.208
loss_actor                      -15328.631711
memory_size                        53212.8435 

=== epoch 1/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  153
episode_length                      60.398693
returns                           -133.179737
return_std                         370.484987
average_reward                      -2.213296
round_time             0 days 00:17:37.610118
episodes_test                           134.0
episode_length_test                 74.597015
returns_test                      -169.552584
return_std_test                    443.658095
average_reward_test                 -2.271899
round_time_test        0 days 00:00:11.403738
round_time_total       0 days 00:17:37.611284
loss_total                       65846404.815
loss_critic                      82311130.282
loss_actor                      -12503.280278
memory_size                         54407.869 

=== epoch 1/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:39<00:00,  1.89it/s]
episodes                                  172
episode_length                      55.732558
returns                           -123.011159
return_std                         348.986757
average_reward                      -2.213084
round_time             0 days 00:17:40.005964
episodes_test                           136.0
episode_length_test                 69.948529
returns_test                      -153.584136
return_std_test                    434.245011
average_reward_test                 -2.202636
round_time_test        0 days 00:00:11.646064
round_time_total       0 days 00:17:40.007231
loss_total                       67228564.643
loss_critic                      84038326.008
loss_actor                      -10486.523002
memory_size                         55521.905 

=== epoch 1/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:30,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                  166
episode_length                      59.993976
returns                           -133.391192
return_std                         360.537377
average_reward                       -2.22368
round_time             0 days 00:17:39.403665
episodes_test                            72.0
episode_length_test                132.916667
returns_test                      -312.898055
return_std_test                    683.155966
average_reward_test                 -2.355814
round_time_test        0 days 00:00:11.372228
round_time_total       0 days 00:17:39.405219
loss_total                        69703210.16
loss_critic                      87131371.578
loss_actor                       -9441.192146
memory_size                        56232.2915 

=== epoch 1/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:43<00:00,  1.88it/s]
episodes                                  182
episode_length                      49.681319
returns                           -108.469634
return_std                         252.877724
average_reward                      -2.201682
round_time             0 days 00:17:44.080531
episodes_test                            87.0
episode_length_test                108.804598
returns_test                      -246.766496
return_std_test                    588.476461
average_reward_test                 -2.272081
round_time_test        0 days 00:00:11.601132
round_time_total       0 days 00:17:44.081668
loss_total                       80372965.411
loss_critic                      100468485.95
loss_actor                       -9123.378238
memory_size                        56908.9445 

=== epoch 1/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:00,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  210
episode_length                      46.957143
returns                           -102.166553
return_std                         237.157372
average_reward                      -2.176276
round_time             0 days 00:17:49.443989
episodes_test                           102.0
episode_length_test                 93.411765
returns_test                      -207.506422
return_std_test                    524.988628
average_reward_test                 -2.218125
round_time_test        0 days 00:00:11.574185
round_time_total       0 days 00:17:49.445382
loss_total                       84028173.263
loss_critic                     105037436.436
loss_actor                        -8886.26227
memory_size                         58188.395 

=== epoch 1/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:26,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  206
episode_length                      42.533981
returns                            -91.690422
return_std                         181.262318
average_reward                      -2.173499
round_time             0 days 00:17:45.155211
episodes_test                           105.0
episode_length_test                 95.180952
returns_test                      -213.968817
return_std_test                    558.615331
average_reward_test                 -2.246608
round_time_test        0 days 00:00:11.559541
round_time_total       0 days 00:17:45.156493
loss_total                       76403635.733
loss_critic                      95506635.248
loss_actor                       -8369.009453
memory_size                         58919.748 

=== epoch 1/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:09,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.87it/s]
episodes                                  181
episode_length                      54.911602
returns                           -120.829553
return_std                         305.405764
average_reward                      -2.201188
round_time             0 days 00:17:47.492224
episodes_test                           124.0
episode_length_test                 80.475806
returns_test                       -179.56801
return_std_test                    501.103227
average_reward_test                 -2.229567
round_time_test        0 days 00:00:11.707086
round_time_total       0 days 00:17:47.493304
loss_total                       78576948.814
loss_critic                      98222844.878
loss_actor                       -6641.848339
memory_size                         60198.959 

=== epoch 1/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:16,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  170
episode_length                      57.947059
returns                           -127.719025
return_std                         337.108498
average_reward                       -2.20309
round_time             0 days 00:17:48.603854
episodes_test                           123.0
episode_length_test                 79.292683
returns_test                      -176.823167
return_std_test                    458.336083
average_reward_test                 -2.225772
round_time_test        0 days 00:00:11.533355
round_time_total       0 days 00:17:48.605015
loss_total                       72552644.835
loss_critic                      90691883.971
loss_actor                       -4317.745475
memory_size                        61603.9395 

=== epoch 1/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:31,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:56<00:00,  1.86it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  140
episode_length                          66.65
returns                           -148.803637
return_std                         404.902694
average_reward                      -2.239643
round_time             0 days 00:17:57.010648
episodes_test                           176.0
episode_length_test                 54.659091
returns_test                      -120.653575
return_std_test                    350.148338
average_reward_test                 -2.215577
round_time_test        0 days 00:00:11.553007
round_time_total       0 days 00:17:57.012171
loss_total                        71825917.63
loss_critic                      89782901.277
loss_actor                       -2023.395263
memory_size                         63012.571 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.00it/s]
episodes                                   29
episode_length                      65.758621
returns                           -143.065066
return_std                         413.039585
average_reward                      -2.181298
round_time             0 days 00:16:37.839613
episodes_test                           184.0
episode_length_test                 49.755435
returns_test                      -104.840985
return_std_test                    286.648359
average_reward_test                 -2.131246
round_time_test        0 days 00:00:11.729166
round_time_total       0 days 00:16:37.840735
loss_total                       83042597.972
loss_critic                     103803429.698
loss_actor                        -736.437347
memory_size                         64547.851 

=== epoch 2/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:44<00:00,  1.99it/s]
episodes                                   64
episode_length                        62.1875
returns                           -138.176763
return_std                         399.233473
average_reward                      -2.220693
round_time             0 days 00:16:45.001212
episodes_test                           155.0
episode_length_test                 64.470968
returns_test                      -143.070955
return_std_test                    402.505791
average_reward_test                 -2.218101
round_time_test        0 days 00:00:11.472480
round_time_total       0 days 00:16:45.002422
loss_total                       79166971.451
loss_critic                      98958475.584
loss_actor                         947.891489
memory_size                        65965.7465 

=== epoch 2/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:50,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:47<00:00,  1.98it/s]
episodes                                   94
episode_length                      63.734043
returns                           -141.892173
return_std                         401.623666
average_reward                       -2.22452
round_time             0 days 00:16:48.349348
episodes_test                           108.0
episode_length_test                 85.351852
returns_test                      -186.103196
return_std_test                    506.929599
average_reward_test                 -2.179994
round_time_test        0 days 00:00:11.887336
round_time_total       0 days 00:16:48.350592
loss_total                       80382373.581
loss_critic                     100477542.868
loss_actor                        1689.445101
memory_size                        66891.5755 

=== epoch 2/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:36,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:46<00:00,  1.99it/s]
episodes                                  123
episode_length                      57.390244
returns                           -126.886543
return_std                         353.599867
average_reward                      -2.227203
round_time             0 days 00:16:47.536348
episodes_test                           143.0
episode_length_test                 69.678322
returns_test                      -152.889276
return_std_test                    430.667846
average_reward_test                  -2.19175
round_time_test        0 days 00:00:11.679532
round_time_total       0 days 00:16:47.537451
loss_total                       86394675.074
loss_critic                     107992746.323
loss_actor                        2383.112807
memory_size                        68103.3125 

=== epoch 2/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:22,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:46<00:00,  1.99it/s]
episodes                                  153
episode_length                      65.124183
returns                           -145.812566
return_std                         409.502361
average_reward                      -2.239826
round_time             0 days 00:16:47.160484
episodes_test                           134.0
episode_length_test                 69.261194
returns_test                      -149.043154
return_std_test                    424.001813
average_reward_test                 -2.156232
round_time_test        0 days 00:00:11.601392
round_time_total       0 days 00:16:47.161577
loss_total                       95810596.835
loss_critic                     119762376.042
loss_actor                        3472.278483
memory_size                         69676.066 

=== epoch 2/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:02,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                  167
episode_length                      49.682635
returns                           -109.956103
return_std                         309.193592
average_reward                      -2.241258
round_time             0 days 00:16:51.641368
episodes_test                           103.0
episode_length_test                 96.300971
returns_test                      -226.486456
return_std_test                    554.203451
average_reward_test                 -2.350284
round_time_test        0 days 00:00:11.705897
round_time_total       0 days 00:16:51.642680
loss_total                       87860043.347
loss_critic                      109823723.66
loss_actor                        5314.331298
memory_size                         70453.977 

=== epoch 2/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                  161
episode_length                      59.720497
returns                            -132.12312
return_std                         370.124803
average_reward                      -2.220795
round_time             0 days 00:16:51.840878
episodes_test                           120.0
episode_length_test                 83.016667
returns_test                      -182.456304
return_std_test                    482.228955
average_reward_test                 -2.197731
round_time_test        0 days 00:00:11.343220
round_time_total       0 days 00:16:51.842173
loss_total                        94685654.55
loss_critic                     118355622.455
loss_actor                        5774.869918
memory_size                        71707.9755 

=== epoch 2/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                  145
episode_length                      68.841379
returns                           -154.825853
return_std                         436.070624
average_reward                       -2.24861
round_time             0 days 00:16:52.543144
episodes_test                           162.0
episode_length_test                 61.722222
returns_test                      -140.082934
return_std_test                    414.954441
average_reward_test                 -2.269422
round_time_test        0 days 00:00:11.465121
round_time_total       0 days 00:16:52.544258
loss_total                      103282124.392
loss_critic                      129101237.77
loss_actor                        5661.240278
memory_size                         73245.816 

=== epoch 2/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:30,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:58<00:00,  1.96it/s]
episodes                                  156
episode_length                      63.326923
returns                           -142.893308
return_std                         425.536058
average_reward                      -2.256138
round_time             0 days 00:16:59.224652
episodes_test                           177.0
episode_length_test                 52.338983
returns_test                      -115.263831
return_std_test                    313.615528
average_reward_test                 -2.221758
round_time_test        0 days 00:00:11.550636
round_time_total       0 days 00:16:59.225792
loss_total                      100900578.824
loss_critic                     126124401.304
loss_actor                        5280.387096
memory_size                        74740.2415 

=== epoch 2/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:08<00:00,  1.94it/s]
episodes                                  157
episode_length                      63.146497
returns                            -142.95101
return_std                         426.333687
average_reward                      -2.270226
round_time             0 days 00:17:09.283598
episodes_test                           139.0
episode_length_test                 67.546763
returns_test                      -159.420803
return_std_test                    442.996913
average_reward_test                 -2.368504
round_time_test        0 days 00:00:11.856552
round_time_total       0 days 00:17:09.284695
loss_total                      102851848.609
loss_critic                     128563537.908
loss_actor                        5082.248216
memory_size                        75692.9605 

=== epoch 2/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:31,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:16<00:00,  1.93it/s]
episodes                                  121
episode_length                      78.586777
returns                           -183.221739
return_std                         503.637752
average_reward                      -2.334059
round_time             0 days 00:17:17.077544
episodes_test                           126.0
episode_length_test                 78.746032
returns_test                      -186.939002
return_std_test                    489.876354
average_reward_test                 -2.377541
round_time_test        0 days 00:00:11.642398
round_time_total       0 days 00:17:17.078640
loss_total                      104228886.984
loss_critic                      130284436.64
loss_actor                        6677.849661
memory_size                         77237.289 

=== epoch 2/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:30<00:00,  1.90it/s]
episodes                                  113
episode_length                      77.840708
returns                           -184.279003
return_std                         515.099669
average_reward                      -2.374848
round_time             0 days 00:17:30.979774
episodes_test                           115.0
episode_length_test                  85.66087
returns_test                       -204.66679
return_std_test                    539.883388
average_reward_test                 -2.385877
round_time_test        0 days 00:00:11.673077
round_time_total       0 days 00:17:30.980898
loss_total                      121820635.108
loss_critic                     152273860.244
loss_actor                        7724.537778
memory_size                        78880.4665 

=== epoch 2/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:11,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:15<00:00,  1.83it/s]
episodes                                  139
episode_length                      70.208633
returns                           -166.536168
return_std                         468.231035
average_reward                      -2.372868
round_time             0 days 00:18:15.553280
episodes_test                           219.0
episode_length_test                 41.324201
returns_test                       -91.614898
return_std_test                    237.474444
average_reward_test                  -2.24201
round_time_test        0 days 00:00:11.778021
round_time_total       0 days 00:18:15.554391
loss_total                       99834476.675
loss_critic                      124790759.07
loss_actor                        9338.510639
memory_size                         80110.884 

=== epoch 2/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:05,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  115
episode_length                      80.886957
returns                           -194.129587
return_std                         512.093003
average_reward                        -2.4113
round_time             0 days 00:17:38.456303
episodes_test                           162.0
episode_length_test                  61.67284
returns_test                      -140.926913
return_std_test                    354.361464
average_reward_test                 -2.284378
round_time_test        0 days 00:00:11.624245
round_time_total       0 days 00:17:38.457413
loss_total                       65235440.562
loss_critic                      81541311.201
loss_actor                        11952.40844
memory_size                        81377.7755 

=== epoch 2/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:01,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  106
episode_length                      89.084906
returns                            -214.92676
return_std                         565.874759
average_reward                       -2.41534
round_time             0 days 00:17:38.007392
episodes_test                           177.0
episode_length_test                  56.40113
returns_test                      -130.470451
return_std_test                    385.678407
average_reward_test                  -2.31233
round_time_test        0 days 00:00:11.562301
round_time_total       0 days 00:17:38.008635
loss_total                       44292463.077
loss_critic                      55362191.135
loss_actor                       13546.943672
memory_size                         82879.645 

=== epoch 2/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:20,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:31<00:00,  1.90it/s]
episodes                                  167
episode_length                      55.532934
returns                           -129.554408
return_std                         377.862908
average_reward                      -2.345893
round_time             0 days 00:17:32.481138
episodes_test                           123.0
episode_length_test                 81.154472
returns_test                      -191.753543
return_std_test                    531.717823
average_reward_test                 -2.361741
round_time_test        0 days 00:00:11.451277
round_time_total       0 days 00:17:32.482500
loss_total                      35847825.5815
loss_critic                       44806286.03
loss_actor                        13980.47913
memory_size                         83780.127 

=== epoch 2/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:34<00:00,  1.90it/s]
episodes                                  193
episode_length                       48.65285
returns                           -113.739045
return_std                         310.205452
average_reward                      -2.347788
round_time             0 days 00:17:35.223088
episodes_test                           193.0
episode_length_test                 51.108808
returns_test                      -120.490433
return_std_test                    332.790517
average_reward_test                 -2.354086
round_time_test        0 days 00:00:11.560856
round_time_total       0 days 00:17:35.224181
loss_total                      29810999.0615
loss_critic                      37260252.401
loss_actor                       13982.875238
memory_size                        84295.6025 

=== epoch 2/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:50,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:33<00:00,  1.90it/s]
episodes                                  164
episode_length                      50.896341
returns                           -118.711659
return_std                         306.734785
average_reward                      -2.364076
round_time             0 days 00:17:34.305819
episodes_test                           220.0
episode_length_test                 45.368182
returns_test                      -101.229613
return_std_test                    246.578839
average_reward_test                 -2.230989
round_time_test        0 days 00:00:11.535634
round_time_total       0 days 00:17:34.306953
loss_total                       21332857.051
loss_critic                      26662449.032
loss_actor                        14487.22933
memory_size                         85566.697 

=== epoch 2/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:22,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                  175
episode_length                      55.354286
returns                           -129.924975
return_std                         351.099844
average_reward                      -2.352409
round_time             0 days 00:17:32.846786
episodes_test                            85.0
episode_length_test                106.658824
returns_test                      -255.839638
return_std_test                    628.655014
average_reward_test                  -2.40608
round_time_test        0 days 00:00:11.482794
round_time_total       0 days 00:17:32.847892
loss_total                       14777791.066
loss_critic                    18468562.02175
loss_actor                        14706.13061
memory_size                         87049.345 

=== epoch 2/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:09,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:33<00:00,  1.90it/s]
episodes                                  178
episode_length                      56.016854
returns                           -132.558895
return_std                         348.535936
average_reward                      -2.365005
round_time             0 days 00:17:33.918307
episodes_test                           180.0
episode_length_test                 55.483333
returns_test                       -127.55274
return_std_test                    345.295936
average_reward_test                 -2.297325
round_time_test        0 days 00:00:11.410258
round_time_total       0 days 00:17:33.919688
loss_total                     13167984.09525
loss_critic                    16456281.15025
loss_actor                       14794.816971
memory_size                        88586.8745 

=== epoch 2/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:49,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:43<00:00,  1.88it/s]
episodes                                  120
episode_length                      75.891667
returns                            -183.47808
return_std                         441.310534
average_reward                      -2.420057
round_time             0 days 00:17:44.399248
episodes_test                           149.0
episode_length_test                 66.644295
returns_test                      -162.706078
return_std_test                    458.314386
average_reward_test                 -2.443031
round_time_test        0 days 00:00:11.582119
round_time_total       0 days 00:17:44.400578
loss_total                    13443733.968875
loss_critic                   16800904.618625
loss_actor                       15050.217656
memory_size                         89985.026 

=== epoch 2/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:39,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:43<00:00,  1.88it/s]
episodes                                   89
episode_length                     107.404494
returns                           -263.511645
return_std                         604.386114
average_reward                      -2.455902
round_time             0 days 00:17:44.284890
episodes_test                           116.0
episode_length_test                 86.034483
returns_test                      -206.069971
return_std_test                     535.89226
average_reward_test                 -2.393738
round_time_test        0 days 00:00:11.645414
round_time_total       0 days 00:17:44.286018
loss_total                     10992548.73425
loss_critic                    13737047.85375
loss_actor                       14551.291542
memory_size                         91716.331 

=== epoch 2/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:02,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                   94
episode_length                     101.287234
returns                           -250.360337
return_std                         581.728493
average_reward                      -2.478614
round_time             0 days 00:17:46.809040
episodes_test                           160.0
episode_length_test                  59.93125
returns_test                      -142.475588
return_std_test                    369.667554
average_reward_test                 -2.387521
round_time_test        0 days 00:00:11.511967
round_time_total       0 days 00:17:46.810324
loss_total                        9331797.716
loss_critic                    11661322.12575
loss_actor                       13699.328279
memory_size                        93151.2995 

=== epoch 2/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:59,  1.59it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                   89
episode_length                     112.258427
returns                           -278.156556
return_std                         614.362807
average_reward                      -2.476663
round_time             0 days 00:17:41.397756
episodes_test                            84.0
episode_length_test                107.857143
returns_test                      -268.550506
return_std_test                     653.47208
average_reward_test                 -2.503159
round_time_test        0 days 00:00:11.636858
round_time_total       0 days 00:17:41.399120
loss_total                     6874542.043937
loss_critic                    8589874.899125
loss_actor                       13209.984503
memory_size                         94763.885 

=== epoch 2/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:41<00:00,  1.88it/s]
episodes                                  116
episode_length                      85.853448
returns                           -210.920669
return_std                         499.568749
average_reward                      -2.453867
round_time             0 days 00:17:41.703071
episodes_test                           153.0
episode_length_test                  65.30719
returns_test                      -154.937625
return_std_test                    416.683305
average_reward_test                 -2.371626
round_time_test        0 days 00:00:11.498403
round_time_total       0 days 00:17:41.704171
loss_total                     5789880.739063
loss_critic                     7233968.41225
loss_actor                       13529.575231
memory_size                        95746.0675 

=== epoch 2/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:36,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                  153
episode_length                      62.444444
returns                           -151.871725
return_std                         378.015044
average_reward                      -2.443102
round_time             0 days 00:17:41.354107
episodes_test                           196.0
episode_length_test                 47.443878
returns_test                      -110.236758
return_std_test                    301.995104
average_reward_test                 -2.345437
round_time_test        0 days 00:00:11.581556
round_time_total       0 days 00:17:41.355236
loss_total                     4591992.519813
loss_critic                      5736720.3735
loss_actor                        13080.70725
memory_size                         96355.038 

=== epoch 2/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:37,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  161
episode_length                      61.583851
returns                           -148.632585
return_std                           375.7503
average_reward                      -2.414429
round_time             0 days 00:17:38.019103
episodes_test                           221.0
episode_length_test                 45.176471
returns_test                      -106.171317
return_std_test                    291.372375
average_reward_test                 -2.349142
round_time_test        0 days 00:00:11.582992
round_time_total       0 days 00:17:38.020358
loss_total                         3526025.41
loss_critic                    4404327.108437
loss_actor                       12818.299608
memory_size                        97450.2745 

=== epoch 2/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:04,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  198
episode_length                      47.994949
returns                           -114.316437
return_std                         278.095165
average_reward                      -2.387374
round_time             0 days 00:17:38.573335
episodes_test                           222.0
episode_length_test                 42.990991
returns_test                      -102.535807
return_std_test                    280.964354
average_reward_test                 -2.382123
round_time_test        0 days 00:00:11.664326
round_time_total       0 days 00:17:38.574876
loss_total                     2968270.044438
loss_critic                     3707166.36975
loss_actor                       12684.488808
memory_size                        98566.2775 

=== epoch 2/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:47,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                  186
episode_length                      49.026882
returns                            -118.30878
return_std                         291.475172
average_reward                      -2.426291
round_time             0 days 00:17:41.498479
episodes_test                           157.0
episode_length_test                 57.394904
returns_test                      -140.510894
return_std_test                    373.696193
average_reward_test                 -2.459767
round_time_test        0 days 00:00:11.588719
round_time_total       0 days 00:17:41.499621
loss_total                     2565562.412656
loss_critic                       3203846.216
loss_actor                       12426.970441
memory_size                        99928.5445 

=== epoch 2/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                  169
episode_length                      57.278107
returns                           -139.805357
return_std                         356.904929
average_reward                      -2.444717
round_time             0 days 00:17:47.236161
episodes_test                           201.0
episode_length_test                 49.656716
returns_test                      -117.208028
return_std_test                    336.849036
average_reward_test                 -2.358881
round_time_test        0 days 00:00:11.882392
round_time_total       0 days 00:17:47.237246
loss_total                     2406603.749375
loss_critic                      3005166.1915
loss_actor                       12353.781062
memory_size                       101340.9075 

=== epoch 2/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:46,  1.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  154
episode_length                      62.675325
returns                           -152.701782
return_std                         398.112038
average_reward                       -2.43411
round_time             0 days 00:17:49.301853
episodes_test                           273.0
episode_length_test                 36.611722
returns_test                       -85.205096
return_std_test                    162.164583
average_reward_test                 -2.326505
round_time_test        0 days 00:00:11.730442
round_time_total       0 days 00:17:49.302950
loss_total                     2172363.374469
loss_critic                    2712427.492156
loss_actor                       12106.717159
memory_size                        102511.907 

=== epoch 2/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:27,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  134
episode_length                      67.246269
returns                           -164.636635
return_std                         423.112249
average_reward                      -2.454559
round_time             0 days 00:17:48.768701
episodes_test                            82.0
episode_length_test                 113.47561
returns_test                      -278.188938
return_std_test                    658.930836
average_reward_test                 -2.461564
round_time_test        0 days 00:00:11.341913
round_time_total       0 days 00:17:48.769817
loss_total                      1963519.08925
loss_critic                    2451435.085906
loss_actor                       11854.942077
memory_size                       103877.7535 

=== epoch 2/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:24,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:42<00:00,  1.88it/s]
episodes                                   99
episode_length                      87.686869
returns                           -215.467162
return_std                         533.663316
average_reward                      -2.473224
round_time             0 days 00:17:43.513191
episodes_test                           113.0
episode_length_test                 88.371681
returns_test                      -219.337987
return_std_test                    552.985947
average_reward_test                 -2.481582
round_time_test        0 days 00:00:11.642031
round_time_total       0 days 00:17:43.514336
loss_total                     1676655.616125
loss_critic                       2092833.602
loss_actor                       11943.514893
memory_size                       105463.7795 

=== epoch 2/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:46,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:41<00:00,  1.88it/s]
episodes                                  106
episode_length                      93.641509
returns                           -229.708435
return_std                         540.788939
average_reward                      -2.453039
round_time             0 days 00:17:42.314152
episodes_test                            77.0
episode_length_test                 127.61039
returns_test                      -323.852868
return_std_test                     723.17174
average_reward_test                 -2.541682
round_time_test        0 days 00:00:11.259351
round_time_total       0 days 00:17:42.315243
loss_total                     1538435.837594
loss_critic                     1920054.87875
loss_actor                        11959.53924
memory_size                          107079.5 

=== epoch 2/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:43,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  101
episode_length                      92.405941
returns                           -225.431867
return_std                         559.209114
average_reward                      -2.447189
round_time             0 days 00:17:45.239126
episodes_test                           139.0
episode_length_test                 71.194245
returns_test                      -176.035154
return_std_test                     430.35332
average_reward_test                 -2.464405
round_time_test        0 days 00:00:11.645026
round_time_total       0 days 00:17:45.240206
loss_total                     1421828.583937
loss_critic                    1774318.015219
loss_actor                       11870.742114
memory_size                        108685.142 

=== epoch 2/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:57,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:42<00:00,  1.88it/s]
episodes                                  104
episode_length                      79.740385
returns                           -194.418914
return_std                          504.70491
average_reward                       -2.45781
round_time             0 days 00:17:42.983213
episodes_test                           217.0
episode_length_test                 46.036866
returns_test                        -107.8233
return_std_test                    301.531269
average_reward_test                 -2.341102
round_time_test        0 days 00:00:11.580168
round_time_total       0 days 00:17:42.984333
loss_total                     1315580.665141
loss_critic                    1641508.445688
loss_actor                       11869.424843
memory_size                        109569.543 

=== epoch 2/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:52,  1.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  130
episode_length                      76.623077
returns                           -187.169189
return_std                          490.41375
average_reward                      -2.443551
round_time             0 days 00:17:45.155244
episodes_test                           175.0
episode_length_test                 56.891429
returns_test                      -140.657046
return_std_test                    395.130279
average_reward_test                 -2.471188
round_time_test        0 days 00:00:11.722407
round_time_total       0 days 00:17:45.156382
loss_total                     1253187.809922
loss_critic                    1563531.335156
loss_actor                       11813.603018
memory_size                        110999.207 

=== epoch 2/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:14,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  162
episode_length                      61.450617
returns                           -147.476292
return_std                         405.935418
average_reward                       -2.39969
round_time             0 days 00:17:45.483793
episodes_test                           119.0
episode_length_test                  84.02521
returns_test                       -204.51133
return_std_test                    577.495147
average_reward_test                  -2.43377
round_time_test        0 days 00:00:11.492171
round_time_total       0 days 00:17:45.484916
loss_total                     1180848.605375
loss_critic                    1473140.107656
loss_actor                       11682.493434
memory_size                       112199.7255 

=== epoch 2/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  190
episode_length                      49.473684
returns                           -116.128797
return_std                         323.376213
average_reward                      -2.363497
round_time             0 days 00:17:45.080181
episodes_test                           107.0
episode_length_test                 86.616822
returns_test                      -211.728245
return_std_test                    573.752524
average_reward_test                 -2.456353
round_time_test        0 days 00:00:11.560205
round_time_total       0 days 00:17:45.081412
loss_total                     1069933.934937
loss_critic                    1334568.080406
loss_actor                       11397.269694
memory_size                       112771.5965 

=== epoch 2/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:31,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                  222
episode_length                      44.824324
returns                           -103.859046
return_std                         285.594652
average_reward                      -2.315867
round_time             0 days 00:17:47.161780
episodes_test                           277.0
episode_length_test                 34.093863
returns_test                       -76.697573
return_std_test                    169.907496
average_reward_test                 -2.262095
round_time_test        0 days 00:00:11.621180
round_time_total       0 days 00:17:47.163101
loss_total                      998196.723844
loss_critic                    1244963.961563
loss_actor                       11127.690922
memory_size                       113733.6045 

=== epoch 2/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:03,  1.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:47<00:00,  1.87it/s]
episodes                                  227
episode_length                      42.255507
returns                            -97.297015
return_std                          261.43833
average_reward                      -2.313686
round_time             0 days 00:17:47.762744
episodes_test                           198.0
episode_length_test                 50.494949
returns_test                      -119.575525
return_std_test                    335.402237
average_reward_test                 -2.367416
round_time_test        0 days 00:00:11.886398
round_time_total       0 days 00:17:47.763984
loss_total                      918313.769984
loss_critic                      1145149.4975
loss_actor                       10970.781268
memory_size                        114426.908 

=== epoch 2/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:24,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                  227
episode_length                      41.418502
returns                             -94.78371
return_std                         220.801162
average_reward                      -2.304577
round_time             0 days 00:17:46.689602
episodes_test                            78.0
episode_length_test                116.884615
returns_test                      -292.352169
return_std_test                    680.706082
average_reward_test                 -2.506544
round_time_test        0 days 00:00:11.293308
round_time_total       0 days 00:17:46.690739
loss_total                       904630.19475
loss_critic                    1128064.241078
loss_actor                       10893.933383
memory_size                        115641.419 

=== epoch 2/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:47,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:50<00:00,  1.87it/s]
episodes                                  197
episode_length                      48.720812
returns                            -113.91054
return_std                         295.317538
average_reward                      -2.342287
round_time             0 days 00:17:51.028787
episodes_test                           221.0
episode_length_test                 45.081448
returns_test                      -106.258614
return_std_test                     232.55295
average_reward_test                 -2.360199
round_time_test        0 days 00:00:11.562599
round_time_total       0 days 00:17:51.029908
loss_total                      881275.387938
loss_critic                    1098888.078641
loss_actor                       10824.547926
memory_size                        116933.763 

=== epoch 2/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:33,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:47<00:00,  1.87it/s]
episodes                                  191
episode_length                      50.963351
returns                           -119.093192
return_std                         290.335011
average_reward                      -2.340359
round_time             0 days 00:17:47.910575
episodes_test                           127.0
episode_length_test                 70.952756
returns_test                        -172.5084
return_std_test                    441.090965
average_reward_test                 -2.441698
round_time_test        0 days 00:00:11.552418
round_time_total       0 days 00:17:47.911767
loss_total                      837744.188031
loss_critic                    1044457.759266
loss_actor                       10889.834397
memory_size                       118223.1655 

=== epoch 2/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:56,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  168
episode_length                      58.184524
returns                           -138.330335
return_std                         307.838931
average_reward                      -2.382126
round_time             0 days 00:17:45.173143
episodes_test                            68.0
episode_length_test                137.529412
returns_test                      -343.815186
return_std_test                    734.168359
average_reward_test                 -2.502781
round_time_test        0 days 00:00:11.556001
round_time_total       0 days 00:17:45.174250
loss_total                      814685.714914
loss_critic                    1015600.988125
loss_actor                       11024.546039
memory_size                        119008.775 

=== epoch 2/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:39,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:45<00:00,  1.88it/s]
episodes                                  142
episode_length                      65.323944
returns                           -157.055684
return_std                         351.095138
average_reward                      -2.415966
round_time             0 days 00:17:46.158975
episodes_test                            86.0
episode_length_test                112.523256
returns_test                      -280.009681
return_std_test                    640.087796
average_reward_test                 -2.495101
round_time_test        0 days 00:00:11.482692
round_time_total       0 days 00:17:46.160094
loss_total                      805346.724352
loss_critic                    1003889.951078
loss_actor                       11173.757659
memory_size                       120330.4945 

=== epoch 2/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  139
episode_length                      71.661871
returns                           -174.234028
return_std                         414.451863
average_reward                      -2.435032
round_time             0 days 00:17:49.477836
episodes_test                            53.0
episode_length_test                185.056604
returns_test                      -469.971285
return_std_test                    883.209333
average_reward_test                 -2.534276
round_time_test        0 days 00:00:11.618231
round_time_total       0 days 00:17:49.478914
loss_total                      849759.913766
loss_critic                    1059403.782797
loss_actor                       11184.361027
memory_size                         121777.21 

=== epoch 2/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:55,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                  157
episode_length                      61.318471
returns                            -147.36388
return_std                         332.025432
average_reward                      -2.408785
round_time             0 days 00:17:48.600849
episodes_test                           111.0
episode_length_test                  89.72973
returns_test                      -219.873977
return_std_test                    570.375933
average_reward_test                 -2.444759
round_time_test        0 days 00:00:11.488279
round_time_total       0 days 00:17:48.601945
loss_total                      850267.626336
loss_critic                    1060082.118375
loss_actor                       11009.591905
memory_size                       122884.4015 

=== epoch 2/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:50<00:00,  1.87it/s]
episodes                                  165
episode_length                      60.484848
returns                           -145.935816
return_std                         336.117767
average_reward                      -2.412562
round_time             0 days 00:17:51.021198
episodes_test                           161.0
episode_length_test                  62.10559
returns_test                      -146.756372
return_std_test                    437.484723
average_reward_test                 -2.362863
round_time_test        0 days 00:00:11.454236
round_time_total       0 days 00:17:51.022309
loss_total                      727726.505883
loss_critic                     906977.251953
loss_actor                       10723.459506
memory_size                       123807.5115 

=== epoch 2/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:58,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  186
episode_length                      49.596774
returns                           -118.007296
return_std                         263.991544
average_reward                      -2.392785
round_time             0 days 00:17:45.017301
episodes_test                           145.0
episode_length_test                 68.848276
returns_test                      -165.499484
return_std_test                    457.483009
average_reward_test                 -2.403528
round_time_test        0 days 00:00:11.700434
round_time_total       0 days 00:17:45.018613
loss_total                      683453.016406
loss_critic                     851672.523797
loss_actor                       10574.927546
memory_size                       124403.2695 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<18:20,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:40<00:00,  2.00it/s]
episodes                                   23
episode_length                      64.869565
returns                              -153.246
return_std                         341.084936
average_reward                       -2.41933
round_time             0 days 00:16:40.288916
episodes_test                           155.0
episode_length_test                 64.348387
returns_test                      -154.431378
return_std_test                     401.19897
average_reward_test                 -2.400884
round_time_test        0 days 00:00:11.476764
round_time_total       0 days 00:16:40.290051
loss_total                      709983.479688
loss_critic                     884810.994969
loss_actor                        10673.35706
memory_size                       125208.3145 

=== epoch 3/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:46<00:00,  1.99it/s]
episodes                                   66
episode_length                      59.833333
returns                           -144.614912
return_std                         369.938917
average_reward                      -2.412849
round_time             0 days 00:16:47.353128
episodes_test                            99.0
episode_length_test                 95.818182
returns_test                      -234.889587
return_std_test                    579.780969
average_reward_test                 -2.459113
round_time_test        0 days 00:00:11.383242
round_time_total       0 days 00:16:47.354364
loss_total                      645929.424859
loss_critic                     804717.101766
loss_actor                       10778.662684
memory_size                       126580.9905 

=== epoch 3/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:10,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:47<00:00,  1.98it/s]
episodes                                   95
episode_length                      62.515789
returns                           -153.369888
return_std                         385.630164
average_reward                      -2.446527
round_time             0 days 00:16:48.254093
episodes_test                           152.0
episode_length_test                 65.690789
returns_test                      -157.802485
return_std_test                    424.395064
average_reward_test                 -2.401859
round_time_test        0 days 00:00:11.501139
round_time_total       0 days 00:16:48.255338
loss_total                      687544.344156
loss_critic                     856782.853742
loss_actor                       10590.254583
memory_size                       127444.5155 

=== epoch 3/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:50<00:00,  1.98it/s]
episodes                                   98
episode_length                      71.571429
returns                           -177.478125
return_std                         454.425433
average_reward                      -2.484809
round_time             0 days 00:16:50.871415
episodes_test                            68.0
episode_length_test                136.308824
returns_test                      -342.811567
return_std_test                    764.859111
average_reward_test                 -2.525091
round_time_test        0 days 00:00:11.497345
round_time_total       0 days 00:16:50.872710
loss_total                      620702.102578
loss_critic                     773244.844602
loss_actor                       10531.075794
memory_size                       129042.2895 

=== epoch 3/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:01,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                  122
episode_length                       80.57377
returns                            -198.40186
return_std                         512.424876
average_reward                      -2.468853
round_time             0 days 00:16:52.467448
episodes_test                           107.0
episode_length_test                 91.046729
returns_test                      -224.971656
return_std_test                    516.986023
average_reward_test                 -2.475296
round_time_test        0 days 00:00:11.516796
round_time_total       0 days 00:16:52.468523
loss_total                      574292.440461
loss_critic                     715203.870523
loss_actor                        10646.67154
memory_size                       130783.5695 

=== epoch 3/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:26,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:49<00:00,  1.98it/s]
episodes                                  128
episode_length                      74.171875
returns                           -181.931952
return_std                         486.003322
average_reward                      -2.459623
round_time             0 days 00:16:49.983465
episodes_test                            63.0
episode_length_test                158.412698
returns_test                      -392.870094
return_std_test                    799.308768
average_reward_test                 -2.478991
round_time_test        0 days 00:00:11.449398
round_time_total       0 days 00:16:49.984671
loss_total                      611663.891781
loss_critic                     761879.547219
loss_actor                       10801.218091
memory_size                        132118.567 

=== epoch 3/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:24,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:54<00:00,  1.97it/s]
episodes                                   99
episode_length                      96.080808
returns                            -236.12252
return_std                         593.247093
average_reward                      -2.466211
round_time             0 days 00:16:54.864208
episodes_test                            96.0
episode_length_test                     104.0
returns_test                      -251.338181
return_std_test                    607.572854
average_reward_test                 -2.415752
round_time_test        0 days 00:00:11.365111
round_time_total       0 days 00:16:54.865295
loss_total                      543935.453672
loss_critic                     677225.472164
loss_actor                       10775.326692
memory_size                       133277.1705 

=== epoch 3/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:24,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:59<00:00,  1.96it/s]
episodes                                   85
episode_length                          106.4
returns                           -258.807146
return_std                         634.869762
average_reward                      -2.456268
round_time             0 days 00:17:00.289060
episodes_test                           131.0
episode_length_test                 76.068702
returns_test                      -185.877761
return_std_test                    489.795502
average_reward_test                 -2.443232
round_time_test        0 days 00:00:11.467055
round_time_total       0 days 00:17:00.290161
loss_total                      470983.362422
loss_critic                     586072.806516
loss_actor                       10625.542504
memory_size                       135015.6115 

=== epoch 3/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:11,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:08<00:00,  1.94it/s]
episodes                                  103
episode_length                      94.951456
returns                           -230.813978
return_std                         582.760892
average_reward                      -2.434742
round_time             0 days 00:17:08.855136
episodes_test                           102.0
episode_length_test                 97.921569
returns_test                      -242.071737
return_std_test                    593.880168
average_reward_test                  -2.47036
round_time_test        0 days 00:00:11.737618
round_time_total       0 days 00:17:08.856227
loss_total                      534947.083219
loss_critic                     666092.807688
loss_actor                       10364.143812
memory_size                        136380.595 

=== epoch 3/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:17<00:00,  1.93it/s]
episodes                                   88
episode_length                      92.977273
returns                           -225.356196
return_std                         568.665467
average_reward                      -2.454299
round_time             0 days 00:17:17.578907
episodes_test                           111.0
episode_length_test                 90.054054
returns_test                      -222.879457
return_std_test                    585.826904
average_reward_test                 -2.474134
round_time_test        0 days 00:00:11.537095
round_time_total       0 days 00:17:17.579996
loss_total                      494601.380027
loss_critic                     615691.194773
loss_actor                       10242.081804
memory_size                       137993.6855 

=== epoch 3/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:26,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:23<00:00,  1.92it/s]
episodes                                   93
episode_length                     106.516129
returns                           -263.163609
return_std                         651.091133
average_reward                      -2.466689
round_time             0 days 00:17:24.250859
episodes_test                            89.0
episode_length_test                 108.58427
returns_test                      -269.571602
return_std_test                     636.93768
average_reward_test                   -2.4875
round_time_test        0 days 00:00:11.428963
round_time_total       0 days 00:17:24.251989
loss_total                      479179.845277
loss_critic                     596417.170359
loss_actor                       10230.503618
memory_size                        139347.532 

=== epoch 3/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:54,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                   92
episode_length                      97.108696
returns                           -238.867509
return_std                          612.63668
average_reward                      -2.472864
round_time             0 days 00:17:37.192724
episodes_test                           126.0
episode_length_test                 79.357143
returns_test                      -193.225514
return_std_test                    498.621727
average_reward_test                 -2.434706
round_time_test        0 days 00:00:11.431748
round_time_total       0 days 00:17:37.193851
loss_total                       491766.60443
loss_critic                     612172.937031
loss_actor                       10141.234982
memory_size                        140759.973 

=== epoch 3/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:41<00:00,  1.88it/s]
episodes                                   89
episode_length                     110.898876
returns                           -275.197868
return_std                         665.054069
average_reward                      -2.482749
round_time             0 days 00:17:42.229235
episodes_test                           133.0
episode_length_test                 75.165414
returns_test                      -187.965571
return_std_test                    488.629352
average_reward_test                 -2.499943
round_time_test        0 days 00:00:11.704947
round_time_total       0 days 00:17:42.230789
loss_total                      472482.732094
loss_critic                      588085.62918
loss_actor                       10071.099658
memory_size                       142406.1235 

=== epoch 3/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:58,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                  100
episode_length                          91.85
returns                           -226.267454
return_std                         586.540014
average_reward                      -2.475572
round_time             0 days 00:17:37.264064
episodes_test                            95.0
episode_length_test                105.073684
returns_test                      -260.070462
return_std_test                    652.152747
average_reward_test                 -2.474148
round_time_test        0 days 00:00:11.535657
round_time_total       0 days 00:17:37.265299
loss_total                        376865.3995
loss_critic                     468549.929621
loss_actor                       10127.247858
memory_size                        144024.414 

=== epoch 3/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  101
episode_length                      90.069307
returns                           -223.376872
return_std                         555.195723
average_reward                      -2.501535
round_time             0 days 00:17:37.889267
episodes_test                            77.0
episode_length_test                123.402597
returns_test                      -306.261497
return_std_test                    702.473342
average_reward_test                 -2.488532
round_time_test        0 days 00:00:11.543051
round_time_total       0 days 00:17:37.890437
loss_total                       411045.59227
loss_critic                     511269.607633
loss_actor                        10149.49688
memory_size                        145197.972 

=== epoch 3/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                   88
episode_length                     111.454545
returns                           -281.154713
return_std                         648.745386
average_reward                       -2.52691
round_time             0 days 00:17:39.029722
episodes_test                           110.0
episode_length_test                 90.654545
returns_test                      -222.346568
return_std_test                    548.201661
average_reward_test                 -2.451219
round_time_test        0 days 00:00:11.428593
round_time_total       0 days 00:17:39.030819
loss_total                      380697.085543
loss_critic                     473326.860734
loss_actor                       10177.954655
memory_size                       146764.1085 

=== epoch 3/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                   86
episode_length                          101.0
returns                           -256.008426
return_std                         620.331507
average_reward                      -2.543827
round_time             0 days 00:17:40.613959
episodes_test                            92.0
episode_length_test                108.641304
returns_test                      -268.875591
return_std_test                    639.777843
average_reward_test                 -2.473361
round_time_test        0 days 00:00:11.496064
round_time_total       0 days 00:17:40.615204
loss_total                      382989.876992
loss_critic                     476189.323352
loss_actor                       10192.056375
memory_size                       148398.3725 

=== epoch 3/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:22,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:42<00:00,  1.88it/s]
episodes                                  109
episode_length                      83.541284
returns                           -210.523292
return_std                         560.087157
average_reward                      -2.525989
round_time             0 days 00:17:43.220411
episodes_test                           139.0
episode_length_test                 68.251799
returns_test                      -164.154778
return_std_test                     450.49356
average_reward_test                 -2.412381
round_time_test        0 days 00:00:11.674711
round_time_total       0 days 00:17:43.221522
loss_total                       337188.76557
loss_critic                     418952.129148
loss_actor                       10135.283049
memory_size                       150015.9315 

=== epoch 3/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:15,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:39<00:00,  1.89it/s]
episodes                                   94
episode_length                      99.670213
returns                           -252.266406
return_std                         648.713534
average_reward                      -2.534855
round_time             0 days 00:17:40.485831
episodes_test                           144.0
episode_length_test                 69.368056
returns_test                      -171.841086
return_std_test                    441.366472
average_reward_test                 -2.476104
round_time_test        0 days 00:00:11.452066
round_time_total       0 days 00:17:40.487126
loss_total                      351990.680105
loss_critic                      437488.85716
loss_actor                        9997.939181
memory_size                       151261.1165 

=== epoch 3/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:02,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:40<00:00,  1.89it/s]
episodes                                   95
episode_length                     101.442105
returns                           -255.593617
return_std                         670.108269
average_reward                      -2.522724
round_time             0 days 00:17:41.006737
episodes_test                            89.0
episode_length_test                102.831461
returns_test                      -253.944593
return_std_test                     625.04421
average_reward_test                 -2.480441
round_time_test        0 days 00:00:11.434856
round_time_total       0 days 00:17:41.008197
loss_total                      365357.359469
loss_critic                     454234.140164
loss_actor                        9850.205872
memory_size                        152884.682 

=== epoch 3/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:27,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                   93
episode_length                      95.516129
returns                            -237.03656
return_std                         626.808449
average_reward                       -2.49432
round_time             0 days 00:17:38.026124
episodes_test                            95.0
episode_length_test                104.694737
returns_test                      -256.913936
return_std_test                    606.793762
average_reward_test                 -2.456092
round_time_test        0 days 00:00:11.463753
round_time_total       0 days 00:17:38.027217
loss_total                      297286.775541
loss_critic                      369164.51184
loss_actor                        9775.800855
memory_size                       154536.8015 

=== epoch 3/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:38<00:00,  1.89it/s]
episodes                                  102
episode_length                      90.156863
returns                           -222.144934
return_std                           598.8561
average_reward                      -2.475219
round_time             0 days 00:17:38.908690
episodes_test                           141.0
episode_length_test                 70.921986
returns_test                      -170.062844
return_std_test                    429.302976
average_reward_test                 -2.397886
round_time_test        0 days 00:00:11.608815
round_time_total       0 days 00:17:38.910009
loss_total                      310463.560867
loss_critic                     385649.622062
loss_actor                        9719.289108
memory_size                        155933.456 

=== epoch 3/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:26,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:45<00:00,  1.88it/s]
episodes                                  118
episode_length                      76.347458
returns                           -184.504356
return_std                         510.998202
average_reward                      -2.436556
round_time             0 days 00:17:45.684683
episodes_test                           162.0
episode_length_test                 61.197531
returns_test                      -142.647269
return_std_test                    399.694851
average_reward_test                 -2.333549
round_time_test        0 days 00:00:11.364892
round_time_total       0 days 00:17:45.685812
loss_total                      314859.701859
loss_critic                     391159.967023
loss_actor                        9658.614619
memory_size                        157097.819 

=== epoch 3/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                  133
episode_length                      71.947368
returns                           -172.401814
return_std                         455.637499
average_reward                       -2.40145
round_time             0 days 00:17:38.237989
episodes_test                            89.0
episode_length_test                109.348315
returns_test                      -267.876456
return_std_test                    633.888525
average_reward_test                 -2.452011
round_time_test        0 days 00:00:11.429906
round_time_total       0 days 00:17:38.239305
loss_total                      315937.071719
loss_critic                      392514.48809
loss_actor                        9627.379099
memory_size                        157913.613 

=== epoch 3/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:26,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:44<00:00,  1.88it/s]
episodes                                  152
episode_length                      60.848684
returns                           -142.565944
return_std                         376.648249
average_reward                      -2.358766
round_time             0 days 00:17:44.736603
episodes_test                           143.0
episode_length_test                 69.286713
returns_test                      -162.539227
return_std_test                    461.849194
average_reward_test                 -2.350724
round_time_test        0 days 00:00:11.670867
round_time_total       0 days 00:17:44.737743
loss_total                      305032.545262
loss_critic                     378896.736088
loss_actor                        9575.754464
memory_size                         159338.17 

=== epoch 3/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:40,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:41<00:00,  1.88it/s]
episodes                                  163
episode_length                      56.699387
returns                           -131.741572
return_std                         360.649357
average_reward                      -2.339165
round_time             0 days 00:17:42.219538
episodes_test                           123.0
episode_length_test                 81.081301
returns_test                      -192.949998
return_std_test                    473.526164
average_reward_test                 -2.379308
round_time_test        0 days 00:00:11.462428
round_time_total       0 days 00:17:42.220627
loss_total                       303577.36859
loss_critic                     377100.822664
loss_actor                        9483.524778
memory_size                         160375.36 

=== epoch 3/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:56,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:47<00:00,  1.87it/s]
episodes                                  157
episode_length                      56.853503
returns                           -131.671996
return_std                         346.218316
average_reward                      -2.338633
round_time             0 days 00:17:48.334624
episodes_test                            96.0
episode_length_test                103.260417
returns_test                      -250.836664
return_std_test                    599.263004
average_reward_test                 -2.433367
round_time_test        0 days 00:00:11.419612
round_time_total       0 days 00:17:48.335708
loss_total                        253187.5841
loss_critic                     314161.031619
loss_actor                        9293.769276
memory_size                        161824.065 

=== epoch 3/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:55,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                  124
episode_length                      78.443548
returns                           -186.505765
return_std                          487.60665
average_reward                      -2.386213
round_time             0 days 00:17:46.922786
episodes_test                           112.0
episode_length_test                 86.330357
returns_test                      -208.931398
return_std_test                    525.507402
average_reward_test                 -2.420433
round_time_test        0 days 00:00:11.527970
round_time_total       0 days 00:17:46.923882
loss_total                      248012.357273
loss_critic                     307751.914689
loss_actor                        9054.105721
memory_size                       163501.7765 

=== epoch 3/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:42<00:00,  1.88it/s]
episodes                                  102
episode_length                      97.215686
returns                           -233.174195
return_std                         584.220385
average_reward                      -2.396446
round_time             0 days 00:17:42.940629
episodes_test                            68.0
episode_length_test                143.176471
returns_test                      -339.468309
return_std_test                    722.605349
average_reward_test                 -2.373804
round_time_test        0 days 00:00:11.627255
round_time_total       0 days 00:17:42.941839
loss_total                      247191.232461
loss_critic                     306808.100939
loss_actor                        8723.738119
memory_size                       165091.2135 

=== epoch 3/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:16,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:46<00:00,  1.88it/s]
episodes                                   89
episode_length                     111.494382
returns                           -258.992613
return_std                         612.647707
average_reward                      -2.314567
round_time             0 days 00:17:46.793450
episodes_test                           101.0
episode_length_test                 97.772277
returns_test                      -214.745185
return_std_test                    517.363027
average_reward_test                  -2.19877
round_time_test        0 days 00:00:11.456161
round_time_total       0 days 00:17:46.794546
loss_total                      228729.569012
loss_critic                     283920.443824
loss_actor                        7966.049686
memory_size                       166763.7925 

=== epoch 3/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:55,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:42<00:00,  1.88it/s]
episodes                                   60
episode_length                          164.8
returns                            -335.68602
return_std                         681.622912
average_reward                      -2.028126
round_time             0 days 00:17:42.984275
episodes_test                            23.0
episode_length_test                411.130435
returns_test                      -206.061435
return_std_test                    197.015579
average_reward_test                 -0.494233
round_time_test        0 days 00:00:11.579455
round_time_total       0 days 00:17:42.985393
loss_total                      191255.151998
loss_critic                       237684.7835
loss_actor                        5536.610377
memory_size                        168361.579 

=== epoch 3/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:26,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:45<00:00,  1.88it/s]
episodes                                   49
episode_length                     189.816327
returns                           -319.260418
return_std                         627.710323
average_reward                      -1.628961
round_time             0 days 00:17:46.299096
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       -33.781952
return_std_test                     43.155131
average_reward_test                 -0.033782
round_time_test        0 days 00:00:11.964587
round_time_total       0 days 00:17:46.300406
loss_total                       89179.390005
loss_critic                     110589.031027
loss_actor                         3540.81821
memory_size                       170262.1525 

=== epoch 3/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:28,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                   35
episode_length                     265.685714
returns                           -316.259575
return_std                         561.236377
average_reward                      -1.249306
round_time             0 days 00:17:49.361516
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       221.156136
return_std_test                     41.685239
average_reward_test                  0.221156
round_time_test        0 days 00:00:12.275728
round_time_total       0 days 00:17:49.362762
loss_total                       46574.307365
loss_critic                      57601.740631
loss_actor                        2464.570144
memory_size                        172170.444 

=== epoch 3/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:51<00:00,  1.87it/s]
episodes                                   29
episode_length                     308.655172
returns                           -232.606568
return_std                         254.304718
average_reward                      -0.908425
round_time             0 days 00:17:52.016110
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       -35.037744
return_std_test                     19.761589
average_reward_test                 -0.035038
round_time_test        0 days 00:00:12.182350
round_time_total       0 days 00:17:52.017248
loss_total                       23950.705541
loss_critic                      29496.202814
loss_actor                        1768.714495
memory_size                        174012.414 

=== epoch 3/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:17,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:54<00:00,  1.86it/s]
episodes                                   16
episode_length                       543.5625
returns                           -318.673698
return_std                         218.892309
average_reward                      -0.632864
round_time             0 days 00:17:54.770240
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -273.977217
return_std_test                       48.5326
average_reward_test                 -0.273977
round_time_test        0 days 00:00:12.394295
round_time_total       0 days 00:17:54.771337
loss_total                       12962.086873
loss_critic                      15877.332917
loss_actor                        1301.101654
memory_size                        175893.447 

=== epoch 3/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:54,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:53<00:00,  1.86it/s]
episodes                                   17
episode_length                     563.294118
returns                           -324.808447
return_std                         206.315235
average_reward                      -0.574067
round_time             0 days 00:17:53.851874
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        -49.68999
return_std_test                     12.561937
average_reward_test                  -0.04969
round_time_test        0 days 00:00:12.149822
round_time_total       0 days 00:17:53.852963
loss_total                        8356.453147
loss_critic                      10201.723854
loss_actor                         975.369597
memory_size                        177766.917 

=== epoch 3/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:57,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:59<00:00,  1.85it/s]
episodes                                   16
episode_length                       557.3125
returns                           -328.457436
return_std                         202.989761
average_reward                      -0.590867
round_time             0 days 00:17:59.891101
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       -14.754701
return_std_test                      9.752451
average_reward_test                 -0.014755
round_time_test        0 days 00:00:11.988194
round_time_total       0 days 00:17:59.892180
loss_total                        5702.829145
loss_critic                       6938.809942
loss_actor                         758.905467
memory_size                        179674.791 

=== epoch 3/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:03<00:00,  1.85it/s]
episodes                                   17
episode_length                     559.294118
returns                           -321.384185
return_std                         217.415421
average_reward                      -0.581507
round_time             0 days 00:18:03.561640
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -120.982062
return_std_test                     16.537251
average_reward_test                 -0.120982
round_time_test        0 days 00:00:12.174775
round_time_total       0 days 00:18:03.562747
loss_total                        4642.463371
loss_critic                        5650.21175
loss_actor                         611.469472
memory_size                       181591.1275 

=== epoch 3/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:55<00:00,  1.86it/s]
episodes                                   15
episode_length                     596.533333
returns                           -328.867067
return_std                         209.404599
average_reward                      -0.548093
round_time             0 days 00:17:55.935119
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         2.657665
return_std_test                     87.727503
average_reward_test                  0.002658
round_time_test        0 days 00:00:12.263960
round_time_total       0 days 00:17:55.936494
loss_total                        4041.736617
loss_critic                       4923.714641
loss_actor                         513.824171
memory_size                        183461.451 

=== epoch 3/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:12,  1.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:53<00:00,  1.86it/s]
episodes                                   13
episode_length                     708.538462
returns                           -381.727429
return_std                          204.76572
average_reward                      -0.538555
round_time             0 days 00:17:54.465445
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         1.642402
return_std_test                     11.111439
average_reward_test                  0.001642
round_time_test        0 days 00:00:12.025615
round_time_total       0 days 00:17:54.466588
loss_total                        3613.297125
loss_critic                       4406.725348
loss_actor                         439.583928
memory_size                        185393.451 

=== epoch 3/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:20,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:58<00:00,  1.86it/s]
episodes                                   12
episode_length                     727.416667
returns                           -383.692283
return_std                         199.306073
average_reward                      -0.524401
round_time             0 days 00:17:58.647951
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        98.402384
return_std_test                     14.105698
average_reward_test                  0.098402
round_time_test        0 days 00:00:11.885167
round_time_total       0 days 00:17:58.649102
loss_total                        3315.873383
loss_critic                       4049.085125
loss_actor                         383.026178
memory_size                       187301.7125 

=== epoch 3/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:12,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:55<00:00,  1.86it/s]
episodes                                   12
episode_length                     727.416667
returns                           -373.070811
return_std                         191.854126
average_reward                      -0.506359
round_time             0 days 00:17:56.241955
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       141.561719
return_std_test                        7.5552
average_reward_test                  0.141562
round_time_test        0 days 00:00:11.930518
round_time_total       0 days 00:17:56.243067
loss_total                        3116.276915
loss_critic                       3812.728688
loss_actor                         330.469566
memory_size                        189233.767 

=== epoch 3/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:57<00:00,  1.86it/s]
episodes                                   11
episode_length                     807.909091
returns                           -400.756261
return_std                         169.948739
average_reward                       -0.49925
round_time             0 days 00:17:57.766346
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       277.844221
return_std_test                     10.847219
average_reward_test                  0.277844
round_time_test        0 days 00:00:11.913693
round_time_total       0 days 00:17:57.767893
loss_total                        2834.183093
loss_critic                       3471.367395
loss_actor                         285.445649
memory_size                        191157.198 

=== epoch 3/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:17,  1.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:04<00:00,  1.84it/s]
episodes                                   13
episode_length                     695.538462
returns                           -335.223527
return_std                         194.873914
average_reward                       -0.49088
round_time             0 days 00:18:05.070294
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       393.619355
return_std_test                     28.574204
average_reward_test                  0.393619
round_time_test        0 days 00:00:12.261357
round_time_total       0 days 00:18:05.071393
loss_total                        2659.711774
loss_critic                       3263.689765
loss_actor                         243.799564
memory_size                        193047.038 

=== epoch 3/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:18,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:08<00:00,  1.84it/s]
episodes                                   13
episode_length                     751.384615
returns                           -363.505429
return_std                         194.642747
average_reward                      -0.483757
round_time             0 days 00:18:09.096636
episodes_test                            11.0
episode_length_test                820.090909
returns_test                        413.41649
return_std_test                    197.320074
average_reward_test                  0.504808
round_time_test        0 days 00:00:11.918364
round_time_total       0 days 00:18:09.097741
loss_total                         2486.85437
loss_critic                       3056.479841
loss_actor                         208.352269
memory_size                        194958.879 

=== epoch 3/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:38,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:02<00:00,  1.85it/s]
episodes                                   15
episode_length                     605.066667
returns                           -283.357869
return_std                         214.237062
average_reward                      -0.463696
round_time             0 days 00:18:03.030677
episodes_test                            10.0
episode_length_test                     901.3
returns_test                       438.813477
return_std_test                      145.9282
average_reward_test                  0.486106
round_time_test        0 days 00:00:12.341009
round_time_total       0 days 00:18:03.031812
loss_total                        2590.380641
loss_critic                       3193.703638
loss_actor                         177.088469
memory_size                       196806.5155 

=== epoch 3/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:21,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:07<00:00,  1.84it/s]
episodes                                   17
episode_length                     564.823529
returns                           -260.535275
return_std                         207.280896
average_reward                      -0.461557
round_time             0 days 00:18:08.168326
episodes_test                            11.0
episode_length_test                820.272727
returns_test                       436.216669
return_std_test                    209.203538
average_reward_test                  0.532135
round_time_test        0 days 00:00:11.999776
round_time_total       0 days 00:18:08.169676
loss_total                        2189.352178
loss_critic                       2698.985182
loss_actor                         150.819961
memory_size                        198696.598 

=== epoch 3/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:00,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:02<00:00,  1.85it/s]
episodes                                   18
episode_length                     496.944444
returns                            -213.22115
return_std                         197.470857
average_reward                      -0.436394
round_time             0 days 00:18:02.799118
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       579.060856
return_std_test                      6.083638
average_reward_test                  0.579061
round_time_test        0 days 00:00:12.048191
round_time_total       0 days 00:18:02.800241
loss_total                        2081.752755
loss_critic                       2569.856519
loss_actor                         129.337528
memory_size                       200588.0095 

=== epoch 3/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:45,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:59<00:00,  1.85it/s]
episodes                                   20
episode_length                          427.4
returns                            -184.77082
return_std                         186.375663
average_reward                      -0.436313
round_time             0 days 00:18:00.133472
episodes_test                            10.0
episode_length_test                     901.2
returns_test                       590.062244
return_std_test                    198.527014
average_reward_test                   0.65914
round_time_test        0 days 00:00:11.992246
round_time_total       0 days 00:18:00.134569
loss_total                        1989.350577
loss_critic                        2459.17689
loss_actor                         110.045138
memory_size                       202381.7045 

=== epoch 3/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:11,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:01<00:00,  1.85it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   24
episode_length                        406.875
returns                           -170.586052
return_std                         177.156172
average_reward                      -0.415572
round_time             0 days 00:18:02.160696
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       753.911439
return_std_test                     15.597693
average_reward_test                  0.753911
round_time_test        0 days 00:00:12.108568
round_time_total       0 days 00:18:02.161788
loss_total                        1914.132014
loss_critic                       2368.431326
loss_actor                          96.934618
memory_size                       204209.7295 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:02<17:02,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:43<00:00,  1.99it/s]
episodes                                    3
episode_length                          468.0
returns                           -142.344687
return_std                         139.924785
average_reward                      -0.329638
round_time             0 days 00:16:43.309761
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       771.789644
return_std_test                      8.518089
average_reward_test                   0.77179
round_time_test        0 days 00:00:11.911227
round_time_total       0 days 00:16:43.310865
loss_total                        1879.715997
loss_critic                       2327.215141
loss_actor                          89.719273
memory_size                        205993.215 

=== epoch 4/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:27,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:47<00:00,  1.98it/s]
episodes                                    9
episode_length                     398.333333
returns                           -137.602819
return_std                           153.2422
average_reward                      -0.358226
round_time             0 days 00:16:48.241264
episodes_test                            10.0
episode_length_test                     901.1
returns_test                       723.427164
return_std_test                    243.135713
average_reward_test                  0.802257
round_time_test        0 days 00:00:11.605407
round_time_total       0 days 00:16:48.242347
loss_total                        1834.828123
loss_critic                       2272.890575
loss_actor                          82.578163
memory_size                        207870.323 

=== epoch 4/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:10,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:52<00:00,  1.97it/s]
episodes                                   12
episode_length                     468.166667
returns                           -165.613772
return_std                         168.592147
average_reward                      -0.331553
round_time             0 days 00:16:53.231018
episodes_test                            10.0
episode_length_test                     901.2
returns_test                       728.993088
return_std_test                    243.697158
average_reward_test                  0.808773
round_time_test        0 days 00:00:11.968434
round_time_total       0 days 00:16:53.232101
loss_total                        1776.463285
loss_critic                       2201.098081
loss_actor                          77.923962
memory_size                       209755.9285 

=== epoch 4/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:35,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:50<00:00,  1.98it/s]
episodes                                   17
episode_length                     462.941176
returns                           -155.194694
return_std                         158.758504
average_reward                      -0.335797
round_time             0 days 00:16:50.881097
episodes_test                            10.0
episode_length_test                     901.1
returns_test                       736.867954
return_std_test                    246.916143
average_reward_test                  0.816944
round_time_test        0 days 00:00:11.608707
round_time_total       0 days 00:16:50.882196
loss_total                        1748.329429
loss_critic                       2166.969806
loss_actor                          73.767778
memory_size                       211625.1555 

=== epoch 4/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:52<00:00,  1.98it/s]
episodes                                   23
episode_length                      400.73913
returns                           -131.911474
return_std                          155.25329
average_reward                      -0.318837
round_time             0 days 00:16:52.762846
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       798.469715
return_std_test                      10.38368
average_reward_test                   0.79847
round_time_test        0 days 00:00:11.703141
round_time_total       0 days 00:16:52.764289
loss_total                        1704.126765
loss_critic                       2112.789556
loss_actor                          69.475472
memory_size                       213431.6175 

=== epoch 4/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:42,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:58<00:00,  1.96it/s]
episodes                                   28
episode_length                     322.071429
returns                            -96.644752
return_std                         132.821106
average_reward                      -0.305378
round_time             0 days 00:16:58.911693
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       824.179334
return_std_test                     11.663129
average_reward_test                  0.824179
round_time_test        0 days 00:00:12.158850
round_time_total       0 days 00:16:58.912808
loss_total                        1669.862995
loss_critic                       2071.544531
loss_actor                          63.136709
memory_size                        215135.832 

=== epoch 4/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:00,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:12<00:00,  1.94it/s]
episodes                                   28
episode_length                     335.678571
returns                            -93.574842
return_std                         121.838285
average_reward                      -0.284807
round_time             0 days 00:17:12.957973
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       838.993478
return_std_test                     12.688303
average_reward_test                  0.838993
round_time_test        0 days 00:00:11.996665
round_time_total       0 days 00:17:12.959485
loss_total                        1644.198634
loss_critic                       2040.472882
loss_actor                          59.101514
memory_size                        216942.018 

=== epoch 4/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:29,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:23<00:00,  1.92it/s]
episodes                                   35
episode_length                          266.0
returns                            -74.193734
return_std                         105.985734
average_reward                      -0.279764
round_time             0 days 00:17:24.165520
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       843.414231
return_std_test                      9.260405
average_reward_test                  0.843414
round_time_test        0 days 00:00:11.784669
round_time_total       0 days 00:17:24.166642
loss_total                        1619.160857
loss_critic                       2010.335138
loss_actor                          54.463603
memory_size                       218695.2615 

=== epoch 4/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:27,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:26<00:00,  1.91it/s]
episodes                                   36
episode_length                     248.722222
returns                            -63.280743
return_std                          87.899969
average_reward                      -0.261578
round_time             0 days 00:17:26.523366
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       846.226291
return_std_test                      9.339641
average_reward_test                  0.846226
round_time_test        0 days 00:00:11.912207
round_time_total       0 days 00:17:26.524466
loss_total                        1589.294745
loss_critic                        1974.19012
loss_actor                          49.713118
memory_size                       220464.2105 

=== epoch 4/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:24,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                   40
episode_length                        244.125
returns                            -61.811104
return_std                          84.636198
average_reward                      -0.253927
round_time             0 days 00:17:37.362732
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       855.235344
return_std_test                      9.929917
average_reward_test                  0.855235
round_time_test        0 days 00:00:12.059178
round_time_total       0 days 00:17:37.363871
loss_total                        1567.230779
loss_critic                       1947.202805
loss_actor                          47.342535
memory_size                       222115.9795 

=== epoch 4/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:46,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:50<00:00,  1.87it/s]
episodes                                   36
episode_length                     257.861111
returns                            -60.281383
return_std                          84.892592
average_reward                      -0.234164
round_time             0 days 00:17:51.529235
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       855.929966
return_std_test                      7.946096
average_reward_test                   0.85593
round_time_test        0 days 00:00:11.951934
round_time_total       0 days 00:17:51.530589
loss_total                          1583.0327
loss_critic                       1967.973737
loss_actor                          43.268423
memory_size                       223886.7925 

=== epoch 4/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:51,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:48<00:00,  1.87it/s]
episodes                                   35
episode_length                     283.228571
returns                            -65.215429
return_std                          84.449618
average_reward                      -0.227857
round_time             0 days 00:17:48.734853
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       862.938173
return_std_test                     13.440378
average_reward_test                  0.862938
round_time_test        0 days 00:00:11.763899
round_time_total       0 days 00:17:48.736202
loss_total                        1537.672345
loss_critic                       1912.141553
loss_actor                          39.795393
memory_size                       225763.3825 

=== epoch 4/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:01,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:41<00:00,  1.88it/s]
episodes                                   31
episode_length                     280.548387
returns                            -64.783515
return_std                          79.780149
average_reward                      -0.235513
round_time             0 days 00:17:42.078668
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       868.196672
return_std_test                      13.04896
average_reward_test                  0.868197
round_time_test        0 days 00:00:11.697885
round_time_total       0 days 00:17:42.079942
loss_total                        1517.934082
loss_critic                       1888.493432
loss_actor                          35.696553
memory_size                       227563.2055 

=== epoch 4/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:07,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                   28
episode_length                     342.785714
returns                            -78.060571
return_std                           92.50316
average_reward                      -0.227182
round_time             0 days 00:17:32.820651
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       878.361203
return_std_test                     12.341707
average_reward_test                  0.878361
round_time_test        0 days 00:00:11.832848
round_time_total       0 days 00:17:32.821755
loss_total                         1482.53768
loss_critic                       1845.011566
loss_actor                          32.642017
memory_size                        229415.177 

=== epoch 4/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:44,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:34<00:00,  1.90it/s]
episodes                                   23
episode_length                     423.565217
returns                           -100.470917
return_std                         101.974836
average_reward                      -0.236226
round_time             0 days 00:17:34.597061
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        873.23706
return_std_test                     12.693545
average_reward_test                  0.873237
round_time_test        0 days 00:00:11.631738
round_time_total       0 days 00:17:34.598176
loss_total                        1456.755373
loss_critic                       1813.720004
loss_actor                          28.896714
memory_size                       231330.8195 

=== epoch 4/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:45,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:31<00:00,  1.90it/s]
episodes                                   27
episode_length                     340.888889
returns                            -86.568674
return_std                          90.605384
average_reward                      -0.255796
round_time             0 days 00:17:31.951326
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       874.335488
return_std_test                     11.478837
average_reward_test                  0.874335
round_time_test        0 days 00:00:11.800914
round_time_total       0 days 00:17:31.952602
loss_total                        1422.967613
loss_critic                       1772.075951
loss_actor                          26.534142
memory_size                       233092.8635 

=== epoch 4/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:28,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   37
episode_length                     253.027027
returns                            -65.010229
return_std                          82.362949
average_reward                      -0.257126
round_time             0 days 00:17:30.375570
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       904.287904
return_std_test                     23.831258
average_reward_test                  0.904288
round_time_test        0 days 00:00:11.666109
round_time_total       0 days 00:17:30.376677
loss_total                         1413.39845
loss_critic                       1760.564018
loss_actor                          24.736057
memory_size                        234667.557 

=== epoch 4/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                   39
episode_length                     240.076923
returns                            -60.265267
return_std                           83.71175
average_reward                       -0.25094
round_time             0 days 00:17:33.460860
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       892.224438
return_std_test                      8.371282
average_reward_test                  0.892224
round_time_test        0 days 00:00:11.531142
round_time_total       0 days 00:17:33.462086
loss_total                        1391.426826
loss_critic                       1733.877087
loss_actor                          21.625668
memory_size                       236436.6195 

=== epoch 4/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:25,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:33<00:00,  1.90it/s]
episodes                                   45
episode_length                     192.666667
returns                            -52.152603
return_std                          73.849319
average_reward                      -0.263537
round_time             0 days 00:17:33.731750
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.269829
return_std_test                     13.993264
average_reward_test                   0.89827
round_time_test        0 days 00:00:11.769873
round_time_total       0 days 00:17:33.732888
loss_total                         1370.17378
loss_critic                       1707.958755
loss_actor                          19.033759
memory_size                       238042.4875 

=== epoch 4/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:25,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                   48
episode_length                         204.75
returns                            -54.502714
return_std                          75.115753
average_reward                      -0.267931
round_time             0 days 00:17:33.281274
episodes_test                            10.0
episode_length_test                     901.1
returns_test                       817.360354
return_std_test                    270.753312
average_reward_test                  0.908213
round_time_test        0 days 00:00:11.722868
round_time_total       0 days 00:17:33.282373
loss_total                        1344.776005
loss_critic                       1676.890373
loss_actor                          16.318426
memory_size                       239802.2035 

=== epoch 4/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:00,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:36<00:00,  1.89it/s]
episodes                                   46
episode_length                     201.086957
returns                            -54.715802
return_std                          77.195154
average_reward                      -0.268066
round_time             0 days 00:17:37.034245
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       902.993146
return_std_test                     16.141399
average_reward_test                  0.902993
round_time_test        0 days 00:00:11.537541
round_time_total       0 days 00:17:37.035349
loss_total                        1321.398984
loss_critic                       1648.300874
loss_actor                          13.791307
memory_size                        241618.104 

=== epoch 4/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:48,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:37<00:00,  1.89it/s]
episodes                                   35
episode_length                     245.457143
returns                            -69.387951
return_std                          86.077759
average_reward                      -0.278275
round_time             0 days 00:17:38.448410
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       905.770945
return_std_test                     10.755742
average_reward_test                  0.905771
round_time_test        0 days 00:00:11.533551
round_time_total       0 days 00:17:38.449523
loss_total                        1323.953778
loss_critic                       1651.865644
loss_actor                          12.306201
memory_size                        243417.199 

=== epoch 4/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:17,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:28<00:00,  1.91it/s]
episodes                                   32
episode_length                      295.65625
returns                            -83.593384
return_std                          92.406238
average_reward                       -0.28326
round_time             0 days 00:17:28.611629
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       906.786218
return_std_test                      6.859383
average_reward_test                  0.906786
round_time_test        0 days 00:00:11.497384
round_time_total       0 days 00:17:28.612738
loss_total                        1304.591254
loss_critic                       1628.143231
loss_actor                          10.383235
memory_size                          245275.2 

=== epoch 4/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:40,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   29
episode_length                     326.586207
returns                            -92.753936
return_std                         105.870861
average_reward                      -0.285644
round_time             0 days 00:17:29.748843
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.597317
return_std_test                     13.613185
average_reward_test                  0.898597
round_time_test        0 days 00:00:11.558875
round_time_total       0 days 00:17:29.749942
loss_total                        1283.040584
loss_critic                       1601.473545
loss_actor                           9.308628
memory_size                        247117.842 

=== epoch 4/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:26<00:00,  1.91it/s]
episodes                                   23
episode_length                     389.478261
returns                           -112.403462
return_std                         117.473202
average_reward                      -0.286025
round_time             0 days 00:17:27.097970
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       903.709007
return_std_test                     14.119235
average_reward_test                  0.903709
round_time_test        0 days 00:00:11.484406
round_time_total       0 days 00:17:27.099117
loss_total                        1288.582439
loss_critic                       1608.966978
loss_actor                            7.04416
memory_size                        248994.397 

=== epoch 4/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:27,  1.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:22<00:00,  1.92it/s]
episodes                                   25
episode_length                         371.48
returns                           -107.821347
return_std                         116.383524
average_reward                      -0.292193
round_time             0 days 00:17:23.364916
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.427564
return_std_test                     10.272348
average_reward_test                  0.912428
round_time_test        0 days 00:00:11.587903
round_time_total       0 days 00:17:23.366011
loss_total                        1262.528185
loss_critic                       1576.915516
loss_actor                            4.97876
memory_size                       250791.0275 

=== epoch 4/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:32,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:27<00:00,  1.91it/s]
episodes                                   32
episode_length                       295.5625
returns                            -87.973883
return_std                         110.602819
average_reward                      -0.296092
round_time             0 days 00:17:27.649330
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       927.113391
return_std_test                     16.156596
average_reward_test                  0.927113
round_time_test        0 days 00:00:11.674909
round_time_total       0 days 00:17:27.650430
loss_total                        1259.096988
loss_critic                       1572.982313
loss_actor                           3.555576
memory_size                       252501.2755 

=== epoch 4/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:16,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:30<00:00,  1.90it/s]
episodes                                   33
episode_length                     280.727273
returns                            -81.333118
return_std                         105.562642
average_reward                      -0.287078
round_time             0 days 00:17:31.256395
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       905.806127
return_std_test                      6.928906
average_reward_test                  0.905806
round_time_test        0 days 00:00:11.619441
round_time_total       0 days 00:17:31.257801
loss_total                         1257.69857
loss_critic                       1571.603831
loss_actor                           2.077412
memory_size                       254251.3225 

=== epoch 4/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:43,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   31
episode_length                     268.741935
returns                             -81.44129
return_std                         103.936422
average_reward                      -0.298742
round_time             0 days 00:17:29.688951
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.174427
return_std_test                      8.686483
average_reward_test                  0.929174
round_time_test        0 days 00:00:11.465616
round_time_total       0 days 00:17:29.690031
loss_total                         1246.74099
loss_critic                        1558.33177
loss_actor                            0.37776
memory_size                       256050.0755 

=== epoch 4/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:39,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.90it/s]
episodes                                   40
episode_length                        249.375
returns                            -72.363907
return_std                         100.803836
average_reward                      -0.290315
round_time             0 days 00:17:30.485960
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.918863
return_std_test                     12.101417
average_reward_test                  0.940919
round_time_test        0 days 00:00:11.764899
round_time_total       0 days 00:17:30.487107
loss_total                         1225.80796
loss_critic                       1532.627718
loss_actor                          -1.471178
memory_size                       257831.5795 

=== epoch 4/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:04,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:23<00:00,  1.92it/s]
episodes                                   39
episode_length                     235.358974
returns                             -67.02226
return_std                          99.172016
average_reward                      -0.279452
round_time             0 days 00:17:23.977462
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       918.099222
return_std_test                      9.380675
average_reward_test                  0.918099
round_time_test        0 days 00:00:11.808372
round_time_total       0 days 00:17:23.978549
loss_total                        1212.958234
loss_critic                        1516.95336
loss_actor                          -3.022382
memory_size                        259474.474 

=== epoch 4/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:26,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   39
episode_length                     242.923077
returns                            -62.862566
return_std                          96.033685
average_reward                       -0.26238
round_time             0 days 00:17:29.851600
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       927.322652
return_std_test                     10.825415
average_reward_test                  0.927323
round_time_test        0 days 00:00:11.682793
round_time_total       0 days 00:17:29.852716
loss_total                        1221.477121
loss_critic                       1528.084396
loss_actor                          -4.952082
memory_size                       261287.2715 

=== epoch 4/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:11,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   46
episode_length                     197.847826
returns                            -52.517567
return_std                          82.766481
average_reward                       -0.27831
round_time             0 days 00:17:30.115332
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.874381
return_std_test                      7.915538
average_reward_test                  0.915874
round_time_test        0 days 00:00:11.610459
round_time_total       0 days 00:17:30.116430
loss_total                        1205.547671
loss_critic                       1508.557656
loss_actor                           -6.49238
memory_size                       262868.9395 

=== epoch 4/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:03,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:27<00:00,  1.91it/s]
episodes                                   47
episode_length                     190.212766
returns                            -50.111025
return_std                          83.997397
average_reward                      -0.267829
round_time             0 days 00:17:28.316826
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       931.748754
return_std_test                     10.231983
average_reward_test                  0.931749
round_time_test        0 days 00:00:11.678318
round_time_total       0 days 00:17:28.318065
loss_total                        1182.975381
loss_critic                       1480.649461
loss_actor                          -7.721043
memory_size                       264531.9465 

=== epoch 4/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:33,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:32<00:00,  1.90it/s]
episodes                                   43
episode_length                     218.023256
returns                            -65.035711
return_std                          98.085179
average_reward                       -0.29405
round_time             0 days 00:17:32.811644
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       925.447338
return_std_test                      8.349273
average_reward_test                  0.925447
round_time_test        0 days 00:00:11.521021
round_time_total       0 days 00:17:32.813045
loss_total                         1176.71991
loss_critic                       1473.053526
loss_actor                          -8.614661
memory_size                       266335.0205 

=== epoch 4/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:57,  1.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:30<00:00,  1.90it/s]
episodes                                   44
episode_length                     208.636364
returns                            -64.143658
return_std                          99.759232
average_reward                      -0.302119
round_time             0 days 00:17:30.729980
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       925.163146
return_std_test                      8.001698
average_reward_test                  0.925163
round_time_test        0 days 00:00:11.441954
round_time_total       0 days 00:17:30.731101
loss_total                        1161.917113
loss_critic                       1454.812636
loss_actor                           -9.66509
memory_size                        268092.795 

=== epoch 4/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:46,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:29<00:00,  1.91it/s]
episodes                                   48
episode_length                        190.625
returns                            -56.823348
return_std                          99.469003
average_reward                       -0.30806
round_time             0 days 00:17:30.200408
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       928.330823
return_std_test                       8.31363
average_reward_test                  0.928331
round_time_test        0 days 00:00:11.479102
round_time_total       0 days 00:17:30.201492
loss_total                        1171.246469
loss_critic                       1466.813521
loss_actor                         -11.021842
memory_size                       269734.2475 

=== epoch 4/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:57,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:31<00:00,  1.90it/s]
episodes                                   45
episode_length                     184.666667
returns                            -56.858755
return_std                          102.83524
average_reward                      -0.313586
round_time             0 days 00:17:31.965534
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       942.430436
return_std_test                      8.089424
average_reward_test                   0.94243
round_time_test        0 days 00:00:11.582233
round_time_total       0 days 00:17:31.966616
loss_total                        1145.760868
loss_critic                       1435.250922
loss_actor                          -12.19946
memory_size                        271308.319 

=== epoch 4/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:07,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:15<00:00,  1.93it/s]
episodes                                   61
episode_length                     162.360656
returns                            -50.024765
return_std                          92.504852
average_reward                      -0.310471
round_time             0 days 00:17:16.086581
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       928.263889
return_std_test                     12.446028
average_reward_test                  0.928264
round_time_test        0 days 00:00:11.830961
round_time_total       0 days 00:17:16.087806
loss_total                        1137.555611
loss_critic                       1425.158972
loss_actor                         -12.857932
memory_size                       272857.4385 

=== epoch 4/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   60
episode_length                     151.983333
returns                            -45.197817
return_std                           86.54129
average_reward                      -0.296638
round_time             0 days 00:16:38.749018
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        930.85588
return_std_test                     10.016288
average_reward_test                  0.930856
round_time_test        0 days 00:00:11.504460
round_time_total       0 days 00:16:38.750150
loss_total                        1138.424267
loss_critic                       1426.601255
loss_actor                          -14.28379
memory_size                        274403.466 

=== epoch 4/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:16,  1.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                   59
episode_length                     156.084746
returns                            -47.280349
return_std                          85.253786
average_reward                      -0.304329
round_time             0 days 00:16:36.682977
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       932.061459
return_std_test                     14.190676
average_reward_test                  0.932061
round_time_test        0 days 00:00:11.574139
round_time_total       0 days 00:16:36.684069
loss_total                        1136.528304
loss_critic                       1424.506962
loss_actor                         -15.386425
memory_size                        276179.455 

=== epoch 4/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:00,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.01it/s]
episodes                                   57
episode_length                     151.280702
returns                             -45.70637
return_std                          83.692925
average_reward                      -0.303342
round_time             0 days 00:16:37.965303
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       939.656989
return_std_test                      8.537719
average_reward_test                  0.939657
round_time_test        0 days 00:00:11.700072
round_time_total       0 days 00:16:37.966765
loss_total                        1132.225072
loss_critic                       1419.427813
loss_actor                         -16.585988
memory_size                        277820.153 

=== epoch 4/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:15,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                   57
episode_length                     170.684211
returns                            -51.556967
return_std                          88.314775
average_reward                      -0.300096
round_time             0 days 00:16:35.340135
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       923.629973
return_std_test                      6.755011
average_reward_test                   0.92363
round_time_test        0 days 00:00:11.615076
round_time_total       0 days 00:16:35.341296
loss_total                        1127.842157
loss_critic                       1414.238125
loss_actor                         -17.741817
memory_size                        279454.413 

=== epoch 4/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   39
episode_length                     237.205128
returns                            -75.592092
return_std                         114.720484
average_reward                      -0.306219
round_time             0 days 00:16:38.838932
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       934.886695
return_std_test                      6.598393
average_reward_test                  0.934887
round_time_test        0 days 00:00:11.361632
round_time_total       0 days 00:16:38.840035
loss_total                        1117.485714
loss_critic                       1401.504966
loss_actor                          -18.59139
memory_size                       281270.9835 

=== epoch 4/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:06,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:51<00:00,  1.98it/s]
episodes                                   37
episode_length                      227.72973
returns                            -69.879638
return_std                         107.768539
average_reward                      -0.309637
round_time             0 days 00:16:51.694462
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       944.114491
return_std_test                     17.733789
average_reward_test                  0.944114
round_time_test        0 days 00:00:11.488039
round_time_total       0 days 00:16:51.695775
loss_total                        1130.409931
loss_critic                        1418.06918
loss_actor                         -20.227163
memory_size                        283150.631 

=== epoch 4/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   34
episode_length                     253.558824
returns                            -78.136591
return_std                          118.23172
average_reward                      -0.312142
round_time             0 days 00:16:38.580249
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       933.359515
return_std_test                     10.561446
average_reward_test                   0.93336
round_time_test        0 days 00:00:11.150820
round_time_total       0 days 00:16:38.581475
loss_total                        1109.575331
loss_critic                       1392.235525
loss_actor                         -21.065552
memory_size                       284967.0245 

=== epoch 4/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:51,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:50<00:00,  1.98it/s]
episodes                                   26
episode_length                     355.038462
returns                           -112.636793
return_std                         133.919663
average_reward                      -0.318308
round_time             0 days 00:16:51.351720
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       934.567076
return_std_test                     11.441129
average_reward_test                  0.934567
round_time_test        0 days 00:00:11.414859
round_time_total       0 days 00:16:51.352822
loss_total                        1123.058679
loss_critic                       1409.424855
loss_actor                         -22.406111
memory_size                        286817.813 

=== epoch 4/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:04,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:44<00:00,  1.99it/s]
episodes                                   21
episode_length                     437.952381
returns                           -139.727259
return_std                         152.860333
average_reward                       -0.31267
round_time             0 days 00:16:44.767734
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.443651
return_std_test                      8.590751
average_reward_test                  0.929444
round_time_test        0 days 00:00:11.407667
round_time_total       0 days 00:16:44.768841
loss_total                        1112.954457
loss_critic                       1396.887305
loss_actor                          -22.77704
memory_size                       288673.9215 

=== epoch 4/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.01it/s]
episodes                                   31
episode_length                      304.16129
returns                            -93.782615
return_std                         129.729985
average_reward                      -0.308123
round_time             0 days 00:16:37.842269
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       908.108409
return_std_test                      9.619533
average_reward_test                  0.908108
round_time_test        0 days 00:00:11.425950
round_time_total       0 days 00:16:37.843365
loss_total                        1100.453428
loss_critic                       1381.680699
loss_actor                         -24.455753
memory_size                       290505.6645 

=== epoch 4/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   33
episode_length                     277.545455
returns                            -86.955269
return_std                         119.668543
average_reward                      -0.318261
round_time             0 days 00:16:39.138646
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       925.834417
return_std_test                     12.931095
average_reward_test                  0.925834
round_time_test        0 days 00:00:11.207932
round_time_total       0 days 00:16:39.139736
loss_total                         1115.66288
loss_critic                       1401.104869
loss_actor                         -26.105165
memory_size                        292065.043 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:28,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:30<00:00,  2.15it/s]
episodes                                    8
episode_length                         235.25
returns                            -68.654314
return_std                         118.041016
average_reward                      -0.289093
round_time             0 days 00:15:30.981755
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       916.705935
return_std_test                     11.033396
average_reward_test                  0.916706
round_time_test        0 days 00:00:11.340157
round_time_total       0 days 00:15:30.982855
loss_total                        1091.992916
loss_critic                       1371.841202
loss_actor                         -27.400326
memory_size                       293878.1605 

=== epoch 5/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:24,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:32<00:00,  2.14it/s]
episodes                                   19
episode_length                     194.894737
returns                            -56.473007
return_std                          97.948239
average_reward                       -0.28305
round_time             0 days 00:15:33.028322
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        944.79959
return_std_test                     14.498058
average_reward_test                    0.9448
round_time_test        0 days 00:00:11.260983
round_time_total       0 days 00:15:33.029393
loss_total                        1098.779392
loss_critic                       1380.626515
loss_actor                         -28.609195
memory_size                        295600.134 

=== epoch 5/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:01,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:35<00:00,  2.14it/s]
episodes                                   33
episode_length                     178.757576
returns                            -56.218761
return_std                          91.320853
average_reward                      -0.318469
round_time             0 days 00:15:35.584884
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.567724
return_std_test                     14.893497
average_reward_test                  0.940568
round_time_test        0 days 00:00:11.459227
round_time_total       0 days 00:15:35.585996
loss_total                        1102.956767
loss_critic                       1386.093689
loss_actor                         -29.591019
memory_size                        297267.641 

=== epoch 5/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:12,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:36<00:00,  2.14it/s]
episodes                                   44
episode_length                     179.795455
returns                            -58.765555
return_std                          97.317816
average_reward                      -0.331492
round_time             0 days 00:15:36.510550
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       943.279055
return_std_test                     17.856141
average_reward_test                  0.943279
round_time_test        0 days 00:00:11.543207
round_time_total       0 days 00:15:36.511661
loss_total                        1097.554672
loss_critic                       1379.554043
loss_actor                         -30.442914
memory_size                        298858.848 

=== epoch 5/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:38<00:00,  2.13it/s]
episodes                                   54
episode_length                     184.907407
returns                            -57.291596
return_std                           99.86535
average_reward                      -0.310223
round_time             0 days 00:15:39.160899
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        943.96479
return_std_test                      5.999696
average_reward_test                  0.943965
round_time_test        0 days 00:00:11.663971
round_time_total       0 days 00:15:39.161989
loss_total                        1106.605512
loss_critic                       1391.117021
loss_actor                         -31.440618
memory_size                       300530.8605 

=== epoch 5/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:57,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:37<00:00,  2.13it/s]
episodes                                   56
episode_length                     162.339286
returns                             -54.44973
return_std                          91.991197
average_reward                      -0.330924
round_time             0 days 00:15:38.251774
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       947.556554
return_std_test                      7.708325
average_reward_test                  0.947557
round_time_test        0 days 00:00:11.519758
round_time_total       0 days 00:15:38.253121
loss_total                        1110.068932
loss_critic                       1395.769166
loss_actor                         -32.732103
memory_size                        302175.967 

=== epoch 5/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:27,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:47<00:00,  2.11it/s]
episodes                                   51
episode_length                     174.862745
returns                            -58.452085
return_std                         101.356981
average_reward                      -0.329652
round_time             0 days 00:15:48.212397
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       935.434391
return_std_test                     10.299942
average_reward_test                  0.935434
round_time_test        0 days 00:00:11.343767
round_time_total       0 days 00:15:48.213620
loss_total                        1110.287993
loss_critic                       1396.160198
loss_actor                         -33.200917
memory_size                        303860.418 

=== epoch 5/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:53,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:53<00:00,  2.10it/s]
episodes                                   45
episode_length                     200.488889
returns                            -62.868513
return_std                         101.152308
average_reward                      -0.319121
round_time             0 days 00:15:54.269516
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.886363
return_std_test                     14.502328
average_reward_test                  0.940886
round_time_test        0 days 00:00:11.295523
round_time_total       0 days 00:15:54.270617
loss_total                        1118.353697
loss_critic                       1406.554865
loss_actor                         -34.451081
memory_size                        305659.505 

=== epoch 5/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:09,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:03<00:00,  2.08it/s]
episodes                                   46
episode_length                     214.869565
returns                            -65.920451
return_std                         108.531563
average_reward                      -0.308892
round_time             0 days 00:16:03.560087
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        930.08037
return_std_test                     10.011966
average_reward_test                   0.93008
round_time_test        0 days 00:00:11.555054
round_time_total       0 days 00:16:03.561172
loss_total                        1099.075416
loss_critic                       1382.617216
loss_actor                         -35.091882
memory_size                       307372.0705 

=== epoch 5/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:50,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:10<00:00,  2.06it/s]
episodes                                   50
episode_length                         196.58
returns                             -62.34378
return_std                          103.37953
average_reward                      -0.312741
round_time             0 days 00:16:10.809564
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       933.067105
return_std_test                     13.489167
average_reward_test                  0.933067
round_time_test        0 days 00:00:11.355576
round_time_total       0 days 00:16:10.810655
loss_total                        1099.633717
loss_critic                       1383.543731
loss_actor                         -36.006433
memory_size                        308962.309 

=== epoch 5/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:32,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
episodes                                   48
episode_length                         196.75
returns                            -58.794111
return_std                         105.026426
average_reward                      -0.297634
round_time             0 days 00:16:18.519667
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        942.82632
return_std_test                     17.507736
average_reward_test                  0.942826
round_time_test        0 days 00:00:11.363658
round_time_total       0 days 00:16:18.520773
loss_total                        1089.788152
loss_critic                       1371.360373
loss_actor                         -36.500833
memory_size                        310610.916 

=== epoch 5/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:10,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                   50
episode_length                         184.92
returns                            -57.212182
return_std                         102.712493
average_reward                      -0.314087
round_time             0 days 00:16:25.158864
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       938.578835
return_std_test                     15.569754
average_reward_test                  0.938579
round_time_test        0 days 00:00:11.324214
round_time_total       0 days 00:16:25.160213
loss_total                        1094.585796
loss_critic                       1377.640985
loss_actor                         -37.635059
memory_size                       312334.9865 

=== epoch 5/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   51
episode_length                     194.862745
returns                            -60.673587
return_std                          111.12478
average_reward                      -0.306065
round_time             0 days 00:16:24.468127
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        930.88388
return_std_test                     17.701334
average_reward_test                  0.930884
round_time_test        0 days 00:00:11.466324
round_time_total       0 days 00:16:24.469354
loss_total                         1091.88399
loss_critic                       1374.606754
loss_actor                         -39.007155
memory_size                        314079.757 

=== epoch 5/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                   50
episode_length                          198.1
returns                            -61.165419
return_std                         108.124101
average_reward                      -0.308957
round_time             0 days 00:16:20.921575
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.992325
return_std_test                     19.485433
average_reward_test                  0.940992
round_time_test        0 days 00:00:11.613132
round_time_total       0 days 00:16:20.922849
loss_total                        1085.143151
loss_critic                       1366.316734
loss_actor                         -39.551278
memory_size                       315826.7825 

=== epoch 5/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:20,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:21<00:00,  2.04it/s]
episodes                                   45
episode_length                          218.4
returns                            -67.927912
return_std                         111.214009
average_reward                      -0.306664
round_time             0 days 00:16:22.471505
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       937.941605
return_std_test                     13.219052
average_reward_test                  0.937942
round_time_test        0 days 00:00:11.142824
round_time_total       0 days 00:16:22.472584
loss_total                        1084.001923
loss_critic                       1365.086758
loss_actor                         -40.337512
memory_size                       317413.4905 

=== epoch 5/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   38
episode_length                     223.078947
returns                            -68.091214
return_std                         111.629303
average_reward                      -0.314612
round_time             0 days 00:16:23.947942
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       935.770608
return_std_test                     11.968662
average_reward_test                  0.935771
round_time_test        0 days 00:00:11.326109
round_time_total       0 days 00:16:23.949021
loss_total                        1078.242042
loss_critic                       1358.073603
loss_actor                         -41.084299
memory_size                       319264.2285 

=== epoch 5/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:21<00:00,  2.04it/s]
episodes                                   34
episode_length                     264.647059
returns                             -86.97091
return_std                         129.430302
average_reward                      -0.323951
round_time             0 days 00:16:22.021949
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.460011
return_std_test                     18.756759
average_reward_test                   0.94046
round_time_test        0 days 00:00:11.254609
round_time_total       0 days 00:16:22.023411
loss_total                         1083.79041
loss_critic                       1365.233845
loss_actor                         -41.983424
memory_size                        321168.808 

=== epoch 5/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:28,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                   37
episode_length                      265.72973
returns                            -88.404175
return_std                         126.800756
average_reward                      -0.331519
round_time             0 days 00:16:19.801566
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       930.759171
return_std_test                     18.303923
average_reward_test                  0.930759
round_time_test        0 days 00:00:11.342209
round_time_total       0 days 00:16:19.802653
loss_total                        1077.597725
loss_critic                       1357.645282
loss_actor                         -42.592594
memory_size                       323006.8045 

=== epoch 5/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   39
episode_length                     239.538462
returns                            -82.663783
return_std                         121.458916
average_reward                      -0.350838
round_time             0 days 00:16:23.026777
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       938.580538
return_std_test                     16.442462
average_reward_test                  0.938581
round_time_test        0 days 00:00:11.202912
round_time_total       0 days 00:16:23.027854
loss_total                        1086.901964
loss_critic                       1369.519856
loss_actor                           -43.5697
memory_size                        324486.865 

=== epoch 5/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:08,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   40
episode_length                        233.375
returns                            -82.855448
return_std                         123.002847
average_reward                      -0.351203
round_time             0 days 00:16:23.158788
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       923.246635
return_std_test                     14.101316
average_reward_test                  0.923247
round_time_test        0 days 00:00:11.303283
round_time_total       0 days 00:16:23.159873
loss_total                         1072.85317
loss_critic                       1352.139822
loss_actor                         -44.293535
memory_size                       326152.7955 

=== epoch 5/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:17,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                   61
episode_length                     162.442623
returns                            -56.029286
return_std                          97.571033
average_reward                      -0.343434
round_time             0 days 00:16:20.454070
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       928.017276
return_std_test                      16.07867
average_reward_test                  0.928017
round_time_test        0 days 00:00:11.454452
round_time_total       0 days 00:16:20.455152
loss_total                        1085.837982
loss_critic                       1368.557899
loss_actor                          -45.04178
memory_size                       327756.0405 

=== epoch 5/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:47,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   63
episode_length                     151.349206
returns                            -50.690086
return_std                           86.05361
average_reward                      -0.333618
round_time             0 days 00:16:23.825011
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       939.479489
return_std_test                     16.762333
average_reward_test                  0.939479
round_time_test        0 days 00:00:11.220354
round_time_total       0 days 00:16:23.826330
loss_total                        1068.440141
loss_critic                       1347.168839
loss_actor                         -46.474743
memory_size                       329272.8285 

=== epoch 5/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:40,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                   56
episode_length                     175.982143
returns                            -58.892086
return_std                          99.809626
average_reward                      -0.333308
round_time             0 days 00:16:25.123514
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.427735
return_std_test                     12.877863
average_reward_test                  0.929428
round_time_test        0 days 00:00:11.364597
round_time_total       0 days 00:16:25.124648
loss_total                        1093.307495
loss_critic                       1378.530277
loss_actor                         -47.583725
memory_size                       331108.6305 

=== epoch 5/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                   59
episode_length                     161.169492
returns                            -50.881244
return_std                          95.328626
average_reward                      -0.315296
round_time             0 days 00:16:30.420406
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       925.422814
return_std_test                     12.250791
average_reward_test                  0.925423
round_time_test        0 days 00:00:11.539353
round_time_total       0 days 00:16:30.421536
loss_total                        1150.672464
loss_critic                       1450.765338
loss_actor                         -49.699137
memory_size                       332728.8635 

=== epoch 5/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:14,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                   58
episode_length                     165.034483
returns                            -53.035744
return_std                          97.192395
average_reward                      -0.325022
round_time             0 days 00:16:25.505396
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       903.821237
return_std_test                     15.369966
average_reward_test                  0.903821
round_time_test        0 days 00:00:11.543928
round_time_total       0 days 00:16:25.506483
loss_total                        1314.977322
loss_critic                       1657.973513
loss_actor                         -57.007557
memory_size                       334382.2175 

=== epoch 5/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                   43
episode_length                     205.883721
returns                            -65.659875
return_std                         107.933622
average_reward                      -0.324096
round_time             0 days 00:16:30.217653
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       815.124383
return_std_test                     13.952043
average_reward_test                  0.815124
round_time_test        0 days 00:00:11.651676
round_time_total       0 days 00:16:30.218744
loss_total                        1434.529952
loss_critic                       1812.743893
loss_actor                         -78.325938
memory_size                        336096.881 

=== epoch 5/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:25<00:00,  2.03it/s]
episodes                                   39
episode_length                     214.384615
returns                            -65.182879
return_std                         108.612503
average_reward                      -0.299807
round_time             0 days 00:16:26.529193
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       742.076724
return_std_test                     10.505837
average_reward_test                  0.742077
round_time_test        0 days 00:00:11.613624
round_time_total       0 days 00:16:26.530279
loss_total                          1384.5384
loss_critic                       1765.304201
loss_actor                         -138.52492
memory_size                        337911.052 

=== epoch 5/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:28<00:00,  2.02it/s]
episodes                                   38
episode_length                     245.236842
returns                            -71.649885
return_std                         108.273038
average_reward                       -0.27998
round_time             0 days 00:16:28.685032
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       719.053748
return_std_test                     18.140456
average_reward_test                  0.719054
round_time_test        0 days 00:00:11.468444
round_time_total       0 days 00:16:28.686291
loss_total                        1519.493579
loss_critic                       1949.836897
loss_actor                        -201.879813
memory_size                        339808.326 

=== epoch 5/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                   29
episode_length                     310.793103
returns                            -78.669223
return_std                         110.578583
average_reward                      -0.255982
round_time             0 days 00:16:36.218357
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       698.807272
return_std_test                     13.088404
average_reward_test                  0.698807
round_time_test        0 days 00:00:11.831210
round_time_total       0 days 00:16:36.219462
loss_total                        1616.162314
loss_critic                       2083.529387
loss_actor                        -253.306122
memory_size                        341596.575 

=== epoch 5/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:12,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                   24
episode_length                         390.25
returns                            -92.847432
return_std                         115.209616
average_reward                        -0.2283
round_time             0 days 00:16:33.979246
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       724.990259
return_std_test                     15.044118
average_reward_test                   0.72499
round_time_test        0 days 00:00:11.520862
round_time_total       0 days 00:16:33.980345
loss_total                        1666.611122
loss_critic                       2155.622488
loss_actor                        -289.434495
memory_size                       343428.4445 

=== epoch 5/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:24,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   20
episode_length                          455.8
returns                            -91.467753
return_std                         102.146438
average_reward                      -0.207451
round_time             0 days 00:16:31.350980
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       735.091097
return_std_test                     17.357108
average_reward_test                  0.735091
round_time_test        0 days 00:00:11.512824
round_time_total       0 days 00:16:31.352071
loss_total                        1757.396131
loss_critic                       2275.188444
loss_actor                        -313.773282
memory_size                        345344.537 

=== epoch 5/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:18,  1.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                   23
episode_length                     419.043478
returns                            -86.115044
return_std                          95.154064
average_reward                      -0.200794
round_time             0 days 00:16:32.461312
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       745.190292
return_std_test                     18.668876
average_reward_test                   0.74519
round_time_test        0 days 00:00:11.713610
round_time_total       0 days 00:16:32.462595
loss_total                        1791.397377
loss_critic                       2321.935365
loss_actor                        -330.754747
memory_size                       347149.3385 

=== epoch 5/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:28,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:35<00:00,  2.01it/s]
episodes                                   23
episode_length                     388.086957
returns                            -73.992495
return_std                          84.291524
average_reward                      -0.193364
round_time             0 days 00:16:35.718868
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        753.42919
return_std_test                     22.116221
average_reward_test                  0.753429
round_time_test        0 days 00:00:11.681860
round_time_total       0 days 00:16:35.719971
loss_total                        1803.994723
loss_critic                       2340.030925
loss_actor                        -340.150239
memory_size                       349022.1945 

=== epoch 5/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:34,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.00it/s]
episodes                                   26
episode_length                     371.538462
returns                            -75.494888
return_std                          81.372161
average_reward                      -0.208176
round_time             0 days 00:16:38.583988
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       750.070951
return_std_test                      9.846598
average_reward_test                  0.750071
round_time_test        0 days 00:00:11.358888
round_time_total       0 days 00:16:38.585071
loss_total                        1761.235378
loss_critic                       2288.336142
loss_actor                        -347.167828
memory_size                        350822.275 

=== epoch 5/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<18:10,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                   25
episode_length                         373.48
returns                            -84.280749
return_std                          88.638666
average_reward                      -0.229662
round_time             0 days 00:16:37.502167
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       751.898864
return_std_test                      20.79665
average_reward_test                  0.751899
round_time_test        0 days 00:00:11.600525
round_time_total       0 days 00:16:37.503263
loss_total                        1770.629121
loss_critic                       2300.974622
loss_actor                        -350.753027
memory_size                       352601.9945 

=== epoch 5/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:15,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                   32
episode_length                      310.65625
returns                            -69.578646
return_std                          90.675568
average_reward                      -0.224081
round_time             0 days 00:16:35.199064
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       818.527406
return_std_test                     23.174788
average_reward_test                  0.818527
round_time_test        0 days 00:00:11.503455
round_time_total       0 days 00:16:35.200153
loss_total                        1728.153021
loss_critic                       2248.054683
loss_actor                        -351.453796
memory_size                       354414.3055 

=== epoch 5/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:05,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                   37
episode_length                     242.675676
returns                            -51.541133
return_std                          84.037988
average_reward                      -0.208284
round_time             0 days 00:16:36.955710
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       810.101358
return_std_test                     10.192753
average_reward_test                  0.810101
round_time_test        0 days 00:00:11.553280
round_time_total       0 days 00:16:36.956831
loss_total                        1715.537993
loss_critic                       2232.267984
loss_actor                        -351.382126
memory_size                        355999.345 

=== epoch 5/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:20,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.01it/s]
episodes                                   40
episode_length                          232.5
returns                             -46.44565
return_std                          78.954963
average_reward                      -0.203355
round_time             0 days 00:16:37.927749
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       795.002843
return_std_test                      9.109362
average_reward_test                  0.795003
round_time_test        0 days 00:00:11.651409
round_time_total       0 days 00:16:37.928844
loss_total                        1670.545546
loss_critic                       2176.351158
loss_actor                         -352.67704
memory_size                       357794.0495 

=== epoch 5/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:43,  1.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   35
episode_length                          251.2
returns                             -41.89592
return_std                           73.99043
average_reward                       -0.17693
round_time             0 days 00:16:38.997071
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       819.557155
return_std_test                     23.720239
average_reward_test                  0.819557
round_time_test        0 days 00:00:11.674035
round_time_total       0 days 00:16:38.998164
loss_total                        1684.413768
loss_critic                       2193.645877
loss_actor                        -352.514817
memory_size                       359583.5755 

=== epoch 5/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:23,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:45<00:00,  1.99it/s]
episodes                                   37
episode_length                     268.297297
returns                            -47.061687
return_std                          80.282809
average_reward                      -0.177349
round_time             0 days 00:16:45.794091
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        804.00098
return_std_test                     13.566296
average_reward_test                  0.804001
round_time_test        0 days 00:00:11.697706
round_time_total       0 days 00:16:45.795596
loss_total                        1649.918827
loss_critic                       2150.738047
loss_actor                        -353.358199
memory_size                       361461.3255 

=== epoch 5/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:43,  1.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:42<00:00,  2.00it/s]
episodes                                   31
episode_length                     293.645161
returns                            -54.150667
return_std                          79.749799
average_reward                      -0.179341
round_time             0 days 00:16:42.571304
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       817.386026
return_std_test                      9.765378
average_reward_test                  0.817386
round_time_test        0 days 00:00:11.602517
round_time_total       0 days 00:16:42.572614
loss_total                        1622.102497
loss_critic                       2116.360254
loss_actor                        -354.928687
memory_size                        363225.372 

=== epoch 5/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:39<00:00,  2.00it/s]
episodes                                   26
episode_length                     351.115385
returns                            -69.386692
return_std                          90.884667
average_reward                      -0.194314
round_time             0 days 00:16:39.606773
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       793.708578
return_std_test                     16.555873
average_reward_test                  0.793709
round_time_test        0 days 00:00:11.581719
round_time_total       0 days 00:16:39.607864
loss_total                        1639.464667
loss_critic                       2138.243768
loss_actor                        -355.651888
memory_size                       365110.3045 

=== epoch 5/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:46<00:00,  1.99it/s]
episodes                                   24
episode_length                     391.916667
returns                            -79.974062
return_std                          84.776536
average_reward                      -0.205064
round_time             0 days 00:16:46.820185
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       832.088195
return_std_test                      7.903005
average_reward_test                  0.832088
round_time_test        0 days 00:00:11.493351
round_time_total       0 days 00:16:46.821281
loss_total                         1581.90787
loss_critic                       2066.738802
loss_actor                         -357.41601
memory_size                       366920.5395 

=== epoch 5/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:11,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:43<00:00,  1.99it/s]
episodes                                   21
episode_length                     404.619048
returns                            -82.434425
return_std                          78.696541
average_reward                      -0.205481
round_time             0 days 00:16:44.132547
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       835.698219
return_std_test                     14.311303
average_reward_test                  0.835698
round_time_test        0 days 00:00:11.516640
round_time_total       0 days 00:16:44.133631
loss_total                        1592.694775
loss_critic                       2080.336969
loss_actor                        -357.874147
memory_size                       368799.8985 

=== epoch 5/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:33,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                   21
episode_length                     471.857143
returns                            -90.157707
return_std                          81.837203
average_reward                      -0.189105
round_time             0 days 00:16:37.155624
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       830.785416
return_std_test                     13.238652
average_reward_test                  0.830785
round_time_test        0 days 00:00:11.365254
round_time_total       0 days 00:16:37.156716
loss_total                        1563.352172
loss_critic                       2043.848439
loss_actor                        -358.633039
memory_size                       370658.6755 

=== epoch 5/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:45,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:39<00:00,  2.00it/s]
episodes                                   20
episode_length                         487.15
returns                            -92.194538
return_std                          77.238933
average_reward                      -0.191959
round_time             0 days 00:16:40.282420
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        852.49671
return_std_test                     15.227732
average_reward_test                  0.852497
round_time_test        0 days 00:00:11.650084
round_time_total       0 days 00:16:40.283700
loss_total                        1533.462632
loss_critic                       2006.829238
loss_actor                        -360.003941
memory_size                       372556.4295 

=== epoch 5/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:00,  1.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                   27
episode_length                      355.37037
returns                            -67.288898
return_std                          71.052216
average_reward                       -0.18814
round_time             0 days 00:16:35.234352
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       828.155926
return_std_test                      9.479356
average_reward_test                  0.828156
round_time_test        0 days 00:00:11.528018
round_time_total       0 days 00:16:35.235448
loss_total                        1592.411831
loss_critic                       2080.593904
loss_actor                        -360.316609
memory_size                       374268.5935 

=== epoch 5/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:22,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:40<00:00,  2.00it/s]
episodes                                   28
episode_length                     298.214286
returns                            -56.449584
return_std                          73.386612
average_reward                      -0.184112
round_time             0 days 00:16:40.640034
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       836.543245
return_std_test                     11.127619
average_reward_test                  0.836543
round_time_test        0 days 00:00:11.572115
round_time_total       0 days 00:16:40.641124
loss_total                        1548.533126
loss_critic                       2025.838305
loss_actor                        -360.687722
memory_size                       375999.7455 

=== epoch 5/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<18:03,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:43<00:00,  1.99it/s]
episodes                                   30
episode_length                          313.7
returns                            -61.446828
return_std                          81.936521
average_reward                      -0.192641
round_time             0 days 00:16:43.598810
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       845.714423
return_std_test                     20.774602
average_reward_test                  0.845714
round_time_test        0 days 00:00:11.649732
round_time_total       0 days 00:16:43.599892
loss_total                        1559.626747
loss_critic                       2039.640352
loss_actor                        -360.427824
memory_size                        377828.788 

=== epoch 5/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:57,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   30
episode_length                     323.566667
returns                            -59.469065
return_std                          79.263904
average_reward                      -0.182557
round_time             0 days 00:16:36.883266
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       844.348743
return_std_test                     13.320399
average_reward_test                  0.844349
round_time_test        0 days 00:00:11.192007
round_time_total       0 days 00:16:36.884357
loss_total                        1531.907892
loss_critic                        2004.93549
loss_actor                        -360.202633
memory_size                       379729.0105 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:01,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:28<00:00,  2.15it/s]
episodes                                   15
episode_length                           97.6
returns                            -18.173188
return_std                          24.858203
average_reward                      -0.157628
round_time             0 days 00:15:29.038890
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        874.56684
return_std_test                     10.158192
average_reward_test                  0.874567
round_time_test        0 days 00:00:11.331543
round_time_total       0 days 00:15:29.040000
loss_total                        1515.005564
loss_critic                       1983.847343
loss_actor                        -360.361696
memory_size                       381266.9695 

=== epoch 6/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:33<00:00,  2.14it/s]
episodes                                   20
episode_length                         187.65
returns                            -36.211229
return_std                          61.044943
average_reward                      -0.202827
round_time             0 days 00:15:34.138584
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       883.248774
return_std_test                     16.852415
average_reward_test                  0.883249
round_time_test        0 days 00:00:11.522539
round_time_total       0 days 00:15:34.139664
loss_total                        1526.286061
loss_critic                       1998.142385
loss_actor                         -361.13937
memory_size                        383077.426 

=== epoch 6/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:19,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:37<00:00,  2.13it/s]
episodes                                   22
episode_length                          261.5
returns                            -54.113901
return_std                          81.595807
average_reward                      -0.205913
round_time             0 days 00:15:37.782176
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.611909
return_std_test                      10.13518
average_reward_test                  0.880612
round_time_test        0 days 00:00:11.770011
round_time_total       0 days 00:15:37.783276
loss_total                        1512.864751
loss_critic                       1981.555467
loss_actor                        -361.898245
memory_size                        384935.663 

=== epoch 6/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:05,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:33<00:00,  2.14it/s]
episodes                                   32
episode_length                      247.96875
returns                            -53.320608
return_std                          77.779763
average_reward                      -0.216544
round_time             0 days 00:15:34.504455
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       867.792873
return_std_test                     19.545153
average_reward_test                  0.867793
round_time_test        0 days 00:00:11.423418
round_time_total       0 days 00:15:34.505546
loss_total                        1491.124126
loss_critic                       1954.495055
loss_actor                         -362.35972
memory_size                       386749.4395 

=== epoch 6/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:23,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:37<00:00,  2.13it/s]
episodes                                   43
episode_length                     231.837209
returns                            -48.325827
return_std                          76.465477
average_reward                      -0.207683
round_time             0 days 00:15:38.386506
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        868.11939
return_std_test                     19.647751
average_reward_test                  0.868119
round_time_test        0 days 00:00:11.486952
round_time_total       0 days 00:15:38.387585
loss_total                        1503.371064
loss_critic                       1969.895358
loss_actor                        -362.726255
memory_size                        388487.379 

=== epoch 6/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:36,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:41<00:00,  2.12it/s]
episodes                                   37
episode_length                     257.675676
returns                            -53.872274
return_std                          83.067459
average_reward                      -0.210007
round_time             0 days 00:15:41.918500
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       867.555614
return_std_test                     15.596702
average_reward_test                  0.867556
round_time_test        0 days 00:00:11.492307
round_time_total       0 days 00:15:41.919741
loss_total                         1439.15168
loss_critic                       1889.754481
loss_actor                        -363.259649
memory_size                        390147.756 

=== epoch 6/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:52<00:00,  2.10it/s]
episodes                                   40
episode_length                        224.125
returns                            -45.728078
return_std                          73.715014
average_reward                      -0.205083
round_time             0 days 00:15:52.785684
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       845.240683
return_std_test                     12.409149
average_reward_test                  0.845241
round_time_test        0 days 00:00:11.461989
round_time_total       0 days 00:15:52.786776
loss_total                        1463.527812
loss_critic                       1920.166986
loss_actor                         -363.02902
memory_size                        391773.084 

=== epoch 6/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:02,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:59<00:00,  2.08it/s]
episodes                                   43
episode_length                     200.023256
returns                            -38.563082
return_std                          66.643437
average_reward                       -0.19624
round_time             0 days 00:15:59.745652
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       866.490112
return_std_test                      12.19468
average_reward_test                   0.86649
round_time_test        0 days 00:00:11.326498
round_time_total       0 days 00:15:59.746723
loss_total                        1460.887008
loss_critic                       1916.970521
loss_actor                        -363.447182
memory_size                         393604.07 

=== epoch 6/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:33,  2.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:06<00:00,  2.07it/s]
episodes                                   43
episode_length                      190.44186
returns                            -30.851079
return_std                          65.746926
average_reward                      -0.170059
round_time             0 days 00:16:06.597837
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       844.067228
return_std_test                      9.326156
average_reward_test                  0.844067
round_time_test        0 days 00:00:11.531459
round_time_total       0 days 00:16:06.598925
loss_total                        1461.676093
loss_critic                       1918.105748
loss_actor                        -364.042664
memory_size                         395318.97 

=== epoch 6/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:53,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                   41
episode_length                      227.04878
returns                            -38.378932
return_std                          77.524958
average_reward                      -0.174689
round_time             0 days 00:16:20.207244
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       875.460048
return_std_test                     11.789384
average_reward_test                   0.87546
round_time_test        0 days 00:00:11.503797
round_time_total       0 days 00:16:20.208326
loss_total                        1444.765566
loss_critic                       1897.099383
loss_actor                         -364.56983
memory_size                       397001.8005 

=== epoch 6/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:21,  1.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:24<00:00,  2.03it/s]
episodes                                   38
episode_length                     261.131579
returns                            -44.858829
return_std                          89.083842
average_reward                      -0.174681
round_time             0 days 00:16:25.462918
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       887.201974
return_std_test                     20.652095
average_reward_test                  0.887202
round_time_test        0 days 00:00:11.546146
round_time_total       0 days 00:16:25.464106
loss_total                        1410.498249
loss_critic                       1854.422156
loss_actor                        -365.197522
memory_size                        398860.879 

=== epoch 6/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:56,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   36
episode_length                     241.111111
returns                             -37.96308
return_std                          83.004873
average_reward                      -0.164506
round_time             0 days 00:16:24.256477
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       888.439879
return_std_test                     16.027211
average_reward_test                   0.88844
round_time_test        0 days 00:00:11.404993
round_time_total       0 days 00:16:24.257567
loss_total                        1448.614275
loss_critic                       1902.077838
loss_actor                        -365.240113
memory_size                         400639.05 

=== epoch 6/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:36,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   44
episode_length                     217.636364
returns                            -35.791388
return_std                           77.70198
average_reward                      -0.166197
round_time             0 days 00:16:23.264140
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       872.680709
return_std_test                     18.266677
average_reward_test                  0.872681
round_time_test        0 days 00:00:11.349077
round_time_total       0 days 00:16:23.265386
loss_total                        1426.196576
loss_critic                       1874.194336
loss_actor                        -365.794596
memory_size                       402361.3185 

=== epoch 6/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:13,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   42
episode_length                     233.928571
returns                            -45.010843
return_std                          76.599764
average_reward                      -0.192552
round_time             0 days 00:16:24.016391
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       882.283287
return_std_test                     13.542109
average_reward_test                  0.882283
round_time_test        0 days 00:00:11.480979
round_time_total       0 days 00:16:24.017849
loss_total                        1410.708437
loss_critic                       1854.939179
loss_actor                        -366.214671
memory_size                        404033.976 

=== epoch 6/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:14,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.03it/s]
episodes                                   37
episode_length                     242.216216
returns                            -42.596242
return_std                          71.868103
average_reward                      -0.176938
round_time             0 days 00:16:23.478976
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.725864
return_std_test                     12.211657
average_reward_test                  0.885726
round_time_test        0 days 00:00:11.527151
round_time_total       0 days 00:16:23.480227
loss_total                        1417.245143
loss_critic                       1863.222491
loss_actor                        -366.664366
memory_size                       405779.7105 

=== epoch 6/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:54,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:19<00:00,  2.04it/s]
episodes                                   37
episode_length                     241.108108
returns                            -42.594939
return_std                          68.273277
average_reward                      -0.176147
round_time             0 days 00:16:20.408232
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       869.679276
return_std_test                     15.775279
average_reward_test                  0.869679
round_time_test        0 days 00:00:11.458120
round_time_total       0 days 00:16:20.409304
loss_total                        1393.705574
loss_critic                       1833.799691
loss_actor                        -366.671014
memory_size                       407635.7625 

=== epoch 6/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:13,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:20<00:00,  2.04it/s]
episodes                                   35
episode_length                     256.085714
returns                            -48.344735
return_std                           69.04991
average_reward                      -0.193835
round_time             0 days 00:16:21.048418
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       864.961955
return_std_test                     18.240385
average_reward_test                  0.864962
round_time_test        0 days 00:00:11.367954
round_time_total       0 days 00:16:21.049650
loss_total                        1409.455304
loss_critic                       1853.678443
loss_actor                        -367.437385
memory_size                        409452.313 

=== epoch 6/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:56,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:21<00:00,  2.04it/s]
episodes                                   29
episode_length                          310.0
returns                             -62.87679
return_std                          82.912809
average_reward                      -0.202505
round_time             0 days 00:16:22.200991
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        847.66464
return_std_test                     19.886646
average_reward_test                  0.847665
round_time_test        0 days 00:00:11.579046
round_time_total       0 days 00:16:22.202065
loss_total                         1430.06132
loss_critic                       1879.550561
loss_actor                        -367.895784
memory_size                       411299.4785 

=== epoch 6/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.03it/s]
episodes                                   25
episode_length                         371.84
returns                            -64.932562
return_std                          89.419057
average_reward                      -0.176553
round_time             0 days 00:16:23.342755
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       887.082063
return_std_test                     11.916483
average_reward_test                  0.887082
round_time_test        0 days 00:00:11.486257
round_time_total       0 days 00:16:23.343831
loss_total                         1436.80383
loss_critic                       1887.699001
loss_actor                        -366.776986
memory_size                         413097.41 

=== epoch 6/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:48,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   29
episode_length                      315.62069
returns                            -62.715553
return_std                           78.54407
average_reward                      -0.200685
round_time             0 days 00:16:24.290847
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       897.205581
return_std_test                     14.217326
average_reward_test                  0.897206
round_time_test        0 days 00:00:11.488651
round_time_total       0 days 00:16:24.291934
loss_total                        1406.056135
loss_critic                       1849.290233
loss_actor                        -366.880392
memory_size                       414889.1055 

=== epoch 6/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:59,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:25<00:00,  2.03it/s]
episodes                                   31
episode_length                     286.064516
returns                            -60.019563
return_std                          73.653267
average_reward                      -0.211064
round_time             0 days 00:16:25.715836
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.868019
return_std_test                     22.883129
average_reward_test                  0.889868
round_time_test        0 days 00:00:11.568232
round_time_total       0 days 00:16:25.717077
loss_total                        1371.552079
loss_critic                       1806.416259
loss_actor                        -367.904773
memory_size                        416651.094 

=== epoch 6/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:37,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                   37
episode_length                     248.891892
returns                            -48.317422
return_std                          74.803616
average_reward                      -0.197681
round_time             0 days 00:16:30.306014
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        886.56327
return_std_test                     15.691561
average_reward_test                  0.886563
round_time_test        0 days 00:00:11.553460
round_time_total       0 days 00:16:30.307089
loss_total                        1395.343085
loss_critic                       1836.450962
loss_actor                        -369.088547
memory_size                       418342.8815 

=== epoch 6/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:14,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   35
episode_length                     253.228571
returns                            -51.326949
return_std                          75.836895
average_reward                      -0.189734
round_time             0 days 00:16:30.917452
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       892.837494
return_std_test                     10.143986
average_reward_test                  0.892837
round_time_test        0 days 00:00:11.516398
round_time_total       0 days 00:16:30.918535
loss_total                        1387.526786
loss_critic                       1826.431581
loss_actor                        -368.092522
memory_size                        420104.703 

=== epoch 6/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:36,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                   36
episode_length                     260.944444
returns                            -49.443109
return_std                          73.216841
average_reward                      -0.185024
round_time             0 days 00:16:33.485061
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.368094
return_std_test                     14.071587
average_reward_test                  0.880368
round_time_test        0 days 00:00:11.392071
round_time_total       0 days 00:16:33.486266
loss_total                        1358.660613
loss_critic                       1790.446897
loss_actor                        -368.484651
memory_size                        421962.391 

=== epoch 6/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:50,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   36
episode_length                     264.638889
returns                            -48.981984
return_std                          77.861538
average_reward                      -0.173328
round_time             0 days 00:16:30.554790
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       895.373975
return_std_test                     12.961799
average_reward_test                  0.895374
round_time_test        0 days 00:00:11.285694
round_time_total       0 days 00:16:30.555875
loss_total                         1378.19553
loss_critic                       1815.066504
loss_actor                        -369.288494
memory_size                        423704.112 

=== epoch 6/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:08,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.02it/s]
episodes                                   29
episode_length                     283.965517
returns                            -43.607438
return_std                          68.272911
average_reward                      -0.173341
round_time             0 days 00:16:32.656420
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       881.132633
return_std_test                     19.545929
average_reward_test                  0.881133
round_time_test        0 days 00:00:11.622345
round_time_total       0 days 00:16:32.657519
loss_total                        1377.458301
loss_critic                       1814.215466
loss_actor                          -369.5705
memory_size                       425565.7645 

=== epoch 6/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:53,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.02it/s]
episodes                                   29
episode_length                     321.896552
returns                            -50.973659
return_std                          73.445515
average_reward                      -0.161765
round_time             0 days 00:16:33.089061
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.330934
return_std_test                     16.782967
average_reward_test                  0.876331
round_time_test        0 days 00:00:11.319059
round_time_total       0 days 00:16:33.090156
loss_total                        1387.624758
loss_critic                       1827.022368
loss_actor                        -369.965811
memory_size                       427336.5575 

=== epoch 6/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<19:38,  1.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.00it/s]
episodes                                   34
episode_length                     265.558824
returns                            -42.370913
return_std                           67.49553
average_reward                      -0.160946
round_time             0 days 00:16:38.151224
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       873.112011
return_std_test                     15.981213
average_reward_test                  0.873112
round_time_test        0 days 00:00:11.388406
round_time_total       0 days 00:16:38.152637
loss_total                        1380.060642
loss_critic                       1817.631456
loss_actor                        -370.222733
memory_size                         429037.12 

=== epoch 6/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:28<00:00,  2.02it/s]
episodes                                   38
episode_length                     257.684211
returns                            -46.095111
return_std                          71.329961
average_reward                      -0.177379
round_time             0 days 00:16:29.513584
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       887.180121
return_std_test                     11.753839
average_reward_test                   0.88718
round_time_test        0 days 00:00:11.431893
round_time_total       0 days 00:16:29.514716
loss_total                         1380.04067
loss_critic                       1817.651726
loss_actor                        -370.403675
memory_size                        430718.227 

=== epoch 6/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:47,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:30<00:00,  2.02it/s]
episodes                                   35
episode_length                     283.171429
returns                            -46.739784
return_std                          71.429606
average_reward                      -0.164881
round_time             0 days 00:16:30.502007
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.092787
return_std_test                     11.431757
average_reward_test                  0.871093
round_time_test        0 days 00:00:11.402884
round_time_total       0 days 00:16:30.503094
loss_total                        1376.523465
loss_critic                       1813.449831
loss_actor                         -371.18212
memory_size                        432559.342 

=== epoch 6/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:18,  1.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:33<00:00,  2.01it/s]
episodes                                   41
episode_length                     234.243902
returns                            -33.081741
return_std                          60.655378
average_reward                       -0.13979
round_time             0 days 00:16:33.914638
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       867.599716
return_std_test                      6.916238
average_reward_test                    0.8676
round_time_test        0 days 00:00:11.606127
round_time_total       0 days 00:16:33.915735
loss_total                        1357.787968
loss_critic                       1790.453471
loss_actor                        -372.874174
memory_size                       434355.2895 

=== epoch 6/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<16:48,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.02it/s]
episodes                                   41
episode_length                     236.853659
returns                            -32.983629
return_std                          60.315604
average_reward                      -0.136402
round_time             0 days 00:16:32.898299
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       862.327485
return_std_test                     12.186405
average_reward_test                  0.862327
round_time_test        0 days 00:00:11.308699
round_time_total       0 days 00:16:32.899594
loss_total                        1371.542786
loss_critic                       1807.762734
loss_actor                        -373.337139
memory_size                        436068.521 

=== epoch 6/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:09,  1.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:31<00:00,  2.02it/s]
episodes                                   38
episode_length                     245.263158
returns                            -34.441067
return_std                           63.09416
average_reward                      -0.146067
round_time             0 days 00:16:31.718749
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       870.205813
return_std_test                     13.777849
average_reward_test                  0.870206
round_time_test        0 days 00:00:11.477970
round_time_total       0 days 00:16:31.719855
loss_total                        1344.530652
loss_critic                       1774.086914
loss_actor                        -373.694536
memory_size                       437754.7665 

=== epoch 6/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:38,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                   37
episode_length                     247.135135
returns                            -36.067696
return_std                          68.509411
average_reward                      -0.150353
round_time             0 days 00:16:33.211508
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       864.308636
return_std_test                     11.510453
average_reward_test                  0.864309
round_time_test        0 days 00:00:11.437088
round_time_total       0 days 00:16:33.212736
loss_total                        1380.564225
loss_critic                       1819.259734
loss_actor                         -374.21795
memory_size                       439487.2275 

=== epoch 6/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:17,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:26<00:00,  2.03it/s]
episodes                                   44
episode_length                     190.909091
returns                            -29.784669
return_std                          59.598648
average_reward                      -0.159341
round_time             0 days 00:16:27.401117
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       853.524704
return_std_test                     18.537712
average_reward_test                  0.853525
round_time_test        0 days 00:00:11.415653
round_time_total       0 days 00:16:27.402226
loss_total                        1368.424708
loss_critic                       1803.939482
loss_actor                        -373.634513
memory_size                        441115.971 

=== epoch 6/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:16,  1.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:34<00:00,  2.01it/s]
episodes                                   43
episode_length                     196.860465
returns                            -38.563829
return_std                          63.433506
average_reward                       -0.19368
round_time             0 days 00:16:35.347902
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.304013
return_std_test                     17.522662
average_reward_test                  0.871304
round_time_test        0 days 00:00:11.570396
round_time_total       0 days 00:16:35.349417
loss_total                        1371.725355
loss_critic                       1807.938989
loss_actor                        -373.129299
memory_size                       442838.0885 

=== epoch 6/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:35,  1.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:44<00:00,  1.99it/s]
episodes                                   44
episode_length                     208.954545
returns                            -39.352568
return_std                          65.115844
average_reward                      -0.191725
round_time             0 days 00:16:44.765385
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       874.448275
return_std_test                     20.586883
average_reward_test                  0.874448
round_time_test        0 days 00:00:11.544292
round_time_total       0 days 00:16:44.766723
loss_total                        1332.061446
loss_critic                       1758.647657
loss_actor                         -374.28352
memory_size                        444633.444 

=== epoch 6/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<20:15,  1.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:45<00:00,  1.99it/s]
episodes                                   42
episode_length                     219.642857
returns                            -40.128025
return_std                          66.311337
average_reward                      -0.189004
round_time             0 days 00:16:45.590079
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.657407
return_std_test                     17.757726
average_reward_test                  0.871657
round_time_test        0 days 00:00:11.379634
round_time_total       0 days 00:16:45.591147
loss_total                        1362.993249
loss_critic                       1797.239217
loss_actor                        -373.990747
memory_size                        446295.321 

=== epoch 6/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:36<00:00,  2.01it/s]
episodes                                   40
episode_length                         225.05
returns                            -39.688882
return_std                          67.695555
average_reward                      -0.181321
round_time             0 days 00:16:37.154425
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       888.449625
return_std_test                      18.56123
average_reward_test                   0.88845
round_time_test        0 days 00:00:11.639085
round_time_total       0 days 00:16:37.155593
loss_total                        1325.373984
loss_critic                        1750.28897
loss_actor                        -374.286086
memory_size                        448119.618 

=== epoch 6/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:49,  1.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.01it/s]
episodes                                   33
episode_length                     273.545455
returns                            -49.710423
return_std                          81.854959
average_reward                      -0.183182
round_time             0 days 00:16:37.944064
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       867.420308
return_std_test                      5.352903
average_reward_test                   0.86742
round_time_test        0 days 00:00:11.435242
round_time_total       0 days 00:16:37.945302
loss_total                        1373.639124
loss_critic                       1810.733662
loss_actor                         -374.73917
memory_size                        449925.035 

=== epoch 6/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:32,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   30
episode_length                     315.066667
returns                            -54.095981
return_std                          91.734128
average_reward                      -0.171876
round_time             0 days 00:16:38.965165
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        877.98665
return_std_test                     15.030789
average_reward_test                  0.877987
round_time_test        0 days 00:00:11.504727
round_time_total       0 days 00:16:38.966246
loss_total                        1347.339299
loss_critic                       1778.018512
loss_actor                        -375.377694
memory_size                        451823.087 

=== epoch 6/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:48,  1.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:37<00:00,  2.00it/s]
episodes                                   24
episode_length                     379.166667
returns                             -69.51662
return_std                          98.428144
average_reward                      -0.187886
round_time             0 days 00:16:38.008563
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.364019
return_std_test                     10.076823
average_reward_test                  0.885364
round_time_test        0 days 00:00:11.559226
round_time_total       0 days 00:16:38.009669
loss_total                        1353.274778
loss_critic                       1785.370071
loss_actor                        -375.106518
memory_size                        453646.137 

=== epoch 6/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:05,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:38<00:00,  2.00it/s]
episodes                                   24
episode_length                     385.291667
returns                            -67.182445
return_std                          98.205666
average_reward                      -0.178089
round_time             0 days 00:16:38.882406
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       882.421687
return_std_test                     12.668378
average_reward_test                  0.882422
round_time_test        0 days 00:00:11.487487
round_time_total       0 days 00:16:38.883489
loss_total                        1337.598248
loss_critic                       1765.964533
loss_actor                        -375.867015
memory_size                        455487.071 

=== epoch 6/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:32<00:00,  2.01it/s]
episodes                                   30
episode_length                          304.0
returns                            -52.397768
return_std                           84.19103
average_reward                      -0.178677
round_time             0 days 00:16:33.226495
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       865.861717
return_std_test                     22.186147
average_reward_test                  0.865862
round_time_test        0 days 00:00:11.584192
round_time_total       0 days 00:16:33.227712
loss_total                        1332.659605
loss_critic                       1759.936557
loss_actor                        -376.448326
memory_size                         457153.28 

=== epoch 6/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:44,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   39
episode_length                     246.512821
returns                            -38.689428
return_std                          75.826858
average_reward                      -0.158845
round_time             0 days 00:16:23.152067
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       892.699356
return_std_test                     16.973455
average_reward_test                  0.892699
round_time_test        0 days 00:00:11.431068
round_time_total       0 days 00:16:23.153148
loss_total                        1313.247023
loss_critic                       1735.755865
loss_actor                        -376.788468
memory_size                       458824.1955 

=== epoch 6/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:59,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:29<00:00,  2.02it/s]
episodes                                   44
episode_length                     211.636364
returns                            -29.917862
return_std                          62.338937
average_reward                       -0.14672
round_time             0 days 00:16:30.170959
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.057216
return_std_test                     14.843937
average_reward_test                  0.876057
round_time_test        0 days 00:00:11.444982
round_time_total       0 days 00:16:30.172220
loss_total                        1336.264416
loss_critic                        1764.57584
loss_actor                        -376.981416
memory_size                       460402.0675 

=== epoch 6/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:54,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:22<00:00,  2.04it/s]
episodes                                   46
episode_length                          207.0
returns                            -29.284024
return_std                          58.472787
average_reward                      -0.144113
round_time             0 days 00:16:22.639420
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       873.092903
return_std_test                     17.038008
average_reward_test                  0.873093
round_time_test        0 days 00:00:11.561342
round_time_total       0 days 00:16:22.640504
loss_total                          1352.6784
loss_critic                        1785.34842
loss_actor                        -378.001815
memory_size                        462140.315 

=== epoch 6/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:09,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:12<00:00,  2.06it/s]
episodes                                   48
episode_length                     207.895833
returns                            -31.552821
return_std                          58.604128
average_reward                      -0.152172
round_time             0 days 00:16:13.242494
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       867.758736
return_std_test                     14.144925
average_reward_test                  0.867759
round_time_test        0 days 00:00:11.281655
round_time_total       0 days 00:16:13.243567
loss_total                        1351.593403
loss_critic                       1783.945135
loss_actor                        -377.813653
memory_size                        464005.533 

=== epoch 6/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                   46
episode_length                     213.152174
returns                            -31.234786
return_std                          58.303674
average_reward                      -0.145115
round_time             0 days 00:16:15.648023
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       899.011943
return_std_test                     14.582337
average_reward_test                  0.899012
round_time_test        0 days 00:00:11.465729
round_time_total       0 days 00:16:15.649484
loss_total                        1321.665574
loss_critic                       1746.579495
loss_actor                        -377.990227
memory_size                        465741.082 

=== epoch 6/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:29,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:17<00:00,  2.05it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   43
episode_length                     228.348837
returns                            -33.602893
return_std                          60.350017
average_reward                      -0.148895
round_time             0 days 00:16:17.870453
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.762665
return_std_test                     10.542469
average_reward_test                  0.871763
round_time_test        0 days 00:00:11.478688
round_time_total       0 days 00:16:17.871534
loss_total                         1367.30004
loss_critic                       1803.750664
loss_actor                        -378.502592
memory_size                       467365.3885 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:18,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:09<00:00,  2.20it/s]
episodes                                    4
episode_length                         297.75
returns                            -71.868976
return_std                          79.913626
average_reward                      -0.236743
round_time             0 days 00:15:09.370934
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.400814
return_std_test                      10.94676
average_reward_test                  0.889401
round_time_test        0 days 00:00:11.359269
round_time_total       0 days 00:15:09.372026
loss_total                        1357.955496
loss_critic                       1791.958459
loss_actor                        -378.056486
memory_size                       469054.0685 

=== epoch 7/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:30,  1.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:13<00:00,  2.19it/s]
episodes                                   12
episode_length                     304.083333
returns                            -62.158775
return_std                          75.621397
average_reward                      -0.191602
round_time             0 days 00:15:13.723734
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       879.726583
return_std_test                     19.326292
average_reward_test                  0.879727
round_time_test        0 days 00:00:11.259357
round_time_total       0 days 00:15:13.724813
loss_total                        1344.816573
loss_critic                       1775.554004
loss_actor                        -378.133273
memory_size                        470867.283 

=== epoch 7/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:10,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:15<00:00,  2.18it/s]
episodes                                   18
episode_length                     333.055556
returns                            -58.129822
return_std                           76.30687
average_reward                      -0.175023
round_time             0 days 00:15:15.935502
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       878.354995
return_std_test                     15.541749
average_reward_test                  0.878355
round_time_test        0 days 00:00:11.436271
round_time_total       0 days 00:15:15.936600
loss_total                        1320.493249
loss_critic                        1745.28041
loss_actor                        -378.655512
memory_size                       472670.1145 

=== epoch 7/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:21<00:00,  2.17it/s]
episodes                                   23
episode_length                     336.391304
returns                            -57.722793
return_std                          75.179857
average_reward                      -0.168126
round_time             0 days 00:15:21.915190
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       860.650513
return_std_test                      7.234934
average_reward_test                  0.860651
round_time_test        0 days 00:00:11.526747
round_time_total       0 days 00:15:21.916451
loss_total                        1342.106977
loss_critic                       1772.453686
loss_actor                        -379.279969
memory_size                        474471.495 

=== epoch 7/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<20:23,  1.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:42<00:00,  2.00it/s]
episodes                                   33
episode_length                     300.242424
returns                            -51.053706
return_std                          68.970383
average_reward                      -0.169249
round_time             0 days 00:16:42.865791
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       869.911899
return_std_test                     10.214439
average_reward_test                  0.869912
round_time_test        0 days 00:00:12.284319
round_time_total       0 days 00:16:42.866872
loss_total                        1336.895896
loss_critic                       1765.851846
loss_actor                        -378.928022
memory_size                        476235.161 

=== epoch 7/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:44<00:00,  1.99it/s]
episodes                                   36
episode_length                     258.833333
returns                            -39.915112
return_std                          62.382765
average_reward                       -0.15809
round_time             0 days 00:16:45.380370
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       891.854323
return_std_test                     10.424653
average_reward_test                  0.891854
round_time_test        0 days 00:00:11.717295
round_time_total       0 days 00:16:45.381663
loss_total                         1337.47703
loss_critic                       1766.580528
loss_actor                        -378.937083
memory_size                        477943.224 

=== epoch 7/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:25,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:55<00:00,  1.76it/s]
episodes                                   31
episode_length                     290.741935
returns                            -49.524223
return_std                            80.0809
average_reward                      -0.164681
round_time             0 days 00:18:55.756560
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        887.47091
return_std_test                     17.683995
average_reward_test                  0.887471
round_time_test        0 days 00:00:11.320539
round_time_total       0 days 00:18:55.757795
loss_total                         1312.13512
loss_critic                       1734.826185
loss_actor                        -378.629259
memory_size                       479751.1715 

=== epoch 7/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:03,  1.51it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:07<00:00,  2.07it/s]
episodes                                   39
episode_length                     245.051282
returns                            -44.138061
return_std                          70.370445
average_reward                       -0.17913
round_time             0 days 00:16:08.339404
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       887.956818
return_std_test                      8.189285
average_reward_test                  0.887957
round_time_test        0 days 00:00:12.505537
round_time_total       0 days 00:16:08.340500
loss_total                        1339.815515
loss_critic                        1769.52382
loss_actor                        -379.017828
memory_size                        481506.207 

=== epoch 7/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:59,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:44<00:00,  2.12it/s]
episodes                                   39
episode_length                     228.410256
returns                            -43.654406
return_std                          69.842404
average_reward                      -0.191538
round_time             0 days 00:15:44.999074
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       891.587659
return_std_test                     13.221533
average_reward_test                  0.891588
round_time_test        0 days 00:00:11.544939
round_time_total       0 days 00:15:45.000160
loss_total                        1306.350295
loss_critic                       1727.755509
loss_actor                        -379.270688
memory_size                       483137.9465 

=== epoch 7/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:30,  1.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:58<00:00,  2.09it/s]
episodes                                   38
episode_length                     252.526316
returns                            -46.108093
return_std                          76.893468
average_reward                      -0.180535
round_time             0 days 00:15:59.235493
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         879.2135
return_std_test                     11.706167
average_reward_test                  0.879214
round_time_test        0 days 00:00:11.497627
round_time_total       0 days 00:15:59.236592
loss_total                        1318.614949
loss_critic                       1743.297933
loss_actor                        -380.117103
memory_size                        484949.386 

=== epoch 7/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:10,  1.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                   34
episode_length                     273.264706
returns                            -48.614378
return_std                          72.973555
average_reward                      -0.188078
round_time             0 days 00:16:06.074577
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       878.567113
return_std_test                     13.955262
average_reward_test                  0.878567
round_time_test        0 days 00:00:11.283515
round_time_total       0 days 00:16:06.075651
loss_total                        1324.835804
loss_critic                       1751.220426
loss_actor                        -380.702809
memory_size                       486729.7985 

=== epoch 7/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:34,  2.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:15<00:00,  2.05it/s]
episodes                                   33
episode_length                     257.484848
returns                            -48.316256
return_std                          69.244188
average_reward                      -0.200898
round_time             0 days 00:16:15.654729
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.716732
return_std_test                     17.046343
average_reward_test                  0.876717
round_time_test        0 days 00:00:11.400000
round_time_total       0 days 00:16:15.656094
loss_total                        1320.172479
loss_critic                       1745.634301
loss_actor                        -381.674928
memory_size                       488603.3875 

=== epoch 7/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:38,  1.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:46<00:00,  1.46it/s]
episodes                                   26
episode_length                          360.0
returns                            -72.957834
return_std                          92.523796
average_reward                      -0.194589
round_time             0 days 00:22:46.951247
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        872.07268
return_std_test                     15.908399
average_reward_test                  0.872073
round_time_test        0 days 00:00:14.134396
round_time_total       0 days 00:22:46.952547
loss_total                        1343.347732
loss_critic                       1774.774559
loss_actor                        -382.359701
memory_size                       490469.0245 

=== epoch 7/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:49,  1.46it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:19<00:00,  1.43it/s]
episodes                                   32
episode_length                       291.0625
returns                            -54.201289
return_std                          82.268725
average_reward                      -0.187019
round_time             0 days 00:23:20.008492
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.090158
return_std_test                     18.020277
average_reward_test                   0.88909
round_time_test        0 days 00:00:12.682411
round_time_total       0 days 00:23:20.009812
loss_total                        1293.969618
loss_critic                       1712.939885
loss_actor                        -381.911574
memory_size                       492227.1445 

=== epoch 7/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:49,  1.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:17<00:00,  1.43it/s]
episodes                                   31
episode_length                     285.806452
returns                            -56.838139
return_std                          84.261388
average_reward                      -0.196576
round_time             0 days 00:23:17.886621
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       879.370166
return_std_test                      8.397187
average_reward_test                   0.87937
round_time_test        0 days 00:00:12.670048
round_time_total       0 days 00:23:17.887808
loss_total                        1315.337216
loss_critic                       1740.000996
loss_actor                         -383.31802
memory_size                        493897.493 

=== epoch 7/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:37,  1.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:17<00:00,  1.43it/s]
episodes                                   33
episode_length                     294.606061
returns                            -55.446213
return_std                          87.478683
average_reward                      -0.190569
round_time             0 days 00:23:18.589661
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.540464
return_std_test                     12.957533
average_reward_test                   0.87654
round_time_test        0 days 00:00:12.658527
round_time_total       0 days 00:23:18.590980
loss_total                        1330.574201
loss_critic                       1759.125418
loss_actor                        -383.630791
memory_size                        495736.256 

=== epoch 7/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:39,  1.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:47<00:00,  1.46it/s]
episodes                                   38
episode_length                          253.5
returns                             -45.26505
return_std                          75.512768
average_reward                      -0.175092
round_time             0 days 00:22:48.315784
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.424093
return_std_test                     14.600046
average_reward_test                  0.880424
round_time_test        0 days 00:00:12.601260
round_time_total       0 days 00:22:48.317073
loss_total                         1319.61445
loss_critic                       1745.306092
loss_actor                        -383.152261
memory_size                       497464.0885 

=== epoch 7/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:10,  1.44it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:47<00:00,  1.46it/s]
episodes                                   39
episode_length                     241.333333
returns                            -38.640893
return_std                          72.056094
average_reward                      -0.163247
round_time             0 days 00:22:48.294210
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       864.327712
return_std_test                      6.718579
average_reward_test                  0.864328
round_time_test        0 days 00:00:12.564797
round_time_total       0 days 00:22:48.295396
loss_total                        1312.525285
loss_critic                       1736.572515
loss_actor                        -383.663763
memory_size                       499279.4585 

=== epoch 7/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<22:18,  1.49it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [22:42<00:00,  1.47it/s]
episodes                                   33
episode_length                     297.212121
returns                            -50.177652
return_std                          84.832739
average_reward                      -0.167906
round_time             0 days 00:22:43.120424
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       873.543571
return_std_test                      7.335064
average_reward_test                  0.873544
round_time_test        0 days 00:00:12.614752
round_time_total       0 days 00:22:43.121836
loss_total                        1323.547424
loss_critic                       1750.444868
loss_actor                        -384.042473
memory_size                        501080.197 

=== epoch 7/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:28,  1.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:26<00:00,  1.81it/s]
episodes                                   28
episode_length                     336.321429
returns                            -57.492367
return_std                          89.079324
average_reward                      -0.174446
round_time             0 days 00:18:27.222719
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.871724
return_std_test                     10.672684
average_reward_test                  0.876872
round_time_test        0 days 00:00:12.614409
round_time_total       0 days 00:18:27.224012
loss_total                        1321.367622
loss_critic                       1747.777124
loss_actor                        -384.270511
memory_size                       502945.7885 

=== epoch 7/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:05,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   30
episode_length                     325.433333
returns                            -58.933727
return_std                          89.582104
average_reward                      -0.176208
round_time             0 days 00:14:10.991571
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       899.975828
return_std_test                     10.679754
average_reward_test                  0.899976
round_time_test        0 days 00:00:11.315874
round_time_total       0 days 00:14:10.992640
loss_total                        1347.713749
loss_critic                       1780.630996
loss_actor                        -383.955364
memory_size                       504710.3395 

=== epoch 7/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:21,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   35
episode_length                     281.314286
returns                            -55.762085
return_std                          86.146498
average_reward                      -0.197278
round_time             0 days 00:14:08.517860
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       902.389121
return_std_test                     12.551826
average_reward_test                  0.902389
round_time_test        0 days 00:00:11.530502
round_time_total       0 days 00:14:08.519231
loss_total                        1353.903892
loss_critic                       1788.207128
loss_actor                        -383.309183
memory_size                       506536.3145 

=== epoch 7/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:35,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                   36
episode_length                          234.0
returns                            -49.207542
return_std                          76.577297
average_reward                      -0.214666
round_time             0 days 00:14:11.610231
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       881.698112
return_std_test                     15.403335
average_reward_test                  0.881698
round_time_test        0 days 00:00:11.457613
round_time_total       0 days 00:14:11.611306
loss_total                        1306.288419
loss_critic                       1728.886038
loss_actor                        -384.102169
memory_size                       508024.7235 

=== epoch 7/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:34,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   36
episode_length                     236.916667
returns                            -54.613422
return_std                          84.279047
average_reward                      -0.228646
round_time             0 days 00:14:10.572101
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       884.508621
return_std_test                     10.510326
average_reward_test                  0.884509
round_time_test        0 days 00:00:11.495997
round_time_total       0 days 00:14:10.573176
loss_total                        1345.461545
loss_critic                        1777.73215
loss_actor                        -383.621003
memory_size                        509830.803 

=== epoch 7/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:00,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                   39
episode_length                     234.794872
returns                            -51.665379
return_std                            85.0873
average_reward                      -0.217855
round_time             0 days 00:14:12.133336
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.398031
return_std_test                     10.158299
average_reward_test                  0.885398
round_time_test        0 days 00:00:11.643066
round_time_total       0 days 00:14:12.134410
loss_total                         1306.37494
loss_critic                       1729.076874
loss_actor                        -384.432911
memory_size                       511647.8155 

=== epoch 7/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:09,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                   37
episode_length                     240.540541
returns                            -54.173648
return_std                          84.324273
average_reward                      -0.229847
round_time             0 days 00:14:14.616276
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.850058
return_std_test                     12.732827
average_reward_test                   0.88585
round_time_test        0 days 00:00:11.451166
round_time_total       0 days 00:14:14.617368
loss_total                        1319.901955
loss_critic                       1746.033384
loss_actor                        -384.623894
memory_size                       513468.5515 

=== epoch 7/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:01,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:23<00:00,  2.32it/s]
episodes                                   38
episode_length                     256.710526
returns                            -53.728616
return_std                          88.554812
average_reward                      -0.198108
round_time             0 days 00:14:24.130722
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       856.964328
return_std_test                      9.125495
average_reward_test                  0.856964
round_time_test        0 days 00:00:11.486915
round_time_total       0 days 00:14:24.131795
loss_total                        1318.975485
loss_critic                       1744.769886
loss_actor                        -384.202239
memory_size                       515219.7615 

=== epoch 7/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:50,  1.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:39<00:00,  2.27it/s]
episodes                                   36
episode_length                     252.861111
returns                            -46.551243
return_std                          84.678257
average_reward                      -0.190119
round_time             0 days 00:14:39.705970
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       861.452454
return_std_test                     12.164973
average_reward_test                  0.861452
round_time_test        0 days 00:00:11.978797
round_time_total       0 days 00:14:39.707031
loss_total                        1327.104591
loss_critic                       1755.005089
loss_actor                        -384.497515
memory_size                       516796.0395 

=== epoch 7/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:24<00:00,  2.31it/s]
episodes                                   36
episode_length                     255.777778
returns                             -41.52616
return_std                          77.397628
average_reward                       -0.16084
round_time             0 days 00:14:25.283558
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       882.083726
return_std_test                     15.216444
average_reward_test                  0.882084
round_time_test        0 days 00:00:11.430755
round_time_total       0 days 00:14:25.284810
loss_total                        1325.004461
loss_critic                       1752.314427
loss_actor                        -384.235525
memory_size                        518613.521 

=== epoch 7/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:18<00:00,  2.33it/s]
episodes                                   35
episode_length                     262.571429
returns                            -45.089002
return_std                          75.971413
average_reward                      -0.172939
round_time             0 days 00:14:18.849198
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        869.12758
return_std_test                     19.807846
average_reward_test                  0.869128
round_time_test        0 days 00:00:11.240561
round_time_total       0 days 00:14:18.850306
loss_total                         1300.28037
loss_critic                        1721.55401
loss_actor                        -384.814307
memory_size                        520458.098 

=== epoch 7/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                   35
episode_length                     255.285714
returns                            -36.727567
return_std                          61.754778
average_reward                      -0.152871
round_time             0 days 00:14:12.439015
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       890.679169
return_std_test                     13.201904
average_reward_test                  0.890679
round_time_test        0 days 00:00:11.171917
round_time_total       0 days 00:14:12.440091
loss_total                        1318.511741
loss_critic                       1744.438976
loss_actor                        -385.197316
memory_size                       522303.1885 

=== epoch 7/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   27
episode_length                          355.0
returns                            -62.673542
return_std                          77.327707
average_reward                      -0.171277
round_time             0 days 00:14:12.933147
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       888.703904
return_std_test                     12.413993
average_reward_test                  0.888704
round_time_test        0 days 00:00:11.447824
round_time_total       0 days 00:14:12.934224
loss_total                        1294.055401
loss_critic                       1713.968577
loss_actor                        -385.597429
memory_size                        524127.395 

=== epoch 7/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:30,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                   32
episode_length                         307.25
returns                            -50.974272
return_std                          74.853169
average_reward                      -0.166371
round_time             0 days 00:14:09.561288
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       895.624133
return_std_test                     14.197165
average_reward_test                  0.895624
round_time_test        0 days 00:00:11.256922
round_time_total       0 days 00:14:09.562371
loss_total                         1302.64768
loss_critic                       1724.670598
loss_actor                        -385.444122
memory_size                       525809.1745 

=== epoch 7/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:21,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   39
episode_length                     235.717949
returns                             -42.56094
return_std                          70.892812
average_reward                      -0.180345
round_time             0 days 00:14:09.834945
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       897.892067
return_std_test                     13.690038
average_reward_test                  0.897892
round_time_test        0 days 00:00:11.332009
round_time_total       0 days 00:14:09.836026
loss_total                        1284.588257
loss_critic                       1702.148618
loss_actor                        -385.653308
memory_size                        527490.946 

=== epoch 7/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                   43
episode_length                     214.627907
returns                            -41.612465
return_std                          70.650607
average_reward                      -0.193466
round_time             0 days 00:14:06.864598
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.118494
return_std_test                     13.459237
average_reward_test                  0.898118
round_time_test        0 days 00:00:11.265466
round_time_total       0 days 00:14:06.865785
loss_total                        1338.278531
loss_critic                       1769.273456
loss_actor                        -385.701292
memory_size                       529179.1135 

=== epoch 7/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:16,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   54
episode_length                     171.648148
returns                            -33.315962
return_std                          65.265732
average_reward                      -0.193748
round_time             0 days 00:14:07.637444
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       879.778031
return_std_test                     14.020087
average_reward_test                  0.879778
round_time_test        0 days 00:00:11.324430
round_time_total       0 days 00:14:07.638812
loss_total                        1282.528511
loss_critic                       1699.708716
loss_actor                        -386.192434
memory_size                       530836.7415 

=== epoch 7/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:41,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                   52
episode_length                     174.096154
returns                            -32.847315
return_std                          60.446901
average_reward                      -0.194255
round_time             0 days 00:14:05.204111
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       886.107119
return_std_test                     22.646879
average_reward_test                  0.886107
round_time_test        0 days 00:00:11.225934
round_time_total       0 days 00:14:05.205376
loss_total                        1298.900761
loss_critic                       1720.334852
loss_actor                         -386.83572
memory_size                        532453.176 

=== epoch 7/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                   48
episode_length                       202.6875
returns                            -36.860136
return_std                          66.924298
average_reward                      -0.192133
round_time             0 days 00:14:14.909197
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       891.817426
return_std_test                      13.07661
average_reward_test                  0.891817
round_time_test        0 days 00:00:11.347151
round_time_total       0 days 00:14:14.910273
loss_total                        1327.443683
loss_critic                       1756.068142
loss_actor                        -387.054278
memory_size                        534259.151 

=== epoch 7/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:51,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   43
episode_length                     208.302326
returns                            -32.485887
return_std                          67.682402
average_reward                      -0.164564
round_time             0 days 00:14:12.894035
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       893.211966
return_std_test                     11.702076
average_reward_test                  0.893212
round_time_test        0 days 00:00:11.390959
round_time_total       0 days 00:14:12.895111
loss_total                         1320.68015
loss_critic                       1747.711068
loss_actor                        -387.443646
memory_size                        536072.548 

=== epoch 7/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   45
episode_length                     203.466667
returns                            -29.638989
return_std                          67.532794
average_reward                       -0.15258
round_time             0 days 00:14:10.219039
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       901.267083
return_std_test                      14.73683
average_reward_test                  0.901267
round_time_test        0 days 00:00:11.144091
round_time_total       0 days 00:14:10.220123
loss_total                        1326.772409
loss_critic                       1755.371309
loss_actor                        -387.623317
memory_size                       537796.7835 

=== epoch 7/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   40
episode_length                          205.5
returns                            -31.352377
return_std                          68.268863
average_reward                      -0.154641
round_time             0 days 00:14:05.990260
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       864.620307
return_std_test                      7.333156
average_reward_test                   0.86462
round_time_test        0 days 00:00:11.468428
round_time_total       0 days 00:14:05.991475
loss_total                        1291.605263
loss_critic                       1711.365046
loss_actor                           -387.434
memory_size                       539297.2515 

=== epoch 7/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   46
episode_length                     213.826087
returns                            -31.974868
return_std                          63.262093
average_reward                      -0.149353
round_time             0 days 00:14:10.974951
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       872.888536
return_std_test                      6.953301
average_reward_test                  0.872889
round_time_test        0 days 00:00:11.181465
round_time_total       0 days 00:14:10.976012
loss_total                         1346.74438
loss_critic                       1780.289816
loss_actor                         -387.43748
memory_size                        541079.508 

=== epoch 7/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:05,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   50
episode_length                         166.44
returns                            -23.454684
return_std                          52.438159
average_reward                      -0.151089
round_time             0 days 00:14:09.954251
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        861.11924
return_std_test                     10.890382
average_reward_test                  0.861119
round_time_test        0 days 00:00:11.371099
round_time_total       0 days 00:14:09.955438
loss_total                        1333.163425
loss_critic                       1763.267128
loss_actor                        -387.251506
memory_size                        542638.226 

=== epoch 7/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:11,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   50
episode_length                          184.0
returns                            -32.269291
return_std                          55.074581
average_reward                       -0.18227
round_time             0 days 00:14:09.050215
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       869.432901
return_std_test                     12.885207
average_reward_test                  0.869433
round_time_test        0 days 00:00:11.536103
round_time_total       0 days 00:14:09.051289
loss_total                         1311.72992
loss_critic                        1736.46529
loss_actor                        -387.211677
memory_size                        544416.622 

=== epoch 7/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   49
episode_length                     203.653061
returns                            -38.611765
return_std                          65.362108
average_reward                      -0.188249
round_time             0 days 00:14:09.021158
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        868.17346
return_std_test                     11.541334
average_reward_test                  0.868173
round_time_test        0 days 00:00:11.230123
round_time_total       0 days 00:14:09.022226
loss_total                        1338.189407
loss_critic                       1769.822032
loss_actor                        -388.341223
memory_size                        546090.899 

=== epoch 7/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:28,  2.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   47
episode_length                     197.957447
returns                            -36.867739
return_std                          61.685125
average_reward                      -0.187321
round_time             0 days 00:14:11.509391
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       859.499041
return_std_test                     19.694293
average_reward_test                  0.859499
round_time_test        0 days 00:00:11.255473
round_time_total       0 days 00:14:11.510810
loss_total                        1338.050063
loss_critic                       1769.626912
loss_actor                         -388.25746
memory_size                       547779.4105 

=== epoch 7/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:37,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   46
episode_length                     213.282609
returns                            -39.547679
return_std                          62.266371
average_reward                      -0.186419
round_time             0 days 00:14:13.195263
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       879.017266
return_std_test                     13.026734
average_reward_test                  0.879017
round_time_test        0 days 00:00:11.286730
round_time_total       0 days 00:14:13.196344
loss_total                        1315.310162
loss_critic                       1741.114867
loss_actor                         -387.90878
memory_size                        549577.262 

=== epoch 7/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                   43
episode_length                     228.372093
returns                            -41.474013
return_std                          57.837479
average_reward                      -0.182103
round_time             0 days 00:14:09.568611
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.799521
return_std_test                     11.701962
average_reward_test                    0.8808
round_time_test        0 days 00:00:11.507796
round_time_total       0 days 00:14:09.569687
loss_total                        1349.813095
loss_critic                       1784.089848
loss_actor                         -387.29404
memory_size                       551311.7495 

=== epoch 7/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:54,  2.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.36it/s]
episodes                                   38
episode_length                     236.368421
returns                            -43.443731
return_std                          63.548681
average_reward                       -0.18305
round_time             0 days 00:14:09.684103
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       875.320163
return_std_test                     11.714218
average_reward_test                   0.87532
round_time_test        0 days 00:00:11.271562
round_time_total       0 days 00:14:09.685179
loss_total                        1314.112197
loss_critic                       1739.673873
loss_actor                        -388.134633
memory_size                       553072.5955 

=== epoch 7/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:02,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   37
episode_length                     263.135135
returns                            -49.580528
return_std                          68.178172
average_reward                      -0.186647
round_time             0 days 00:14:11.648530
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.289046
return_std_test                      8.300036
average_reward_test                  0.898289
round_time_test        0 days 00:00:11.292053
round_time_total       0 days 00:14:11.649772
loss_total                        1332.248446
loss_critic                       1762.518192
loss_actor                        -388.830666
memory_size                       554915.4685 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 4/2000 [00:01<14:08,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:18<00:00,  2.50it/s]
episodes                                    2
episode_length                          523.5
returns                           -117.571698
return_std                           80.26755
average_reward                      -0.238919
round_time             0 days 00:13:18.640437
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       884.124296
return_std_test                     17.971548
average_reward_test                  0.884124
round_time_test        0 days 00:00:11.433037
round_time_total       0 days 00:13:18.641529
loss_total                        1333.854939
loss_critic                       1764.273915
loss_actor                        -387.821093
memory_size                        556724.617 

=== epoch 8/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:21<00:00,  2.49it/s]
episodes                                    6
episode_length                     554.333333
returns                           -128.118507
return_std                         102.081099
average_reward                      -0.231572
round_time             0 days 00:13:22.184979
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       868.589459
return_std_test                       14.3224
average_reward_test                  0.868589
round_time_test        0 days 00:00:11.623400
round_time_total       0 days 00:13:22.186041
loss_total                        1318.307301
loss_critic                       1744.885032
loss_actor                        -388.003747
memory_size                       558601.2265 

=== epoch 8/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:23,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:23<00:00,  2.49it/s]
episodes                                   16
episode_length                       340.9375
returns                             -64.66037
return_std                          86.735189
average_reward                      -0.189948
round_time             0 days 00:13:23.605061
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.931287
return_std_test                     16.938957
average_reward_test                  0.880931
round_time_test        0 days 00:00:11.478236
round_time_total       0 days 00:13:23.606390
loss_total                        1308.412819
loss_critic                       1732.655108
loss_actor                        -388.556455
memory_size                         560371.78 

=== epoch 8/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:04,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:30<00:00,  2.47it/s]
episodes                                   27
episode_length                     277.444444
returns                            -49.472333
return_std                          80.535213
average_reward                      -0.171688
round_time             0 days 00:13:30.877458
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       865.785624
return_std_test                     11.641108
average_reward_test                  0.865786
round_time_test        0 days 00:00:11.433982
round_time_total       0 days 00:13:30.878542
loss_total                        1311.625998
loss_critic                       1737.089226
loss_actor                        -390.227032
memory_size                       562097.8495 

=== epoch 8/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:43,  2.42it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:27<00:00,  2.48it/s]
episodes                                   33
episode_length                     301.393939
returns                            -54.969702
return_std                          82.185065
average_reward                      -0.182465
round_time             0 days 00:13:28.228903
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        867.68974
return_std_test                      9.050193
average_reward_test                   0.86769
round_time_test        0 days 00:00:11.472583
round_time_total       0 days 00:13:28.229980
loss_total                         1314.81285
loss_critic                       1741.084646
loss_actor                        -390.274449
memory_size                        563874.271 

=== epoch 8/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:01,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:25<00:00,  2.48it/s]
episodes                                   35
episode_length                     256.571429
returns                            -39.514277
return_std                          67.847749
average_reward                      -0.156903
round_time             0 days 00:13:26.216148
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       883.070817
return_std_test                     10.078962
average_reward_test                  0.883071
round_time_test        0 days 00:00:11.648687
round_time_total       0 days 00:13:26.217217
loss_total                        1330.962278
loss_critic                       1761.365961
loss_actor                        -390.652581
memory_size                       565598.7695 

=== epoch 8/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:30<00:00,  2.47it/s]
episodes                                   40
episode_length                         247.85
returns                            -36.902313
return_std                          65.910935
average_reward                      -0.146546
round_time             0 days 00:13:30.962805
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.939722
return_std_test                     12.729019
average_reward_test                   0.88094
round_time_test        0 days 00:00:11.387233
round_time_total       0 days 00:13:30.963864
loss_total                        1350.610266
loss_critic                       1785.992793
loss_actor                        -390.919973
memory_size                        567430.117 

=== epoch 8/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:39<00:00,  2.44it/s]
episodes                                   32
episode_length                      297.40625
returns                            -49.468416
return_std                          78.795111
average_reward                      -0.169963
round_time             0 days 00:13:39.589654
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.368435
return_std_test                      14.75238
average_reward_test                  0.876368
round_time_test        0 days 00:00:11.531556
round_time_total       0 days 00:13:39.590740
loss_total                        1340.103387
loss_critic                       1772.922066
loss_actor                        -391.171442
memory_size                        569255.109 

=== epoch 8/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:07,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:46<00:00,  2.42it/s]
episodes                                   25
episode_length                         352.04
returns                            -65.588696
return_std                          85.554103
average_reward                      -0.188263
round_time             0 days 00:13:46.531265
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       883.267003
return_std_test                     13.638801
average_reward_test                  0.883267
round_time_test        0 days 00:00:11.346817
round_time_total       0 days 00:13:46.532345
loss_total                        1334.772072
loss_critic                       1766.142687
loss_actor                        -390.710497
memory_size                       571147.7985 

=== epoch 8/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:56<00:00,  2.39it/s]
episodes                                   27
episode_length                     363.814815
returns                            -58.992338
return_std                          91.299354
average_reward                      -0.161346
round_time             0 days 00:13:56.588838
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        869.83311
return_std_test                     15.430158
average_reward_test                  0.869833
round_time_test        0 days 00:00:11.425384
round_time_total       0 days 00:13:56.589922
loss_total                        1336.189818
loss_critic                       1767.794227
loss_actor                        -390.227943
memory_size                        572964.919 

=== epoch 8/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                   25
episode_length                         388.28
returns                             -69.77823
return_std                          99.656611
average_reward                      -0.182908
round_time             0 days 00:14:00.387058
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       881.075656
return_std_test                     13.118646
average_reward_test                  0.881076
round_time_test        0 days 00:00:11.716666
round_time_total       0 days 00:14:00.388366
loss_total                          1293.7172
loss_critic                       1714.744215
loss_actor                        -390.390995
memory_size                       574763.2995 

=== epoch 8/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:09<00:00,  2.06it/s]
episodes                                   19
episode_length                     475.157895
returns                            -88.190321
return_std                         109.088951
average_reward                      -0.186648
round_time             0 days 00:16:09.656136
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       872.442462
return_std_test                     13.700793
average_reward_test                  0.872442
round_time_test        0 days 00:00:11.532971
round_time_total       0 days 00:16:09.657325
loss_total                        1295.003956
loss_critic                        1716.67212
loss_actor                        -391.668817
memory_size                        576677.039 

=== epoch 8/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:43,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:53<00:00,  1.97it/s]
episodes                                   25
episode_length                          357.0
returns                            -58.161781
return_std                          95.636223
average_reward                      -0.166074
round_time             0 days 00:16:54.381163
episodes_test                            10.0
episode_length_test                     901.2
returns_test                       807.114411
return_std_test                    267.787519
average_reward_test                  0.894048
round_time_test        0 days 00:00:12.032742
round_time_total       0 days 00:16:54.382379
loss_total                        1333.092173
loss_critic                       1764.114495
loss_actor                        -390.997237
memory_size                        578539.044 

=== epoch 8/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:36,  1.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:42<00:00,  2.00it/s]
episodes                                   31
episode_length                      313.16129
returns                            -51.276021
return_std                           91.17414
average_reward                      -0.166471
round_time             0 days 00:16:42.987339
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       891.552079
return_std_test                      4.768718
average_reward_test                  0.891552
round_time_test        0 days 00:00:12.119289
round_time_total       0 days 00:16:42.988521
loss_total                        1352.896078
loss_critic                       1788.882793
loss_actor                        -391.050911
memory_size                       580205.9135 

=== epoch 8/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:41,  1.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:05<00:00,  2.07it/s]
episodes                                   36
episode_length                     264.722222
returns                            -48.220208
return_std                          79.778696
average_reward                      -0.179677
round_time             0 days 00:16:05.697763
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       902.237827
return_std_test                      7.530802
average_reward_test                  0.902238
round_time_test        0 days 00:00:12.155584
round_time_total       0 days 00:16:05.698838
loss_total                        1307.499373
loss_critic                       1732.059179
loss_actor                        -390.739978
memory_size                        581887.164 

=== epoch 8/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                   36
episode_length                     238.722222
returns                            -41.378275
return_std                          65.289368
average_reward                      -0.185994
round_time             0 days 00:14:01.608877
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       886.416476
return_std_test                      9.105836
average_reward_test                  0.886416
round_time_test        0 days 00:00:11.536850
round_time_total       0 days 00:14:01.609946
loss_total                        1329.187891
loss_critic                       1759.269694
loss_actor                        -391.139444
memory_size                       583569.4945 

=== epoch 8/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                   46
episode_length                     214.369565
returns                            -41.058216
return_std                          71.247382
average_reward                      -0.190683
round_time             0 days 00:14:00.512870
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       861.034736
return_std_test                     12.650813
average_reward_test                  0.861035
round_time_test        0 days 00:00:11.469663
round_time_total       0 days 00:14:00.513934
loss_total                        1322.563768
loss_critic                         1751.0081
loss_actor                        -391.213675
memory_size                        585375.426 

=== epoch 8/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                   41
episode_length                     221.341463
returns                            -46.821456
return_std                          76.861776
average_reward                      -0.210537
round_time             0 days 00:14:00.296775
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.623401
return_std_test                      9.004455
average_reward_test                  0.880623
round_time_test        0 days 00:00:11.673251
round_time_total       0 days 00:14:00.297851
loss_total                        1313.997448
loss_critic                       1740.314959
loss_actor                        -391.272718
memory_size                        587060.567 

=== epoch 8/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:13,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:58<00:00,  2.39it/s]
episodes                                   43
episode_length                      224.44186
returns                            -46.128657
return_std                          76.502421
average_reward                      -0.199321
round_time             0 days 00:13:58.747795
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       899.235967
return_std_test                      5.589423
average_reward_test                  0.899236
round_time_test        0 days 00:00:11.530218
round_time_total       0 days 00:13:58.748862
loss_total                        1340.972514
loss_critic                       1773.989021
loss_actor                        -391.093638
memory_size                       588857.9925 

=== epoch 8/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:28,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                   34
episode_length                     290.147059
returns                            -59.964164
return_std                          90.347911
average_reward                      -0.209243
round_time             0 days 00:14:01.727361
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         913.6047
return_std_test                       8.00612
average_reward_test                  0.913605
round_time_test        0 days 00:00:11.419525
round_time_total       0 days 00:14:01.728435
loss_total                        1327.390155
loss_critic                       1757.075779
loss_actor                        -391.352472
memory_size                        590603.181 

=== epoch 8/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                   38
episode_length                     252.473684
returns                            -48.545417
return_std                          76.073216
average_reward                      -0.193503
round_time             0 days 00:14:01.486569
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       872.964497
return_std_test                     13.989698
average_reward_test                  0.872964
round_time_test        0 days 00:00:11.364281
round_time_total       0 days 00:14:01.487639
loss_total                        1294.653102
loss_critic                       1716.185614
loss_actor                        -391.477062
memory_size                       592371.5885 

=== epoch 8/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                   32
episode_length                      301.84375
returns                            -52.308413
return_std                          86.377207
average_reward                       -0.17307
round_time             0 days 00:14:03.200445
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       899.883876
return_std_test                     15.261979
average_reward_test                  0.899884
round_time_test        0 days 00:00:11.427038
round_time_total       0 days 00:14:03.201524
loss_total                        1305.286556
loss_critic                       1729.450439
loss_actor                        -391.369101
memory_size                       594165.3625 

=== epoch 8/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:56,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                   38
episode_length                     242.736842
returns                             -37.65166
return_std                          70.553773
average_reward                      -0.160722
round_time             0 days 00:14:03.840398
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        915.21868
return_std_test                     10.242308
average_reward_test                  0.915219
round_time_test        0 days 00:00:11.228433
round_time_total       0 days 00:14:03.841462
loss_total                        1315.482667
loss_critic                       1742.324831
loss_actor                        -391.886121
memory_size                       595981.2685 

=== epoch 8/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   34
episode_length                     262.852941
returns                            -38.105595
return_std                          71.705268
average_reward                      -0.157508
round_time             0 days 00:14:08.390622
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       904.693255
return_std_test                      12.73807
average_reward_test                  0.904693
round_time_test        0 days 00:00:11.525007
round_time_total       0 days 00:14:08.391925
loss_total                        1314.389091
loss_critic                        1740.99993
loss_actor                        -392.054398
memory_size                        597626.231 

=== epoch 8/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                   43
episode_length                     230.837209
returns                            -33.354354
return_std                           64.97194
average_reward                      -0.145667
round_time             0 days 00:14:07.248417
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.530702
return_std_test                      9.729946
average_reward_test                  0.913531
round_time_test        0 days 00:00:11.440020
round_time_total       0 days 00:14:07.249702
loss_total                        1328.995223
loss_critic                       1759.301082
loss_actor                        -392.228331
memory_size                       599419.5195 

=== epoch 8/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                   42
episode_length                     223.738095
returns                            -36.478269
return_std                          70.105639
average_reward                      -0.144637
round_time             0 days 00:14:02.960198
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       893.818898
return_std_test                      17.44817
average_reward_test                  0.893819
round_time_test        0 days 00:00:11.537782
round_time_total       0 days 00:14:02.961264
loss_total                         1314.39118
loss_critic                       1741.142188
loss_actor                        -392.612971
memory_size                        601023.686 

=== epoch 8/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                   44
episode_length                     210.681818
returns                            -35.390103
return_std                          69.103943
average_reward                      -0.172704
round_time             0 days 00:14:01.630830
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.558635
return_std_test                      8.238815
average_reward_test                  0.889559
round_time_test        0 days 00:00:11.534320
round_time_total       0 days 00:14:01.632087
loss_total                         1321.89989
loss_critic                       1750.338805
loss_actor                        -391.855883
memory_size                       602875.5175 

=== epoch 8/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:07,  2.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   43
episode_length                     228.023256
returns                            -39.507549
return_std                          73.395374
average_reward                      -0.171373
round_time             0 days 00:14:06.005691
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       894.985036
return_std_test                     10.395146
average_reward_test                  0.894985
round_time_test        0 days 00:00:11.442512
round_time_total       0 days 00:14:06.006909
loss_total                        1314.471888
loss_critic                       1741.098835
loss_actor                        -392.036025
memory_size                       604512.7865 

=== epoch 8/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:40,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   42
episode_length                     217.428571
returns                            -39.369554
return_std                          73.698179
average_reward                        -0.1815
round_time             0 days 00:14:05.898556
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        855.74802
return_std_test                      13.11679
average_reward_test                  0.855748
round_time_test        0 days 00:00:11.604294
round_time_total       0 days 00:14:05.899664
loss_total                        1319.334397
loss_critic                       1747.214607
loss_actor                        -392.186553
memory_size                       606303.9455 

=== epoch 8/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:28,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                   35
episode_length                     257.628571
returns                            -54.257512
return_std                          83.726909
average_reward                       -0.20562
round_time             0 days 00:14:06.485922
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.054609
return_std_test                     12.544027
average_reward_test                  0.889055
round_time_test        0 days 00:00:11.593083
round_time_total       0 days 00:14:06.486993
loss_total                        1306.180634
loss_critic                       1730.999243
loss_actor                        -393.093919
memory_size                        608126.852 

=== epoch 8/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:18,  2.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                   33
episode_length                     280.151515
returns                             -57.62997
return_std                          85.942831
average_reward                      -0.204467
round_time             0 days 00:14:06.471477
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       892.193705
return_std_test                     15.570135
average_reward_test                  0.892194
round_time_test        0 days 00:00:11.467003
round_time_total       0 days 00:14:06.472546
loss_total                        1267.911544
loss_critic                       1683.232167
loss_actor                        -393.371065
memory_size                        609934.356 

=== epoch 8/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:55,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   33
episode_length                     273.424242
returns                            -47.439496
return_std                          79.103681
average_reward                      -0.182423
round_time             0 days 00:14:05.766888
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        915.69734
return_std_test                     16.594371
average_reward_test                  0.915697
round_time_test        0 days 00:00:11.360753
round_time_total       0 days 00:14:05.767968
loss_total                        1336.565253
loss_critic                       1769.074589
loss_actor                        -393.472206
memory_size                       611674.5615 

=== epoch 8/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:45,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   32
episode_length                      296.46875
returns                            -54.368305
return_std                          85.262616
average_reward                      -0.185138
round_time             0 days 00:14:08.753172
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       894.910457
return_std_test                     13.595367
average_reward_test                   0.89491
round_time_test        0 days 00:00:11.640126
round_time_total       0 days 00:14:08.754237
loss_total                         1310.38394
loss_critic                       1736.510346
loss_actor                        -394.121803
memory_size                       613492.0135 

=== epoch 8/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:04,  2.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   28
episode_length                     298.821429
returns                            -52.373489
return_std                          77.826278
average_reward                      -0.180483
round_time             0 days 00:14:05.651700
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.632482
return_std_test                     11.041687
average_reward_test                  0.889632
round_time_test        0 days 00:00:11.407322
round_time_total       0 days 00:14:05.652795
loss_total                        1286.591515
loss_critic                       1707.163437
loss_actor                        -395.696283
memory_size                       615283.9755 

=== epoch 8/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:47,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                   33
episode_length                     275.242424
returns                            -42.697177
return_std                          70.303791
average_reward                      -0.164094
round_time             0 days 00:14:06.332062
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.925558
return_std_test                     15.268742
average_reward_test                  0.913926
round_time_test        0 days 00:00:11.363588
round_time_total       0 days 00:14:06.333120
loss_total                        1297.507829
loss_critic                        1720.94175
loss_actor                        -396.227976
memory_size                         617026.55 

=== epoch 8/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   33
episode_length                     281.878788
returns                            -46.156046
return_std                          71.522718
average_reward                      -0.157622
round_time             0 days 00:14:09.240393
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       879.702654
return_std_test                     10.489431
average_reward_test                  0.879703
round_time_test        0 days 00:00:11.428267
round_time_total       0 days 00:14:09.241458
loss_total                         1289.22925
loss_critic                       1710.528237
loss_actor                        -395.966815
memory_size                       618806.7845 

=== epoch 8/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:26,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   32
episode_length                      279.84375
returns                            -47.292264
return_std                            58.1448
average_reward                      -0.172357
round_time             0 days 00:14:13.331814
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       882.724528
return_std_test                     15.377848
average_reward_test                  0.882725
round_time_test        0 days 00:00:11.425297
round_time_total       0 days 00:14:13.332894
loss_total                        1322.771426
loss_critic                       1752.521196
loss_actor                        -396.227776
memory_size                        620598.188 

=== epoch 8/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:14,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   30
episode_length                     314.333333
returns                            -54.829298
return_std                          71.967815
average_reward                       -0.17319
round_time             0 days 00:14:11.449343
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.137084
return_std_test                     13.852123
average_reward_test                  0.871137
round_time_test        0 days 00:00:11.693928
round_time_total       0 days 00:14:11.450551
loss_total                        1358.783285
loss_critic                       1797.463626
loss_actor                        -395.938195
memory_size                        622430.399 

=== epoch 8/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   33
episode_length                     299.727273
returns                            -55.472975
return_std                          72.217959
average_reward                      -0.184633
round_time             0 days 00:14:08.391359
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       871.973706
return_std_test                     21.255864
average_reward_test                  0.871974
round_time_test        0 days 00:00:11.561489
round_time_total       0 days 00:14:08.392435
loss_total                        1349.086801
loss_critic                       1785.182265
loss_actor                        -395.295186
memory_size                        624272.023 

=== epoch 8/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:08<00:00,  2.36it/s]
episodes                                   29
episode_length                          327.0
returns                            -58.318788
return_std                          75.128731
average_reward                      -0.180759
round_time             0 days 00:14:08.572340
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.682149
return_std_test                     18.996497
average_reward_test                  0.900682
round_time_test        0 days 00:00:11.561011
round_time_total       0 days 00:14:08.573605
loss_total                        1309.778577
loss_critic                       1735.962977
loss_actor                        -394.959146
memory_size                       626047.7355 

=== epoch 8/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:30,  2.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:06<00:00,  2.36it/s]
episodes                                   33
episode_length                     291.181818
returns                            -54.933967
return_std                          76.532798
average_reward                      -0.189348
round_time             0 days 00:14:06.904445
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       899.080614
return_std_test                     18.533223
average_reward_test                  0.899081
round_time_test        0 days 00:00:11.430780
round_time_total       0 days 00:14:06.905521
loss_total                        1347.141954
loss_critic                       1782.733599
loss_actor                        -395.224759
memory_size                       627738.2825 

=== epoch 8/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   28
episode_length                     319.285714
returns                            -60.846733
return_std                          84.788289
average_reward                      -0.195477
round_time             0 days 00:14:09.770330
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       902.872472
return_std_test                      9.004043
average_reward_test                  0.902872
round_time_test        0 days 00:00:11.621601
round_time_total       0 days 00:14:09.771399
loss_total                        1353.549739
loss_critic                       1790.720432
loss_actor                        -395.133169
memory_size                        629619.707 

=== epoch 8/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:37,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:16<00:00,  2.34it/s]
episodes                                   30
episode_length                          312.3
returns                            -56.356727
return_std                          78.299175
average_reward                      -0.187464
round_time             0 days 00:14:16.942283
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.619233
return_std_test                     11.424481
average_reward_test                  0.912619
round_time_test        0 days 00:00:11.424085
round_time_total       0 days 00:14:16.943355
loss_total                        1357.477058
loss_critic                        1795.48346
loss_actor                        -394.548667
memory_size                       631439.1645 

=== epoch 8/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:56,  1.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:21<00:00,  2.32it/s]
episodes                                   38
episode_length                     259.447368
returns                            -38.262181
return_std                           72.03386
average_reward                      -0.148861
round_time             0 days 00:14:21.788827
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        899.94727
return_std_test                     12.265892
average_reward_test                  0.899947
round_time_test        0 days 00:00:11.399333
round_time_total       0 days 00:14:21.789890
loss_total                        1305.590883
loss_critic                       1730.644746
loss_actor                        -394.624689
memory_size                        633162.999 

=== epoch 8/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:24,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:20<00:00,  2.32it/s]
episodes                                   49
episode_length                     202.959184
returns                            -32.273336
return_std                          64.885153
average_reward                      -0.160007
round_time             0 days 00:14:21.093064
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       893.918119
return_std_test                     21.385065
average_reward_test                  0.893918
round_time_test        0 days 00:00:11.515762
round_time_total       0 days 00:14:21.094135
loss_total                        1339.323594
loss_critic                       1773.046957
loss_actor                        -395.569987
memory_size                       634801.2855 

=== epoch 8/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   51
episode_length                     195.215686
returns                            -27.452837
return_std                          60.037976
average_reward                      -0.140046
round_time             0 days 00:14:10.186531
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.775664
return_std_test                      11.83155
average_reward_test                  0.900776
round_time_test        0 days 00:00:11.607512
round_time_total       0 days 00:14:10.187785
loss_total                        1342.369979
loss_critic                       1776.838259
loss_actor                        -395.503256
memory_size                       636184.6575 

=== epoch 8/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   52
episode_length                     178.134615
returns                            -22.879573
return_std                          47.185538
average_reward                      -0.123597
round_time             0 days 00:14:11.352942
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       880.425723
return_std_test                       8.54936
average_reward_test                  0.880426
round_time_test        0 days 00:00:11.685468
round_time_total       0 days 00:14:11.354175
loss_total                        1350.475775
loss_critic                       1786.963262
loss_actor                        -395.474289
memory_size                        638021.842 

=== epoch 8/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:54,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   51
episode_length                     193.411765
returns                             -29.76866
return_std                          62.089519
average_reward                      -0.151491
round_time             0 days 00:14:11.089091
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.561768
return_std_test                      12.13119
average_reward_test                  0.898562
round_time_test        0 days 00:00:11.514882
round_time_total       0 days 00:14:11.090168
loss_total                        1324.915465
loss_critic                       1755.075615
loss_actor                        -395.725263
memory_size                        639881.503 

=== epoch 8/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:45,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:13<00:00,  2.34it/s]
episodes                                   48
episode_length                        172.875
returns                            -27.101285
return_std                           55.31974
average_reward                      -0.168502
round_time             0 days 00:14:14.274475
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.478691
return_std_test                      7.476066
average_reward_test                  0.889479
round_time_test        0 days 00:00:11.435934
round_time_total       0 days 00:14:14.275554
loss_total                        1374.816152
loss_critic                       1817.465907
loss_actor                        -395.783009
memory_size                       641508.3705 

=== epoch 8/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   39
episode_length                     251.512821
returns                            -35.774245
return_std                          69.146125
average_reward                      -0.133014
round_time             0 days 00:14:12.975447
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.788679
return_std_test                     19.508704
average_reward_test                  0.885789
round_time_test        0 days 00:00:11.362463
round_time_total       0 days 00:14:12.976652
loss_total                        1334.722245
loss_critic                       1767.348474
loss_actor                        -395.782789
memory_size                       643258.7045 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 5/2000 [00:02<14:23,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:17<00:00,  2.51it/s]
episodes                                    6
episode_length                     277.833333
returns                            -30.146978
return_std                          79.865007
average_reward                      -0.114581
round_time             0 days 00:13:17.592843
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       890.424332
return_std_test                     17.915417
average_reward_test                  0.890424
round_time_test        0 days 00:00:11.740886
round_time_total       0 days 00:13:17.594054
loss_total                        1344.282863
loss_critic                       1779.350002
loss_actor                        -395.985813
memory_size                       644937.8175 

=== epoch 9/10 ===== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:52,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:21<00:00,  2.50it/s]
episodes                                    8
episode_length                        458.375
returns                            -77.889845
return_std                         108.101648
average_reward                      -0.171123
round_time             0 days 00:13:21.895107
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       909.407416
return_std_test                     14.551529
average_reward_test                  0.909407
round_time_test        0 days 00:00:11.618832
round_time_total       0 days 00:13:21.896188
loss_total                        1346.643513
loss_critic                       1782.181793
loss_actor                        -395.509728
memory_size                        646832.739 

=== epoch 9/10 ===== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:22<00:00,  2.49it/s]
episodes                                   14
episode_length                     377.214286
returns                            -63.160125
return_std                          98.186351
average_reward                      -0.169731
round_time             0 days 00:13:23.196526
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.258876
return_std_test                     11.430069
average_reward_test                  0.900259
round_time_test        0 days 00:00:11.544342
round_time_total       0 days 00:13:23.197599
loss_total                        1329.025377
loss_critic                       1760.499754
loss_actor                        -396.872257
memory_size                       648686.4525 

=== epoch 9/10 ===== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:59,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:29<00:00,  2.47it/s]
episodes                                   20
episode_length                         392.25
returns                            -70.401256
return_std                         102.961861
average_reward                      -0.178979
round_time             0 days 00:13:29.676936
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       918.410516
return_std_test                      7.626298
average_reward_test                  0.918411
round_time_test        0 days 00:00:11.345290
round_time_total       0 days 00:13:29.678003
loss_total                        1306.955848
loss_critic                       1732.911884
loss_actor                        -396.868417
memory_size                       650532.1225 

=== epoch 9/10 ===== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:23<00:00,  2.49it/s]
episodes                                   30
episode_length                     333.233333
returns                            -53.729813
return_std                          97.104065
average_reward                      -0.160909
round_time             0 days 00:13:23.925613
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       885.909996
return_std_test                      6.190323
average_reward_test                   0.88591
round_time_test        0 days 00:00:11.550371
round_time_total       0 days 00:13:23.926826
loss_total                        1345.464238
loss_critic                       1781.070964
loss_actor                        -396.962784
memory_size                        652253.212 

=== epoch 9/10 ===== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:29<00:00,  2.47it/s]
episodes                                   30
episode_length                          288.4
returns                            -49.629354
return_std                           91.84914
average_reward                      -0.180664
round_time             0 days 00:13:29.648889
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       894.384647
return_std_test                      8.419715
average_reward_test                  0.894385
round_time_test        0 days 00:00:11.428632
round_time_total       0 days 00:13:29.649945
loss_total                        1334.015235
loss_critic                       1766.614701
loss_actor                         -396.38274
memory_size                       653957.8665 

=== epoch 9/10 ===== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:32,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:46<00:00,  2.42it/s]
episodes                                   40
episode_length                          232.5
returns                            -36.435502
return_std                           78.42862
average_reward                      -0.160495
round_time             0 days 00:13:46.975343
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       906.558523
return_std_test                     11.539042
average_reward_test                  0.906559
round_time_test        0 days 00:00:11.435196
round_time_total       0 days 00:13:46.976408
loss_total                        1333.384681
loss_critic                       1765.891859
loss_actor                        -396.644157
memory_size                       655734.6715 

=== epoch 9/10 ===== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<15:15,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:54<00:00,  2.40it/s]
episodes                                   43
episode_length                     224.860465
returns                            -37.431748
return_std                          78.906605
average_reward                      -0.170553
round_time             0 days 00:13:54.784498
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.097823
return_std_test                      8.160331
average_reward_test                  0.915098
round_time_test        0 days 00:00:11.386486
round_time_total       0 days 00:13:54.785578
loss_total                        1343.421771
loss_critic                       1778.422279
loss_actor                        -396.580382
memory_size                        657386.296 

=== epoch 9/10 ===== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:46<00:00,  2.11it/s]
episodes                                   49
episode_length                     194.061224
returns                            -30.811228
return_std                          66.517915
average_reward                       -0.15664
round_time             0 days 00:15:46.881170
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        940.58479
return_std_test                     10.108604
average_reward_test                  0.940585
round_time_test        0 days 00:00:11.324571
round_time_total       0 days 00:15:46.882505
loss_total                         1330.08882
loss_critic                       1761.855107
loss_actor                        -396.976454
memory_size                       659036.2745 

=== epoch 9/10 ===== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:08,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [17:01<00:00,  1.96it/s]
episodes                                   44
episode_length                     210.340909
returns                            -38.613973
return_std                          67.639866
average_reward                      -0.182714
round_time             0 days 00:17:02.404710
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       928.311542
return_std_test                      12.95805
average_reward_test                  0.928312
round_time_test        0 days 00:00:11.990164
round_time_total       0 days 00:17:02.406080
loss_total                         1324.99433
loss_critic                       1755.562407
loss_actor                        -397.278095
memory_size                        660770.016 

=== epoch 9/10 ===== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:58,  1.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:53<00:00,  1.97it/s]
episodes                                   48
episode_length                     189.666667
returns                            -30.176074
return_std                          60.439537
average_reward                      -0.160599
round_time             0 days 00:16:54.499219
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       921.093704
return_std_test                      9.367153
average_reward_test                  0.921094
round_time_test        0 days 00:00:11.804490
round_time_total       0 days 00:16:54.500386
loss_total                        1330.849742
loss_critic                        1762.94534
loss_actor                         -397.53278
memory_size                       662494.0285 

=== epoch 9/10 ===== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<17:54,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:53<00:00,  2.24it/s]
episodes                                   46
episode_length                          202.0
returns                             -34.00602
return_std                          60.560985
average_reward                        -0.1718
round_time             0 days 00:14:54.399657
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       941.179061
return_std_test                     15.244783
average_reward_test                  0.941179
round_time_test        0 days 00:00:11.974177
round_time_total       0 days 00:14:54.400738
loss_total                        1353.688904
loss_critic                       1791.382116
loss_actor                        -397.084082
memory_size                       664157.4945 

=== epoch 9/10 ===== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                   51
episode_length                      188.27451
returns                            -29.244483
return_std                          56.802418
average_reward                      -0.146863
round_time             0 days 00:14:04.855959
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       920.385314
return_std_test                     23.470061
average_reward_test                  0.920385
round_time_test        0 days 00:00:11.538505
round_time_total       0 days 00:14:04.857034
loss_total                        1338.174165
loss_critic                       1771.883542
loss_actor                        -396.663478
memory_size                        665803.147 

=== epoch 9/10 ===== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:51,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                   52
episode_length                     171.057692
returns                            -24.649631
return_std                          53.675212
average_reward                      -0.150622
round_time             0 days 00:14:02.292636
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.820476
return_std_test                     15.846295
average_reward_test                   0.92982
round_time_test        0 days 00:00:11.311083
round_time_total       0 days 00:14:02.293707
loss_total                        1341.151636
loss_critic                       1775.554407
loss_actor                        -396.459574
memory_size                       667347.1015 

=== epoch 9/10 ===== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:03,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:37<00:00,  2.13it/s]
episodes                                   56
episode_length                     158.267857
returns                            -21.571256
return_std                          53.677154
average_reward                      -0.148051
round_time             0 days 00:15:38.187575
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       906.668739
return_std_test                      12.76773
average_reward_test                  0.906669
round_time_test        0 days 00:00:11.491820
round_time_total       0 days 00:15:38.188892
loss_total                        1334.675079
loss_critic                       1767.618184
loss_actor                        -397.097461
memory_size                        669005.744 

=== epoch 9/10 ===== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:27,  1.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:55<00:00,  1.97it/s]
episodes                                   48
episode_length                     183.916667
returns                            -30.342006
return_std                          64.436196
average_reward                      -0.171641
round_time             0 days 00:16:56.175323
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.712093
return_std_test                     15.252269
average_reward_test                  0.929712
round_time_test        0 days 00:00:11.794551
round_time_total       0 days 00:16:56.176537
loss_total                        1323.394092
loss_critic                       1753.724881
loss_actor                        -397.929183
memory_size                        670846.437 

=== epoch 9/10 ===== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<19:18,  1.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:50<00:00,  1.98it/s]
episodes                                   40
episode_length                          223.7
returns                            -36.229511
return_std                          71.935722
average_reward                      -0.166556
round_time             0 days 00:16:50.801416
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       936.442284
return_std_test                      10.57823
average_reward_test                  0.936442
round_time_test        0 days 00:00:11.900962
round_time_total       0 days 00:16:50.802538
loss_total                        1347.301848
loss_critic                       1783.705006
loss_actor                        -398.310921
memory_size                        672761.595 

=== epoch 9/10 ===== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<18:06,  1.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [16:23<00:00,  2.03it/s]
episodes                                   30
episode_length                          309.5
returns                            -62.894179
return_std                          93.125066
average_reward                      -0.209927
round_time             0 days 00:16:23.639256
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       937.097065
return_std_test                      8.285254
average_reward_test                  0.937097
round_time_test        0 days 00:00:12.255807
round_time_total       0 days 00:16:23.640572
loss_total                        1335.790832
loss_critic                       1769.387012
loss_actor                        -398.594017
memory_size                        674662.992 

=== epoch 9/10 ===== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<23:59,  1.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:36<00:00,  1.41it/s]
episodes                                   29
episode_length                     328.551724
returns                            -62.996748
return_std                          98.565915
average_reward                      -0.195916
round_time             0 days 00:23:37.204621
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.308549
return_std_test                     17.580763
average_reward_test                  0.915309
round_time_test        0 days 00:00:12.514458
round_time_total       0 days 00:23:37.205923
loss_total                        1307.677402
loss_critic                       1734.469255
loss_actor                        -399.490131
memory_size                        676460.592 

=== epoch 9/10 ===== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:00<29:24,  1.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [28:20<00:00,  1.18it/s]
episodes                                   31
episode_length                     316.709677
returns                            -58.768597
return_std                           93.81327
average_reward                      -0.185705
round_time             0 days 00:28:21.522707
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       921.255914
return_std_test                     16.708663
average_reward_test                  0.921256
round_time_test        0 days 00:00:14.137591
round_time_total       0 days 00:28:21.524183
loss_total                        1326.656283
loss_critic                       1758.328121
loss_actor                        -400.031186
memory_size                        678005.589 

=== epoch 9/10 ===== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:40,  1.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:44<00:00,  1.20it/s]
episodes                                   36
episode_length                     267.277778
returns                            -45.367879
return_std                           87.60806
average_reward                      -0.172787
round_time             0 days 00:27:45.670119
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       898.390929
return_std_test                     14.996763
average_reward_test                  0.898391
round_time_test        0 days 00:00:13.833963
round_time_total       0 days 00:27:45.671464
loss_total                        1319.397653
loss_critic                       1749.126075
loss_actor                        -399.516148
memory_size                        679732.489 

=== epoch 9/10 ===== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:18,  1.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [25:07<00:00,  1.33it/s]
episodes                                   37
episode_length                     261.675676
returns                            -50.084834
return_std                          96.610154
average_reward                      -0.195498
round_time             0 days 00:25:07.816999
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       874.497097
return_std_test                      11.24597
average_reward_test                  0.874497
round_time_test        0 days 00:00:13.911979
round_time_total       0 days 00:25:07.818246
loss_total                        1309.700531
loss_critic                       1737.134222
loss_actor                        -400.034361
memory_size                        681609.438 

=== epoch 9/10 ===== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<24:29,  1.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [23:43<00:00,  1.41it/s]
episodes                                   36
episode_length                     242.027778
returns                            -39.900131
return_std                           81.67148
average_reward                      -0.176115
round_time             0 days 00:23:43.883605
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       904.496858
return_std_test                      9.282993
average_reward_test                  0.904497
round_time_test        0 days 00:00:12.980507
round_time_total       0 days 00:23:43.885092
loss_total                         1336.34173
loss_critic                       1770.359434
loss_actor                        -399.729214
memory_size                        683503.925 

=== epoch 9/10 ===== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:05,  1.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [27:49<00:00,  1.20it/s]
episodes                                   31
episode_length                     312.032258
returns                            -65.988861
return_std                           97.07789
average_reward                       -0.20932
round_time             0 days 00:27:50.630240
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.797746
return_std_test                     12.672577
average_reward_test                  0.913798
round_time_test        0 days 00:00:14.322908
round_time_total       0 days 00:27:50.631725
loss_total                        1323.293775
loss_critic                       1754.317428
loss_actor                        -400.800961
memory_size                        685317.658 

=== epoch 9/10 ===== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 2/2000 [00:01<28:56,  1.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [18:29<00:00,  1.80it/s]
episodes                                   30
episode_length                     328.133333
returns                            -71.050522
return_std                         103.327777
average_reward                      -0.211758
round_time             0 days 00:18:29.928886
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       911.192258
return_std_test                       9.77484
average_reward_test                  0.911192
round_time_test        0 days 00:00:13.876720
round_time_total       0 days 00:18:29.930111
loss_total                        1313.786072
loss_critic                       1742.615472
loss_actor                        -401.531652
memory_size                        687142.239 

=== epoch 9/10 ===== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:07,  1.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:41<00:00,  2.27it/s]
episodes                                   26
episode_length                     325.230769
returns                            -67.417432
return_std                         101.178486
average_reward                      -0.207395
round_time             0 days 00:14:42.056169
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        922.57833
return_std_test                      3.146166
average_reward_test                  0.922578
round_time_test        0 days 00:00:12.098506
round_time_total       0 days 00:14:42.057246
loss_total                        1312.008574
loss_critic                       1740.274829
loss_actor                        -401.056575
memory_size                       688859.9705 

=== epoch 9/10 ===== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   27
episode_length                     315.740741
returns                            -61.200428
return_std                          95.738033
average_reward                      -0.187748
round_time             0 days 00:14:10.602831
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       918.140079
return_std_test                     23.612678
average_reward_test                   0.91814
round_time_test        0 days 00:00:11.196689
round_time_total       0 days 00:14:10.603906
loss_total                        1343.523407
loss_critic                       1779.618033
loss_actor                        -400.855221
memory_size                        690727.229 

=== epoch 9/10 ===== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:13<00:00,  2.34it/s]
episodes                                   28
episode_length                     350.607143
returns                            -67.977482
return_std                          96.684016
average_reward                       -0.19305
round_time             0 days 00:14:13.535099
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       923.807831
return_std_test                     10.391993
average_reward_test                  0.923808
round_time_test        0 days 00:00:11.272676
round_time_total       0 days 00:14:13.536327
loss_total                        1342.884194
loss_critic                       1778.752371
loss_actor                        -400.588645
memory_size                       692579.9145 

=== epoch 9/10 ===== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:13<00:00,  2.34it/s]
episodes                                   28
episode_length                     316.285714
returns                            -48.615734
return_std                            81.2842
average_reward                      -0.163061
round_time             0 days 00:14:14.251310
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       920.528615
return_std_test                     17.560848
average_reward_test                  0.920529
round_time_test        0 days 00:00:11.649822
round_time_total       0 days 00:14:14.252383
loss_total                        1353.778177
loss_critic                       1792.495009
loss_actor                        -401.089272
memory_size                       694381.5335 

=== epoch 9/10 ===== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:43,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   22
episode_length                     381.727273
returns                            -61.399768
return_std                           88.76801
average_reward                      -0.174765
round_time             0 days 00:14:10.246488
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       905.424172
return_std_test                      6.399261
average_reward_test                  0.905424
round_time_test        0 days 00:00:11.491507
round_time_total       0 days 00:14:10.247760
loss_total                        1368.990446
loss_critic                       1811.430814
loss_actor                        -400.771149
memory_size                       696182.8175 

=== epoch 9/10 ===== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:22,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:10<00:00,  2.35it/s]
episodes                                   27
episode_length                     341.333333
returns                            -51.479983
return_std                          91.108841
average_reward                      -0.156722
round_time             0 days 00:14:11.221859
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       903.186231
return_std_test                       9.26885
average_reward_test                  0.903186
round_time_test        0 days 00:00:11.489923
round_time_total       0 days 00:14:11.223020
loss_total                        1329.458558
loss_critic                       1761.881897
loss_actor                        -400.234917
memory_size                        698004.744 

=== epoch 9/10 ===== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   29
episode_length                     299.241379
returns                            -46.714276
return_std                          82.225183
average_reward                      -0.170252
round_time             0 days 00:14:08.514038
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       887.031964
return_std_test                     18.028955
average_reward_test                  0.887032
round_time_test        0 days 00:00:11.559178
round_time_total       0 days 00:14:08.515153
loss_total                        1354.636135
loss_critic                       1793.510384
loss_actor                        -400.860992
memory_size                       699754.5965 

=== epoch 9/10 ===== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:36,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.36it/s]
episodes                                   32
episode_length                       289.0625
returns                            -43.259976
return_std                           78.65453
average_reward                      -0.156079
round_time             0 days 00:14:06.437180
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       920.701452
return_std_test                      9.233217
average_reward_test                  0.920701
round_time_test        0 days 00:00:11.491121
round_time_total       0 days 00:14:06.438237
loss_total                        1308.880928
loss_critic                       1736.554851
loss_actor                        -401.814886
memory_size                        701513.683 

=== epoch 9/10 ===== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:40,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   37
episode_length                     263.756757
returns                            -48.141924
return_std                          76.926729
average_reward                      -0.186677
round_time             0 days 00:14:08.020287
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.273205
return_std_test                      9.939024
average_reward_test                  0.912273
round_time_test        0 days 00:00:11.417477
round_time_total       0 days 00:14:08.021362
loss_total                        1324.948697
loss_critic                       1756.668338
loss_actor                        -401.929994
memory_size                        703261.427 

=== epoch 9/10 ===== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:48,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:09<00:00,  2.35it/s]
episodes                                   39
episode_length                     231.461538
returns                             -36.36327
return_std                          67.418473
average_reward                      -0.160458
round_time             0 days 00:14:09.838024
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.187602
return_std_test                     12.834732
average_reward_test                  0.915188
round_time_test        0 days 00:00:11.528129
round_time_total       0 days 00:14:09.839361
loss_total                        1326.758073
loss_critic                       1758.951581
loss_actor                        -402.016083
memory_size                        704965.431 

=== epoch 9/10 ===== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   36
episode_length                          243.5
returns                            -42.795174
return_std                           62.35156
average_reward                      -0.181619
round_time             0 days 00:14:08.143724
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        899.46197
return_std_test                       5.34839
average_reward_test                  0.899462
round_time_test        0 days 00:00:11.321273
round_time_total       0 days 00:14:08.144966
loss_total                        1333.318806
loss_critic                       1767.127782
loss_actor                        -401.917227
memory_size                       706748.1905 

=== epoch 9/10 ===== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:25,  2.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                   32
episode_length                      286.71875
returns                            -46.563938
return_std                          66.474358
average_reward                      -0.162198
round_time             0 days 00:14:04.327633
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.740963
return_std_test                      9.369564
average_reward_test                  0.913741
round_time_test        0 days 00:00:11.372850
round_time_total       0 days 00:14:04.328700
loss_total                        1321.212915
loss_critic                        1751.96193
loss_actor                        -401.783271
memory_size                        708625.635 

=== epoch 9/10 ===== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:52,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:14<00:00,  2.34it/s]
episodes                                   29
episode_length                     321.793103
returns                            -55.136693
return_std                          71.553876
average_reward                       -0.17442
round_time             0 days 00:14:14.516373
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       921.962316
return_std_test                      6.842708
average_reward_test                  0.921962
round_time_test        0 days 00:00:11.374468
round_time_total       0 days 00:14:14.517663
loss_total                        1323.705192
loss_critic                       1755.211606
loss_actor                        -402.320597
memory_size                        710513.483 

=== epoch 9/10 ===== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:01<00:00,  2.38it/s]
episodes                                   25
episode_length                         332.04
returns                            -53.428235
return_std                          76.810068
average_reward                      -0.170102
round_time             0 days 00:14:01.971273
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       943.144174
return_std_test                      9.798376
average_reward_test                  0.943144
round_time_test        0 days 00:00:11.149992
round_time_total       0 days 00:14:01.972538
loss_total                        1361.052514
loss_critic                       1802.026586
loss_actor                         -402.84391
memory_size                       712311.8845 

=== epoch 9/10 ===== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:11,  2.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:23<00:00,  2.17it/s]
episodes                                   26
episode_length                     361.961538
returns                            -70.888793
return_std                          87.309296
average_reward                      -0.185825
round_time             0 days 00:15:23.891029
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       874.929588
return_std_test                      8.998864
average_reward_test                   0.87493
round_time_test        0 days 00:00:11.401074
round_time_total       0 days 00:15:23.892296
loss_total                        1328.498408
loss_critic                       1761.394798
loss_actor                        -403.087292
memory_size                       714123.2905 

=== epoch 9/10 ===== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:52,  1.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:36<00:00,  2.13it/s]
episodes                                   36
episode_length                     261.777778
returns                            -40.786613
return_std                          77.893417
average_reward                      -0.159053
round_time             0 days 00:15:37.468163
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       918.070784
return_std_test                     16.708999
average_reward_test                  0.918071
round_time_test        0 days 00:00:11.979366
round_time_total       0 days 00:15:37.469493
loss_total                        1353.298124
loss_critic                       1792.349631
loss_actor                        -402.908029
memory_size                        715685.665 

=== epoch 9/10 ===== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:06,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [15:33<00:00,  2.14it/s]
episodes                                   42
episode_length                      228.02381
returns                            -36.559743
return_std                          74.472085
average_reward                      -0.165896
round_time             0 days 00:15:33.917957
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       924.895664
return_std_test                     18.113932
average_reward_test                  0.924896
round_time_test        0 days 00:00:12.123056
round_time_total       0 days 00:15:33.919087
loss_total                        1375.925135
loss_critic                       1820.483452
loss_actor                        -402.308262
memory_size                        717405.856 

=== epoch 9/10 ===== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:49,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   48
episode_length                     200.854167
returns                            -26.993806
return_std                          66.864818
average_reward                       -0.13529
round_time             0 days 00:14:12.821817
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       888.075796
return_std_test                     16.795581
average_reward_test                  0.888076
round_time_test        0 days 00:00:11.568291
round_time_total       0 days 00:14:12.822896
loss_total                        1328.476541
loss_critic                       1761.170673
loss_actor                        -402.300114
memory_size                        719131.022 

=== epoch 9/10 ===== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 1/2000 [00:01<52:46,  1.58s/it]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:11<00:00,  2.35it/s]
episodes                                   54
episode_length                     174.851852
returns                            -18.125299
return_std                           56.08268
average_reward                       -0.10792
round_time             0 days 00:14:11.827062
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       928.307491
return_std_test                     16.912162
average_reward_test                  0.928307
round_time_test        0 days 00:00:11.380217
round_time_total       0 days 00:14:11.828178
loss_total                        1351.368194
loss_critic                       1789.815481
loss_actor                         -402.42109
memory_size                       720625.6365 

=== epoch 9/10 ===== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:17<00:00,  2.33it/s]
episodes                                   51
episode_length                     176.509804
returns                            -19.108119
return_std                          56.269478
average_reward                      -0.120609
round_time             0 days 00:14:18.269190
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       921.275671
return_std_test                      9.614971
average_reward_test                  0.921276
round_time_test        0 days 00:00:11.363163
round_time_total       0 days 00:14:18.270268
loss_total                        1341.827955
loss_critic                       1778.046885
loss_actor                        -403.047888
memory_size                        722409.543 

=== epoch 9/10 ===== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:43,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:12<00:00,  2.35it/s]
episodes                                   48
episode_length                        193.125
returns                            -31.596699
return_std                          70.034908
average_reward                      -0.154559
round_time             0 days 00:14:12.986313
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.756313
return_std_test                      9.910708
average_reward_test                  0.912756
round_time_test        0 days 00:00:11.539213
round_time_total       0 days 00:14:12.987381
loss_total                        1367.971199
loss_critic                       1810.718423
loss_actor                         -403.01783
memory_size                        724067.626 

=== epoch 9/10 ===== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:48,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:13<00:00,  2.34it/s]
episodes                                   49
episode_length                     197.142857
returns                            -32.026722
return_std                          69.488323
average_reward                      -0.159533
round_time             0 days 00:14:13.527232
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       907.003443
return_std_test                     10.574262
average_reward_test                  0.907003
round_time_test        0 days 00:00:11.478354
round_time_total       0 days 00:14:13.528302
loss_total                        1364.976464
loss_critic                       1806.949787
loss_actor                        -402.916954
memory_size                       725861.2075 

=== epoch 9/10 ===== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:53,  1.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:16<00:00,  2.34it/s]
episodes                                   42
episode_length                     213.904762
returns                            -38.732555
return_std                          70.117441
average_reward                      -0.183271
round_time             0 days 00:14:16.822239
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       919.235547
return_std_test                     14.883046
average_reward_test                  0.919236
round_time_test        0 days 00:00:11.390531
round_time_total       0 days 00:14:16.823560
loss_total                        1312.326742
loss_critic                       1741.108813
loss_actor                        -402.801665
memory_size                       727615.3365 

=== epoch 9/10 ===== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:36,  2.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:17<00:00,  2.33it/s]
episodes                                   36
episode_length                     258.194444
returns                            -50.235784
return_std                          79.598889
average_reward                      -0.193131
round_time             0 days 00:14:18.135198
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        915.82398
return_std_test                     23.290876
average_reward_test                  0.915824
round_time_test        0 days 00:00:11.526733
round_time_total       0 days 00:14:18.136359
loss_total                        1361.442855
loss_critic                       1802.458154
loss_actor                        -402.618469
memory_size                       729465.4385 

=== epoch 9/10 ===== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:25<00:00,  2.31it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   37
episode_length                     259.243243
returns                            -45.731087
return_std                          78.562294
average_reward                      -0.176901
round_time             0 days 00:14:25.933853
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       896.786128
return_std_test                     22.219882
average_reward_test                  0.896786
round_time_test        0 days 00:00:11.428931
round_time_total       0 days 00:14:25.935141
loss_total                        1342.017626
loss_critic                       1778.371222
loss_actor                        -403.396881
memory_size                        731218.935 


<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 4/2000 [00:01<15:20,  2.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:20<00:00,  2.32it/s]
episodes                                    8
episode_length                        240.375
returns                            -29.289423
return_std                          62.731549
average_reward                      -0.111932
round_time             0 days 00:14:20.647910
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       919.765566
return_std_test                      7.013197
average_reward_test                  0.919766
round_time_test        0 days 00:00:11.860409
round_time_total       0 days 00:14:20.649105
loss_total                        1367.017728
loss_critic                       1809.857504
loss_actor                        -404.341505
memory_size                        732972.192 

=== epoch 10/10 ==== round 2/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:52,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:21<00:00,  2.32it/s]
episodes                                   14
episode_length                     269.785714
returns                            -33.978549
return_std                          73.841921
average_reward                      -0.140509
round_time             0 days 00:14:22.070228
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       917.391952
return_std_test                     11.613098
average_reward_test                  0.917392
round_time_test        0 days 00:00:11.917664
round_time_total       0 days 00:14:22.071378
loss_total                        1351.738654
loss_critic                       1790.660893
loss_actor                        -403.950427
memory_size                        734782.337 

=== epoch 10/10 ==== round 3/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:36,  2.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:58<00:00,  2.39it/s]
episodes                                   34
episode_length                     174.441176
returns                            -26.848886
return_std                          51.968436
average_reward                        -0.1485
round_time             0 days 00:13:58.877383
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       939.417836
return_std_test                     13.755743
average_reward_test                  0.939418
round_time_test        0 days 00:00:11.973636
round_time_total       0 days 00:13:58.878682
loss_total                        1407.232072
loss_critic                         1859.8634
loss_actor                        -403.293374
memory_size                       736270.9395 

=== epoch 10/10 ==== round 4/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:56,  2.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:20<00:00,  2.50it/s]
episodes                                   39
episode_length                     185.102564
returns                            -27.264006
return_std                          53.002558
average_reward                      -0.145153
round_time             0 days 00:13:20.902746
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       905.623779
return_std_test                      6.194875
average_reward_test                  0.905624
round_time_test        0 days 00:00:11.468221
round_time_total       0 days 00:13:20.903881
loss_total                        1367.204121
loss_critic                       1809.682176
loss_actor                        -402.708228
memory_size                        737899.133 

=== epoch 10/10 ==== round 5/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:09,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:34<00:00,  2.29it/s]
episodes                                   46
episode_length                     209.217391
returns                              -30.0673
return_std                          52.414954
average_reward                      -0.145952
round_time             0 days 00:14:35.476014
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        919.99003
return_std_test                     15.277648
average_reward_test                   0.91999
round_time_test        0 days 00:00:11.919334
round_time_total       0 days 00:14:35.477289
loss_total                        1379.373547
loss_critic                       1824.891973
loss_actor                        -402.700279
memory_size                        739738.276 

=== epoch 10/10 ==== round 6/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:06,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:34<00:00,  2.29it/s]
episodes                                   43
episode_length                      208.27907
returns                            -33.397916
return_std                          57.010664
average_reward                      -0.169386
round_time             0 days 00:14:34.690822
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.057644
return_std_test                     11.614651
average_reward_test                  0.912058
round_time_test        0 days 00:00:12.043125
round_time_total       0 days 00:14:34.691934
loss_total                        1375.625948
loss_critic                       1820.185089
loss_actor                        -402.610747
memory_size                       741564.7455 

=== epoch 10/10 ==== round 7/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<17:04,  1.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:41<00:00,  2.27it/s]
episodes                                   49
episode_length                     202.142857
returns                            -29.053006
return_std                          61.782276
average_reward                      -0.140895
round_time             0 days 00:14:42.129036
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       904.913813
return_std_test                     13.850895
average_reward_test                  0.904914
round_time_test        0 days 00:00:12.059913
round_time_total       0 days 00:14:42.130119
loss_total                        1375.425164
loss_critic                       1819.919131
loss_actor                        -402.550824
memory_size                       743321.9385 

=== epoch 10/10 ==== round 8/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:25,  2.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:44<00:00,  2.43it/s]
episodes                                   36
episode_length                     265.277778
returns                            -36.324078
return_std                          79.232319
average_reward                      -0.142926
round_time             0 days 00:13:45.250046
episodes_test                            10.0
episode_length_test                     901.2
returns_test                       820.089371
return_std_test                     273.51122
average_reward_test                   0.91024
round_time_test        0 days 00:00:11.325094
round_time_total       0 days 00:13:45.251115
loss_total                         1356.15462
loss_critic                       1795.821499
loss_actor                        -402.513023
memory_size                        744961.625 

=== epoch 10/10 ==== round 9/50 ======================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:04,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:47<00:00,  2.42it/s]
episodes                                   35
episode_length                     271.714286
returns                            -44.083612
return_std                          91.427002
average_reward                      -0.163693
round_time             0 days 00:13:47.671953
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       940.029676
return_std_test                     16.020135
average_reward_test                   0.94003
round_time_test        0 days 00:00:11.378725
round_time_total       0 days 00:13:47.673030
loss_total                        1391.119486
loss_critic                       1839.617853
loss_actor                         -402.87411
memory_size                       746787.5675 

=== epoch 10/10 ==== round 10/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:14,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:53<00:00,  2.40it/s]
episodes                                   37
episode_length                     248.216216
returns                            -39.685864
return_std                          87.580508
average_reward                      -0.158809
round_time             0 days 00:13:54.189491
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        923.46009
return_std_test                     13.758804
average_reward_test                   0.92346
round_time_test        0 days 00:00:11.388780
round_time_total       0 days 00:13:54.190594
loss_total                         1362.39256
loss_critic                       1803.657048
loss_actor                        -402.665529
memory_size                       748590.3535 

=== epoch 10/10 ==== round 11/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:58<00:00,  2.39it/s]
episodes                                   39
episode_length                     245.923077
returns                            -37.095211
return_std                          87.213066
average_reward                      -0.156481
round_time             0 days 00:13:58.625971
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       917.890175
return_std_test                      4.850151
average_reward_test                   0.91789
round_time_test        0 days 00:00:11.452886
round_time_total       0 days 00:13:58.627296
loss_total                        1373.270406
loss_critic                       1817.347812
loss_actor                        -403.039346
memory_size                        750251.813 

=== epoch 10/10 ==== round 12/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:38,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                   38
episode_length                     255.052632
returns                            -48.553521
return_std                          88.927068
average_reward                       -0.18551
round_time             0 days 00:14:03.602698
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.899399
return_std_test                     18.517893
average_reward_test                  0.929899
round_time_test        0 days 00:00:11.302012
round_time_total       0 days 00:14:03.603760
loss_total                        1381.721413
loss_critic                       1827.938262
loss_actor                        -403.146115
memory_size                        751998.986 

=== epoch 10/10 ==== round 13/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:01,  2.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                   34
episode_length                     248.941176
returns                            -46.510834
return_std                          79.546261
average_reward                      -0.190105
round_time             0 days 00:14:02.704647
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        934.17808
return_std_test                     25.595147
average_reward_test                  0.934178
round_time_test        0 days 00:00:11.316508
round_time_total       0 days 00:14:02.705722
loss_total                         1362.22461
loss_critic                       1803.503123
loss_actor                        -402.889557
memory_size                       753747.6765 

=== epoch 10/10 ==== round 14/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:57<00:00,  2.39it/s]
episodes                                   36
episode_length                     275.027778
returns                            -49.022883
return_std                          81.003437
average_reward                      -0.173954
round_time             0 days 00:13:58.433790
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.490453
return_std_test                     13.217354
average_reward_test                   0.90049
round_time_test        0 days 00:00:11.541605
round_time_total       0 days 00:13:58.435042
loss_total                        1382.931773
loss_critic                       1829.237078
loss_actor                        -402.289563
memory_size                       755580.6505 

=== epoch 10/10 ==== round 15/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:46,  2.25it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                   34
episode_length                     287.823529
returns                            -50.890133
return_std                          78.351157
average_reward                      -0.171723
round_time             0 days 00:14:00.873728
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.462783
return_std_test                     10.258952
average_reward_test                  0.912463
round_time_test        0 days 00:00:11.343213
round_time_total       0 days 00:14:00.874803
loss_total                        1386.564834
loss_critic                         1833.7065
loss_actor                        -402.001962
memory_size                       757432.8645 

=== epoch 10/10 ==== round 16/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:26,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                   35
episode_length                     264.885714
returns                             -34.33873
return_std                           62.82136
average_reward                       -0.13584
round_time             0 days 00:13:59.623084
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       895.417345
return_std_test                      8.606071
average_reward_test                  0.895417
round_time_test        0 days 00:00:11.505383
round_time_total       0 days 00:13:59.624183
loss_total                        1380.587105
loss_critic                        1826.43061
loss_actor                        -402.787038
memory_size                        759195.171 

=== epoch 10/10 ==== round 17/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:03,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                   28
episode_length                     305.857143
returns                            -42.828892
return_std                          80.371917
average_reward                      -0.147903
round_time             0 days 00:14:00.731228
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       876.984467
return_std_test                     12.161943
average_reward_test                  0.876984
round_time_test        0 days 00:00:11.621347
round_time_total       0 days 00:14:00.732305
loss_total                        1327.258983
loss_critic                       1760.127913
loss_actor                         -404.21686
memory_size                       760926.4915 

=== epoch 10/10 ==== round 18/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:36,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:57<00:00,  2.39it/s]
episodes                                   32
episode_length                       309.9375
returns                            -42.153194
return_std                          81.026794
average_reward                      -0.135726
round_time             0 days 00:13:58.239569
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.065606
return_std_test                     10.522093
average_reward_test                  0.913066
round_time_test        0 days 00:00:11.370960
round_time_total       0 days 00:13:58.240639
loss_total                        1408.610516
loss_critic                       1861.874291
loss_actor                        -404.444717
memory_size                       762777.6575 

=== epoch 10/10 ==== round 19/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<16:10,  2.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:55<00:00,  2.39it/s]
episodes                                   36
episode_length                     273.277778
returns                            -33.951084
return_std                          69.840119
average_reward                      -0.127702
round_time             0 days 00:13:56.141122
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       895.523663
return_std_test                      6.813714
average_reward_test                  0.895524
round_time_test        0 days 00:00:11.542909
round_time_total       0 days 00:13:56.142196
loss_total                        1366.958443
loss_critic                       1809.786787
loss_actor                        -404.355061
memory_size                        764590.097 

=== epoch 10/10 ==== round 20/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:57,  2.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:59<00:00,  2.38it/s]
episodes                                   41
episode_length                     231.878049
returns                            -34.957569
return_std                          66.377954
average_reward                      -0.139591
round_time             0 days 00:13:59.733424
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       907.666014
return_std_test                     16.457156
average_reward_test                  0.907666
round_time_test        0 days 00:00:11.585539
round_time_total       0 days 00:13:59.734506
loss_total                        1356.506781
loss_critic                       1796.660625
loss_actor                        -404.108728
memory_size                       766138.3485 

=== epoch 10/10 ==== round 21/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:44,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:00<00:00,  2.38it/s]
episodes                                   36
episode_length                          275.0
returns                            -50.801016
return_std                          82.488252
average_reward                       -0.17669
round_time             0 days 00:14:01.444693
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       897.704738
return_std_test                     14.948057
average_reward_test                  0.897705
round_time_test        0 days 00:00:11.579553
round_time_total       0 days 00:14:01.445768
loss_total                        1352.540065
loss_critic                       1791.869559
loss_actor                        -404.778028
memory_size                       767961.9835 

=== epoch 10/10 ==== round 22/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:03,  2.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:03<00:00,  2.37it/s]
episodes                                   44
episode_length                     218.227273
returns                            -37.523288
return_std                          74.212455
average_reward                      -0.172743
round_time             0 days 00:14:03.679503
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       903.419306
return_std_test                     15.283502
average_reward_test                  0.903419
round_time_test        0 days 00:00:11.458336
round_time_total       0 days 00:14:03.680566
loss_total                        1353.101609
loss_critic                       1792.682147
loss_actor                         -405.22066
memory_size                        769739.568 

=== epoch 10/10 ==== round 23/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:50,  2.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                   41
episode_length                     219.414634
returns                            -45.358309
return_std                          87.341107
average_reward                      -0.196246
round_time             0 days 00:14:04.616587
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       932.041999
return_std_test                     14.354114
average_reward_test                  0.932042
round_time_test        0 days 00:00:11.455956
round_time_total       0 days 00:14:04.617664
loss_total                         1353.64931
loss_critic                       1793.518109
loss_actor                        -405.826018
memory_size                       771487.4645 

=== epoch 10/10 ==== round 24/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:17,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                   37
episode_length                     256.486486
returns                             -54.00721
return_std                          93.606889
average_reward                      -0.209745
round_time             0 days 00:14:04.770495
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       923.183013
return_std_test                     14.629368
average_reward_test                  0.923183
round_time_test        0 days 00:00:11.213219
round_time_total       0 days 00:14:04.771795
loss_total                        1361.261118
loss_critic                        1803.09008
loss_actor                        -406.054849
memory_size                        773344.762 

=== epoch 10/10 ==== round 25/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:14,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:05<00:00,  2.37it/s]
episodes                                   41
episode_length                     220.439024
returns                            -42.342129
return_std                          88.544714
average_reward                      -0.199821
round_time             0 days 00:14:06.125531
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        913.47642
return_std_test                     10.442856
average_reward_test                  0.913476
round_time_test        0 days 00:00:11.653942
round_time_total       0 days 00:14:06.126804
loss_total                        1363.628013
loss_critic                       1806.010919
loss_actor                        -405.903738
memory_size                       774966.3295 

=== epoch 10/10 ==== round 26/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:01,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                   40
episode_length                          221.1
returns                             -42.61383
return_std                          89.950505
average_reward                      -0.192558
round_time             0 days 00:14:03.249004
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.009326
return_std_test                     10.906048
average_reward_test                  0.915009
round_time_test        0 days 00:00:11.489371
round_time_total       0 days 00:14:03.250209
loss_total                        1374.977781
loss_critic                       1820.128891
loss_actor                        -405.626781
memory_size                        776687.241 

=== epoch 10/10 ==== round 27/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:42,  2.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:02<00:00,  2.37it/s]
episodes                                   32
episode_length                        259.625
returns                            -43.524304
return_std                          84.270669
average_reward                      -0.184315
round_time             0 days 00:14:03.142261
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       897.936174
return_std_test                     16.546495
average_reward_test                  0.897936
round_time_test        0 days 00:00:11.610789
round_time_total       0 days 00:14:03.143345
loss_total                        1345.332477
loss_critic                       1783.061695
loss_actor                        -405.584528
memory_size                        778548.266 

=== epoch 10/10 ==== round 28/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:31,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:04<00:00,  2.37it/s]
episodes                                   43
episode_length                     206.627907
returns                            -32.685328
return_std                           73.38431
average_reward                      -0.158001
round_time             0 days 00:14:04.924613
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       909.122911
return_std_test                      8.168083
average_reward_test                  0.909123
round_time_test        0 days 00:00:11.547662
round_time_total       0 days 00:14:04.925679
loss_total                        1387.496073
loss_critic                       1835.677692
loss_actor                        -405.230537
memory_size                         780274.23 

=== epoch 10/10 ==== round 29/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   48
episode_length                         202.75
returns                            -30.928316
return_std                          74.391046
average_reward                      -0.152779
round_time             0 days 00:14:07.927399
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       913.070614
return_std_test                     16.184328
average_reward_test                  0.913071
round_time_test        0 days 00:00:11.577464
round_time_total       0 days 00:14:07.928598
loss_total                        1370.959444
loss_critic                       1814.918604
loss_actor                        -404.877326
memory_size                        781887.623 

=== epoch 10/10 ==== round 30/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:47,  2.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:42<00:00,  2.43it/s]
episodes                                   36
episode_length                          236.5
returns                            -35.246492
return_std                          74.175589
average_reward                      -0.166101
round_time             0 days 00:13:43.121197
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       897.160524
return_std_test                      9.556088
average_reward_test                  0.897161
round_time_test        0 days 00:00:11.418738
round_time_total       0 days 00:13:43.122276
loss_total                        1364.807263
loss_critic                       1807.431114
loss_actor                        -405.688283
memory_size                        783641.641 

=== epoch 10/10 ==== round 31/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:24,  2.31it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:21<00:00,  2.49it/s]
episodes                                   42
episode_length                     235.952381
returns                            -37.717485
return_std                           82.85184
average_reward                      -0.160494
round_time             0 days 00:13:22.086921
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       923.069107
return_std_test                     12.076734
average_reward_test                  0.923069
round_time_test        0 days 00:00:10.862029
round_time_total       0 days 00:13:22.087992
loss_total                        1378.934532
loss_critic                       1824.943195
loss_actor                         -405.10025
memory_size                        785416.555 

=== epoch 10/10 ==== round 32/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:16,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:24<00:00,  2.48it/s]
episodes                                   49
episode_length                     191.306122
returns                            -28.874384
return_std                          69.415711
average_reward                      -0.151951
round_time             0 days 00:13:25.333041
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        914.75178
return_std_test                      13.39208
average_reward_test                  0.914752
round_time_test        0 days 00:00:11.113704
round_time_total       0 days 00:13:25.334101
loss_total                        1357.865784
loss_critic                        1798.70049
loss_actor                        -405.473166
memory_size                        787100.092 

=== epoch 10/10 ==== round 33/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:03,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:25<00:00,  2.48it/s]
episodes                                   39
episode_length                     230.230769
returns                              -35.6782
return_std                          79.205744
average_reward                      -0.157668
round_time             0 days 00:13:26.250220
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       889.711758
return_std_test                      8.956444
average_reward_test                  0.889712
round_time_test        0 days 00:00:11.314893
round_time_total       0 days 00:13:26.251292
loss_total                         1389.29174
loss_critic                       1837.883023
loss_actor                        -405.073516
memory_size                       788840.7415 

=== epoch 10/10 ==== round 34/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:07,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:29<00:00,  2.47it/s]
episodes                                   45
episode_length                     202.977778
returns                            -24.946209
return_std                          69.911854
average_reward                      -0.126155
round_time             0 days 00:13:29.698940
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       920.617038
return_std_test                     11.191972
average_reward_test                  0.920617
round_time_test        0 days 00:00:11.058106
round_time_total       0 days 00:13:29.700018
loss_total                        1374.580287
loss_critic                       1819.720163
loss_actor                        -405.979346
memory_size                       790475.9865 

=== epoch 10/10 ==== round 35/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:26<00:00,  2.48it/s]
episodes                                   49
episode_length                     194.959184
returns                            -20.357164
return_std                          61.594947
average_reward                      -0.114201
round_time             0 days 00:13:27.096489
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.261082
return_std_test                      13.90469
average_reward_test                  0.900261
round_time_test        0 days 00:00:10.973730
round_time_total       0 days 00:13:27.097592
loss_total                        1391.546015
loss_critic                       1840.983515
loss_actor                         -406.20411
memory_size                       792092.6865 

=== epoch 10/10 ==== round 36/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:25<00:00,  2.48it/s]
episodes                                   52
episode_length                     186.461538
returns                            -17.043868
return_std                          61.691907
average_reward                      -0.094386
round_time             0 days 00:13:26.431984
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       917.913187
return_std_test                     18.016897
average_reward_test                  0.917913
round_time_test        0 days 00:00:11.211636
round_time_total       0 days 00:13:26.433052
loss_total                        1374.867961
loss_critic                       1820.196682
loss_actor                         -406.44706
memory_size                         793783.64 

=== epoch 10/10 ==== round 37/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:07<00:00,  2.36it/s]
episodes                                   48
episode_length                     189.020833
returns                            -16.510154
return_std                          65.478363
average_reward                      -0.093606
round_time             0 days 00:14:07.887000
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       900.561086
return_std_test                     18.975098
average_reward_test                  0.900561
round_time_test        0 days 00:00:11.222303
round_time_total       0 days 00:14:07.888205
loss_total                        1353.564792
loss_critic                       1793.698897
loss_actor                        -406.971758
memory_size                        795472.572 

=== epoch 10/10 ==== round 38/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:07,  2.36it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:27<00:00,  2.48it/s]
episodes                                   50
episode_length                         196.48
returns                            -21.444173
return_std                           71.07381
average_reward                      -0.111272
round_time             0 days 00:13:28.443308
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        904.92224
return_std_test                     12.694233
average_reward_test                  0.904922
round_time_test        0 days 00:00:11.146630
round_time_total       0 days 00:13:28.444366
loss_total                        1381.965044
loss_critic                       1829.276419
loss_actor                        -407.280593
memory_size                       797293.0445 

=== epoch 10/10 ==== round 39/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:33,  2.29it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:32<00:00,  2.46it/s]
episodes                                   43
episode_length                     229.581395
returns                            -30.879875
return_std                          83.702147
average_reward                      -0.132272
round_time             0 days 00:13:32.643509
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       901.076158
return_std_test                     10.522446
average_reward_test                  0.901076
round_time_test        0 days 00:00:11.149479
round_time_total       0 days 00:13:32.644574
loss_total                        1448.151609
loss_critic                       1911.947761
loss_actor                        -407.033145
memory_size                       799052.3185 

=== epoch 10/10 ==== round 40/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:00,  2.38it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:31<00:00,  2.46it/s]
episodes                                   44
episode_length                     222.772727
returns                            -27.702868
return_std                          79.641315
average_reward                      -0.116791
round_time             0 days 00:13:32.377105
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       916.325046
return_std_test                      8.799286
average_reward_test                  0.916325
round_time_test        0 days 00:00:10.907299
round_time_total       0 days 00:13:32.378168
loss_total                        1398.349881
loss_critic                       1849.651882
loss_actor                        -406.858257
memory_size                       800723.7885 

=== epoch 10/10 ==== round 41/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:41,  2.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:26<00:00,  2.48it/s]
episodes                                   38
episode_length                          251.0
returns                            -35.234544
return_std                          80.866445
average_reward                      -0.136628
round_time             0 days 00:13:26.620854
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       926.599131
return_std_test                      6.098339
average_reward_test                  0.926599
round_time_test        0 days 00:00:11.185718
round_time_total       0 days 00:13:26.621921
loss_total                        1388.572307
loss_critic                       1837.579639
loss_actor                        -407.457148
memory_size                        802443.766 

=== epoch 10/10 ==== round 42/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:12,  2.34it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:35<00:00,  2.45it/s]
episodes                                   35
episode_length                     249.114286
returns                            -33.288131
return_std                          76.971036
average_reward                      -0.142246
round_time             0 days 00:13:36.183842
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       924.441792
return_std_test                     15.999407
average_reward_test                  0.924442
round_time_test        0 days 00:00:11.065368
round_time_total       0 days 00:13:36.184917
loss_total                        1381.547499
loss_critic                       1828.743702
loss_actor                        -407.237432
memory_size                       804305.0245 

=== epoch 10/10 ==== round 43/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:10,  2.35it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:26<00:00,  2.48it/s]
episodes                                   37
episode_length                     257.621622
returns                            -30.920987
return_std                          74.480276
average_reward                      -0.117788
round_time             0 days 00:13:26.925922
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       929.490887
return_std_test                     12.863593
average_reward_test                  0.929491
round_time_test        0 days 00:00:11.235660
round_time_total       0 days 00:13:26.926992
loss_total                        1364.998527
loss_critic                       1808.077836
loss_actor                        -407.318838
memory_size                       806115.9325 

=== epoch 10/10 ==== round 44/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:18,  2.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:28<00:00,  2.47it/s]
episodes                                   36
episode_length                     276.333333
returns                            -32.356988
return_std                          66.199015
average_reward                      -0.115155
round_time             0 days 00:13:29.011237
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       917.596502
return_std_test                      8.814013
average_reward_test                  0.917597
round_time_test        0 days 00:00:11.160867
round_time_total       0 days 00:13:29.012305
loss_total                        1405.915572
loss_critic                       1859.410483
loss_actor                        -408.064203
memory_size                       807834.2235 

=== epoch 10/10 ==== round 45/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:02,  2.37it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:28<00:00,  2.47it/s]
episodes                                   29
episode_length                      319.62069
returns                            -44.069244
return_std                          77.824862
average_reward                      -0.138157
round_time             0 days 00:13:29.419886
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       915.547547
return_std_test                     10.466007
average_reward_test                  0.915548
round_time_test        0 days 00:00:10.991379
round_time_total       0 days 00:13:29.420953
loss_total                        1386.536816
loss_critic                       1835.043186
loss_actor                        -407.488797
memory_size                        809631.982 

=== epoch 10/10 ==== round 46/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:51,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:29<00:00,  2.47it/s]
episodes                                   28
episode_length                     351.214286
returns                            -49.816275
return_std                          86.809866
average_reward                      -0.138587
round_time             0 days 00:13:29.986672
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       912.015643
return_std_test                      8.287019
average_reward_test                  0.912016
round_time_test        0 days 00:00:11.143442
round_time_total       0 days 00:13:29.987792
loss_total                        1364.493171
loss_critic                       1807.601744
loss_actor                        -407.941247
memory_size                       811547.6435 

=== epoch 10/10 ==== round 47/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:35,  2.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:36<00:00,  2.45it/s]
episodes                                   41
episode_length                     234.146341
returns                            -27.457883
return_std                          68.644315
average_reward                      -0.122163
round_time             0 days 00:13:36.742637
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       924.262203
return_std_test                     10.562734
average_reward_test                  0.924262
round_time_test        0 days 00:00:11.025730
round_time_total       0 days 00:13:36.743749
loss_total                        1379.876742
loss_critic                       1826.860786
loss_actor                        -408.059563
memory_size                        813157.111 

=== epoch 10/10 ==== round 48/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:19,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:32<00:00,  2.46it/s]
episodes                                   38
episode_length                     258.526316
returns                            -30.304451
return_std                          67.467071
average_reward                      -0.114339
round_time             0 days 00:13:32.921731
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       919.594295
return_std_test                      5.634491
average_reward_test                  0.919594
round_time_test        0 days 00:00:10.998562
round_time_total       0 days 00:13:32.922796
loss_total                        1404.445326
loss_critic                        1857.60509
loss_actor                        -408.193855
memory_size                        814800.562 

=== epoch 10/10 ==== round 49/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<15:14,  2.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:30<00:00,  2.47it/s]
episodes                                   41
episode_length                     235.341463
returns                            -31.074904
return_std                          66.644805
average_reward                      -0.128154
round_time             0 days 00:13:31.033239
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        942.09293
return_std_test                      14.63504
average_reward_test                  0.942093
round_time_test        0 days 00:00:10.958979
round_time_total       0 days 00:13:31.034311
loss_total                        1375.437663
loss_critic                       1821.552332
loss_actor                        -409.021132
memory_size                       816607.1915 

=== epoch 10/10 ==== round 50/50 =====================================
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<13:55,  2.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:30<00:00,  2.47it/s]
<GE432_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
episodes                                   43
episode_length                     205.139535
returns                            -25.300342
return_std                          59.988174
average_reward                       -0.13849
round_time             0 days 00:13:30.588841
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       937.176192
return_std_test                       7.94409
average_reward_test                  0.937176
round_time_test        0 days 00:00:11.039557
round_time_total       0 days 00:13:30.589907
loss_total                        1352.716767
loss_critic                       1793.214823
loss_actor                         -409.27559
memory_size                       818245.5755 


