/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
=== specification ====================================================
+: rlrd.training:Training
epochs: 10
rounds: 50
steps: 2000
stats_window: 10000
seed: 0
tag: ''
Env:
   +: rlrd.envs:RandomDelayEnv
   seed_val: 0
   id: Ant-v4
   frame_skip: 0
   min_observation_delay: 0
   sup_observation_delay: 1
   min_action_delay: 0
   sup_action_delay: 1
   real_world_sampler: 3
   action_noise: 0.05
Test:
   +: rlrd.testing:Test
   workers: 1
   number: 1
   device: cpu
Agent:
   +: rlrd.dcac:Agent
   batchsize: 128
   memory_size: 1000000
   lr: 0.0003
   discount: 0.99
   target_update: 0.005
   reward_scale: 5.0
   entropy_scale: 1.0
   start_training: 10000
   device: cpu
   training_steps: 1.0
   loss_alpha: 0.2
   rtac: false
   Model:
      +: rlrd.dcac_models:Mlp
      hidden_units: 256
      num_critics: 2
      act_delay: true
      obs_delay: true
   OutputNorm:
      +: rlrd.nn:PopArt
      beta: 0.0003
      zero_debias: true
      start_pop: 8
__format_version__: '3'
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>

<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 1/10 ===== round 1/50 ======================================
100%|██████████| 2000/2000 [00:02<00:00, 986.55it/s]
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                    6
episode_length                          196.0
returns                           -129.779351
return_std                         237.669386
average_reward                      -0.668597
round_time             0 days 00:00:02.037611
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        936.39992
return_std_test                     14.224904
average_reward_test                    0.9364
round_time_test        0 days 00:00:10.848673
round_time_total       0 days 00:00:12.584652 

=== epoch 1/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 79%|███████▉  | 1575/2000 [00:01<00:00, 971.33it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 973.71it/s]
episodes                                   21
episode_length                     189.904762
returns                           -114.573266
return_std                         215.070182
average_reward                       -0.60305
round_time             0 days 00:00:02.570986
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       949.967506
return_std_test                     11.871715
average_reward_test                  0.949968
round_time_test        0 days 00:00:10.932900
round_time_total       0 days 00:00:12.710521 

=== epoch 1/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 79%|███████▉  | 1587/2000 [00:01<00:00, 861.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 941.96it/s]
episodes                                   25
episode_length                         211.04
returns                           -130.915197
return_std                         222.172078
average_reward                      -0.618617
round_time             0 days 00:00:02.650409
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       937.873818
return_std_test                       6.68141
average_reward_test                  0.937874
round_time_test        0 days 00:00:10.962550
round_time_total       0 days 00:00:12.796108 

=== epoch 1/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 80%|███████▉  | 1593/2000 [00:01<00:00, 968.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 975.81it/s]
episodes                                   40
episode_length                        176.175
returns                           -108.555405
return_std                         198.644904
average_reward                      -0.612714
round_time             0 days 00:00:02.644094
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       941.009679
return_std_test                     10.396684
average_reward_test                   0.94101
round_time_test        0 days 00:00:10.901101
round_time_total       0 days 00:00:12.698399 

=== epoch 1/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
 80%|███████▉  | 1596/2000 [00:01<00:00, 891.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [00:02<00:00, 952.68it/s]
episodes                                   56
episode_length                     161.232143
returns                           -100.555489
return_std                         183.440452
average_reward                      -0.621428
round_time             0 days 00:00:02.627303
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        932.84982
return_std_test                     14.108907
average_reward_test                   0.93285
round_time_test        0 days 00:00:10.975848
round_time_total       0 days 00:00:12.820127 

=== epoch 1/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 0/2000 [00:00<?, ?it/s]/<ANONYMIZED PATH>/rmst-rlrd/rlrd/nn.py:41: UserWarning: TypedStorage is deprecated. It will be removed in the future and UntypedStorage will be the only storage class. This should only matter to you if you are using storages directly.  To access UntypedStorage directly, use tensor.untyped_storage() instead of tensor.storage()
  assert b.storage().data_ptr() == a.storage().data_ptr()
  0%|          | 4/2000 [00:01<12:34,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:45<00:00,  3.10it/s]
starting training
episodes                                   59
episode_length                     159.355932
returns                            -93.702971
return_std                         166.610656
average_reward                      -0.581826
round_time             0 days 00:10:45.814090
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       937.439018
return_std_test                      9.153366
average_reward_test                  0.937439
round_time_test        0 days 00:00:10.959688
round_time_total       0 days 00:10:45.815535
loss_total                         489.817228
loss_critic                        629.103989
loss_actor                         -67.329861
memory_size                         9519.4625 

=== epoch 1/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:52<00:00,  3.06it/s]
episodes                                   53
episode_length                     185.320755
returns                           -115.022372
return_std                         199.135927
average_reward                      -0.626755
round_time             0 days 00:10:53.101394
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       375.517767
return_std_test                      29.39581
average_reward_test                  0.375518
round_time_test        0 days 00:00:11.145011
round_time_total       0 days 00:10:53.102507
loss_total                         469.645432
loss_critic                        621.299637
loss_actor                        -136.971433
memory_size                        11420.9985 

=== epoch 1/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.05it/s]
episodes                                   58
episode_length                     156.172414
returns                            -100.59344
return_std                         191.576817
average_reward                      -0.646832
round_time             0 days 00:10:55.237380
episodes_test                            18.0
episode_length_test                501.333333
returns_test                       110.735651
return_std_test                     85.501242
average_reward_test                  0.226395
round_time_test        0 days 00:00:10.901976
round_time_total       0 days 00:10:55.238470
loss_total                         381.737313
loss_critic                        520.626568
loss_actor                         -173.81974
memory_size                        13153.4555 

=== epoch 1/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                   54
episode_length                     182.981481
returns                           -125.988517
return_std                         225.799193
average_reward                       -0.68562
round_time             0 days 00:10:54.862857
episodes_test                            12.0
episode_length_test                    832.25
returns_test                       223.629274
return_std_test                    106.123561
average_reward_test                  0.269664
round_time_test        0 days 00:00:11.121734
round_time_total       0 days 00:10:54.864043
loss_total                         555.164417
loss_critic                        742.660506
loss_actor                        -194.819987
memory_size                          14941.16 

=== epoch 1/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   49
episode_length                     195.102041
returns                           -142.449486
return_std                         246.330748
average_reward                      -0.736347
round_time             0 days 00:10:57.593685
episodes_test                            12.0
episode_length_test                769.833333
returns_test                       127.197651
return_std_test                     59.576425
average_reward_test                  0.165746
round_time_test        0 days 00:00:10.759584
round_time_total       0 days 00:10:57.594828
loss_total                        1231.940105
loss_critic                       1593.594593
loss_actor                        -214.677938
memory_size                        16584.4765 

=== epoch 1/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   49
episode_length                     191.265306
returns                           -161.216516
return_std                         285.078033
average_reward                      -0.836909
round_time             0 days 00:10:57.621377
episodes_test                            25.0
episode_length_test                    399.68
returns_test                       -52.983802
return_std_test                      55.15918
average_reward_test                  -0.13318
round_time_test        0 days 00:00:10.910159
round_time_total       0 days 00:10:57.622464
loss_total                        3335.028307
loss_critic                       4232.632949
loss_actor                        -255.390547
memory_size                        18471.5765 

=== epoch 1/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   56
episode_length                     162.142857
returns                           -152.890462
return_std                         265.318349
average_reward                      -0.932851
round_time             0 days 00:11:00.619749
episodes_test                            41.0
episode_length_test                230.414634
returns_test                      -173.515702
return_std_test                    274.097548
average_reward_test                 -0.753973
round_time_test        0 days 00:00:10.671575
round_time_total       0 days 00:11:00.621019
loss_total                       26858.865342
loss_critic                       33681.50758
loss_actor                        -431.705864
memory_size                        20092.3005 

=== epoch 1/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:25,  2.48it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   69
episode_length                     130.173913
returns                           -137.302964
return_std                         234.099254
average_reward                      -1.085447
round_time             0 days 00:11:05.558598
episodes_test                            59.0
episode_length_test                167.847458
returns_test                      -237.072618
return_std_test                    426.357175
average_reward_test                 -1.407534
round_time_test        0 days 00:00:10.853193
round_time_total       0 days 00:11:05.559671
loss_total                      150354.093882
loss_critic                     188227.001047
loss_actor                       -1137.548196
memory_size                         21596.168 

=== epoch 1/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:47,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   80
episode_length                          124.5
returns                           -161.290331
return_std                         334.071108
average_reward                      -1.298697
round_time             0 days 00:11:03.846982
episodes_test                            62.0
episode_length_test                149.419355
returns_test                       -272.14376
return_std_test                    557.396014
average_reward_test                 -1.818554
round_time_test        0 days 00:00:10.575289
round_time_total       0 days 00:11:03.848083
loss_total                      843722.062125
loss_critic                     1055366.79375
loss_actor                       -2856.933848
memory_size                        23203.0835 

=== epoch 1/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  100
episode_length                          94.05
returns                           -143.767247
return_std                           331.7192
average_reward                      -1.496196
round_time             0 days 00:11:05.031579
episodes_test                            77.0
episode_length_test                123.909091
returns_test                      -242.203628
return_std_test                    506.881495
average_reward_test                 -1.949056
round_time_test        0 days 00:00:10.883935
round_time_total       0 days 00:11:05.032910
loss_total                     1959830.611719
loss_critic                    2451128.535031
loss_actor                       -5361.246766
memory_size                        24768.5445 

=== epoch 1/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  101
episode_length                      90.584158
returns                           -152.818864
return_std                         372.139577
average_reward                      -1.705558
round_time             0 days 00:11:05.885712
episodes_test                           155.0
episode_length_test                 60.974194
returns_test                      -120.316058
return_std_test                    318.640003
average_reward_test                 -1.967731
round_time_test        0 days 00:00:10.787682
round_time_total       0 days 00:11:05.887080
loss_total                     3837254.597312
loss_critic                    4798707.837063
loss_actor                       -8558.700193
memory_size                         26169.217 

=== epoch 1/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  108
episode_length                      85.388889
returns                            -160.48057
return_std                         398.522132
average_reward                      -1.902024
round_time             0 days 00:11:04.087708
episodes_test                           145.0
episode_length_test                 68.682759
returns_test                      -140.099661
return_std_test                    373.241614
average_reward_test                 -2.040007
round_time_test        0 days 00:00:10.738273
round_time_total       0 days 00:11:04.089024
loss_total                      8891661.78825
loss_critic                     11117677.9025
loss_actor                       -12403.42955
memory_size                        27803.5495 

=== epoch 1/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:59,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   96
episode_length                      95.947917
returns                            -194.08225
return_std                          488.39424
average_reward                       -2.02351
round_time             0 days 00:11:04.261967
episodes_test                           157.0
episode_length_test                 62.057325
returns_test                      -129.697205
return_std_test                    340.148802
average_reward_test                 -2.093064
round_time_test        0 days 00:00:10.685745
round_time_total       0 days 00:11:04.263058
loss_total                     19021474.71975
loss_critic                    23781085.10075
loss_actor                       -16968.55048
memory_size                         29475.664 

=== epoch 1/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   89
episode_length                      90.438202
returns                           -191.091694
return_std                         485.574618
average_reward                      -2.103781
round_time             0 days 00:11:04.584250
episodes_test                           125.0
episode_length_test                     76.56
returns_test                      -164.643141
return_std_test                    393.376883
average_reward_test                 -2.149617
round_time_test        0 days 00:00:10.902885
round_time_total       0 days 00:11:04.585349
loss_total                        37147293.66
loss_critic                     46439621.0485
loss_actor                      -22019.124906
memory_size                         31207.048 

=== epoch 1/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:22,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   65
episode_length                          141.0
returns                           -305.320269
return_std                         660.081728
average_reward                      -2.171748
round_time             0 days 00:11:04.119105
episodes_test                           131.0
episode_length_test                 73.961832
returns_test                      -160.596747
return_std_test                    422.739658
average_reward_test                 -2.173834
round_time_test        0 days 00:00:10.715516
round_time_total       0 days 00:11:04.120181
loss_total                       63924532.616
loss_critic                       79912717.41
loss_actor                      -28212.254595
memory_size                        32949.3365 

=== epoch 1/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   70
episode_length                     133.942857
returns                           -297.837587
return_std                         655.183882
average_reward                      -2.222658
round_time             0 days 00:11:01.801643
episodes_test                           194.0
episode_length_test                 51.252577
returns_test                      -112.578971
return_std_test                    319.493674
average_reward_test                 -2.198218
round_time_test        0 days 00:00:10.857461
round_time_total       0 days 00:11:01.802965
loss_total                       75583169.995
loss_critic                      94487360.374
loss_actor                      -33597.701143
memory_size                         34765.552 

=== epoch 1/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   63
episode_length                     139.380952
returns                           -309.489513
return_std                         693.049775
average_reward                       -2.23141
round_time             0 days 00:11:01.177972
episodes_test                           131.0
episode_length_test                 71.938931
returns_test                      -157.837706
return_std_test                     403.30331
average_reward_test                 -2.201027
round_time_test        0 days 00:00:10.815500
round_time_total       0 days 00:11:01.179380
loss_total                       72757818.675
loss_critic                      90956547.318
loss_actor                      -37102.667408
memory_size                         36428.776 

=== epoch 1/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   74
episode_length                      123.22973
returns                           -277.230496
return_std                         655.994271
average_reward                      -2.246741
round_time             0 days 00:10:59.053428
episodes_test                           144.0
episode_length_test                 66.006944
returns_test                      -150.326368
return_std_test                    405.868056
average_reward_test                 -2.278405
round_time_test        0 days 00:00:10.744312
round_time_total       0 days 00:10:59.054507
loss_total                       90880214.154
loss_critic                     113610179.568
loss_actor                       -39655.69393
memory_size                         38045.685 

=== epoch 1/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   70
episode_length                          140.9
returns                           -316.963299
return_std                         713.795372
average_reward                      -2.253299
round_time             0 days 00:10:59.445909
episodes_test                           101.0
episode_length_test                 98.029703
returns_test                      -216.047906
return_std_test                    544.544995
average_reward_test                 -2.209316
round_time_test        0 days 00:00:10.849373
round_time_total       0 days 00:10:59.447155
loss_total                      125280744.876
loss_critic                     156611417.902
loss_actor                      -41958.517457
memory_size                        39754.2195 

=== epoch 1/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   79
episode_length                     124.303797
returns                           -279.031044
return_std                         647.443774
average_reward                      -2.245914
round_time             0 days 00:11:02.828862
episodes_test                           107.0
episode_length_test                 85.766355
returns_test                      -195.242974
return_std_test                    477.545738
average_reward_test                 -2.275929
round_time_test        0 days 00:00:10.884121
round_time_total       0 days 00:11:02.829947
loss_total                      157033262.514
loss_critic                     196302401.684
loss_actor                      -43307.434025
memory_size                        41512.7405 

=== epoch 1/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  112
episode_length                      81.696429
returns                           -181.006167
return_std                         475.806208
average_reward                      -2.228772
round_time             0 days 00:11:03.250981
episodes_test                           192.0
episode_length_test                 51.979167
returns_test                      -115.583284
return_std_test                    250.655291
average_reward_test                 -2.223662
round_time_test        0 days 00:00:10.969893
round_time_total       0 days 00:11:03.252071
loss_total                      164488523.736
loss_critic                     205621522.008
loss_actor                      -43484.494248
memory_size                          42705.82 

=== epoch 1/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  107
episode_length                      84.345794
returns                           -189.003085
return_std                          485.36567
average_reward                      -2.246907
round_time             0 days 00:11:01.461986
episodes_test                           109.0
episode_length_test                 86.779817
returns_test                      -196.161013
return_std_test                     483.71061
average_reward_test                 -2.263816
round_time_test        0 days 00:00:10.901082
round_time_total       0 days 00:11:01.463105
loss_total                      164801217.332
loss_critic                     206012465.332
loss_actor                      -43788.970318
memory_size                        44076.2385 

=== epoch 1/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  106
episode_length                      84.764151
returns                           -192.355011
return_std                         491.965553
average_reward                      -2.273077
round_time             0 days 00:11:02.293409
episodes_test                           153.0
episode_length_test                 60.411765
returns_test                      -138.831221
return_std_test                    351.410453
average_reward_test                 -2.307009
round_time_test        0 days 00:00:10.668757
round_time_total       0 days 00:11:02.294516
loss_total                      139086099.264
loss_critic                     173868133.928
loss_actor                      -42051.719572
memory_size                         45761.602 

=== epoch 1/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:27,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  115
episode_length                      75.008696
returns                           -169.965989
return_std                         461.779623
average_reward                      -2.265852
round_time             0 days 00:11:03.727522
episodes_test                           167.0
episode_length_test                 57.461078
returns_test                      -129.307932
return_std_test                    350.625265
average_reward_test                 -2.246909
round_time_test        0 days 00:00:10.848003
round_time_total       0 days 00:11:03.728622
loss_total                      139698154.274
loss_critic                     174632641.608
loss_actor                      -39807.139096
memory_size                        47338.9465 

=== epoch 1/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  130
episode_length                      76.623077
returns                           -174.514857
return_std                         475.520795
average_reward                      -2.279631
round_time             0 days 00:11:01.530604
episodes_test                           157.0
episode_length_test                 60.452229
returns_test                      -138.720595
return_std_test                    368.044896
average_reward_test                 -2.301096
round_time_test        0 days 00:00:10.719787
round_time_total       0 days 00:11:01.531720
loss_total                       142169123.11
loss_critic                     177720611.472
loss_actor                      -36843.262281
memory_size                        48735.5695 

=== epoch 1/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  101
episode_length                      83.207921
returns                           -190.843812
return_std                         507.513228
average_reward                      -2.302938
round_time             0 days 00:11:01.567603
episodes_test                           151.0
episode_length_test                 65.781457
returns_test                      -149.679628
return_std_test                    406.082381
average_reward_test                 -2.277417
round_time_test        0 days 00:00:11.001935
round_time_total       0 days 00:11:01.568683
loss_total                      122295445.774
loss_critic                     152877665.664
loss_actor                      -33442.937686
memory_size                         50253.177 

=== epoch 1/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  112
episode_length                      86.669643
returns                           -199.746947
return_std                         525.613353
average_reward                      -2.307409
round_time             0 days 00:11:02.157874
episodes_test                            79.0
episode_length_test                115.303797
returns_test                      -266.894615
return_std_test                    624.695228
average_reward_test                 -2.317374
round_time_test        0 days 00:00:10.758151
round_time_total       0 days 00:11:02.159134
loss_total                      112462150.394
loss_critic                     140585184.036
loss_actor                      -29993.671848
memory_size                        51933.7515 

=== epoch 1/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  107
episode_length                      83.411215
returns                           -192.739627
return_std                         504.455836
average_reward                      -2.309783
round_time             0 days 00:11:03.265828
episodes_test                           105.0
episode_length_test                 94.866667
returns_test                      -215.869119
return_std_test                    526.973994
average_reward_test                 -2.275933
round_time_test        0 days 00:00:10.835055
round_time_total       0 days 00:11:03.266934
loss_total                      111331555.144
loss_critic                     139171042.956
loss_actor                      -26405.785591
memory_size                         53413.534 

=== epoch 1/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                  126
episode_length                      77.793651
returns                           -176.800845
return_std                         466.302345
average_reward                      -2.274078
round_time             0 days 00:10:57.959079
episodes_test                           103.0
episode_length_test                 92.621359
returns_test                      -211.363286
return_std_test                    514.235157
average_reward_test                 -2.297424
round_time_test        0 days 00:00:10.744204
round_time_total       0 days 00:10:57.960172
loss_total                      108555011.328
loss_critic                     135699498.116
loss_actor                      -22945.142469
memory_size                         54964.365 

=== epoch 1/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:16,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  108
episode_length                      86.601852
returns                           -197.431533
return_std                         499.393591
average_reward                      -2.282043
round_time             0 days 00:11:01.022379
episodes_test                           114.0
episode_length_test                 83.359649
returns_test                      -186.227129
return_std_test                    463.914421
average_reward_test                 -2.239165
round_time_test        0 days 00:00:10.751418
round_time_total       0 days 00:11:01.023652
loss_total                       107127706.58
loss_critic                      133914631.26
loss_actor                      -20001.174073
memory_size                         56248.923 

=== epoch 1/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                  101
episode_length                      93.316832
returns                           -212.892219
return_std                         513.617615
average_reward                      -2.290676
round_time             0 days 00:11:01.293638
episodes_test                           157.0
episode_length_test                 63.681529
returns_test                      -142.316406
return_std_test                     331.68231
average_reward_test                 -2.234651
round_time_test        0 days 00:00:10.448392
round_time_total       0 days 00:11:01.294872
loss_total                      106770464.718
loss_critic                     133467324.672
loss_actor                      -16984.035275
memory_size                        58059.0205 

=== epoch 1/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   94
episode_length                      94.734043
returns                            -213.54838
return_std                         526.414487
average_reward                      -2.261413
round_time             0 days 00:11:02.356746
episodes_test                           101.0
episode_length_test                 95.475248
returns_test                      -217.386109
return_std_test                    493.831332
average_reward_test                   -2.2801
round_time_test        0 days 00:00:10.715985
round_time_total       0 days 00:11:02.357954
loss_total                      112737640.624
loss_critic                     140925570.904
loss_actor                      -14089.516941
memory_size                        59717.6985 

=== epoch 1/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:55,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   87
episode_length                      99.666667
returns                           -222.937233
return_std                         538.627782
average_reward                      -2.238049
round_time             0 days 00:11:03.599643
episodes_test                           122.0
episode_length_test                 81.909836
returns_test                      -182.714971
return_std_test                     485.58192
average_reward_test                  -2.22989
round_time_test        0 days 00:00:10.823258
round_time_total       0 days 00:11:03.600797
loss_total                      116943924.782
loss_critic                     146182843.416
loss_actor                      -11759.630898
memory_size                        61496.3425 

=== epoch 1/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   69
episode_length                     143.246377
returns                           -325.099651
return_std                         685.378517
average_reward                      -2.267725
round_time             0 days 00:11:05.533510
episodes_test                           114.0
episode_length_test                 87.701754
returns_test                      -196.715251
return_std_test                    477.324056
average_reward_test                 -2.242434
round_time_test        0 days 00:00:10.644873
round_time_total       0 days 00:11:05.534622
loss_total                      127853488.408
loss_critic                     159819262.832
loss_actor                       -9620.748435
memory_size                        63130.6625 

=== epoch 1/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   86
episode_length                     110.895349
returns                           -249.518712
return_std                         582.809576
average_reward                      -2.245543
round_time             0 days 00:11:04.121738
episodes_test                           157.0
episode_length_test                 63.509554
returns_test                      -141.313167
return_std_test                    397.244791
average_reward_test                 -2.227036
round_time_test        0 days 00:00:10.641928
round_time_total       0 days 00:11:04.122834
loss_total                      135086919.798
loss_critic                      168860652.02
loss_actor                       -8019.746415
memory_size                         64667.754 

=== epoch 1/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   93
episode_length                       96.11828
returns                           -212.153727
return_std                          540.79662
average_reward                      -2.215278
round_time             0 days 00:11:05.677424
episodes_test                            67.0
episode_length_test                140.373134
returns_test                       -320.99038
return_std_test                    684.927558
average_reward_test                 -2.287217
round_time_test        0 days 00:00:10.978606
round_time_total       0 days 00:11:05.678554
loss_total                      137233728.016
loss_critic                     171543862.832
loss_actor                        -6823.01454
memory_size                         65994.023 

=== epoch 1/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:31,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  101
episode_length                      94.376238
returns                           -207.057806
return_std                         520.098631
average_reward                      -2.200146
round_time             0 days 00:11:05.034660
episodes_test                           149.0
episode_length_test                  67.09396
returns_test                      -149.770835
return_std_test                    377.938603
average_reward_test                 -2.231825
round_time_test        0 days 00:00:10.758824
round_time_total       0 days 00:11:05.035754
loss_total                      113719706.196
loss_critic                     142150892.124
loss_actor                        -5047.32458
memory_size                        67657.8845 

=== epoch 1/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  105
episode_length                       84.07619
returns                           -184.593038
return_std                         473.731454
average_reward                      -2.205329
round_time             0 days 00:11:05.746680
episodes_test                           133.0
episode_length_test                 69.030075
returns_test                      -155.438382
return_std_test                    400.208928
average_reward_test                 -2.257335
round_time_test        0 days 00:00:10.862181
round_time_total       0 days 00:11:05.747784
loss_total                      105348417.082
loss_critic                     131686369.934
loss_actor                       -3402.044951
memory_size                        69236.4245 

=== epoch 1/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  121
episode_length                      77.677686
returns                           -169.597175
return_std                          440.11006
average_reward                      -2.185927
round_time             0 days 00:11:05.067227
episodes_test                           138.0
episode_length_test                  72.23913
returns_test                      -160.167283
return_std_test                     408.40483
average_reward_test                 -2.216915
round_time_test        0 days 00:00:10.782551
round_time_total       0 days 00:11:05.068327
loss_total                       105013023.43
loss_critic                      131266745.72
loss_actor                       -1875.223606
memory_size                        70635.5275 

=== epoch 1/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  138
episode_length                       71.23913
returns                           -154.970181
return_std                         414.431973
average_reward                      -2.178443
round_time             0 days 00:11:05.979460
episodes_test                           144.0
episode_length_test                 67.319444
returns_test                      -148.865672
return_std_test                    409.295491
average_reward_test                 -2.220921
round_time_test        0 days 00:00:10.879002
round_time_total       0 days 00:11:05.980716
loss_total                        100283133.0
loss_critic                     125354085.456
loss_actor                        -686.210518
memory_size                         71889.594 

=== epoch 1/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  126
episode_length                      74.753968
returns                            -163.66681
return_std                         434.648702
average_reward                      -2.191667
round_time             0 days 00:11:05.727836
episodes_test                           142.0
episode_length_test                 70.232394
returns_test                      -157.136746
return_std_test                    412.471634
average_reward_test                 -2.240511
round_time_test        0 days 00:00:10.831066
round_time_total       0 days 00:11:05.728922
loss_total                       90674754.136
loss_critic                     113343307.804
loss_actor                         531.661883
memory_size                         73287.966 

=== epoch 1/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:16,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  147
episode_length                      63.394558
returns                           -139.434112
return_std                          382.20078
average_reward                      -2.211581
round_time             0 days 00:11:02.138996
episodes_test                           162.0
episode_length_test                 61.444444
returns_test                      -134.047597
return_std_test                    342.506619
average_reward_test                 -2.182951
round_time_test        0 days 00:00:10.623844
round_time_total       0 days 00:11:02.140108
loss_total                        89331433.66
loss_critic                     111663894.446
loss_actor                        1582.277831
memory_size                        74652.1685 

=== epoch 1/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  176
episode_length                      52.829545
returns                           -116.644978
return_std                         313.248515
average_reward                      -2.210492
round_time             0 days 00:11:03.860635
episodes_test                           190.0
episode_length_test                      52.4
returns_test                      -116.393032
return_std_test                    284.618713
average_reward_test                 -2.220592
round_time_test        0 days 00:00:10.762153
round_time_total       0 days 00:11:03.862148
loss_total                       87383589.084
loss_critic                     109228890.982
loss_actor                        2374.159021
memory_size                        75715.9155 

=== epoch 1/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:06,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  147
episode_length                      60.870748
returns                           -136.487492
return_std                         374.241712
average_reward                      -2.246172
round_time             0 days 00:11:06.585640
episodes_test                            77.0
episode_length_test                117.584416
returns_test                      -266.268095
return_std_test                    608.598889
average_reward_test                 -2.271343
round_time_test        0 days 00:00:10.671365
round_time_total       0 days 00:11:06.586773
loss_total                       83015520.158
loss_critic                     103768469.494
loss_actor                        3715.224706
memory_size                        77363.4885 

=== epoch 1/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  110
episode_length                           79.6
returns                           -179.884662
return_std                         469.716809
average_reward                      -2.259429
round_time             0 days 00:11:05.094308
episodes_test                           157.0
episode_length_test                 63.267516
returns_test                      -138.404945
return_std_test                    357.788941
average_reward_test                 -2.191211
round_time_test        0 days 00:00:10.829336
round_time_total       0 days 00:11:05.095403
loss_total                       72752646.156
loss_critic                      90939373.486
loss_actor                        5729.834751
memory_size                        79172.0255 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 2/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                   57
episode_length                      34.964912
returns                            -74.337735
return_std                          60.131091
average_reward                      -2.125131
round_time             0 days 00:10:54.474896
episodes_test                           109.0
episode_length_test                 82.779817
returns_test                      -186.922688
return_std_test                    474.090844
average_reward_test                 -2.254615
round_time_test        0 days 00:00:10.793885
round_time_total       0 days 00:10:54.476021
loss_total                       65726256.804
loss_critic                      82155858.682
loss_actor                        7843.486611
memory_size                        80461.3565 

=== epoch 2/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                   79
episode_length                      48.189873
returns                           -106.322852
return_std                          251.56114
average_reward                       -2.21236
round_time             0 days 00:10:56.425790
episodes_test                            95.0
episode_length_test                104.694737
returns_test                      -232.694469
return_std_test                     575.67924
average_reward_test                  -2.22393
round_time_test        0 days 00:00:10.870973
round_time_total       0 days 00:10:56.426911
loss_total                       62815653.832
loss_critic                      78517285.732
loss_actor                        9121.159733
memory_size                         81598.105 

=== epoch 2/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   86
episode_length                      69.116279
returns                           -155.031205
return_std                         411.100904
average_reward                      -2.246726
round_time             0 days 00:10:57.713953
episodes_test                           136.0
episode_length_test                 71.845588
returns_test                      -162.273396
return_std_test                     432.71127
average_reward_test                 -2.260037
round_time_test        0 days 00:00:11.081127
round_time_total       0 days 00:10:57.715035
loss_total                       61566736.186
loss_critic                      76955785.828
loss_actor                       10532.516669
memory_size                          83348.57 

=== epoch 2/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                  114
episode_length                       65.04386
returns                           -148.102036
return_std                         366.859345
average_reward                      -2.282235
round_time             0 days 00:10:56.081548
episodes_test                           168.0
episode_length_test                  59.39881
returns_test                      -132.765774
return_std_test                    341.548075
average_reward_test                 -2.235311
round_time_test        0 days 00:00:10.949116
round_time_total       0 days 00:10:56.082653
loss_total                        56583228.95
loss_critic                       70726007.78
loss_actor                       12109.354661
memory_size                        84891.5835 

=== epoch 2/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.04it/s]
episodes                                  138
episode_length                      72.297101
returns                           -165.113924
return_std                         423.533072
average_reward                      -2.284444
round_time             0 days 00:10:57.403054
episodes_test                           127.0
episode_length_test                 76.023622
returns_test                      -170.941919
return_std_test                    421.968442
average_reward_test                 -2.247655
round_time_test        0 days 00:00:10.780247
round_time_total       0 days 00:10:57.404146
loss_total                       51060097.768
loss_critic                       63821835.27
loss_actor                       13143.689828
memory_size                         86366.541 

=== epoch 2/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                  101
episode_length                      98.455446
returns                            -228.49553
return_std                         533.881317
average_reward                       -2.32121
round_time             0 days 00:10:59.059893
episodes_test                           113.0
episode_length_test                 85.575221
returns_test                       -192.55645
return_std_test                    461.194185
average_reward_test                 -2.245267
round_time_test        0 days 00:00:10.735372
round_time_total       0 days 00:10:59.061236
loss_total                       47900444.781
loss_critic                       59872073.22
loss_actor                       13926.862297
memory_size                        87995.4345 

=== epoch 2/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   85
episode_length                      97.941176
returns                           -228.837511
return_std                         536.336839
average_reward                      -2.322636
round_time             0 days 00:11:02.885366
episodes_test                           129.0
episode_length_test                 76.124031
returns_test                      -169.632024
return_std_test                    445.910882
average_reward_test                 -2.235143
round_time_test        0 days 00:00:10.886731
round_time_total       0 days 00:11:02.886457
loss_total                       45009252.122
loss_critic                      56257842.753
loss_actor                       14885.787754
memory_size                        89612.7555 

=== epoch 2/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                  122
episode_length                      78.172131
returns                            -179.25781
return_std                         450.846996
average_reward                       -2.29715
round_time             0 days 00:11:03.556118
episodes_test                           150.0
episode_length_test                     66.66
returns_test                       -147.33069
return_std_test                    416.744185
average_reward_test                 -2.210007
round_time_test        0 days 00:00:10.746370
round_time_total       0 days 00:11:03.557218
loss_total                       42886955.938
loss_critic                      53604720.368
loss_actor                       15894.708747
memory_size                           91044.4 

=== epoch 2/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   99
episode_length                      93.919192
returns                           -213.059584
return_std                         532.341695
average_reward                      -2.268732
round_time             0 days 00:11:04.191611
episodes_test                           120.0
episode_length_test                    83.225
returns_test                      -184.880448
return_std_test                    493.143109
average_reward_test                 -2.221329
round_time_test        0 days 00:00:10.720760
round_time_total       0 days 00:11:04.192900
loss_total                       40594216.012
loss_critic                      50738654.938
loss_actor                        16456.47342
memory_size                          92566.85 

=== epoch 2/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   92
episode_length                     107.304348
returns                           -242.654377
return_std                          569.07242
average_reward                      -2.259389
round_time             0 days 00:11:04.225347
episodes_test                           151.0
episode_length_test                 61.298013
returns_test                      -136.452075
return_std_test                    333.898328
average_reward_test                 -2.225813
round_time_test        0 days 00:00:10.910716
round_time_total       0 days 00:11:04.226471
loss_total                       37680402.159
loss_critic                      47096235.219
loss_actor                       17066.257348
memory_size                         94453.413 

=== epoch 2/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:25,  2.68it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  109
episode_length                      91.477064
returns                           -205.224429
return_std                         501.509331
average_reward                      -2.245027
round_time             0 days 00:11:04.853451
episodes_test                           133.0
episode_length_test                  75.18797
returns_test                      -166.659735
return_std_test                    414.734941
average_reward_test                 -2.216574
round_time_test        0 days 00:00:10.849175
round_time_total       0 days 00:11:04.854558
loss_total                       33819620.553
loss_critic                      42270075.277
loss_actor                       17798.579011
memory_size                        95871.1745 

=== epoch 2/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:47,  2.60it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  107
episode_length                      84.373832
returns                           -188.994534
return_std                         465.594368
average_reward                      -2.238331
round_time             0 days 00:11:05.902237
episodes_test                            74.0
episode_length_test                134.689189
returns_test                      -305.535557
return_std_test                    626.607974
average_reward_test                 -2.268123
round_time_test        0 days 00:00:10.869280
round_time_total       0 days 00:11:05.903319
loss_total                      30401766.6305
loss_critic                      37997669.023
loss_actor                        18154.74169
memory_size                         97306.188 

=== epoch 2/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   84
episode_length                     108.642857
returns                           -245.419848
return_std                         563.403651
average_reward                      -2.258512
round_time             0 days 00:11:05.798961
episodes_test                            62.0
episode_length_test                160.951613
returns_test                      -364.897662
return_std_test                    731.790976
average_reward_test                 -2.267581
round_time_test        0 days 00:00:10.728142
round_time_total       0 days 00:11:05.800057
loss_total                       28061933.433
loss_critic                      35072750.814
loss_actor                       18661.893855
memory_size                         98934.207 

=== epoch 2/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  103
episode_length                      84.087379
returns                           -190.884616
return_std                         474.021877
average_reward                      -2.275267
round_time             0 days 00:11:05.267970
episodes_test                           145.0
episode_length_test                  68.77931
returns_test                      -156.255782
return_std_test                    372.553785
average_reward_test                 -2.271604
round_time_test        0 days 00:00:10.795940
round_time_total       0 days 00:11:05.269073
loss_total                       26298468.482
loss_critic                      32868287.898
loss_actor                        19188.68518
memory_size                         100564.33 

=== epoch 2/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                  106
episode_length                      93.801887
returns                           -213.999772
return_std                         508.318296
average_reward                      -2.284702
round_time             0 days 00:11:02.517700
episodes_test                           125.0
episode_length_test                    76.088
returns_test                      -171.374826
return_std_test                    433.067177
average_reward_test                  -2.25221
round_time_test        0 days 00:00:10.855825
round_time_total       0 days 00:11:02.518782
loss_total                       24125326.346
loss_critic                      30151759.222
loss_actor                       19592.741458
memory_size                       102074.6165 

=== epoch 2/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:29,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   78
episode_length                     119.423077
returns                           -273.884932
return_std                         611.492662
average_reward                      -2.293533
round_time             0 days 00:11:03.475701
episodes_test                           124.0
episode_length_test                 76.782258
returns_test                      -172.803419
return_std_test                    432.137702
average_reward_test                 -2.255224
round_time_test        0 days 00:00:10.829564
round_time_total       0 days 00:11:03.476826
loss_total                      22903402.0345
loss_critic                     28624271.1045
loss_actor                       19923.792571
memory_size                        103699.364 

=== epoch 2/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  119
episode_length                      79.915966
returns                           -181.245119
return_std                         467.046809
average_reward                      -2.269633
round_time             0 days 00:11:05.486352
episodes_test                            91.0
episode_length_test                103.032967
returns_test                      -226.076089
return_std_test                    523.677053
average_reward_test                 -2.198988
round_time_test        0 days 00:00:10.761389
round_time_total       0 days 00:11:05.487490
loss_total                       21566428.373
loss_critic                     26953013.7805
loss_actor                       20084.739772
memory_size                       105155.1195 

=== epoch 2/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  120
episode_length                      72.733333
returns                           -162.479388
return_std                         422.315638
average_reward                      -2.239861
round_time             0 days 00:11:06.119509
episodes_test                            90.0
episode_length_test                100.244444
returns_test                      -217.349632
return_std_test                    531.148805
average_reward_test                 -2.173598
round_time_test        0 days 00:00:10.791943
round_time_total       0 days 00:11:06.120607
loss_total                      20275935.6755
loss_critic                     25339863.8305
loss_actor                       20221.241213
memory_size                       106464.5385 

=== epoch 2/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:56,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  116
episode_length                       82.62931
returns                           -182.781741
return_std                         467.813584
average_reward                      -2.210798
round_time             0 days 00:11:05.760419
episodes_test                            92.0
episode_length_test                100.815217
returns_test                      -222.107821
return_std_test                    510.773706
average_reward_test                 -2.201092
round_time_test        0 days 00:00:10.780401
round_time_total       0 days 00:11:05.761848
loss_total                       19357939.188
loss_critic                     24192340.5555
loss_actor                       20331.913403
memory_size                       108026.1735 

=== epoch 2/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  120
episode_length                         83.025
returns                           -181.125596
return_std                         479.364761
average_reward                      -2.180306
round_time             0 days 00:11:06.851602
episodes_test                           151.0
episode_length_test                 65.774834
returns_test                      -143.459141
return_std_test                    364.821572
average_reward_test                 -2.182662
round_time_test        0 days 00:00:10.750494
round_time_total       0 days 00:11:06.852757
loss_total                      18017988.8995
loss_critic                     22517370.8965
loss_actor                       20459.230965
memory_size                       109587.5865 

=== epoch 2/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  155
episode_length                      61.045161
returns                           -130.771887
return_std                         348.706563
average_reward                      -2.130535
round_time             0 days 00:11:05.821332
episodes_test                           110.0
episode_length_test                 90.054545
returns_test                      -197.796617
return_std_test                    487.629404
average_reward_test                 -2.193649
round_time_test        0 days 00:00:10.873644
round_time_total       0 days 00:11:05.822508
loss_total                      17034930.4675
loss_critic                     21288579.0875
loss_actor                       20334.577205
memory_size                        110764.898 

=== epoch 2/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:24,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  123
episode_length                      75.430894
returns                            -160.99844
return_std                         414.877562
average_reward                       -2.14289
round_time             0 days 00:11:06.221591
episodes_test                           149.0
episode_length_test                 66.872483
returns_test                      -147.325235
return_std_test                    379.182444
average_reward_test                 -2.200989
round_time_test        0 days 00:00:10.739715
round_time_total       0 days 00:11:06.222749
loss_total                      16252611.5745
loss_critic                      20310744.493
loss_actor                       20078.415182
memory_size                       112159.3755 

=== epoch 2/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  126
episode_length                      75.968254
returns                           -162.716169
return_std                         408.433052
average_reward                      -2.143422
round_time             0 days 00:11:05.842243
episodes_test                           154.0
episode_length_test                 64.441558
returns_test                      -140.634673
return_std_test                    383.855052
average_reward_test                 -2.181638
round_time_test        0 days 00:00:10.698056
round_time_total       0 days 00:11:05.843328
loss_total                     15325446.90825
loss_critic                      19151840.321
loss_actor                       19872.016038
memory_size                        113642.122 

=== epoch 2/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  131
episode_length                      76.030534
returns                           -162.097257
return_std                         399.825564
average_reward                      -2.130496
round_time             0 days 00:11:07.139889
episodes_test                           120.0
episode_length_test                 80.183333
returns_test                      -173.828447
return_std_test                    443.554371
average_reward_test                 -2.169912
round_time_test        0 days 00:00:10.919721
round_time_total       0 days 00:11:07.141005
loss_total                       14084287.585
loss_critic                     17600451.7585
loss_actor                       19629.698799
memory_size                        115169.949 

=== epoch 2/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  133
episode_length                      67.323308
returns                           -141.499012
return_std                         354.820245
average_reward                       -2.11114
round_time             0 days 00:11:07.563171
episodes_test                           129.0
episode_length_test                 76.449612
returns_test                      -162.693746
return_std_test                    412.992934
average_reward_test                 -2.130414
round_time_test        0 days 00:00:10.782151
round_time_total       0 days 00:11:07.564276
loss_total                      13320330.7885
loss_critic                      16645573.273
loss_actor                       19359.620153
memory_size                        116442.678 

=== epoch 2/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                  145
episode_length                      63.193103
returns                           -132.599971
return_std                         342.386201
average_reward                        -2.1122
round_time             0 days 00:11:06.471193
episodes_test                           127.0
episode_length_test                 78.385827
returns_test                      -165.593717
return_std_test                    450.453598
average_reward_test                 -2.114022
round_time_test        0 days 00:00:10.837273
round_time_total       0 days 00:11:06.472278
loss_total                     12739724.10075
loss_critic                     15919923.6345
loss_actor                       18924.967743
memory_size                       117689.4145 

=== epoch 2/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:09,  2.53it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  151
episode_length                       61.97351
returns                           -129.335093
return_std                         323.875828
average_reward                      -2.091081
round_time             0 days 00:11:07.466045
episodes_test                           193.0
episode_length_test                 51.564767
returns_test                      -106.680136
return_std_test                    238.580337
average_reward_test                 -2.069814
round_time_test        0 days 00:00:10.904317
round_time_total       0 days 00:11:07.467149
loss_total                     11933934.59525
loss_critic                      14912792.743
loss_actor                       18501.004794
memory_size                        118787.695 

=== epoch 2/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  154
episode_length                      61.227273
returns                           -126.581499
return_std                         318.858015
average_reward                      -2.075656
round_time             0 days 00:11:06.063626
episodes_test                           115.0
episode_length_test                 86.669565
returns_test                      -186.512462
return_std_test                     434.69814
average_reward_test                 -2.153551
round_time_test        0 days 00:00:10.865334
round_time_total       0 days 00:11:06.064774
loss_total                     11051631.55325
loss_critic                      13810005.298
loss_actor                       18135.607051
memory_size                        120329.139 

=== epoch 2/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  165
episode_length                      60.545455
returns                           -124.039812
return_std                         310.835761
average_reward                      -2.050683
round_time             0 days 00:11:05.501359
episodes_test                           136.0
episode_length_test                 72.132353
returns_test                      -147.335021
return_std_test                    389.465138
average_reward_test                 -2.042117
round_time_test        0 days 00:00:10.582113
round_time_total       0 days 00:11:05.502460
loss_total                        10471265.98
loss_critic                      13084660.718
loss_actor                       17686.030572
memory_size                       121755.7385 

=== epoch 2/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:43,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  158
episode_length                      62.822785
returns                           -129.765984
return_std                         320.199669
average_reward                      -2.070203
round_time             0 days 00:11:08.964832
episodes_test                           126.0
episode_length_test                 79.134921
returns_test                      -169.105037
return_std_test                    420.879339
average_reward_test                 -2.135623
round_time_test        0 days 00:00:10.711739
round_time_total       0 days 00:11:08.966190
loss_total                        9947192.548
loss_critic                    12429661.60475
loss_actor                       17315.389363
memory_size                       123005.1275 

=== epoch 2/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                  117
episode_length                      80.931624
returns                           -167.009647
return_std                         407.781609
average_reward                      -2.069122
round_time             0 days 00:11:11.030435
episodes_test                           145.0
episode_length_test                 68.593103
returns_test                      -141.149857
return_std_test                    349.343006
average_reward_test                 -2.058623
round_time_test        0 days 00:00:10.906106
round_time_total       0 days 00:11:11.031513
loss_total                       9245286.3175
loss_critic                      11552354.459
loss_actor                        17012.83983
memory_size                         124634.02 

=== epoch 2/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  124
episode_length                      74.862903
returns                           -153.703044
return_std                         375.724387
average_reward                      -2.054751
round_time             0 days 00:11:06.678167
episodes_test                           111.0
episode_length_test                 89.693694
returns_test                      -183.183055
return_std_test                    473.886187
average_reward_test                 -2.040819
round_time_test        0 days 00:00:10.741513
round_time_total       0 days 00:11:06.679246
loss_total                      8732367.72275
loss_critic                      10911286.375
loss_actor                       16692.356379
memory_size                       126212.3745 

=== epoch 2/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:02,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  114
episode_length                      82.622807
returns                           -169.023798
return_std                         401.925082
average_reward                      -2.046354
round_time             0 days 00:11:08.328112
episodes_test                           115.0
episode_length_test                 86.904348
returns_test                      -180.792864
return_std_test                    452.292135
average_reward_test                 -2.079772
round_time_test        0 days 00:00:10.832401
round_time_total       0 days 00:11:08.329212
loss_total                     8259801.902875
loss_critic                    10320654.28275
loss_actor                       16391.643988
memory_size                       127742.5365 

=== epoch 2/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   97
episode_length                      94.680412
returns                           -195.260497
return_std                         441.053058
average_reward                      -2.063756
round_time             0 days 00:11:07.189962
episodes_test                            88.0
episode_length_test                107.215909
returns_test                      -216.215718
return_std_test                    519.255761
average_reward_test                  -2.02088
round_time_test        0 days 00:00:11.194201
round_time_total       0 days 00:11:07.191052
loss_total                      7674754.59775
loss_critic                     9589405.50925
loss_actor                       16150.333889
memory_size                        129263.706 

=== epoch 2/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:23,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  102
episode_length                       89.45098
returns                           -180.876511
return_std                         423.861424
average_reward                        -2.0203
round_time             0 days 00:11:07.992333
episodes_test                           112.0
episode_length_test                 80.723214
returns_test                      -166.254022
return_std_test                    398.527422
average_reward_test                 -2.050612
round_time_test        0 days 00:00:10.802819
round_time_total       0 days 00:11:07.993425
loss_total                        7242387.581
loss_critic                     9049017.40225
loss_actor                       15867.730213
memory_size                       130799.3695 

=== epoch 2/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:27,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                  111
episode_length                      85.747748
returns                            -172.94374
return_std                         400.263829
average_reward                       -2.01941
round_time             0 days 00:11:09.788488
episodes_test                           132.0
episode_length_test                 71.613636
returns_test                      -143.991617
return_std_test                     378.91799
average_reward_test                 -2.013215
round_time_test        0 days 00:00:10.977003
round_time_total       0 days 00:11:09.789717
loss_total                      6775707.88725
loss_critic                     8465749.44475
loss_actor                        15541.09771
memory_size                        132456.112 

=== epoch 2/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  104
episode_length                      89.759615
returns                           -180.190044
return_std                         415.783461
average_reward                      -2.015484
round_time             0 days 00:11:08.280965
episodes_test                           142.0
episode_length_test                 66.485915
returns_test                      -126.426769
return_std_test                    334.111614
average_reward_test                 -1.913701
round_time_test        0 days 00:00:10.768140
round_time_total       0 days 00:11:08.282043
loss_total                     6213883.048875
loss_critic                       7763546.209
loss_actor                       15229.860774
memory_size                        133993.066 

=== epoch 2/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  113
episode_length                      85.982301
returns                           -172.263781
return_std                         424.325657
average_reward                       -2.00559
round_time             0 days 00:11:08.259038
episodes_test                           128.0
episode_length_test                 74.898438
returns_test                      -146.909088
return_std_test                    343.356097
average_reward_test                 -1.965315
round_time_test        0 days 00:00:10.803786
round_time_total       0 days 00:11:08.260281
loss_total                     5901275.032625
loss_critic                    7372873.594625
loss_actor                       14880.285736
memory_size                        135490.954 

=== epoch 2/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:30,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  115
episode_length                      84.426087
returns                           -165.916306
return_std                         411.240662
average_reward                      -1.972325
round_time             0 days 00:11:08.274001
episodes_test                           105.0
episode_length_test                 95.209524
returns_test                      -185.871301
return_std_test                    465.702321
average_reward_test                 -1.952065
round_time_test        0 days 00:00:10.836901
round_time_total       0 days 00:11:08.275111
loss_total                     5561424.418625
loss_critic                    6948157.506125
loss_actor                       14491.572976
memory_size                       136809.7785 

=== epoch 2/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  143
episode_length                      68.692308
returns                           -133.898167
return_std                         338.993979
average_reward                      -1.951837
round_time             0 days 00:11:09.054156
episodes_test                           134.0
episode_length_test                 74.447761
returns_test                      -141.874789
return_std_test                    378.564411
average_reward_test                 -1.905915
round_time_test        0 days 00:00:10.653482
round_time_total       0 days 00:11:09.055251
loss_total                     5093015.761875
loss_critic                    6362719.606625
loss_actor                       14199.925766
memory_size                       138018.9335 

=== epoch 2/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  150
episode_length                      65.853333
returns                           -125.682032
return_std                         327.568263
average_reward                      -1.907423
round_time             0 days 00:11:09.348147
episodes_test                           105.0
episode_length_test                 87.295238
returns_test                      -169.577122
return_std_test                      409.4826
average_reward_test                 -1.949768
round_time_test        0 days 00:00:10.900063
round_time_total       0 days 00:11:09.349387
loss_total                        4867513.907
loss_critic                      6080924.1335
loss_actor                       13872.579145
memory_size                        139131.114 

=== epoch 2/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  158
episode_length                      62.924051
returns                           -119.369796
return_std                         310.430137
average_reward                      -1.894563
round_time             0 days 00:11:07.695683
episodes_test                           127.0
episode_length_test                 78.110236
returns_test                      -146.604822
return_std_test                    401.596072
average_reward_test                 -1.875644
round_time_test        0 days 00:00:10.843221
round_time_total       0 days 00:11:07.696950
loss_total                     4572289.726125
loss_critic                    5711989.413125
loss_actor                       13490.571678
memory_size                        140653.115 

=== epoch 2/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  171
episode_length                      55.959064
returns                           -104.074053
return_std                         259.983758
average_reward                      -1.859388
round_time             0 days 00:11:08.307874
episodes_test                           142.0
episode_length_test                 70.352113
returns_test                      -132.723394
return_std_test                    327.315561
average_reward_test                 -1.886295
round_time_test        0 days 00:00:10.768708
round_time_total       0 days 00:11:08.308971
loss_total                     4187462.496625
loss_critic                         5231048.5
loss_actor                       13118.127263
memory_size                       141722.3305 

=== epoch 2/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  156
episode_length                      61.641026
returns                           -114.096911
return_std                         301.261469
average_reward                      -1.856929
round_time             0 days 00:11:08.169244
episodes_test                           124.0
episode_length_test                 80.096774
returns_test                      -151.154913
return_std_test                    369.963661
average_reward_test                 -1.888457
round_time_test        0 days 00:00:10.810485
round_time_total       0 days 00:11:08.170362
loss_total                      3859840.73675
loss_critic                    4821617.835125
loss_actor                       12732.010861
memory_size                        143346.866 

=== epoch 2/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:17<00:00,  2.95it/s]
episodes                                  133
episode_length                      73.360902
returns                           -139.341344
return_std                         362.919208
average_reward                        -1.8993
round_time             0 days 00:11:18.366056
episodes_test                           144.0
episode_length_test                 66.111111
returns_test                      -119.066192
return_std_test                    311.200672
average_reward_test                 -1.805514
round_time_test        0 days 00:00:10.912373
round_time_total       0 days 00:11:18.367141
loss_total                     3625731.509187
loss_critic                    4529052.708125
loss_actor                       12446.408434
memory_size                         145063.46 

=== epoch 2/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:34<00:00,  2.88it/s]
episodes                                  140
episode_length                      65.142857
returns                           -123.005779
return_std                         318.946976
average_reward                      -1.897214
round_time             0 days 00:11:34.838739
episodes_test                            98.0
episode_length_test                 96.540816
returns_test                      -177.287347
return_std_test                    447.865307
average_reward_test                 -1.840881
round_time_test        0 days 00:00:11.054575
round_time_total       0 days 00:11:34.839989
loss_total                     3356721.146062
loss_critic                    4192865.056625
loss_actor                       12145.242714
memory_size                        146393.921 

=== epoch 2/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  130
episode_length                      75.307692
returns                           -142.608839
return_std                         366.140582
average_reward                      -1.883618
round_time             0 days 00:11:07.379404
episodes_test                           112.0
episode_length_test                 89.232143
returns_test                      -160.747933
return_std_test                    413.558001
average_reward_test                 -1.801305
round_time_test        0 days 00:00:11.096982
round_time_total       0 days 00:11:07.380515
loss_total                      3192144.81475
loss_critic                    3987227.344625
loss_actor                       11814.430318
memory_size                       147628.9175 

=== epoch 2/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  127
episode_length                       74.11811
returns                           -139.439923
return_std                         365.138172
average_reward                      -1.883326
round_time             0 days 00:11:09.427003
episodes_test                           112.0
episode_length_test                 82.339286
returns_test                      -149.308199
return_std_test                    352.571902
average_reward_test                 -1.817003
round_time_test        0 days 00:00:11.064236
round_time_total       0 days 00:11:09.428113
loss_total                     2996041.925687
loss_critic                     3742175.30825
loss_actor                       11508.146848
memory_size                       149233.0765 

=== epoch 2/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                  148
episode_length                      62.513514
returns                            -115.87142
return_std                         300.406776
average_reward                      -1.853938
round_time             0 days 00:11:09.369969
episodes_test                            99.0
episode_length_test                 95.929293
returns_test                      -170.230529
return_std_test                    421.771527
average_reward_test                 -1.777215
round_time_test        0 days 00:00:11.040699
round_time_total       0 days 00:11:09.371051
loss_total                      2762747.36275
loss_critic                    3450643.626125
loss_actor                        11162.06034
memory_size                        150456.946 

=== epoch 2/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                  141
episode_length                       61.87234
returns                           -110.212688
return_std                          284.11987
average_reward                      -1.799496
round_time             0 days 00:11:09.181295
episodes_test                           128.0
episode_length_test                 78.054688
returns_test                      -139.716912
return_std_test                    353.813239
average_reward_test                 -1.790325
round_time_test        0 days 00:00:10.776623
round_time_total       0 days 00:11:09.182592
loss_total                      2543925.64025
loss_critic                    3177192.695625
loss_actor                       10857.200241
memory_size                        151893.457 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 3/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<11:31,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.05it/s]
episodes                                   34
episode_length                           39.0
returns                             -69.94126
return_std                          71.585082
average_reward                      -1.829242
round_time             0 days 00:10:54.860518
episodes_test                           153.0
episode_length_test                 64.875817
returns_test                      -111.602027
return_std_test                     291.48379
average_reward_test                 -1.714917
round_time_test        0 days 00:00:10.861921
round_time_total       0 days 00:10:54.861632
loss_total                     2399024.550375
loss_critic                    2996152.569562
loss_actor                       10512.263903
memory_size                        153147.374 

=== epoch 3/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:32,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   47
episode_length                      65.234043
returns                           -118.937474
return_std                         267.122379
average_reward                      -1.814327
round_time             0 days 00:10:59.668597
episodes_test                           121.0
episode_length_test                  76.07438
returns_test                      -127.698583
return_std_test                    337.612018
average_reward_test                 -1.686598
round_time_test        0 days 00:00:10.717572
round_time_total       0 days 00:10:59.669922
loss_total                     2245078.778125
loss_critic                    2803810.052375
loss_actor                        10153.47671
memory_size                        154714.745 

=== epoch 3/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:03,  2.76it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   92
episode_length                      64.347826
returns                           -111.385437
return_std                         265.224426
average_reward                      -1.737452
round_time             0 days 00:11:01.343657
episodes_test                           114.0
episode_length_test                 85.973684
returns_test                      -145.124294
return_std_test                    330.130648
average_reward_test                 -1.691379
round_time_test        0 days 00:00:10.650069
round_time_total       0 days 00:11:01.344760
loss_total                     2080672.640531
loss_critic                    2598381.289187
loss_actor                        9837.866764
memory_size                       156150.6415 

=== epoch 3/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:12,  2.52it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                  104
episode_length                      70.576923
returns                           -121.695815
return_std                         296.602675
average_reward                      -1.718417
round_time             0 days 00:10:59.207910
episodes_test                            79.0
episode_length_test                117.848101
returns_test                      -197.661714
return_std_test                    460.449323
average_reward_test                 -1.682107
round_time_test        0 days 00:00:10.689938
round_time_total       0 days 00:10:59.209048
loss_total                     1972783.664719
loss_critic                    2463589.767063
loss_actor                        9559.079157
memory_size                        157526.753 

=== epoch 3/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                  123
episode_length                      74.105691
returns                           -127.849851
return_std                         295.809827
average_reward                      -1.733064
round_time             0 days 00:11:01.592202
episodes_test                            81.0
episode_length_test                 123.17284
returns_test                      -209.492871
return_std_test                     458.39708
average_reward_test                 -1.700823
round_time_test        0 days 00:00:10.921435
round_time_total       0 days 00:11:01.593294
loss_total                     1832726.944969
loss_critic                    2288572.834312
loss_actor                        9343.235178
memory_size                       159060.7025 

=== epoch 3/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                  108
episode_length                      89.083333
returns                           -150.533233
return_std                         345.693771
average_reward                      -1.688447
round_time             0 days 00:11:02.444969
episodes_test                           156.0
episode_length_test                 61.448718
returns_test                       -95.157326
return_std_test                    231.783533
average_reward_test                 -1.564893
round_time_test        0 days 00:00:10.926898
round_time_total       0 days 00:11:02.446119
loss_total                        1691062.615
loss_critic                    2111551.373812
loss_actor                        9107.440946
memory_size                        160742.289 

=== epoch 3/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                  120
episode_length                      82.433333
returns                           -133.951358
return_std                         319.083298
average_reward                      -1.627135
round_time             0 days 00:11:04.326940
episodes_test                            80.0
episode_length_test                   124.825
returns_test                      -207.272118
return_std_test                    449.718463
average_reward_test                  -1.65967
round_time_test        0 days 00:00:10.745130
round_time_total       0 days 00:11:04.328059
loss_total                     1599962.927063
loss_critic                    1997744.901438
loss_actor                        8834.890291
memory_size                        162128.304 

=== epoch 3/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   96
episode_length                       94.28125
returns                           -152.515931
return_std                         348.092533
average_reward                      -1.626928
round_time             0 days 00:11:04.634999
episodes_test                            83.0
episode_length_test                110.518072
returns_test                      -176.746408
return_std_test                    370.861119
average_reward_test                 -1.596122
round_time_test        0 days 00:00:10.764971
round_time_total       0 days 00:11:04.636139
loss_total                     1499407.775188
loss_critic                    1872112.738563
loss_actor                        8587.787041
memory_size                       163685.3405 

=== epoch 3/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   99
episode_length                      94.838384
returns                           -152.957945
return_std                          354.41131
average_reward                      -1.617619
round_time             0 days 00:11:10.298131
episodes_test                            65.0
episode_length_test                145.169231
returns_test                      -236.485577
return_std_test                    500.057563
average_reward_test                 -1.626062
round_time_test        0 days 00:00:10.788135
round_time_total       0 days 00:11:10.299694
loss_total                     1402162.312375
loss_critic                    1750616.030813
loss_actor                        8347.320867
memory_size                       165197.2125 

=== epoch 3/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   84
episode_length                     112.166667
returns                           -177.465345
return_std                         406.752918
average_reward                      -1.590231
round_time             0 days 00:11:07.956848
episodes_test                           101.0
episode_length_test                 92.920792
returns_test                      -148.119723
return_std_test                    343.825151
average_reward_test                 -1.597606
round_time_test        0 days 00:00:10.760892
round_time_total       0 days 00:11:07.957936
loss_total                     1298545.192156
loss_critic                    1621148.853781
loss_actor                        8130.435295
memory_size                        167087.081 

=== epoch 3/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:01,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   68
episode_length                          140.5
returns                           -222.973205
return_std                         471.987994
average_reward                      -1.580297
round_time             0 days 00:11:11.239268
episodes_test                            72.0
episode_length_test                126.361111
returns_test                       -198.64564
return_std_test                    432.631026
average_reward_test                 -1.580064
round_time_test        0 days 00:00:10.852659
round_time_total       0 days 00:11:11.240403
loss_total                     1226607.902656
loss_critic                    1531280.749031
loss_actor                        7916.409256
memory_size                       169021.0985 

=== epoch 3/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   63
episode_length                     148.968254
returns                           -235.366072
return_std                         489.359088
average_reward                      -1.576374
round_time             0 days 00:11:09.915023
episodes_test                            61.0
episode_length_test                150.868852
returns_test                      -229.854584
return_std_test                    441.669712
average_reward_test                 -1.526735
round_time_test        0 days 00:00:11.097494
round_time_total       0 days 00:11:09.916114
loss_total                        1132954.551
loss_critic                    1414265.144563
loss_actor                        7712.077134
memory_size                        170744.418 

=== epoch 3/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   68
episode_length                          135.0
returns                           -211.510467
return_std                         448.094866
average_reward                      -1.569473
round_time             0 days 00:11:03.730633
episodes_test                           108.0
episode_length_test                 84.157407
returns_test                       -125.49458
return_std_test                    273.355946
average_reward_test                 -1.496296
round_time_test        0 days 00:00:10.783545
round_time_total       0 days 00:11:03.731920
loss_total                     1046719.130969
loss_critic                    1306523.070938
loss_actor                        7503.288969
memory_size                        172104.504 

=== epoch 3/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   72
episode_length                         119.25
returns                           -183.851043
return_std                         401.134373
average_reward                      -1.544596
round_time             0 days 00:11:04.255183
episodes_test                           146.0
episode_length_test                 67.315068
returns_test                       -91.505724
return_std_test                    213.207697
average_reward_test                 -1.363466
round_time_test        0 days 00:00:10.803591
round_time_total       0 days 00:11:04.256490
loss_total                      974281.713984
loss_critic                    1216028.574328
loss_actor                        7294.192334
memory_size                       173658.0325 

=== epoch 3/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   91
episode_length                      103.43956
returns                           -155.187491
return_std                          350.30302
average_reward                       -1.50448
round_time             0 days 00:11:04.314257
episodes_test                           157.0
episode_length_test                 63.643312
returns_test                       -87.497122
return_std_test                    201.712376
average_reward_test                 -1.374603
round_time_test        0 days 00:00:10.779470
round_time_total       0 days 00:11:04.315552
loss_total                      872383.777813
loss_critic                    1088706.437594
loss_actor                         7093.06691
memory_size                       175288.8065 

=== epoch 3/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:52,  2.40it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  110
episode_length                      84.318182
returns                           -124.876979
return_std                         288.295609
average_reward                      -1.480806
round_time             0 days 00:11:07.955248
episodes_test                           129.0
episode_length_test                 76.658915
returns_test                      -112.082182
return_std_test                    216.126747
average_reward_test                 -1.463002
round_time_test        0 days 00:00:10.932304
round_time_total       0 days 00:11:07.956378
loss_total                      814569.784078
loss_critic                    1016501.367391
loss_actor                        6843.386206
memory_size                       176679.5655 

=== epoch 3/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:34,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                  100
episode_length                          92.85
returns                           -137.428014
return_std                          320.09261
average_reward                      -1.485215
round_time             0 days 00:11:05.683360
episodes_test                            76.0
episode_length_test                120.223684
returns_test                      -165.373183
return_std_test                     347.61886
average_reward_test                 -1.383852
round_time_test        0 days 00:00:10.866634
round_time_total       0 days 00:11:05.684650
loss_total                      731638.704547
loss_critic                     912898.445906
loss_actor                        6599.681818
memory_size                       178305.2835 

=== epoch 3/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   91
episode_length                      98.747253
returns                           -143.929235
return_std                          332.89079
average_reward                      -1.461369
round_time             0 days 00:11:03.073412
episodes_test                            55.0
episode_length_test                179.745455
returns_test                      -245.700148
return_std_test                    448.038741
average_reward_test                  -1.36691
round_time_test        0 days 00:00:10.839225
round_time_total       0 days 00:11:03.074510
loss_total                      657359.454969
loss_critic                     820107.608688
loss_actor                        6366.783352
memory_size                       180006.0195 

=== epoch 3/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   90
episode_length                     109.444444
returns                           -159.638718
return_std                         362.314842
average_reward                      -1.453252
round_time             0 days 00:10:59.445205
episodes_test                            80.0
episode_length_test                  124.2125
returns_test                      -174.833062
return_std_test                    358.014109
average_reward_test                 -1.405541
round_time_test        0 days 00:00:10.825921
round_time_total       0 days 00:10:59.446308
loss_total                      600357.687273
loss_critic                     748911.195641
loss_actor                        6143.604872
memory_size                        181624.802 

=== epoch 3/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:40,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   97
episode_length                      96.587629
returns                           -137.694164
return_std                         322.212008
average_reward                      -1.420405
round_time             0 days 00:11:02.482704
episodes_test                            79.0
episode_length_test                114.734177
returns_test                      -151.332666
return_std_test                    308.801022
average_reward_test                 -1.336666
round_time_test        0 days 00:00:10.652430
round_time_total       0 days 00:11:02.483794
loss_total                      548207.346063
loss_critic                     683779.207906
loss_actor                        5919.854088
memory_size                       183024.8505 

=== epoch 3/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   86
episode_length                     113.802326
returns                           -159.431953
return_std                         361.569284
average_reward                      -1.402734
round_time             0 days 00:11:03.800914
episodes_test                            80.0
episode_length_test                  123.7625
returns_test                      -159.943612
return_std_test                    350.510012
average_reward_test                 -1.291428
round_time_test        0 days 00:00:10.725747
round_time_total       0 days 00:11:03.802020
loss_total                      508168.559609
loss_critic                     633782.499727
loss_actor                        5712.756095
memory_size                       184644.3005 

=== epoch 3/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   95
episode_length                     101.905263
returns                           -137.657521
return_std                         312.163002
average_reward                        -1.3532
round_time             0 days 00:11:06.465499
episodes_test                            66.0
episode_length_test                150.575758
returns_test                      -190.148998
return_std_test                    353.303732
average_reward_test                 -1.263393
round_time_test        0 days 00:00:10.885249
round_time_total       0 days 00:11:06.466978
loss_total                      465334.780242
loss_critic                      580287.43825
loss_actor                        5524.106687
memory_size                        186372.863 

=== epoch 3/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  115
episode_length                      84.878261
returns                           -110.682058
return_std                         255.910058
average_reward                      -1.309903
round_time             0 days 00:11:04.733675
episodes_test                            61.0
episode_length_test                150.180328
returns_test                      -178.896036
return_std_test                    346.339854
average_reward_test                 -1.194623
round_time_test        0 days 00:00:10.879912
round_time_total       0 days 00:11:04.735063
loss_total                      422161.662195
loss_critic                     526372.435766
loss_actor                        5318.531323
memory_size                        187689.625 

=== epoch 3/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:40,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  109
episode_length                      87.605505
returns                           -110.826839
return_std                         253.426863
average_reward                       -1.25391
round_time             0 days 00:11:05.252801
episodes_test                            61.0
episode_length_test                     163.0
returns_test                      -191.898566
return_std_test                     375.39011
average_reward_test                 -1.176429
round_time_test        0 days 00:00:10.669539
round_time_total       0 days 00:11:05.253955
loss_total                      395826.624453
loss_critic                     493500.065094
loss_actor                        5132.828428
memory_size                        189186.242 

=== epoch 3/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   92
episode_length                     104.217391
returns                           -127.835614
return_std                         277.385347
average_reward                      -1.229936
round_time             0 days 00:11:04.762658
episodes_test                            56.0
episode_length_test                177.214286
returns_test                      -201.193244
return_std_test                    360.525312
average_reward_test                 -1.132708
round_time_test        0 days 00:00:10.688492
round_time_total       0 days 00:11:04.763743
loss_total                      373159.457867
loss_critic                     465209.254977
loss_actor                        4960.240512
memory_size                       190915.2145 

=== epoch 3/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   94
episode_length                      97.234043
returns                           -115.019748
return_std                         227.933438
average_reward                       -1.19637
round_time             0 days 00:11:06.311729
episodes_test                            50.0
episode_length_test                    182.56
returns_test                      -192.565447
return_std_test                    327.964114
average_reward_test                 -1.060317
round_time_test        0 days 00:00:10.923694
round_time_total       0 days 00:11:06.312874
loss_total                      346317.366031
loss_critic                      431696.18993
loss_actor                        4802.042029
memory_size                        192552.164 

=== epoch 3/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   85
episode_length                     111.717647
returns                           -130.640301
return_std                         261.753152
average_reward                      -1.168956
round_time             0 days 00:11:06.978873
episodes_test                            35.0
episode_length_test                285.114286
returns_test                      -327.600865
return_std_test                    411.777049
average_reward_test                 -1.147879
round_time_test        0 days 00:00:10.985049
round_time_total       0 days 00:11:06.980160
loss_total                      327177.305375
loss_critic                     407812.263305
loss_actor                         4637.44656
memory_size                         194226.27 

=== epoch 3/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   61
episode_length                     147.721311
returns                           -167.674416
return_std                         317.156585
average_reward                       -1.15396
round_time             0 days 00:11:05.120049
episodes_test                            62.0
episode_length_test                 157.66129
returns_test                      -171.540413
return_std_test                    300.941927
average_reward_test                 -1.089921
round_time_test        0 days 00:00:10.937551
round_time_total       0 days 00:11:05.121445
loss_total                      307550.100969
loss_critic                     383316.249883
loss_actor                        4485.479108
memory_size                        196066.118 

=== epoch 3/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   61
episode_length                     149.491803
returns                           -170.181024
return_std                         327.766844
average_reward                      -1.135954
round_time             0 days 00:11:04.248565
episodes_test                            53.0
episode_length_test                188.528302
returns_test                      -191.617641
return_std_test                    286.161271
average_reward_test                 -1.015708
round_time_test        0 days 00:00:10.817744
round_time_total       0 days 00:11:04.249990
loss_total                      297116.296488
loss_critic                     370308.870188
loss_actor                        4345.975946
memory_size                        197660.567 

=== epoch 3/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   64
episode_length                     144.046875
returns                           -161.865324
return_std                         318.799069
average_reward                      -1.121901
round_time             0 days 00:11:04.861165
episodes_test                            56.0
episode_length_test                171.946429
returns_test                      -178.220424
return_std_test                    290.531324
average_reward_test                 -1.034824
round_time_test        0 days 00:00:10.639618
round_time_total       0 days 00:11:04.862247
loss_total                      280696.643535
loss_critic                     349824.419094
loss_actor                        4185.516258
memory_size                        199348.858 

=== epoch 3/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   64
episode_length                     141.921875
returns                           -153.936929
return_std                          318.33982
average_reward                      -1.089488
round_time             0 days 00:11:05.359889
episodes_test                            41.0
episode_length_test                     242.0
returns_test                      -230.945315
return_std_test                    372.688203
average_reward_test                  -0.94811
round_time_test        0 days 00:00:10.803201
round_time_total       0 days 00:11:05.361117
loss_total                      267374.987922
loss_critic                     333203.528098
loss_actor                        4060.803108
memory_size                       200944.2335 

=== epoch 3/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:29,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   71
episode_length                     132.746479
returns                           -145.894333
return_std                         306.402223
average_reward                      -1.103782
round_time             0 days 00:11:02.539152
episodes_test                            56.0
episode_length_test                   177.625
returns_test                      -165.726741
return_std_test                    277.175203
average_reward_test                 -0.931481
round_time_test        0 days 00:00:10.671169
round_time_total       0 days 00:11:02.540592
loss_total                      263166.516105
loss_critic                     327973.702539
loss_actor                        3937.746855
memory_size                        202622.344 

=== epoch 3/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   61
episode_length                     147.639344
returns                           -162.838531
return_std                         329.567431
average_reward                      -1.100254
round_time             0 days 00:11:03.928065
episodes_test                            70.0
episode_length_test                142.528571
returns_test                       -132.72297
return_std_test                     238.39044
average_reward_test                 -0.929267
round_time_test        0 days 00:00:10.866381
round_time_total       0 days 00:11:03.929146
loss_total                      267979.274082
loss_critic                     334025.856676
loss_actor                        3792.920699
memory_size                       204381.3725 

=== epoch 3/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   68
episode_length                     138.588235
returns                           -148.704779
return_std                         310.302014
average_reward                      -1.079751
round_time             0 days 00:11:00.967107
episodes_test                            50.0
episode_length_test                     197.8
returns_test                      -158.469982
return_std_test                    241.276838
average_reward_test                 -0.799579
round_time_test        0 days 00:00:10.624683
round_time_total       0 days 00:11:00.968204
loss_total                      266199.660824
loss_critic                      331826.27677
loss_actor                        3693.173983
memory_size                       206088.3735 

=== epoch 3/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   64
episode_length                     150.890625
returns                           -161.883021
return_std                         315.513084
average_reward                      -1.075144
round_time             0 days 00:11:00.935458
episodes_test                            62.0
episode_length_test                161.112903
returns_test                      -139.258178
return_std_test                    256.914486
average_reward_test                 -0.864273
round_time_test        0 days 00:00:10.697082
round_time_total       0 days 00:11:00.936524
loss_total                      264163.369773
loss_critic                     329307.500199
loss_actor                        3586.826574
memory_size                        207777.715 

=== epoch 3/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   69
episode_length                     142.318841
returns                            -151.01712
return_std                         304.061731
average_reward                      -1.062459
round_time             0 days 00:11:03.935931
episodes_test                            62.0
episode_length_test                160.451613
returns_test                      -141.671704
return_std_test                     212.01088
average_reward_test                 -0.881383
round_time_test        0 days 00:00:10.673356
round_time_total       0 days 00:11:03.937050
loss_total                      273417.805496
loss_critic                     340902.113223
loss_actor                        3480.550004
memory_size                        209362.429 

=== epoch 3/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   75
episode_length                     132.613333
returns                           -137.153664
return_std                         263.311004
average_reward                      -1.030677
round_time             0 days 00:11:00.898536
episodes_test                            56.0
episode_length_test                177.696429
returns_test                       -137.37412
return_std_test                    254.341954
average_reward_test                 -0.770483
round_time_test        0 days 00:00:10.854358
round_time_total       0 days 00:11:00.899856
loss_total                      295194.563895
loss_critic                     368143.772996
loss_actor                        3397.701769
memory_size                        210958.286 

=== epoch 3/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:51,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   91
episode_length                     109.186813
returns                           -111.841372
return_std                         221.048668
average_reward                      -1.020235
round_time             0 days 00:11:02.094021
episodes_test                            47.0
episode_length_test                     196.0
returns_test                      -169.117518
return_std_test                    293.208508
average_reward_test                  -0.86897
round_time_test        0 days 00:00:10.890407
round_time_total       0 days 00:11:02.095220
loss_total                      345801.534082
loss_critic                     431424.820879
loss_actor                        3308.356417
memory_size                        212424.177 

=== epoch 3/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   77
episode_length                     119.376623
returns                           -122.260129
return_std                         235.345235
average_reward                      -1.038284
round_time             0 days 00:11:02.792483
episodes_test                            65.0
episode_length_test                141.569231
returns_test                      -136.167087
return_std_test                    244.054627
average_reward_test                 -0.971697
round_time_test        0 days 00:00:10.906284
round_time_total       0 days 00:11:02.793646
loss_total                      445493.911207
loss_critic                     556060.701781
loss_actor                        3226.708073
memory_size                       214115.2595 

=== epoch 3/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   77
episode_length                     128.688312
returns                           -134.125487
return_std                         261.986372
average_reward                      -1.039813
round_time             0 days 00:11:04.872313
episodes_test                            54.0
episode_length_test                176.944444
returns_test                      -166.187403
return_std_test                    302.647371
average_reward_test                 -0.930269
round_time_test        0 days 00:00:10.900565
round_time_total       0 days 00:11:04.873423
loss_total                      636559.424938
loss_critic                     794918.764109
loss_actor                        3122.016315
memory_size                       215953.0985 

=== epoch 3/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:28,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   76
episode_length                     130.684211
returns                           -141.564165
return_std                         270.681277
average_reward                      -1.080442
round_time             0 days 00:11:06.058314
episodes_test                            75.0
episode_length_test                    133.32
returns_test                      -129.960886
return_std_test                    258.190962
average_reward_test                 -0.974785
round_time_test        0 days 00:00:10.879483
round_time_total       0 days 00:11:06.059386
loss_total                       550362.72918
loss_critic                     687185.706023
loss_actor                        3070.772567
memory_size                       217569.8735 

=== epoch 3/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:28,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   69
episode_length                     140.753623
returns                           -155.399736
return_std                         307.828087
average_reward                      -1.095058
round_time             0 days 00:11:07.845551
episodes_test                            70.0
episode_length_test                     142.2
returns_test                      -141.211679
return_std_test                    275.075209
average_reward_test                 -0.988932
round_time_test        0 days 00:00:10.733528
round_time_total       0 days 00:11:07.846661
loss_total                      295597.083328
loss_critic                     368732.022219
loss_actor                        3057.305046
memory_size                       219164.7315 

=== epoch 3/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   78
episode_length                     122.333333
returns                           -130.729474
return_std                         268.540029
average_reward                      -1.072766
round_time             0 days 00:11:06.449483
episodes_test                            76.0
episode_length_test                123.052632
returns_test                       -119.13643
return_std_test                    255.808451
average_reward_test                 -0.970237
round_time_test        0 days 00:00:10.815648
round_time_total       0 days 00:11:06.450586
loss_total                      224955.147332
loss_critic                     280437.301203
loss_actor                        3026.513449
memory_size                       220570.5755 

=== epoch 3/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:37,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   83
episode_length                      105.53012
returns                           -111.207386
return_std                         241.802573
average_reward                      -1.055818
round_time             0 days 00:11:07.694262
episodes_test                            60.0
episode_length_test                166.466667
returns_test                      -153.506025
return_std_test                    273.286805
average_reward_test                 -0.921623
round_time_test        0 days 00:00:10.763674
round_time_total       0 days 00:11:07.695395
loss_total                       211451.64282
loss_critic                     263573.332906
loss_actor                        2964.863401
memory_size                       222170.8565 

=== epoch 3/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                  107
episode_length                      92.205607
returns                            -95.109074
return_std                         213.746927
average_reward                      -1.027478
round_time             0 days 00:11:08.211544
episodes_test                            93.0
episode_length_test                107.215054
returns_test                       -84.892567
return_std_test                    193.578645
average_reward_test                 -0.791453
round_time_test        0 days 00:00:10.720674
round_time_total       0 days 00:11:08.212689
loss_total                      181826.419844
loss_critic                     226557.367137
loss_actor                        2902.615112
memory_size                        223648.062 

=== epoch 3/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   93
episode_length                      97.258065
returns                            -96.233939
return_std                         213.770796
average_reward                      -0.983918
round_time             0 days 00:11:10.122558
episodes_test                            49.0
episode_length_test                194.142857
returns_test                      -156.832296
return_std_test                    269.796681
average_reward_test                 -0.799766
round_time_test        0 days 00:00:10.988489
round_time_total       0 days 00:11:10.123724
loss_total                      166041.718398
loss_critic                     206848.056371
loss_actor                        2816.351998
memory_size                       225229.2185 

=== epoch 3/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:53,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   92
episode_length                      108.48913
returns                           -101.409526
return_std                         219.997951
average_reward                      -0.933984
round_time             0 days 00:11:09.391349
episodes_test                            52.0
episode_length_test                177.730769
returns_test                      -148.891986
return_std_test                    256.254941
average_reward_test                 -0.845819
round_time_test        0 days 00:00:10.854987
round_time_total       0 days 00:11:09.392464
loss_total                      164583.212232
loss_critic                     205045.657604
loss_actor                        2733.417163
memory_size                       227000.9615 

=== epoch 3/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   81
episode_length                     115.049383
returns                             -104.9565
return_std                         221.126984
average_reward                      -0.923229
round_time             0 days 00:11:09.587745
episodes_test                            29.0
episode_length_test                325.241379
returns_test                      -214.425852
return_std_test                    257.045358
average_reward_test                 -0.653011
round_time_test        0 days 00:00:10.846250
round_time_total       0 days 00:11:09.588827
loss_total                      182616.234785
loss_critic                     227605.872039
loss_actor                          2657.6691
memory_size                        228710.571 

=== epoch 3/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   85
episode_length                          114.8
returns                           -104.179356
return_std                         206.639993
average_reward                      -0.907216
round_time             0 days 00:11:10.700189
episodes_test                            77.0
episode_length_test                128.272727
returns_test                       -87.606138
return_std_test                    181.529189
average_reward_test                 -0.675273
round_time_test        0 days 00:00:10.738105
round_time_total       0 days 00:11:10.701359
loss_total                      213888.298584
loss_critic                     266715.356602
loss_actor                        2580.047877
memory_size                       230231.2465 

=== epoch 3/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:19,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   60
episode_length                          153.1
returns                           -138.143849
return_std                         247.147996
average_reward                      -0.906037
round_time             0 days 00:11:11.148611
episodes_test                            34.0
episode_length_test                279.235294
returns_test                      -238.417744
return_std_test                    332.252464
average_reward_test                 -0.866837
round_time_test        0 days 00:00:10.728538
round_time_total       0 days 00:11:11.149923
loss_total                      246479.521414
loss_critic                     307470.297805
loss_actor                        2516.392536
memory_size                       231909.3345 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 4/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<11:55,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   11
episode_length                     113.545455
returns                           -105.539004
return_std                           52.12019
average_reward                      -1.023335
round_time             0 days 00:10:58.423269
episodes_test                            36.0
episode_length_test                254.416667
returns_test                      -237.691794
return_std_test                    341.926925
average_reward_test                 -0.938224
round_time_test        0 days 00:00:10.775373
round_time_total       0 days 00:10:58.424395
loss_total                      303179.959207
loss_critic                     378361.574141
loss_actor                        2453.474361
memory_size                       233627.3615 

=== epoch 4/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:36,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   27
episode_length                      146.62963
returns                           -148.671396
return_std                          276.22421
average_reward                      -1.002352
round_time             0 days 00:11:00.022443
episodes_test                            40.0
episode_length_test                   232.325
returns_test                      -239.211939
return_std_test                    358.421459
average_reward_test                 -1.025153
round_time_test        0 days 00:00:10.709347
round_time_total       0 days 00:11:00.023760
loss_total                      414457.355477
loss_critic                     517467.377789
loss_actor                        2417.229524
memory_size                       235367.7655 

=== epoch 4/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   40
episode_length                        144.975
returns                           -146.912708
return_std                         277.804476
average_reward                      -1.027103
round_time             0 days 00:11:01.264542
episodes_test                            71.0
episode_length_test                136.323944
returns_test                      -144.782765
return_std_test                    303.067949
average_reward_test                 -1.058312
round_time_test        0 days 00:00:10.802745
round_time_total       0 days 00:11:01.266139
loss_total                      543528.227121
loss_critic                     678815.050969
loss_actor                        2380.884303
memory_size                        237060.227 

=== epoch 4/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   54
episode_length                     139.666667
returns                           -143.847909
return_std                         283.574342
average_reward                      -1.032912
round_time             0 days 00:11:01.199390
episodes_test                            72.0
episode_length_test                126.041667
returns_test                      -133.101321
return_std_test                    224.233292
average_reward_test                 -1.037847
round_time_test        0 days 00:00:10.750557
round_time_total       0 days 00:11:01.200752
loss_total                      449345.669594
loss_critic                      561081.83741
loss_actor                         2400.95875
memory_size                         238710.03 

=== epoch 4/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   65
episode_length                     153.338462
returns                           -158.131465
return_std                         308.641251
average_reward                      -1.027688
round_time             0 days 00:11:00.112506
episodes_test                            92.0
episode_length_test                108.695652
returns_test                       -99.810589
return_std_test                    191.028537
average_reward_test                 -0.918257
round_time_test        0 days 00:00:10.840170
round_time_total       0 days 00:11:00.113615
loss_total                      349487.054285
loss_critic                     436254.683539
loss_actor                        2416.509283
memory_size                        240471.876 

=== epoch 4/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   71
episode_length                     136.225352
returns                           -138.746122
return_std                         293.984519
average_reward                      -1.018526
round_time             0 days 00:11:01.733348
episodes_test                            65.0
episode_length_test                151.184615
returns_test                      -141.448611
return_std_test                    273.589821
average_reward_test                 -0.932558
round_time_test        0 days 00:00:10.849835
round_time_total       0 days 00:11:01.734437
loss_total                      403729.627777
loss_critic                     504061.107141
loss_actor                        2403.677417
memory_size                        242043.785 

=== epoch 4/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   80
episode_length                         106.45
returns                           -107.477816
return_std                         232.867789
average_reward                      -1.035304
round_time             0 days 00:11:03.885051
episodes_test                            36.0
episode_length_test                262.611111
returns_test                      -252.106436
return_std_test                    294.517052
average_reward_test                 -0.958025
round_time_test        0 days 00:00:10.766570
round_time_total       0 days 00:11:03.886347
loss_total                      648184.474973
loss_critic                     809639.683766
loss_actor                        2363.584544
memory_size                       243482.3515 

=== epoch 4/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   81
episode_length                     112.901235
returns                            -116.21156
return_std                         259.515012
average_reward                      -1.037789
round_time             0 days 00:11:06.849531
episodes_test                            64.0
episode_length_test                153.296875
returns_test                      -138.251358
return_std_test                    248.071947
average_reward_test                 -0.892828
round_time_test        0 days 00:00:10.667956
round_time_total       0 days 00:11:06.850695
loss_total                      714084.792301
loss_critic                     892019.121516
loss_actor                        2347.417282
memory_size                        245110.147 

=== epoch 4/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:04,  2.54it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   70
episode_length                     122.757143
returns                           -129.581212
return_std                         281.457456
average_reward                      -1.071488
round_time             0 days 00:11:08.298359
episodes_test                            71.0
episode_length_test                128.985915
returns_test                       -134.46377
return_std_test                    276.087523
average_reward_test                 -1.049723
round_time_test        0 days 00:00:10.766234
round_time_total       0 days 00:11:08.299552
loss_total                      706428.406227
loss_critic                     882452.671117
loss_actor                        2331.285927
memory_size                       246893.7175 

=== epoch 4/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   74
episode_length                     134.108108
returns                           -149.780407
return_std                         310.676008
average_reward                      -1.112195
round_time             0 days 00:11:08.228188
episodes_test                            91.0
episode_length_test                101.934066
returns_test                      -108.350216
return_std_test                    221.350323
average_reward_test                  -1.06266
round_time_test        0 days 00:00:10.902849
round_time_total       0 days 00:11:08.229284
loss_total                      701045.543867
loss_critic                     875723.787414
loss_actor                        2332.511854
memory_size                       248748.9965 

=== epoch 4/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   67
episode_length                      143.41791
returns                           -166.798405
return_std                         335.903117
average_reward                      -1.157756
round_time             0 days 00:11:09.106099
episodes_test                            80.0
episode_length_test                   117.725
returns_test                      -125.941246
return_std_test                    254.586487
average_reward_test                 -1.092535
round_time_test        0 days 00:00:10.738912
round_time_total       0 days 00:11:09.107199
loss_total                      994144.242102
loss_critic                    1242106.249109
loss_actor                        2296.128422
memory_size                         250330.26 

=== epoch 4/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:52,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   51
episode_length                     167.803922
returns                            -200.35204
return_std                         375.962448
average_reward                      -1.196872
round_time             0 days 00:11:07.987300
episodes_test                            99.0
episode_length_test                 99.626263
returns_test                      -107.209525
return_std_test                    223.220775
average_reward_test                 -1.071842
round_time_test        0 days 00:00:10.732277
round_time_total       0 days 00:11:07.988554
loss_total                      962729.213031
loss_critic                      1202836.4375
loss_actor                        2300.239071
memory_size                       252145.1265 

=== epoch 4/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   51
episode_length                     188.803922
returns                           -229.099023
return_std                         405.749455
average_reward                      -1.221959
round_time             0 days 00:11:06.887883
episodes_test                            70.0
episode_length_test                140.157143
returns_test                      -164.525665
return_std_test                    355.585349
average_reward_test                 -1.171022
round_time_test        0 days 00:00:11.140488
round_time_total       0 days 00:11:06.889000
loss_total                     1454669.466703
loss_critic                    1817775.171234
loss_actor                        2246.532439
memory_size                          253818.7 

=== epoch 4/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:02,  2.55it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   56
episode_length                     175.392857
returns                           -217.196869
return_std                         401.833165
average_reward                      -1.238831
round_time             0 days 00:11:04.256998
episodes_test                            60.0
episode_length_test                     160.8
returns_test                      -203.460246
return_std_test                     398.88613
average_reward_test                 -1.266482
round_time_test        0 days 00:00:10.770977
round_time_total       0 days 00:11:04.258084
loss_total                     1568536.839859
loss_critic                    1960111.824688
loss_actor                        2236.768663
memory_size                       255674.3685 

=== epoch 4/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   66
episode_length                          143.0
returns                           -180.625294
return_std                         347.963072
average_reward                      -1.263922
round_time             0 days 00:11:05.250242
episodes_test                           117.0
episode_length_test                 85.418803
returns_test                      -105.113822
return_std_test                    227.119083
average_reward_test                  -1.23008
round_time_test        0 days 00:00:10.812237
round_time_total       0 days 00:11:05.251534
loss_total                      592087.144398
loss_critic                     739526.991484
loss_actor                        2327.707345
memory_size                        257175.835 

=== epoch 4/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   64
episode_length                     142.734375
returns                           -181.745795
return_std                         354.238724
average_reward                      -1.270339
round_time             0 days 00:11:02.743120
episodes_test                            91.0
episode_length_test                109.186813
returns_test                      -134.318021
return_std_test                    294.547941
average_reward_test                 -1.228722
round_time_test        0 days 00:00:10.625943
round_time_total       0 days 00:11:02.744382
loss_total                      418256.450059
loss_critic                     522233.699508
loss_actor                        2347.414863
memory_size                       258923.6315 

=== epoch 4/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   69
episode_length                     133.231884
returns                           -167.858146
return_std                         339.961901
average_reward                      -1.254612
round_time             0 days 00:11:02.524827
episodes_test                            91.0
episode_length_test                 109.67033
returns_test                      -124.504816
return_std_test                    285.079653
average_reward_test                 -1.136199
round_time_test        0 days 00:00:10.840842
round_time_total       0 days 00:11:02.525899
loss_total                      352140.994441
loss_critic                      439585.49102
loss_actor                        2362.976607
memory_size                       260623.3885 

=== epoch 4/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   82
episode_length                     111.317073
returns                           -133.788712
return_std                         282.939932
average_reward                      -1.198731
round_time             0 days 00:11:03.923078
episodes_test                            62.0
episode_length_test                148.354839
returns_test                      -158.744388
return_std_test                     321.33304
average_reward_test                 -1.066897
round_time_test        0 days 00:00:10.833194
round_time_total       0 days 00:11:03.924158
loss_total                      378122.931301
loss_critic                     472065.704688
loss_actor                        2351.807771
memory_size                       262151.0155 

=== epoch 4/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   90
episode_length                     109.444444
returns                           -126.886642
return_std                         261.645262
average_reward                      -1.167041
round_time             0 days 00:11:03.963175
episodes_test                            47.0
episode_length_test                209.914894
returns_test                      -212.566525
return_std_test                    345.491261
average_reward_test                 -1.012899
round_time_test        0 days 00:00:10.819685
round_time_total       0 days 00:11:03.964303
loss_total                      483267.117527
loss_critic                     603499.360844
loss_actor                        2338.104125
memory_size                       263672.6575 

=== epoch 4/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   85
episode_length                          112.0
returns                           -122.812035
return_std                         260.393087
average_reward                       -1.10595
round_time             0 days 00:11:05.101787
episodes_test                            46.0
episode_length_test                215.304348
returns_test                      -194.036412
return_std_test                     314.20574
average_reward_test                 -0.904655
round_time_test        0 days 00:00:10.773976
round_time_total       0 days 00:11:05.102878
loss_total                      521138.559613
loss_critic                     650835.304242
loss_actor                        2351.531544
memory_size                        265354.022 

=== epoch 4/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   86
episode_length                     100.127907
returns                           -107.929579
return_std                         233.012657
average_reward                      -1.086185
round_time             0 days 00:11:05.180627
episodes_test                            90.0
episode_length_test                102.411111
returns_test                       -98.283196
return_std_test                    204.708546
average_reward_test                 -0.956647
round_time_test        0 days 00:00:10.770271
round_time_total       0 days 00:11:05.181722
loss_total                      683027.932934
loss_critic                     853200.272078
loss_actor                        2338.522138
memory_size                        266899.011 

=== epoch 4/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   80
episode_length                        121.275
returns                           -131.283307
return_std                         265.999437
average_reward                      -1.081561
round_time             0 days 00:11:04.670230
episodes_test                            25.0
episode_length_test                     379.8
returns_test                      -402.573834
return_std_test                    483.593833
average_reward_test                 -1.059456
round_time_test        0 days 00:00:10.712045
round_time_total       0 days 00:11:04.671523
loss_total                       768037.40257
loss_critic                     959463.630258
loss_actor                        2332.420501
memory_size                        268670.493 

=== epoch 4/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   68
episode_length                     139.279412
returns                           -156.482417
return_std                         313.638878
average_reward                      -1.119709
round_time             0 days 00:11:04.671712
episodes_test                            66.0
episode_length_test                137.393939
returns_test                      -139.207982
return_std_test                    267.824158
average_reward_test                 -1.015854
round_time_test        0 days 00:00:10.845500
round_time_total       0 days 00:11:04.672819
loss_total                      718965.018937
loss_critic                     898119.350188
loss_actor                        2347.631858
memory_size                        270444.026 

=== epoch 4/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   60
episode_length                     137.066667
returns                           -155.555013
return_std                         307.506851
average_reward                      -1.127594
round_time             0 days 00:11:03.747874
episodes_test                            65.0
episode_length_test                153.815385
returns_test                       -171.36811
return_std_test                     323.62006
average_reward_test                 -1.114135
round_time_test        0 days 00:00:10.820892
round_time_total       0 days 00:11:03.749014
loss_total                      752428.352945
loss_critic                     939945.171023
loss_actor                          2361.0202
memory_size                       272119.3165 

=== epoch 4/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   56
episode_length                     158.035714
returns                           -181.998888
return_std                         343.280873
average_reward                      -1.152478
round_time             0 days 00:11:03.030526
episodes_test                            89.0
episode_length_test                111.910112
returns_test                      -120.729303
return_std_test                    258.252744
average_reward_test                 -1.075418
round_time_test        0 days 00:00:10.655243
round_time_total       0 days 00:11:03.031706
loss_total                      752996.329609
loss_critic                     940650.512813
loss_actor                         2379.53576
memory_size                       273906.6135 

=== epoch 4/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:30,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   55
episode_length                     158.454545
returns                           -183.749396
return_std                         348.750964
average_reward                      -1.169007
round_time             0 days 00:11:03.377550
episodes_test                           106.0
episode_length_test                 93.650943
returns_test                      -103.816586
return_std_test                    219.369738
average_reward_test                 -1.105613
round_time_test        0 days 00:00:10.688761
round_time_total       0 days 00:11:03.378700
loss_total                      838061.719578
loss_critic                    1046983.328687
loss_actor                         2375.21659
memory_size                       275564.4885 

=== epoch 4/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   66
episode_length                     127.772727
returns                            -150.35372
return_std                         303.637504
average_reward                      -1.179533
round_time             0 days 00:11:05.368999
episodes_test                            88.0
episode_length_test                113.590909
returns_test                      -121.056783
return_std_test                    260.084873
average_reward_test                 -1.065894
round_time_test        0 days 00:00:10.924451
round_time_total       0 days 00:11:05.370093
loss_total                      876389.729828
loss_critic                    1094889.354016
loss_actor                         2391.16247
memory_size                       277172.3235 

=== epoch 4/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   80
episode_length                         117.25
returns                           -139.938117
return_std                         281.686282
average_reward                      -1.191862
round_time             0 days 00:11:00.483463
episodes_test                            64.0
episode_length_test                155.609375
returns_test                      -186.944329
return_std_test                    339.855967
average_reward_test                 -1.201737
round_time_test        0 days 00:00:10.711107
round_time_total       0 days 00:11:00.484569
loss_total                      638301.288703
loss_critic                      797265.55632
loss_actor                        2444.164221
memory_size                         278734.91 

=== epoch 4/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   92
episode_length                     107.630435
returns                           -127.646161
return_std                         271.887812
average_reward                      -1.185448
round_time             0 days 00:11:04.512231
episodes_test                            92.0
episode_length_test                108.478261
returns_test                       -123.48888
return_std_test                    274.829945
average_reward_test                   -1.1393
round_time_test        0 days 00:00:10.777749
round_time_total       0 days 00:11:04.513631
loss_total                      555021.394992
loss_critic                     693161.613344
loss_actor                        2460.468845
memory_size                       280052.8605 

=== epoch 4/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   89
episode_length                      98.179775
returns                           -116.852169
return_std                         247.411421
average_reward                      -1.185037
round_time             0 days 00:11:05.216353
episodes_test                            58.0
episode_length_test                170.827586
returns_test                      -181.874709
return_std_test                     341.34174
average_reward_test                   -1.0656
round_time_test        0 days 00:00:10.822172
round_time_total       0 days 00:11:05.217685
loss_total                      511201.379414
loss_critic                     638389.152883
loss_actor                        2450.239987
memory_size                       281677.8985 

=== epoch 4/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                  104
episode_length                      92.298077
returns                           -108.088116
return_std                         226.259464
average_reward                       -1.16356
round_time             0 days 00:11:05.502999
episodes_test                           115.0
episode_length_test                 86.347826
returns_test                       -86.812336
return_std_test                    175.618438
average_reward_test                 -1.003033
round_time_test        0 days 00:00:10.820681
round_time_total       0 days 00:11:05.504139
loss_total                       476678.47441
loss_critic                     595236.092184
loss_actor                        2447.960483
memory_size                       283337.7285 

=== epoch 4/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                  101
episode_length                      91.574257
returns                           -104.551109
return_std                         221.001823
average_reward                      -1.148424
round_time             0 days 00:11:07.215723
episodes_test                            39.0
episode_length_test                 255.25641
returns_test                      -240.788269
return_std_test                    364.493032
average_reward_test                 -0.938749
round_time_test        0 days 00:00:10.790159
round_time_total       0 days 00:11:07.217018
loss_total                      438640.809988
loss_critic                     547689.869988
loss_actor                        2444.531912
memory_size                       284851.5485 

=== epoch 4/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   81
episode_length                     121.876543
returns                            -137.35482
return_std                          289.27534
average_reward                      -1.130283
round_time             0 days 00:11:10.270672
episodes_test                            52.0
episode_length_test                186.711538
returns_test                      -175.543411
return_std_test                    298.165499
average_reward_test                 -0.940237
round_time_test        0 days 00:00:10.748526
round_time_total       0 days 00:11:10.271903
loss_total                      387202.470012
loss_critic                     483394.171469
loss_actor                        2435.630587
memory_size                       286595.1195 

=== epoch 4/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:29,  2.66it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   68
episode_length                     134.647059
returns                            -150.71393
return_std                         299.726884
average_reward                      -1.106611
round_time             0 days 00:11:06.620747
episodes_test                            51.0
episode_length_test                176.803922
returns_test                       -155.02106
return_std_test                     277.20189
average_reward_test                 -0.870558
round_time_test        0 days 00:00:10.788057
round_time_total       0 days 00:11:06.621840
loss_total                      354695.591035
loss_critic                     442769.653742
loss_actor                        2399.308678
memory_size                        288277.402 

=== epoch 4/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   68
episode_length                     138.941176
returns                            -150.05655
return_std                         287.416434
average_reward                      -1.090634
round_time             0 days 00:11:09.931055
episodes_test                            49.0
episode_length_test                195.591837
returns_test                      -168.552573
return_std_test                    278.331652
average_reward_test                  -0.85494
round_time_test        0 days 00:00:10.776469
round_time_total       0 days 00:11:09.932557
loss_total                      346819.296945
loss_critic                     432927.736582
loss_actor                        2385.508473
memory_size                       290114.8815 

=== epoch 4/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:48,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   65
episode_length                     135.107692
returns                           -144.152035
return_std                         268.105177
average_reward                      -1.069024
round_time             0 days 00:11:07.630308
episodes_test                            69.0
episode_length_test                144.405797
returns_test                      -118.713395
return_std_test                    204.954641
average_reward_test                 -0.821755
round_time_test        0 days 00:00:10.844715
round_time_total       0 days 00:11:07.631393
loss_total                      357226.998812
loss_critic                      445941.46425
loss_actor                        2369.105728
memory_size                       291633.4315 

=== epoch 4/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:44,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   62
episode_length                     142.435484
returns                           -147.482046
return_std                         264.695902
average_reward                      -1.033928
round_time             0 days 00:11:10.143234
episodes_test                            62.0
episode_length_test                158.145161
returns_test                      -137.227116
return_std_test                    258.925278
average_reward_test                 -0.871296
round_time_test        0 days 00:00:10.798928
round_time_total       0 days 00:11:10.144317
loss_total                      409858.733125
loss_critic                     511743.282105
loss_actor                        2320.501373
memory_size                        293264.873 

=== epoch 4/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   64
episode_length                         155.75
returns                           -158.402588
return_std                         272.302125
average_reward                      -1.019774
round_time             0 days 00:11:08.073851
episodes_test                            33.0
episode_length_test                302.909091
returns_test                      -215.006748
return_std_test                    284.081239
average_reward_test                 -0.709944
round_time_test        0 days 00:00:10.690382
round_time_total       0 days 00:11:08.074933
loss_total                      412555.807027
loss_critic                      515119.16848
loss_actor                        2302.330114
memory_size                       295058.4535 

=== epoch 4/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   69
episode_length                     140.565217
returns                           -141.477996
return_std                         239.003433
average_reward                      -1.004971
round_time             0 days 00:11:06.282115
episodes_test                            48.0
episode_length_test                  191.6875
returns_test                      -157.960418
return_std_test                    240.082434
average_reward_test                 -0.821214
round_time_test        0 days 00:00:10.621958
round_time_total       0 days 00:11:06.283406
loss_total                      372783.824145
loss_critic                     465403.945066
loss_actor                        2303.304625
memory_size                        296641.641 

=== epoch 4/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   65
episode_length                     148.046154
returns                           -145.643625
return_std                         243.951668
average_reward                      -0.979864
round_time             0 days 00:11:09.599867
episodes_test                            45.0
episode_length_test                220.066667
returns_test                      -159.666788
return_std_test                    247.449287
average_reward_test                 -0.725024
round_time_test        0 days 00:00:10.796827
round_time_total       0 days 00:11:09.600971
loss_total                      426486.749129
loss_critic                     532535.917516
loss_actor                        2290.037188
memory_size                        298451.569 

=== epoch 4/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:46,  2.61it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   51
episode_length                     170.647059
returns                           -165.450251
return_std                         271.158153
average_reward                      -0.978215
round_time             0 days 00:11:07.081211
episodes_test                            67.0
episode_length_test                147.223881
returns_test                      -114.310365
return_std_test                    213.357319
average_reward_test                 -0.773655
round_time_test        0 days 00:00:10.947237
round_time_total       0 days 00:11:07.082309
loss_total                      521157.023645
loss_critic                     650877.439109
loss_actor                         2275.31776
memory_size                       300241.4395 

=== epoch 4/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   53
episode_length                     178.509434
returns                           -178.317177
return_std                         301.251637
average_reward                      -0.996701
round_time             0 days 00:11:08.949015
episodes_test                            62.0
episode_length_test                150.693548
returns_test                      -125.238145
return_std_test                    208.338771
average_reward_test                 -0.834543
round_time_test        0 days 00:00:10.622372
round_time_total       0 days 00:11:08.950160
loss_total                      631582.010473
loss_critic                     788908.617188
loss_actor                        2275.530247
memory_size                       301935.6735 

=== epoch 4/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   56
episode_length                     178.232143
returns                           -176.980294
return_std                         301.691579
average_reward                      -0.991875
round_time             0 days 00:11:11.884295
episodes_test                            65.0
episode_length_test                148.676923
returns_test                      -130.986913
return_std_test                    242.803895
average_reward_test                 -0.879731
round_time_test        0 days 00:00:10.847915
round_time_total       0 days 00:11:11.885420
loss_total                      597220.702281
loss_critic                     745954.055145
loss_actor                        2287.241072
memory_size                       303689.4815 

=== epoch 4/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   55
episode_length                     176.690909
returns                           -180.186798
return_std                         314.492219
average_reward                      -1.024567
round_time             0 days 00:11:08.267348
episodes_test                            59.0
episode_length_test                154.050847
returns_test                      -148.442257
return_std_test                    290.701195
average_reward_test                 -0.957507
round_time_test        0 days 00:00:10.887876
round_time_total       0 days 00:11:08.268727
loss_total                      625868.797375
loss_critic                     781766.278711
loss_actor                        2278.818782
memory_size                       305263.7165 

=== epoch 4/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   68
episode_length                         136.75
returns                           -145.410985
return_std                         275.646782
average_reward                      -1.056977
round_time             0 days 00:11:10.823373
episodes_test                            50.0
episode_length_test                    197.42
returns_test                      -191.331711
return_std_test                    278.823545
average_reward_test                  -0.96715
round_time_test        0 days 00:00:10.738923
round_time_total       0 days 00:11:10.824864
loss_total                      553763.990801
loss_critic                     691629.082004
loss_actor                        2303.577143
memory_size                        307018.795 

=== epoch 4/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   67
episode_length                     135.253731
returns                           -147.812047
return_std                         279.556526
average_reward                      -1.090523
round_time             0 days 00:11:11.013282
episodes_test                            36.0
episode_length_test                271.777778
returns_test                      -281.548252
return_std_test                    412.924339
average_reward_test                 -1.034622
round_time_test        0 days 00:00:10.687196
round_time_total       0 days 00:11:11.014386
loss_total                      533672.771918
loss_critic                     666515.760324
loss_actor                        2300.773502
memory_size                       308540.0675 

=== epoch 4/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   57
episode_length                     150.192982
returns                           -165.231472
return_std                         302.893451
average_reward                      -1.101561
round_time             0 days 00:11:06.852636
episodes_test                           107.0
episode_length_test                 93.448598
returns_test                        -92.00419
return_std_test                    171.890893
average_reward_test                 -0.984461
round_time_test        0 days 00:00:10.775405
round_time_total       0 days 00:11:06.853753
loss_total                      436303.128746
loss_critic                     544801.661785
loss_actor                        2308.958683
memory_size                        310410.992 

=== epoch 4/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   54
episode_length                     166.481481
returns                           -188.420948
return_std                         335.230374
average_reward                      -1.121759
round_time             0 days 00:11:09.519588
episodes_test                            61.0
episode_length_test                163.590164
returns_test                      -165.600353
return_std_test                    275.395802
average_reward_test                 -1.012073
round_time_test        0 days 00:00:10.919220
round_time_total       0 days 00:11:09.520901
loss_total                      424064.972418
loss_critic                     529510.044781
loss_actor                        2284.646046
memory_size                       312208.0695 

=== epoch 4/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:07<00:00,  3.00it/s]
episodes                                   47
episode_length                     178.212766
returns                           -197.821659
return_std                          350.49424
average_reward                      -1.114221
round_time             0 days 00:11:07.807428
episodes_test                            51.0
episode_length_test                189.215686
returns_test                      -183.643763
return_std_test                     259.68328
average_reward_test                  -0.96157
round_time_test        0 days 00:00:10.768164
round_time_total       0 days 00:11:07.808514
loss_total                      464464.967557
loss_critic                     580017.363004
loss_actor                        2255.350106
memory_size                        314027.193 

=== epoch 4/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   41
episode_length                     225.682927
returns                           -249.984036
return_std                         394.092939
average_reward                      -1.104343
round_time             0 days 00:11:08.675373
episodes_test                            42.0
episode_length_test                230.571429
returns_test                      -197.857886
return_std_test                     310.52373
average_reward_test                 -0.853465
round_time_test        0 days 00:00:10.801646
round_time_total       0 days 00:11:08.676507
loss_total                      505650.552273
loss_critic                     631504.231848
loss_actor                         2235.79346
memory_size                       315751.4315 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 5/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                    6
episode_length                     213.166667
returns                           -229.522588
return_std                         361.408906
average_reward                      -1.099376
round_time             0 days 00:11:00.690109
episodes_test                            33.0
episode_length_test                273.090909
returns_test                      -242.876272
return_std_test                    332.606733
average_reward_test                 -0.893721
round_time_test        0 days 00:00:10.761375
round_time_total       0 days 00:11:00.691196
loss_total                      672775.825105
loss_critic                     840420.809469
loss_actor                        2195.824249
memory_size                       317526.0625 

=== epoch 5/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   23
episode_length                     147.217391
returns                           -162.196375
return_std                          288.56773
average_reward                      -1.115341
round_time             0 days 00:10:59.547318
episodes_test                            51.0
episode_length_test                186.568627
returns_test                      -171.530938
return_std_test                    328.490123
average_reward_test                 -0.926583
round_time_test        0 days 00:00:10.849974
round_time_total       0 days 00:10:59.548407
loss_total                      871450.348617
loss_critic                    1088771.460773
loss_actor                        2165.824672
memory_size                       319225.8675 

=== epoch 5/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   30
episode_length                     193.033333
returns                           -213.047876
return_std                           353.9488
average_reward                      -1.101123
round_time             0 days 00:10:59.482509
episodes_test                            58.0
episode_length_test                171.810345
returns_test                      -167.812529
return_std_test                    318.016242
average_reward_test                 -0.976276
round_time_test        0 days 00:00:10.793982
round_time_total       0 days 00:10:59.483816
loss_total                     1072972.121316
loss_critic                    1340680.590453
loss_actor                        2138.153018
memory_size                        320974.859 

=== epoch 5/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:59<00:00,  3.03it/s]
episodes                                   37
episode_length                     213.459459
returns                           -241.091187
return_std                         399.367059
average_reward                      -1.127713
round_time             0 days 00:10:59.759811
episodes_test                            43.0
episode_length_test                227.953488
returns_test                      -239.281653
return_std_test                    396.770766
average_reward_test                 -1.046113
round_time_test        0 days 00:00:10.848120
round_time_total       0 days 00:10:59.760957
loss_total                     1406796.177875
loss_critic                    1757966.891031
loss_actor                        2113.205108
memory_size                        322860.934 

=== epoch 5/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                   40
episode_length                          248.8
returns                           -288.250273
return_std                         445.732974
average_reward                       -1.15725
round_time             0 days 00:11:01.676101
episodes_test                            53.0
episode_length_test                180.792453
returns_test                      -190.328258
return_std_test                    323.671126
average_reward_test                 -1.053973
round_time_test        0 days 00:00:10.783252
round_time_total       0 days 00:11:01.677334
loss_total                     1785836.660055
loss_critic                     2231781.59857
loss_actor                        2056.750871
memory_size                       324718.7355 

=== epoch 5/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:49,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   46
episode_length                     203.565217
returns                           -243.223684
return_std                         413.599198
average_reward                      -1.198961
round_time             0 days 00:11:01.545122
episodes_test                            37.0
episode_length_test                259.324324
returns_test                      -330.594496
return_std_test                    442.362217
average_reward_test                 -1.274373
round_time_test        0 days 00:00:10.778679
round_time_total       0 days 00:11:01.546221
loss_total                     2461628.147156
loss_critic                    3076545.879422
loss_actor                        1957.012417
memory_size                        326465.451 

=== epoch 5/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:43,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   35
episode_length                     270.571429
returns                           -349.363172
return_std                         525.744982
average_reward                      -1.291718
round_time             0 days 00:11:02.214507
episodes_test                            26.0
episode_length_test                355.538462
returns_test                      -493.491229
return_std_test                    592.415836
average_reward_test                 -1.386039
round_time_test        0 days 00:00:10.800393
round_time_total       0 days 00:11:02.215594
loss_total                        3141440.046
loss_critic                    3926392.843063
loss_actor                        1628.588965
memory_size                       328324.4185 

=== epoch 5/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   42
episode_length                     217.857143
returns                           -298.239494
return_std                          487.57185
average_reward                      -1.357113
round_time             0 days 00:11:01.523952
episodes_test                            42.0
episode_length_test                218.547619
returns_test                      -282.408667
return_std_test                    439.072363
average_reward_test                 -1.290934
round_time_test        0 days 00:00:10.660672
round_time_total       0 days 00:11:01.525312
loss_total                     1530936.266734
loss_critic                    1913245.530703
loss_actor                        1699.082122
memory_size                       330110.5655 

=== epoch 5/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:56,  2.79it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   59
episode_length                     139.576271
returns                           -190.064166
return_std                         381.350987
average_reward                      -1.335591
round_time             0 days 00:11:04.696003
episodes_test                           109.0
episode_length_test                 91.733945
returns_test                       -98.910654
return_std_test                    153.346842
average_reward_test                 -1.078165
round_time_test        0 days 00:00:10.605975
round_time_total       0 days 00:11:04.697119
loss_total                      900333.010863
loss_critic                    1124972.074242
loss_actor                        1776.677712
memory_size                        331502.566 

=== epoch 5/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:43,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   61
episode_length                      151.04918
returns                           -197.000505
return_std                         384.451153
average_reward                      -1.276718
round_time             0 days 00:11:03.021213
episodes_test                            48.0
episode_length_test                  195.6875
returns_test                      -199.611361
return_std_test                    336.756957
average_reward_test                  -1.02502
round_time_test        0 days 00:00:10.822595
round_time_total       0 days 00:11:03.022459
loss_total                      233764.261865
loss_critic                     291763.450963
loss_actor                        1767.485943
memory_size                         333244.61 

=== epoch 5/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   58
episode_length                     156.448276
returns                           -178.274673
return_std                         340.091143
average_reward                      -1.173327
round_time             0 days 00:11:06.152573
episodes_test                            23.0
episode_length_test                409.695652
returns_test                      -248.202342
return_std_test                    215.827254
average_reward_test                 -0.602478
round_time_test        0 days 00:00:11.062415
round_time_total       0 days 00:11:06.153763
loss_total                       156025.93604
loss_critic                     194635.720854
loss_actor                        1586.783338
memory_size                        335110.445 

=== epoch 5/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   55
episode_length                     157.963636
returns                           -157.931813
return_std                         272.333516
average_reward                      -1.023154
round_time             0 days 00:11:03.266563
episodes_test                            18.0
episode_length_test                543.777778
returns_test                      -422.376638
return_std_test                    397.282282
average_reward_test                 -0.783423
round_time_test        0 days 00:00:10.937852
round_time_total       0 days 00:11:03.267692
loss_total                       90509.793559
loss_critic                     112770.160812
loss_actor                         1468.31666
memory_size                        336945.063 

=== epoch 5/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   46
episode_length                     212.413043
returns                           -187.116303
return_std                          312.12809
average_reward                      -0.879056
round_time             0 days 00:11:01.169119
episodes_test                            14.0
episode_length_test                680.142857
returns_test                      -484.352846
return_std_test                    376.544926
average_reward_test                 -0.721702
round_time_test        0 days 00:00:11.177848
round_time_total       0 days 00:11:01.170616
loss_total                       39731.337814
loss_critic                      49320.311633
loss_actor                        1375.438892
memory_size                       338840.6325 

=== epoch 5/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:39,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   38
episode_length                     259.421053
returns                           -200.428732
return_std                          297.05525
average_reward                      -0.776381
round_time             0 days 00:11:02.886259
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -339.098961
return_std_test                    183.497697
average_reward_test                 -0.339099
round_time_test        0 days 00:00:11.178830
round_time_total       0 days 00:11:02.887577
loss_total                       19901.661464
loss_critic                      24568.804603
loss_actor                        1233.087246
memory_size                       340593.2155 

=== epoch 5/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:01,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   35
episode_length                     251.485714
returns                           -173.308036
return_std                         251.943741
average_reward                       -0.67299
round_time             0 days 00:11:01.820065
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       -53.692474
return_std_test                      49.62099
average_reward_test                 -0.053692
round_time_test        0 days 00:00:11.270148
round_time_total       0 days 00:11:01.821395
loss_total                       14635.642134
loss_critic                      18018.935786
loss_actor                        1102.466219
memory_size                        342392.926 

=== epoch 5/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:40,  2.63it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   34
episode_length                     278.970588
returns                           -169.543752
return_std                         220.642695
average_reward                      -0.626982
round_time             0 days 00:11:03.042313
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        35.837541
return_std_test                     15.969891
average_reward_test                  0.035838
round_time_test        0 days 00:00:11.284574
round_time_total       0 days 00:11:03.043450
loss_total                       11599.856628
loss_critic                      14251.610669
loss_actor                         992.839396
memory_size                        344311.871 

=== epoch 5/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   36
episode_length                     242.888889
returns                           -141.090325
return_std                         191.633034
average_reward                      -0.577488
round_time             0 days 00:11:03.688240
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       356.966003
return_std_test                     12.414353
average_reward_test                  0.356966
round_time_test        0 days 00:00:11.261312
round_time_total       0 days 00:11:03.689356
loss_total                        9593.925143
loss_critic                       11768.79775
loss_actor                         894.433909
memory_size                        346093.201 

=== epoch 5/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   35
episode_length                     268.028571
returns                            -148.53813
return_std                         193.954208
average_reward                      -0.548561
round_time             0 days 00:11:04.491855
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       193.887392
return_std_test                     68.105808
average_reward_test                  0.193887
round_time_test        0 days 00:00:11.459052
round_time_total       0 days 00:11:04.492942
loss_total                        8127.145297
loss_critic                       9955.890264
loss_actor                         812.164707
memory_size                          347975.5 

=== epoch 5/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:40,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.00it/s]
episodes                                   22
episode_length                     385.181818
returns                           -211.371042
return_std                         242.331542
average_reward                      -0.543918
round_time             0 days 00:11:06.245290
episodes_test                            10.0
episode_length_test                     903.2
returns_test                       218.392546
return_std_test                    129.316813
average_reward_test                  0.240521
round_time_test        0 days 00:00:11.413411
round_time_total       0 days 00:11:06.246393
loss_total                        6729.109664
loss_critic                       8225.317739
loss_actor                         744.276784
memory_size                        349891.298 

=== epoch 5/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                   25
episode_length                         394.12
returns                           -219.716199
return_std                         242.818486
average_reward                      -0.554915
round_time             0 days 00:11:01.655729
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -280.480567
return_std_test                     24.815252
average_reward_test                 -0.280481
round_time_test        0 days 00:00:10.989010
round_time_total       0 days 00:11:01.656820
loss_total                        6017.641161
loss_critic                       7350.230235
loss_actor                         687.284364
memory_size                       351794.0095 

=== epoch 5/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:42,  2.62it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   23
episode_length                     385.434783
returns                           -203.422383
return_std                         215.538626
average_reward                      -0.552605
round_time             0 days 00:11:03.552302
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       364.344138
return_std_test                    155.124805
average_reward_test                  0.364344
round_time_test        0 days 00:00:11.310904
round_time_total       0 days 00:11:03.553395
loss_total                        5327.741814
loss_critic                       6502.248093
loss_actor                         629.716277
memory_size                       353662.6745 

=== epoch 5/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   22
episode_length                     418.909091
returns                           -230.127291
return_std                         218.854587
average_reward                      -0.555401
round_time             0 days 00:11:01.466360
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -183.479802
return_std_test                     67.655008
average_reward_test                  -0.18348
round_time_test        0 days 00:00:11.235815
round_time_total       0 days 00:11:01.467544
loss_total                        4795.637045
loss_critic                       5850.225944
loss_actor                         577.281035
memory_size                       355476.0145 

=== epoch 5/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:21,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   21
episode_length                     433.333333
returns                            -252.59429
return_std                         240.887225
average_reward                      -0.578834
round_time             0 days 00:11:04.299680
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -219.258069
return_std_test                     59.584773
average_reward_test                 -0.219258
round_time_test        0 days 00:00:11.496503
round_time_total       0 days 00:11:04.300923
loss_total                         4374.25552
loss_critic                       5335.511601
loss_actor                          529.23082
memory_size                       357414.5495 

=== epoch 5/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   20
episode_length                          427.8
returns                           -259.177161
return_std                         237.652234
average_reward                      -0.589148
round_time             0 days 00:11:01.699102
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                      -379.213613
return_std_test                     113.73315
average_reward_test                 -0.379214
round_time_test        0 days 00:00:11.209944
round_time_total       0 days 00:11:01.700218
loss_total                        3933.469266
loss_critic                       4794.724493
loss_actor                         488.448027
memory_size                       359342.3865 

=== epoch 5/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:01,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   24
episode_length                     405.666667
returns                           -241.434542
return_std                          241.26679
average_reward                      -0.594035
round_time             0 days 00:11:01.120531
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         0.584417
return_std_test                     80.740376
average_reward_test                  0.000584
round_time_test        0 days 00:00:11.381241
round_time_total       0 days 00:11:01.121643
loss_total                        3599.025536
loss_critic                       4384.526916
loss_actor                         457.019696
memory_size                       361178.2575 

=== epoch 5/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:19,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   21
episode_length                     439.666667
returns                           -264.372458
return_std                         253.415638
average_reward                      -0.604822
round_time             0 days 00:11:01.945862
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       -36.030371
return_std_test                     64.632666
average_reward_test                  -0.03603
round_time_test        0 days 00:00:11.198263
round_time_total       0 days 00:11:01.947131
loss_total                        3370.334151
loss_critic                       4107.154784
loss_actor                         423.051341
memory_size                        363041.395 

=== epoch 5/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:50,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   20
episode_length                         441.65
returns                           -265.738819
return_std                         256.314124
average_reward                      -0.601419
round_time             0 days 00:11:01.319519
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        42.048833
return_std_test                     51.918819
average_reward_test                  0.042049
round_time_test        0 days 00:00:11.262138
round_time_total       0 days 00:11:01.320607
loss_total                        3089.538113
loss_critic                       3763.145677
loss_actor                         395.107596
memory_size                        364957.512 

=== epoch 5/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<13:46,  2.41it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   24
episode_length                        385.875
returns                           -218.399562
return_std                          239.20795
average_reward                      -0.577574
round_time             0 days 00:11:02.195595
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        71.428269
return_std_test                     86.526813
average_reward_test                  0.071428
round_time_test        0 days 00:00:11.303847
round_time_total       0 days 00:11:02.196900
loss_total                        2837.654038
loss_critic                       3453.888837
loss_actor                         372.714584
memory_size                       366868.0955 

=== epoch 5/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   25
episode_length                         362.92
returns                           -203.715645
return_std                         247.675341
average_reward                      -0.561191
round_time             0 days 00:11:06.079397
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        -1.916294
return_std_test                     92.692445
average_reward_test                 -0.001916
round_time_test        0 days 00:00:11.451236
round_time_total       0 days 00:11:06.080517
loss_total                        2687.352777
loss_critic                       3270.687352
loss_actor                         354.014266
memory_size                       368713.4725 

=== epoch 5/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   21
episode_length                     470.333333
returns                           -254.736821
return_std                         256.056521
average_reward                      -0.543298
round_time             0 days 00:11:04.621356
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       361.146909
return_std_test                    130.135383
average_reward_test                  0.361147
round_time_test        0 days 00:00:11.359793
round_time_total       0 days 00:11:04.622476
loss_total                        2581.620145
loss_critic                       3143.121328
loss_actor                         335.615201
memory_size                       370615.5775 

=== epoch 5/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:15,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   21
episode_length                     439.904762
returns                           -227.633082
return_std                          238.47362
average_reward                       -0.52184
round_time             0 days 00:11:05.436923
episodes_test                            10.0
episode_length_test                     928.3
returns_test                       307.136954
return_std_test                     97.089203
average_reward_test                  0.332437
round_time_test        0 days 00:00:11.320962
round_time_total       0 days 00:11:05.438022
loss_total                        2480.765411
loss_critic                       3021.530453
loss_actor                         317.705023
memory_size                       372506.2825 

=== epoch 5/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   24
episode_length                     402.541667
returns                           -203.372634
return_std                         217.318554
average_reward                      -0.509463
round_time             0 days 00:11:01.844110
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       267.118042
return_std_test                      52.66319
average_reward_test                  0.267118
round_time_test        0 days 00:00:11.205341
round_time_total       0 days 00:11:01.845189
loss_total                        2325.495452
loss_critic                       2832.314352
loss_actor                         298.219671
memory_size                        374404.295 

=== epoch 5/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   31
episode_length                      294.83871
returns                           -131.311618
return_std                         170.013132
average_reward                      -0.462767
round_time             0 days 00:11:03.624570
episodes_test                            10.0
episode_length_test                     979.0
returns_test                       393.134651
return_std_test                     44.998134
average_reward_test                  0.403038
round_time_test        0 days 00:00:11.349568
round_time_total       0 days 00:11:03.625953
loss_total                        2222.519363
loss_critic                       2707.575004
loss_actor                         282.296603
memory_size                        376142.808 

=== epoch 5/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:57,  2.57it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   32
episode_length                       280.8125
returns                           -126.865437
return_std                         166.481752
average_reward                      -0.451502
round_time             0 days 00:11:04.031180
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       291.869409
return_std_test                     34.448502
average_reward_test                  0.291869
round_time_test        0 days 00:00:11.450525
round_time_total       0 days 00:11:04.032303
loss_total                         2096.60352
loss_critic                       2553.949661
loss_actor                         267.218769
memory_size                       377930.3885 

=== epoch 5/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                   33
episode_length                     289.939394
returns                           -127.441654
return_std                         181.585844
average_reward                      -0.439539
round_time             0 days 00:11:01.596397
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       432.471511
return_std_test                     50.084132
average_reward_test                  0.432472
round_time_test        0 days 00:00:11.363663
round_time_total       0 days 00:11:01.597493
loss_total                        2013.917059
loss_critic                       2454.398062
loss_actor                         251.992884
memory_size                       379773.5315 

=== epoch 5/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   35
episode_length                     265.342857
returns                            -109.60274
return_std                          167.45375
average_reward                      -0.421679
round_time             0 days 00:11:05.320779
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       412.133241
return_std_test                     47.951354
average_reward_test                  0.412133
round_time_test        0 days 00:00:11.262287
round_time_total       0 days 00:11:05.322126
loss_total                          1894.0139
loss_critic                       2307.882302
loss_actor                         238.540141
memory_size                        381668.847 

=== epoch 5/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   36
episode_length                     275.305556
returns                           -107.586517
return_std                         158.515434
average_reward                      -0.392023
round_time             0 days 00:11:02.566068
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       131.112165
return_std_test                     81.782484
average_reward_test                  0.131112
round_time_test        0 days 00:00:11.450647
round_time_total       0 days 00:11:02.567610
loss_total                        1815.402085
loss_critic                       2213.293516
loss_actor                         223.836208
memory_size                       383521.4155 

=== epoch 5/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:07,  2.74it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   27
episode_length                     312.407407
returns                           -120.590758
return_std                         160.045148
average_reward                      -0.399338
round_time             0 days 00:11:05.666116
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       466.157295
return_std_test                     95.039014
average_reward_test                  0.466157
round_time_test        0 days 00:00:11.272672
round_time_total       0 days 00:11:05.667221
loss_total                        1755.734763
loss_critic                       2142.202123
loss_actor                         209.865178
memory_size                        385359.517 

=== epoch 5/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:02<00:00,  3.02it/s]
episodes                                   34
episode_length                     281.882353
returns                           -109.644868
return_std                         144.839479
average_reward                      -0.395427
round_time             0 days 00:11:03.403545
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       319.213399
return_std_test                     63.800081
average_reward_test                  0.319213
round_time_test        0 days 00:00:11.414198
round_time_total       0 days 00:11:03.404909
loss_total                        1672.817935
loss_critic                       2042.042499
loss_actor                         195.919547
memory_size                        387213.497 

=== epoch 5/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   44
episode_length                         209.25
returns                            -76.970798
return_std                         103.480861
average_reward                      -0.378372
round_time             0 days 00:11:03.697366
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        333.71893
return_std_test                     38.909754
average_reward_test                  0.333719
round_time_test        0 days 00:00:11.344858
round_time_total       0 days 00:11:03.698448
loss_total                        1622.274395
loss_critic                       1982.269943
loss_actor                         182.292065
memory_size                         388867.63 

=== epoch 5/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:10,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   45
episode_length                          200.8
returns                            -76.400547
return_std                          108.76065
average_reward                      -0.376175
round_time             0 days 00:11:09.362714
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       397.854979
return_std_test                     41.686798
average_reward_test                  0.397855
round_time_test        0 days 00:00:11.168707
round_time_total       0 days 00:11:09.363827
loss_total                        1561.616049
loss_critic                       1909.289391
loss_actor                         170.922555
memory_size                        390583.169 

=== epoch 5/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   41
episode_length                     220.682927
returns                            -81.760241
return_std                         119.774583
average_reward                      -0.367141
round_time             0 days 00:11:08.899665
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       282.620775
return_std_test                     29.272766
average_reward_test                  0.282621
round_time_test        0 days 00:00:11.512946
round_time_total       0 days 00:11:08.900766
loss_total                         1513.38465
loss_critic                       1852.191503
loss_actor                         158.157117
memory_size                       392454.2185 

=== epoch 5/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:42,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   42
episode_length                     211.738095
returns                            -74.285753
return_std                         107.828311
average_reward                      -0.356779
round_time             0 days 00:11:09.563309
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       117.670017
return_std_test                     34.568814
average_reward_test                   0.11767
round_time_test        0 days 00:00:11.276131
round_time_total       0 days 00:11:09.564430
loss_total                        1457.757058
loss_critic                       1786.047713
loss_actor                         144.594315
memory_size                        394339.731 

=== epoch 5/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:51,  2.81it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
episodes                                   41
episode_length                     240.439024
returns                            -76.870622
return_std                          116.30329
average_reward                      -0.317452
round_time             0 days 00:11:10.761846
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        388.36927
return_std_test                     46.526256
average_reward_test                  0.388369
round_time_test        0 days 00:00:11.244941
round_time_total       0 days 00:11:10.763016
loss_total                        1412.473521
loss_critic                       1732.452907
loss_actor                         132.555864
memory_size                       396167.9135 

=== epoch 5/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:15,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:08<00:00,  2.99it/s]
episodes                                   36
episode_length                     268.777778
returns                            -79.806082
return_std                         120.448315
average_reward                      -0.296117
round_time             0 days 00:11:09.496608
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       349.151778
return_std_test                     22.360055
average_reward_test                  0.349152
round_time_test        0 days 00:00:11.385458
round_time_total       0 days 00:11:09.497700
loss_total                        1366.039778
loss_critic                        1676.83153
loss_actor                         122.872653
memory_size                       397883.0525 

=== epoch 5/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:11<00:00,  2.98it/s]
episodes                                   35
episode_length                          264.4
returns                            -73.166022
return_std                          99.109424
average_reward                      -0.285074
round_time             0 days 00:11:11.587943
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       482.615709
return_std_test                     47.047254
average_reward_test                  0.482616
round_time_test        0 days 00:00:11.220247
round_time_total       0 days 00:11:11.589051
loss_total                        1324.799192
loss_critic                       1628.174572
loss_actor                         111.297555
memory_size                        399688.585 

=== epoch 5/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:58,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:12<00:00,  2.98it/s]
episodes                                   38
episode_length                     254.973684
returns                             -71.07039
return_std                          95.525734
average_reward                      -0.283356
round_time             0 days 00:11:12.781110
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       214.523807
return_std_test                     16.373449
average_reward_test                  0.214524
round_time_test        0 days 00:00:10.951199
round_time_total       0 days 00:11:12.782217
loss_total                        1281.263178
loss_critic                       1576.857557
loss_actor                           98.88555
memory_size                       401564.4845 

=== epoch 5/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:12,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:09<00:00,  2.99it/s]
episodes                                   39
episode_length                     245.410256
returns                            -66.235403
return_std                          86.253003
average_reward                      -0.271717
round_time             0 days 00:11:10.230797
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        126.71268
return_std_test                     33.317841
average_reward_test                  0.126713
round_time_test        0 days 00:00:11.188819
round_time_total       0 days 00:11:10.231907
loss_total                        1278.694218
loss_critic                       1577.072539
loss_actor                          85.180817
memory_size                       403370.3285 

=== epoch 5/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:27,  2.90it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:06<00:00,  3.00it/s]
episodes                                   35
episode_length                     277.885714
returns                            -79.285858
return_std                         102.032411
average_reward                      -0.277695
round_time             0 days 00:11:07.303897
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       287.350489
return_std_test                      21.64572
average_reward_test                   0.28735
round_time_test        0 days 00:00:11.080304
round_time_total       0 days 00:11:07.305002
loss_total                        1234.055027
loss_critic                       1524.018358
loss_actor                          74.201586
memory_size                        405213.295 

=== epoch 5/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:54,  2.80it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:10<00:00,  2.98it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   31
episode_length                     289.870968
returns                            -83.678662
return_std                         112.058332
average_reward                      -0.283736
round_time             0 days 00:11:11.138656
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       253.304786
return_std_test                     72.417808
average_reward_test                  0.253305
round_time_test        0 days 00:00:11.290409
round_time_total       0 days 00:11:11.139756
loss_total                        1232.807133
loss_critic                       1525.063115
loss_actor                          63.783098
memory_size                        407042.052 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 6/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:02<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                    6
episode_length                     263.333333
returns                            -77.709938
return_std                         101.165879
average_reward                      -0.276728
round_time             0 days 00:10:58.415315
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       355.081985
return_std_test                     22.127968
average_reward_test                  0.355082
round_time_test        0 days 00:00:11.270966
round_time_total       0 days 00:10:58.416614
loss_total                        1182.159443
loss_critic                       1464.777313
loss_actor                          51.687863
memory_size                       408886.5525 

=== epoch 6/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   13
episode_length                     268.230769
returns                            -67.263639
return_std                           98.87899
average_reward                      -0.264159
round_time             0 days 00:10:59.230292
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       298.508094
return_std_test                     17.779988
average_reward_test                  0.298508
round_time_test        0 days 00:00:11.111932
round_time_total       0 days 00:10:59.231387
loss_total                        1173.544197
loss_critic                       1456.647416
loss_actor                          41.131228
memory_size                       410731.2025 

=== epoch 6/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   18
episode_length                     314.222222
returns                            -83.203068
return_std                         109.496638
average_reward                      -0.262867
round_time             0 days 00:11:02.129589
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       449.875268
return_std_test                     16.119796
average_reward_test                  0.449875
round_time_test        0 days 00:00:11.234519
round_time_total       0 days 00:11:02.130943
loss_total                         1151.73229
loss_critic                       1432.043571
loss_actor                           30.48707
memory_size                       412592.5605 

=== epoch 6/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:35,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   22
episode_length                     361.909091
returns                            -94.157971
return_std                         118.819852
average_reward                      -0.258466
round_time             0 days 00:11:01.379246
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       377.563505
return_std_test                     13.811987
average_reward_test                  0.377564
round_time_test        0 days 00:00:11.278240
round_time_total       0 days 00:11:01.380358
loss_total                        1133.642796
loss_critic                       1412.475231
loss_actor                          18.312961
memory_size                        414485.379 

=== epoch 6/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                   32
episode_length                      309.46875
returns                            -73.001288
return_std                         106.529662
average_reward                      -0.237053
round_time             0 days 00:10:56.560368
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       237.794831
return_std_test                     14.008825
average_reward_test                  0.237795
round_time_test        0 days 00:00:11.000102
round_time_total       0 days 00:10:56.561847
loss_total                        1132.578478
loss_critic                       1414.029374
loss_actor                           6.774796
memory_size                        416328.975 

=== epoch 6/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   28
episode_length                     323.642857
returns                             -73.22015
return_std                         107.545389
average_reward                      -0.232456
round_time             0 days 00:10:59.411085
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       427.457209
return_std_test                     13.014907
average_reward_test                  0.427457
round_time_test        0 days 00:00:11.239128
round_time_total       0 days 00:10:59.412191
loss_total                        1126.518087
loss_critic                       1409.037541
loss_actor                          -3.559833
memory_size                        418140.239 

=== epoch 6/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.03it/s]
episodes                                   26
episode_length                     337.115385
returns                            -71.804439
return_std                         105.732798
average_reward                      -0.221479
round_time             0 days 00:11:01.555452
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       384.906291
return_std_test                     34.528362
average_reward_test                  0.384906
round_time_test        0 days 00:00:11.256306
round_time_total       0 days 00:11:01.556580
loss_total                        1133.449392
loss_critic                       1420.504072
loss_actor                         -14.769426
memory_size                       420030.5085 

=== epoch 6/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:05,  2.75it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   27
episode_length                     345.259259
returns                            -69.306726
return_std                         106.213242
average_reward                      -0.206585
round_time             0 days 00:11:00.827801
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       235.111261
return_std_test                     18.574183
average_reward_test                  0.235111
round_time_test        0 days 00:00:11.088016
round_time_total       0 days 00:11:00.828873
loss_total                        1101.361752
loss_critic                       1382.623226
loss_actor                          -23.68424
memory_size                        421886.764 

=== epoch 6/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:04<00:00,  3.01it/s]
episodes                                   29
episode_length                     308.448276
returns                            -57.914376
return_std                          94.524057
average_reward                      -0.186344
round_time             0 days 00:11:05.065332
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       321.552201
return_std_test                     28.670665
average_reward_test                  0.321552
round_time_test        0 days 00:00:11.267731
round_time_total       0 days 00:11:05.066428
loss_total                        1118.337921
loss_critic                       1406.366722
loss_actor                         -33.777381
memory_size                       423729.4905 

=== epoch 6/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.02it/s]
episodes                                   21
episode_length                     403.333333
returns                            -75.303815
return_std                          99.844412
average_reward                      -0.200089
round_time             0 days 00:11:03.863692
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       252.716409
return_std_test                     18.273537
average_reward_test                  0.252716
round_time_test        0 days 00:00:11.009412
round_time_total       0 days 00:11:03.864831
loss_total                        1114.471686
loss_critic                       1404.043839
loss_actor                         -43.817024
memory_size                        425636.523 

=== epoch 6/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:03<00:00,  3.01it/s]
episodes                                   27
episode_length                     335.333333
returns                            -69.876617
return_std                          86.846863
average_reward                      -0.219716
round_time             0 days 00:11:04.278738
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        265.68731
return_std_test                     56.970385
average_reward_test                  0.265687
round_time_test        0 days 00:00:11.455584
round_time_total       0 days 00:11:04.279851
loss_total                        1108.297871
loss_critic                       1398.753292
loss_actor                         -53.523904
memory_size                        427506.399 

=== epoch 6/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:57,  2.78it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:01<00:00,  3.02it/s]
episodes                                   29
episode_length                      335.62069
returns                            -76.003732
return_std                          98.147053
average_reward                      -0.228837
round_time             0 days 00:11:01.917185
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       364.254413
return_std_test                      9.569939
average_reward_test                  0.364254
round_time_test        0 days 00:00:11.103405
round_time_total       0 days 00:11:01.918290
loss_total                        1109.562951
loss_critic                       1402.703109
loss_actor                         -62.997787
memory_size                        429314.708 

=== epoch 6/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:00<00:00,  3.03it/s]
episodes                                   27
episode_length                     326.074074
returns                            -82.269188
return_std                          93.459669
average_reward                      -0.244472
round_time             0 days 00:11:01.103159
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       434.274349
return_std_test                      35.96131
average_reward_test                  0.434274
round_time_test        0 days 00:00:11.381398
round_time_total       0 days 00:11:01.104237
loss_total                        1128.181848
loss_critic                       1428.465288
loss_actor                         -72.952009
memory_size                         431144.39 

=== epoch 6/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   27
episode_length                     356.444444
returns                            -87.026656
return_std                          96.152869
average_reward                      -0.243454
round_time             0 days 00:10:58.350002
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       311.473894
return_std_test                     11.508406
average_reward_test                  0.311474
round_time_test        0 days 00:00:11.118767
round_time_total       0 days 00:10:58.351111
loss_total                        1116.810264
loss_critic                       1416.659627
loss_actor                         -82.587292
memory_size                       433032.9965 

=== epoch 6/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:16,  2.95it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   35
episode_length                     271.028571
returns                            -63.385979
return_std                          80.673834
average_reward                       -0.23361
round_time             0 days 00:10:58.508419
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       300.284571
return_std_test                     30.331856
average_reward_test                  0.300285
round_time_test        0 days 00:00:11.171768
round_time_total       0 days 00:10:58.509700
loss_total                        1122.116648
loss_critic                        1425.26995
loss_actor                         -90.496655
memory_size                       434835.9545 

=== epoch 6/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:11,  3.26it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:58<00:00,  3.04it/s]
episodes                                   33
episode_length                     281.848485
returns                            -55.575015
return_std                          70.045986
average_reward                      -0.203821
round_time             0 days 00:10:59.499329
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       495.663627
return_std_test                     14.107895
average_reward_test                  0.495664
round_time_test        0 days 00:00:11.081386
round_time_total       0 days 00:10:59.500450
loss_total                        1136.925727
loss_critic                       1445.854311
loss_actor                         -98.788708
memory_size                       436639.1235 

=== epoch 6/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   31
episode_length                     302.806452
returns                            -55.516163
return_std                          61.841566
average_reward                      -0.188836
round_time             0 days 00:10:54.065584
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       460.756568
return_std_test                     16.794449
average_reward_test                  0.460757
round_time_test        0 days 00:00:11.291703
round_time_total       0 days 00:10:54.066693
loss_total                        1127.100601
loss_critic                       1435.440674
loss_actor                        -106.259795
memory_size                       438466.8005 

=== epoch 6/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:57<00:00,  3.04it/s]
episodes                                   37
episode_length                     267.081081
returns                            -46.657509
return_std                          62.331468
average_reward                      -0.178277
round_time             0 days 00:10:57.791518
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       479.747502
return_std_test                     20.778894
average_reward_test                  0.479748
round_time_test        0 days 00:00:11.145287
round_time_total       0 days 00:10:57.792619
loss_total                        1129.177643
loss_critic                       1439.776758
loss_actor                        -113.218915
memory_size                        440345.487 

=== epoch 6/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                   38
episode_length                     258.447368
returns                            -50.795607
return_std                          64.765901
average_reward                      -0.199143
round_time             0 days 00:10:56.530056
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       468.384385
return_std_test                     13.920574
average_reward_test                  0.468384
round_time_test        0 days 00:00:11.398276
round_time_total       0 days 00:10:56.531530
loss_total                         1127.05297
loss_critic                       1438.848111
loss_actor                        -120.127694
memory_size                        442097.258 

=== epoch 6/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:17,  2.71it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                   33
episode_length                     270.969697
returns                            -57.050043
return_std                          71.058724
average_reward                      -0.200203
round_time             0 days 00:10:49.293642
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       413.417228
return_std_test                     19.214276
average_reward_test                  0.413417
round_time_test        0 days 00:00:11.062024
round_time_total       0 days 00:10:49.294775
loss_total                        1123.651609
loss_critic                        1436.26809
loss_actor                        -126.814414
memory_size                        443938.075 

=== epoch 6/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:49<00:00,  3.08it/s]
episodes                                   33
episode_length                     272.515152
returns                            -51.487456
return_std                          76.438267
average_reward                      -0.184034
round_time             0 days 00:10:49.995589
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       535.611571
return_std_test                     16.225633
average_reward_test                  0.535612
round_time_test        0 days 00:00:11.035784
round_time_total       0 days 00:10:49.996679
loss_total                        1145.512592
loss_critic                       1465.249899
loss_actor                        -133.436731
memory_size                        445789.002 

=== epoch 6/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:50<00:00,  3.07it/s]
episodes                                   33
episode_length                     287.151515
returns                            -50.656635
return_std                           66.28075
average_reward                      -0.177949
round_time             0 days 00:10:51.128326
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       419.722279
return_std_test                     17.219174
average_reward_test                  0.419722
round_time_test        0 days 00:00:11.245372
round_time_total       0 days 00:10:51.129406
loss_total                        1164.605148
loss_critic                       1490.615823
loss_actor                        -139.437658
memory_size                        447646.987 

=== epoch 6/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                   32
episode_length                      296.90625
returns                            -52.068615
return_std                          67.654077
average_reward                      -0.181055
round_time             0 days 00:10:48.951844
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       350.642293
return_std_test                     17.183085
average_reward_test                  0.350642
round_time_test        0 days 00:00:11.134860
round_time_total       0 days 00:10:48.952919
loss_total                        1157.982839
loss_critic                       1483.989757
loss_actor                        -146.044928
memory_size                        449474.135 

=== epoch 6/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:49<00:00,  3.08it/s]
episodes                                   28
episode_length                     348.642857
returns                            -56.699239
return_std                          73.505303
average_reward                      -0.167593
round_time             0 days 00:10:49.752458
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       521.989651
return_std_test                     29.901077
average_reward_test                   0.52199
round_time_test        0 days 00:00:11.170186
round_time_total       0 days 00:10:49.753565
loss_total                        1171.643345
loss_critic                       1502.663272
loss_actor                        -152.436466
memory_size                        451334.752 

=== epoch 6/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:52,  2.58it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:50<00:00,  3.08it/s]
episodes                                   34
episode_length                     270.705882
returns                            -44.137074
return_std                          63.840723
average_reward                      -0.165256
round_time             0 days 00:10:50.829127
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       511.206074
return_std_test                      23.41363
average_reward_test                  0.511206
round_time_test        0 days 00:00:11.110392
round_time_total       0 days 00:10:50.830208
loss_total                        1184.444498
loss_critic                       1520.257186
loss_actor                        -158.806356
memory_size                       453166.2625 

=== epoch 6/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:50<00:00,  3.08it/s]
episodes                                   33
episode_length                     295.484848
returns                            -46.020407
return_std                          60.590127
average_reward                      -0.156542
round_time             0 days 00:10:50.553059
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        344.28024
return_std_test                     52.485153
average_reward_test                   0.34428
round_time_test        0 days 00:00:11.037012
round_time_total       0 days 00:10:50.554304
loss_total                        1200.170903
loss_critic                        1540.99103
loss_actor                        -163.109704
memory_size                       454931.3515 

=== epoch 6/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:31,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                   30
episode_length                     326.333333
returns                            -57.415633
return_std                          81.104367
average_reward                      -0.180852
round_time             0 days 00:10:49.069699
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       518.995832
return_std_test                     17.063665
average_reward_test                  0.518996
round_time_test        0 days 00:00:11.297322
round_time_total       0 days 00:10:49.070892
loss_total                        1176.268878
loss_critic                       1512.335501
loss_actor                        -167.997725
memory_size                        456842.731 

=== epoch 6/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:49<00:00,  3.08it/s]
episodes                                   31
episode_length                     288.258065
returns                            -47.049885
return_std                          77.281076
average_reward                      -0.162231
round_time             0 days 00:10:50.532076
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       783.016451
return_std_test                     79.655681
average_reward_test                  0.783016
round_time_test        0 days 00:00:11.010148
round_time_total       0 days 00:10:50.533155
loss_total                        1194.319655
loss_critic                       1535.780643
loss_actor                        -171.524409
memory_size                       458693.5785 

=== epoch 6/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:21,  2.69it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:49<00:00,  3.08it/s]
episodes                                   36
episode_length                     271.861111
returns                            -41.963911
return_std                          72.519604
average_reward                      -0.149575
round_time             0 days 00:10:50.059361
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       588.813117
return_std_test                    107.840253
average_reward_test                  0.588813
round_time_test        0 days 00:00:11.215549
round_time_total       0 days 00:10:50.060469
loss_total                        1185.201201
loss_critic                       1525.431114
loss_actor                        -175.718553
memory_size                        460469.885 

=== epoch 6/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:48<00:00,  3.08it/s]
episodes                                   38
episode_length                     257.947368
returns                            -41.105045
return_std                          75.962325
average_reward                       -0.15427
round_time             0 days 00:10:49.272148
episodes_test                            10.0
episode_length_test                     900.8
returns_test                        476.83858
return_std_test                    159.152391
average_reward_test                  0.528816
round_time_test        0 days 00:00:11.123883
round_time_total       0 days 00:10:49.273471
loss_total                        1192.406289
loss_critic                        1535.35271
loss_actor                         -179.37951
memory_size                       462306.2615 

=== epoch 6/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:51<00:00,  3.07it/s]
episodes                                   43
episode_length                     208.860465
returns                            -30.531146
return_std                          66.932674
average_reward                      -0.150458
round_time             0 days 00:10:52.329446
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       477.976273
return_std_test                      6.313609
average_reward_test                  0.477976
round_time_test        0 days 00:00:11.062271
round_time_total       0 days 00:10:52.330532
loss_total                        1205.470253
loss_critic                       1552.805902
loss_actor                        -183.872445
memory_size                       463895.9225 

=== epoch 6/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:51<00:00,  3.07it/s]
episodes                                   47
episode_length                     190.297872
returns                            -22.865288
return_std                           55.07673
average_reward                      -0.122297
round_time             0 days 00:10:52.315279
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       381.554278
return_std_test                     72.142145
average_reward_test                  0.381554
round_time_test        0 days 00:00:10.876677
round_time_total       0 days 00:10:52.316426
loss_total                        1215.234803
loss_critic                       1566.063395
loss_actor                        -188.079679
memory_size                          465681.3 

=== epoch 6/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.05it/s]
episodes                                   42
episode_length                     213.142857
returns                            -26.738733
return_std                          56.847963
average_reward                      -0.130944
round_time             0 days 00:10:56.599801
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       845.220549
return_std_test                     28.124538
average_reward_test                  0.845221
round_time_test        0 days 00:00:11.277120
round_time_total       0 days 00:10:56.601057
loss_total                        1231.083244
loss_critic                       1586.829131
loss_actor                        -191.900411
memory_size                       467551.5335 

=== epoch 6/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   40
episode_length                        218.275
returns                            -28.256196
return_std                          59.547053
average_reward                      -0.132171
round_time             0 days 00:10:54.382190
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       591.084017
return_std_test                     30.417941
average_reward_test                  0.591084
round_time_test        0 days 00:00:11.196979
round_time_total       0 days 00:10:54.383446
loss_total                        1228.723094
loss_critic                       1584.807627
loss_actor                        -195.615145
memory_size                         469435.89 

=== epoch 6/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:31,  2.65it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   31
episode_length                      317.83871
returns                            -36.858506
return_std                          66.469625
average_reward                      -0.115383
round_time             0 days 00:10:53.806698
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       664.174469
return_std_test                     67.113981
average_reward_test                  0.664174
round_time_test        0 days 00:00:10.841764
round_time_total       0 days 00:10:53.807778
loss_total                        1230.909755
loss_critic                       1588.306805
loss_actor                        -198.678564
memory_size                        471337.249 

=== epoch 6/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:26,  2.67it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                   24
episode_length                     382.916667
returns                            -50.078839
return_std                          76.803145
average_reward                      -0.131144
round_time             0 days 00:10:55.951364
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       463.835937
return_std_test                     49.274182
average_reward_test                  0.463836
round_time_test        0 days 00:00:11.104377
round_time_total       0 days 00:10:55.952453
loss_total                        1225.751493
loss_critic                       1582.953521
loss_actor                        -203.056726
memory_size                       473219.3535 

=== epoch 6/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:25,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   24
episode_length                          396.5
returns                            -54.580672
return_std                           79.61747
average_reward                      -0.138061
round_time             0 days 00:10:54.124371
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       414.055445
return_std_test                     29.222121
average_reward_test                  0.414055
round_time_test        0 days 00:00:11.002478
round_time_total       0 days 00:10:54.125675
loss_total                        1254.066073
loss_critic                       1619.363538
loss_actor                        -207.123889
memory_size                         475086.93 

=== epoch 6/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:13,  2.72it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   21
episode_length                     439.285714
returns                            -50.766211
return_std                          74.767902
average_reward                      -0.123554
round_time             0 days 00:10:53.703329
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       493.430008
return_std_test                     16.442329
average_reward_test                   0.49343
round_time_test        0 days 00:00:11.268232
round_time_total       0 days 00:10:53.704437
loss_total                        1257.555134
loss_critic                       1624.624822
loss_actor                        -210.723731
memory_size                       476991.7955 

=== epoch 6/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:48,  2.82it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   24
episode_length                         392.75
returns                            -43.464609
return_std                          77.629452
average_reward                       -0.11489
round_time             0 days 00:10:54.032950
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       703.281556
return_std_test                     48.394115
average_reward_test                  0.703282
round_time_test        0 days 00:00:11.254576
round_time_total       0 days 00:10:54.034152
loss_total                        1231.829622
loss_critic                       1593.390683
loss_actor                        -214.414739
memory_size                       478896.3855 

=== epoch 6/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   34
episode_length                     264.470588
returns                            -28.793956
return_std                          61.530565
average_reward                      -0.117365
round_time             0 days 00:10:54.134706
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       562.479323
return_std_test                     76.598571
average_reward_test                  0.562479
round_time_test        0 days 00:00:10.982520
round_time_total       0 days 00:10:54.135964
loss_total                        1267.579569
loss_critic                       1638.925023
loss_actor                        -217.802359
memory_size                        480646.107 

=== epoch 6/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:55<00:00,  3.05it/s]
episodes                                   40
episode_length                        225.625
returns                            -25.374184
return_std                          53.028776
average_reward                      -0.115147
round_time             0 days 00:10:55.942905
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       465.661093
return_std_test                     55.850984
average_reward_test                  0.465661
round_time_test        0 days 00:00:11.111589
round_time_total       0 days 00:10:55.944014
loss_total                        1277.761572
loss_critic                       1652.112956
loss_actor                        -219.644075
memory_size                        482279.132 

=== epoch 6/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:36,  2.64it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.05it/s]
episodes                                   43
episode_length                     206.395349
returns                            -20.960082
return_std                          49.808068
average_reward                      -0.092468
round_time             0 days 00:10:57.367294
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       576.052232
return_std_test                     30.405578
average_reward_test                  0.576052
round_time_test        0 days 00:00:11.120039
round_time_total       0 days 00:10:57.368389
loss_total                         1277.43405
loss_critic                        1652.00975
loss_actor                        -220.868874
memory_size                        484049.236 

=== epoch 6/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:53<00:00,  3.06it/s]
episodes                                   46
episode_length                     203.978261
returns                            -20.748385
return_std                          45.901744
average_reward                       -0.10021
round_time             0 days 00:10:53.880634
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        753.50239
return_std_test                     43.425994
average_reward_test                  0.753502
round_time_test        0 days 00:00:11.196830
round_time_total       0 days 00:10:53.881737
loss_total                        1275.881638
loss_critic                       1650.588429
loss_actor                        -222.945644
memory_size                       485884.4195 

=== epoch 6/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:10,  2.73it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                   46
episode_length                     186.913043
returns                            -15.862383
return_std                          44.908688
average_reward                      -0.097976
round_time             0 days 00:10:54.732635
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       561.328333
return_std_test                     71.347676
average_reward_test                  0.561328
round_time_test        0 days 00:00:11.109172
round_time_total       0 days 00:10:54.733813
loss_total                        1284.050991
loss_critic                       1661.380964
loss_actor                         -225.26901
memory_size                       487704.1375 

=== epoch 6/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:06,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:56<00:00,  3.05it/s]
episodes                                   42
episode_length                     233.714286
returns                            -17.753134
return_std                          51.820275
average_reward                      -0.072066
round_time             0 days 00:10:56.514729
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        514.86218
return_std_test                     17.965351
average_reward_test                  0.514862
round_time_test        0 days 00:00:11.081492
round_time_total       0 days 00:10:56.515818
loss_total                        1301.022992
loss_critic                       1683.170288
loss_actor                        -227.566302
memory_size                       489482.1105 

=== epoch 6/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:54<00:00,  3.06it/s]
episodes                                   36
episode_length                     244.888889
returns                            -13.421735
return_std                          53.879065
average_reward                      -0.068958
round_time             0 days 00:10:55.045351
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       516.828848
return_std_test                     39.443315
average_reward_test                  0.516829
round_time_test        0 days 00:00:11.334351
round_time_total       0 days 00:10:55.046437
loss_total                        1283.064848
loss_critic                       1661.300979
loss_actor                        -229.879796
memory_size                         491313.36 

=== epoch 6/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:39,  2.85it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                   30
episode_length                     311.133333
returns                            -26.792343
return_std                          70.046668
average_reward                      -0.091948
round_time             0 days 00:10:19.011333
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       554.277819
return_std_test                     48.647454
average_reward_test                  0.554278
round_time_test        0 days 00:00:10.963942
round_time_total       0 days 00:10:19.012446
loss_total                        1316.200008
loss_critic                       1703.477456
loss_actor                        -232.909913
memory_size                       493216.9525 

=== epoch 6/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   35
episode_length                     278.542857
returns                            -22.249958
return_std                          61.132661
average_reward                      -0.072089
round_time             0 days 00:10:17.134684
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       644.978716
return_std_test                     86.197663
average_reward_test                  0.644979
round_time_test        0 days 00:00:11.119200
round_time_total       0 days 00:10:17.135764
loss_total                        1291.762583
loss_critic                       1673.530442
loss_actor                        -235.308972
memory_size                       495055.6525 

=== epoch 6/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   35
episode_length                     280.514286
returns                            -19.338597
return_std                          58.067196
average_reward                      -0.068422
round_time             0 days 00:10:16.706009
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       679.089216
return_std_test                     95.674345
average_reward_test                  0.679089
round_time_test        0 days 00:00:11.009059
round_time_total       0 days 00:10:16.707291
loss_total                        1292.410038
loss_critic                       1674.878172
loss_actor                        -237.462613
memory_size                       496892.7415 

=== epoch 6/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   31
episode_length                      292.16129
returns                            -21.964253
return_std                          63.720317
average_reward                      -0.082432
round_time             0 days 00:10:16.183727
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       376.905887
return_std_test                     22.722925
average_reward_test                  0.376906
round_time_test        0 days 00:00:11.043196
round_time_total       0 days 00:10:16.184830
loss_total                        1329.379292
loss_critic                       1722.048017
loss_actor                        -241.295724
memory_size                        498705.004 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 7/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:05<00:00,  3.30it/s]
episodes                                    7
episode_length                     252.571429
returns                            -21.736839
return_std                          78.739715
average_reward                      -0.094481
round_time             0 days 00:10:05.801079
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       454.607852
return_std_test                     69.652876
average_reward_test                  0.454608
round_time_test        0 days 00:00:10.962885
round_time_total       0 days 00:10:05.802187
loss_total                        1310.096761
loss_critic                       1698.625006
loss_actor                        -244.016328
memory_size                       500523.9405 

=== epoch 7/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                    9
episode_length                     345.111111
returns                            -26.001956
return_std                          94.728558
average_reward                       -0.09187
round_time             0 days 00:10:09.812209
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       502.671088
return_std_test                     15.748014
average_reward_test                  0.502671
round_time_test        0 days 00:00:11.100045
round_time_total       0 days 00:10:09.813288
loss_total                        1301.419491
loss_critic                       1688.381671
loss_actor                        -246.429348
memory_size                        502378.634 

=== epoch 7/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   12
episode_length                     444.666667
returns                             -48.19269
return_std                          96.611676
average_reward                      -0.110762
round_time             0 days 00:10:07.818598
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       590.328711
return_std_test                     92.460621
average_reward_test                  0.590329
round_time_test        0 days 00:00:11.084260
round_time_total       0 days 00:10:07.819679
loss_total                        1345.818848
loss_critic                       1744.408403
loss_actor                        -248.539499
memory_size                       504310.4435 

=== epoch 7/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   16
episode_length                       475.0625
returns                            -53.096052
return_std                          92.789838
average_reward                      -0.116299
round_time             0 days 00:10:07.990586
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       700.256166
return_std_test                     95.676473
average_reward_test                  0.700256
round_time_test        0 days 00:00:10.896769
round_time_total       0 days 00:10:07.991655
loss_total                        1335.228893
loss_critic                       1731.758051
loss_actor                        -250.887861
memory_size                       506235.1735 

=== epoch 7/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   18
episode_length                     533.388889
returns                            -62.305365
return_std                          92.182029
average_reward                      -0.112736
round_time             0 days 00:10:08.210547
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       370.556159
return_std_test                     64.013032
average_reward_test                  0.370556
round_time_test        0 days 00:00:11.057186
round_time_total       0 days 00:10:08.211644
loss_total                        1322.786551
loss_critic                       1716.469759
loss_actor                        -251.946396
memory_size                        508152.451 

=== epoch 7/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:26,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:06<00:00,  3.30it/s]
episodes                                   14
episode_length                     650.428571
returns                            -77.766323
return_std                          86.894639
average_reward                       -0.12347
round_time             0 days 00:10:07.228018
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       447.152015
return_std_test                     95.900747
average_reward_test                  0.447152
round_time_test        0 days 00:00:10.816316
round_time_total       0 days 00:10:07.229550
loss_total                        1343.647701
loss_critic                        1743.12902
loss_actor                        -254.277694
memory_size                       510075.7375 

=== epoch 7/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   21
episode_length                     451.190476
returns                            -62.565171
return_std                           70.83095
average_reward                       -0.13711
round_time             0 days 00:10:08.680287
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       573.374579
return_std_test                     77.884673
average_reward_test                  0.573375
round_time_test        0 days 00:00:10.930674
round_time_total       0 days 00:10:08.681543
loss_total                        1316.259585
loss_critic                       1709.327579
loss_actor                         -256.01251
memory_size                       511932.2065 

=== epoch 7/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   28
episode_length                     341.321429
returns                            -42.993571
return_std                          71.260259
average_reward                      -0.124546
round_time             0 days 00:10:09.655268
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       472.500345
return_std_test                     69.751662
average_reward_test                    0.4725
round_time_test        0 days 00:00:11.109293
round_time_total       0 days 00:10:09.656347
loss_total                        1339.081715
loss_critic                       1738.734098
loss_actor                        -259.527941
memory_size                       513711.8675 

=== epoch 7/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:09,  3.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:06<00:00,  3.30it/s]
episodes                                   34
episode_length                     264.029412
returns                            -23.494116
return_std                          61.619249
average_reward                       -0.10282
round_time             0 days 00:10:07.512942
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       493.569534
return_std_test                    115.890482
average_reward_test                   0.49357
round_time_test        0 days 00:00:10.986167
round_time_total       0 days 00:10:07.514026
loss_total                        1361.535393
loss_critic                       1767.069252
loss_actor                        -260.600156
memory_size                       515447.5555 

=== epoch 7/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   36
episode_length                     252.416667
returns                            -24.691578
return_std                           66.72397
average_reward                      -0.114059
round_time             0 days 00:10:08.222594
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       516.262399
return_std_test                     80.419829
average_reward_test                  0.516262
round_time_test        0 days 00:00:10.976542
round_time_total       0 days 00:10:08.223928
loss_total                        1361.169301
loss_critic                       1766.835813
loss_actor                        -261.496868
memory_size                       517282.3915 

=== epoch 7/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   40
episode_length                          242.1
returns                            -24.973625
return_std                           65.63387
average_reward                      -0.100377
round_time             0 days 00:10:08.716431
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       310.754232
return_std_test                     73.373584
average_reward_test                  0.310754
round_time_test        0 days 00:00:11.022135
round_time_total       0 days 00:10:08.717787
loss_total                        1371.166668
loss_critic                       1779.749553
loss_actor                        -263.164992
memory_size                        519104.657 

=== epoch 7/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.27it/s]
episodes                                   37
episode_length                     244.702703
returns                            -17.137453
return_std                          62.972893
average_reward                      -0.074986
round_time             0 days 00:10:11.292844
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       375.320374
return_std_test                     14.452307
average_reward_test                   0.37532
round_time_test        0 days 00:00:11.062838
round_time_total       0 days 00:10:11.293926
loss_total                         1365.81546
loss_critic                       1773.528079
loss_actor                         -265.03514
memory_size                       520926.1175 

=== epoch 7/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   37
episode_length                     259.621622
returns                            -19.959397
return_std                           62.32435
average_reward                      -0.067794
round_time             0 days 00:10:12.360428
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       371.435681
return_std_test                     29.838195
average_reward_test                  0.371436
round_time_test        0 days 00:00:10.990380
round_time_total       0 days 00:10:12.361527
loss_total                        1378.642857
loss_critic                       1790.031261
loss_actor                        -266.910877
memory_size                       522811.0435 

=== epoch 7/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   34
episode_length                     262.882353
returns                            -20.238356
return_std                          61.737824
average_reward                      -0.092636
round_time             0 days 00:10:13.589995
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       549.750635
return_std_test                     121.18166
average_reward_test                  0.549751
round_time_test        0 days 00:00:10.920978
round_time_total       0 days 00:10:13.591073
loss_total                        1367.587962
loss_critic                       1776.225982
loss_actor                        -266.964232
memory_size                          524555.2 

=== epoch 7/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:26,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.27it/s]
episodes                                   34
episode_length                          290.0
returns                            -21.157586
return_std                          70.931134
average_reward                      -0.071009
round_time             0 days 00:10:12.681404
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        289.48532
return_std_test                     24.571986
average_reward_test                  0.289485
round_time_test        0 days 00:00:10.936934
round_time_total       0 days 00:10:12.682548
loss_total                        1352.854295
loss_critic                       1758.067025
loss_actor                         -267.99675
memory_size                        526441.266 

=== epoch 7/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   31
episode_length                      315.16129
returns                            -22.414453
return_std                           80.56866
average_reward                      -0.068037
round_time             0 days 00:10:15.413751
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       556.444099
return_std_test                    142.300696
average_reward_test                  0.556444
round_time_test        0 days 00:00:11.122732
round_time_total       0 days 00:10:15.415249
loss_total                        1366.436418
loss_critic                       1775.779502
loss_actor                        -270.936039
memory_size                       528327.8625 

=== epoch 7/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   29
episode_length                      322.37931
returns                             -30.42389
return_std                           86.19516
average_reward                      -0.097181
round_time             0 days 00:10:14.579875
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                         413.3322
return_std_test                     99.841232
average_reward_test                  0.413332
round_time_test        0 days 00:00:10.997813
round_time_total       0 days 00:10:14.580946
loss_total                        1410.381498
loss_critic                       1830.995041
loss_actor                        -272.072804
memory_size                       530212.5045 

=== epoch 7/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   21
episode_length                     431.285714
returns                            -47.690974
return_std                          95.096644
average_reward                      -0.112386
round_time             0 days 00:10:14.350718
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        585.93775
return_std_test                     89.143859
average_reward_test                  0.585938
round_time_test        0 days 00:00:10.931091
round_time_total       0 days 00:10:14.351945
loss_total                        1393.000798
loss_critic                       1809.009801
loss_actor                        -271.035346
memory_size                        532103.117 

=== epoch 7/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   24
episode_length                     382.916667
returns                            -39.408633
return_std                          82.427785
average_reward                      -0.105529
round_time             0 days 00:10:15.732930
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       530.774764
return_std_test                    222.022053
average_reward_test                  0.530775
round_time_test        0 days 00:00:10.990660
round_time_total       0 days 00:10:15.734010
loss_total                        1348.813973
loss_critic                       1754.107922
loss_actor                        -272.361931
memory_size                        533924.145 

=== epoch 7/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   23
episode_length                     421.782609
returns                            -47.406837
return_std                          80.769813
average_reward                      -0.107843
round_time             0 days 00:10:16.460180
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       530.896752
return_std_test                    142.627995
average_reward_test                  0.530897
round_time_test        0 days 00:00:11.058565
round_time_total       0 days 00:10:16.461254
loss_total                        1364.824569
loss_critic                       1774.364103
loss_actor                        -273.333679
memory_size                        535829.043 

=== epoch 7/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   28
episode_length                     323.714286
returns                            -33.026825
return_std                          67.372029
average_reward                      -0.108565
round_time             0 days 00:10:15.980615
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       457.139771
return_std_test                    168.053986
average_reward_test                   0.45714
round_time_test        0 days 00:00:11.024552
round_time_total       0 days 00:10:15.981712
loss_total                         1350.42697
loss_critic                       1757.311204
loss_actor                          -277.1101
memory_size                       537702.8565 

=== epoch 7/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   30
episode_length                          325.0
returns                            -25.655556
return_std                           59.58917
average_reward                      -0.081359
round_time             0 days 00:10:16.177708
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       518.203011
return_std_test                     55.458004
average_reward_test                  0.518203
round_time_test        0 days 00:00:11.119963
round_time_total       0 days 00:10:16.178791
loss_total                        1375.589074
loss_critic                       1789.004831
loss_actor                        -278.074079
memory_size                       539485.9125 

=== epoch 7/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   31
episode_length                     295.032258
returns                            -24.148182
return_std                          65.218876
average_reward                      -0.082999
round_time             0 days 00:10:14.520119
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       551.389207
return_std_test                      94.51837
average_reward_test                  0.551389
round_time_test        0 days 00:00:10.862867
round_time_total       0 days 00:10:14.521202
loss_total                         1367.54235
loss_critic                       1779.182327
loss_actor                        -279.017688
memory_size                        541342.986 

=== epoch 7/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   38
episode_length                     243.552632
returns                            -15.021243
return_std                          55.364476
average_reward                      -0.063894
round_time             0 days 00:10:14.712726
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       416.492418
return_std_test                     25.608942
average_reward_test                  0.416492
round_time_test        0 days 00:00:10.910208
round_time_total       0 days 00:10:14.714033
loss_total                        1361.620387
loss_critic                       1772.158445
loss_actor                         -280.53196
memory_size                       543107.0925 

=== epoch 7/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   40
episode_length                        237.175
returns                            -12.271784
return_std                          58.989491
average_reward                      -0.041899
round_time             0 days 00:10:13.378921
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       438.485496
return_std_test                    158.369533
average_reward_test                  0.438485
round_time_test        0 days 00:00:10.925527
round_time_total       0 days 00:10:13.380005
loss_total                        1371.408699
loss_critic                        1784.29956
loss_actor                        -280.154868
memory_size                        544819.522 

=== epoch 7/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.27it/s]
episodes                                   38
episode_length                     229.763158
returns                             -6.026241
return_std                          59.676985
average_reward                      -0.038191
round_time             0 days 00:10:11.444791
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        506.40328
return_std_test                     10.950621
average_reward_test                  0.506403
round_time_test        0 days 00:00:10.981114
round_time_total       0 days 00:10:11.445984
loss_total                        1377.754107
loss_critic                       1792.542078
loss_actor                        -281.397909
memory_size                         546629.15 

=== epoch 7/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   36
episode_length                     249.777778
returns                             -6.717745
return_std                           70.33866
average_reward                      -0.038931
round_time             0 days 00:10:12.212889
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       511.433593
return_std_test                    110.906081
average_reward_test                  0.511434
round_time_test        0 days 00:00:10.845579
round_time_total       0 days 00:10:12.213977
loss_total                        1411.403193
loss_critic                       1834.706201
loss_actor                        -281.808967
memory_size                        548494.735 

=== epoch 7/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   38
episode_length                     239.631579
returns                              1.039927
return_std                          60.489574
average_reward                       -0.01119
round_time             0 days 00:10:11.903302
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        635.70324
return_std_test                    108.851181
average_reward_test                  0.635703
round_time_test        0 days 00:00:10.613019
round_time_total       0 days 00:10:11.904663
loss_total                        1378.470914
loss_critic                       1793.624441
loss_actor                        -282.143313
memory_size                        550380.957 

=== epoch 7/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   23
episode_length                      404.26087
returns                            -15.737455
return_std                          95.323921
average_reward                      -0.048999
round_time             0 days 00:10:12.494532
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       422.767938
return_std_test                     11.186348
average_reward_test                  0.422768
round_time_test        0 days 00:00:10.863253
round_time_total       0 days 00:10:12.495619
loss_total                        1375.398985
loss_critic                       1789.766553
loss_actor                        -282.071411
memory_size                        552304.691 

=== epoch 7/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:59,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   24
episode_length                     407.083333
returns                             -36.60971
return_std                          93.131071
average_reward                      -0.080536
round_time             0 days 00:10:12.443952
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       685.866431
return_std_test                      59.86069
average_reward_test                  0.685866
round_time_test        0 days 00:00:11.151304
round_time_total       0 days 00:10:12.445033
loss_total                        1386.779165
loss_critic                       1803.939922
loss_actor                        -281.863988
memory_size                       554203.8805 

=== epoch 7/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   26
episode_length                          356.5
returns                            -22.439043
return_std                           88.32178
average_reward                      -0.059804
round_time             0 days 00:10:11.674194
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       613.453065
return_std_test                     62.842626
average_reward_test                  0.613453
round_time_test        0 days 00:00:10.896667
round_time_total       0 days 00:10:11.675505
loss_total                        1379.833594
loss_critic                       1795.570701
loss_actor                        -283.114965
memory_size                       555987.3875 

=== epoch 7/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   32
episode_length                       307.0625
returns                            -17.246433
return_std                          75.758654
average_reward                      -0.059477
round_time             0 days 00:10:12.420284
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       567.139244
return_std_test                      44.39816
average_reward_test                  0.567139
round_time_test        0 days 00:00:10.886576
round_time_total       0 days 00:10:12.421364
loss_total                         1383.23016
loss_critic                       1800.109458
loss_actor                        -284.287146
memory_size                       557797.3375 

=== epoch 7/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   33
episode_length                     290.848485
returns                            -24.266979
return_std                          79.440053
average_reward                      -0.087718
round_time             0 days 00:10:14.352239
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        512.48513
return_std_test                    125.614947
average_reward_test                  0.512485
round_time_test        0 days 00:00:10.975180
round_time_total       0 days 00:10:14.353324
loss_total                        1359.271609
loss_critic                       1770.355485
loss_actor                        -285.064014
memory_size                       559597.3825 

=== epoch 7/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:33,  2.87it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.27it/s]
episodes                                   42
episode_length                     216.047619
returns                             -6.758584
return_std                          60.294319
average_reward                      -0.045625
round_time             0 days 00:10:13.051823
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       615.843066
return_std_test                     74.453841
average_reward_test                  0.615843
round_time_test        0 days 00:00:10.929215
round_time_total       0 days 00:10:13.053343
loss_total                        1368.577958
loss_critic                       1782.183105
loss_actor                        -285.842754
memory_size                       561336.7125 

=== epoch 7/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   44
episode_length                     217.295455
returns                            -10.519537
return_std                          62.876182
average_reward                      -0.047334
round_time             0 days 00:10:08.167845
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       453.921665
return_std_test                     97.572668
average_reward_test                  0.453922
round_time_test        0 days 00:00:11.130355
round_time_total       0 days 00:10:08.168935
loss_total                        1378.726308
loss_critic                       1794.893495
loss_actor                        -285.942559
memory_size                       563157.5605 

=== epoch 7/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:44,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   38
episode_length                     249.342105
returns                            -17.950301
return_std                          68.853646
average_reward                      -0.074841
round_time             0 days 00:10:10.177377
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       499.128797
return_std_test                    114.195909
average_reward_test                  0.499129
round_time_test        0 days 00:00:11.041474
round_time_total       0 days 00:10:10.178458
loss_total                        1385.240923
loss_critic                       1803.002016
loss_actor                        -285.803579
memory_size                       564988.2705 

=== epoch 7/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   36
episode_length                     239.611111
returns                            -11.288198
return_std                          60.535103
average_reward                      -0.072499
round_time             0 days 00:10:14.545353
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       401.478714
return_std_test                     80.257364
average_reward_test                  0.401479
round_time_test        0 days 00:00:10.918893
round_time_total       0 days 00:10:14.546454
loss_total                        1346.722126
loss_critic                       1755.055503
loss_actor                        -286.611498
memory_size                       566819.4375 

=== epoch 7/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:59,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   41
episode_length                     231.609756
returns                            -14.591814
return_std                          66.191268
average_reward                      -0.067697
round_time             0 days 00:10:10.290575
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       547.371662
return_std_test                    118.925429
average_reward_test                  0.547372
round_time_test        0 days 00:00:10.942652
round_time_total       0 days 00:10:10.291683
loss_total                        1419.217617
loss_critic                       1846.034387
loss_actor                        -288.049593
memory_size                          568597.4 

=== epoch 7/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   36
episode_length                     265.055556
returns                            -27.241193
return_std                          80.699122
average_reward                      -0.105307
round_time             0 days 00:10:14.977454
episodes_test                            10.0
episode_length_test                     901.7
returns_test                       497.883978
return_std_test                      197.5341
average_reward_test                  0.534131
round_time_test        0 days 00:00:11.042824
round_time_total       0 days 00:10:14.978912
loss_total                        1403.569354
loss_critic                       1826.810954
loss_actor                        -289.397174
memory_size                        570383.968 

=== epoch 7/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   33
episode_length                     281.575758
returns                            -22.347898
return_std                          80.098974
average_reward                      -0.086373
round_time             0 days 00:10:16.700081
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       321.068715
return_std_test                    101.714087
average_reward_test                  0.321069
round_time_test        0 days 00:00:10.925651
round_time_total       0 days 00:10:16.701154
loss_total                         1407.56756
loss_critic                       1831.956391
loss_actor                        -289.987874
memory_size                       572262.7355 

=== epoch 7/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   40
episode_length                        229.025
returns                            -14.939575
return_std                          71.823634
average_reward                      -0.072369
round_time             0 days 00:10:17.522432
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       498.753294
return_std_test                    181.293064
average_reward_test                  0.498753
round_time_test        0 days 00:00:10.942138
round_time_total       0 days 00:10:17.523511
loss_total                        1381.858372
loss_critic                       1800.051098
loss_actor                        -290.912653
memory_size                       574020.7665 

=== epoch 7/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   39
episode_length                     232.025641
returns                            -13.946198
return_std                          73.506799
average_reward                      -0.076127
round_time             0 days 00:10:16.471292
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       730.921839
return_std_test                     15.301128
average_reward_test                  0.730922
round_time_test        0 days 00:00:11.050951
round_time_total       0 days 00:10:16.472382
loss_total                        1398.300168
loss_critic                       1821.087067
loss_actor                        -292.847549
memory_size                        575826.333 

=== epoch 7/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   32
episode_length                          280.0
returns                            -21.273542
return_std                          80.422902
average_reward                      -0.083728
round_time             0 days 00:10:16.327990
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       379.242717
return_std_test                     35.293032
average_reward_test                  0.379243
round_time_test        0 days 00:00:11.078505
round_time_total       0 days 00:10:16.329082
loss_total                        1420.698377
loss_critic                       1849.073073
loss_actor                        -292.800534
memory_size                        577713.365 

=== epoch 7/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   34
episode_length                     266.235294
returns                            -20.015174
return_std                          74.936924
average_reward                      -0.071919
round_time             0 days 00:10:14.521636
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       328.144444
return_std_test                     23.428073
average_reward_test                  0.328144
round_time_test        0 days 00:00:11.002519
round_time_total       0 days 00:10:14.522696
loss_total                        1408.899194
loss_critic                       1834.068769
loss_actor                        -291.779239
memory_size                       579541.9155 

=== epoch 7/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   33
episode_length                     287.181818
returns                            -32.454172
return_std                          87.677864
average_reward                      -0.116071
round_time             0 days 00:10:16.915376
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       443.281791
return_std_test                    118.788567
average_reward_test                  0.443282
round_time_test        0 days 00:00:10.971807
round_time_total       0 days 00:10:16.916664
loss_total                        1382.116242
loss_critic                       1800.625312
loss_actor                        -291.920158
memory_size                        581404.148 

=== epoch 7/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   33
episode_length                     291.757576
returns                            -29.241742
return_std                          85.525148
average_reward                      -0.102065
round_time             0 days 00:10:16.690714
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       484.665624
return_std_test                    178.594145
average_reward_test                  0.484666
round_time_test        0 days 00:00:10.997966
round_time_total       0 days 00:10:16.692021
loss_total                        1383.423883
loss_critic                       1802.392315
loss_actor                        -292.449957
memory_size                       583233.4765 

=== epoch 7/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   29
episode_length                     299.344828
returns                            -35.060831
return_std                          87.684738
average_reward                      -0.126155
round_time             0 days 00:10:16.601454
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       625.467196
return_std_test                    141.453172
average_reward_test                  0.625467
round_time_test        0 days 00:00:11.046165
round_time_total       0 days 00:10:16.602517
loss_total                        1377.492813
loss_critic                        1795.16845
loss_actor                        -293.209864
memory_size                       585046.2595 

=== epoch 7/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:33,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   32
episode_length                       284.4375
returns                            -16.148723
return_std                          87.530216
average_reward                      -0.070976
round_time             0 days 00:10:16.249447
episodes_test                            10.0
episode_length_test                     912.4
returns_test                       597.685042
return_std_test                    173.198344
average_reward_test                  0.653859
round_time_test        0 days 00:00:10.873524
round_time_total       0 days 00:10:16.250526
loss_total                        1432.232238
loss_critic                       1863.580571
loss_actor                        -293.161214
memory_size                       586908.0605 

=== epoch 7/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   30
episode_length                     317.266667
returns                            -22.239671
return_std                          88.350271
average_reward                      -0.073455
round_time             0 days 00:10:18.555713
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       411.031684
return_std_test                    177.836897
average_reward_test                  0.411032
round_time_test        0 days 00:00:11.230392
round_time_total       0 days 00:10:18.556820
loss_total                        1386.654337
loss_critic                       1806.752605
loss_actor                        -293.738865
memory_size                       588766.4185 

=== epoch 7/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.24it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   32
episode_length                      281.03125
returns                             -8.282599
return_std                          75.125126
average_reward                      -0.053949
round_time             0 days 00:10:18.515611
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       424.380041
return_std_test                     31.672346
average_reward_test                   0.42438
round_time_test        0 days 00:00:11.061252
round_time_total       0 days 00:10:18.516823
loss_total                        1383.717345
loss_critic                       1803.436806
loss_actor                        -295.160623
memory_size                        590612.458 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 8/10 ===== round 1/50 ======================================
  0%|          | 6/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:04<00:00,  3.31it/s]
episodes                                    7
episode_length                     204.714286
returns                            -19.665063
return_std                          79.573741
average_reward                      -0.116533
round_time             0 days 00:10:04.203480
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        443.29459
return_std_test                      66.20729
average_reward_test                  0.443295
round_time_test        0 days 00:00:10.998025
round_time_total       0 days 00:10:04.204577
loss_total                        1401.119679
loss_critic                       1825.536513
loss_actor                        -296.547785
memory_size                        592403.152 

=== epoch 8/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:40,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   10
episode_length                          366.1
returns                            -46.542484
return_std                         113.226464
average_reward                      -0.118166
round_time             0 days 00:10:08.565075
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       487.592753
return_std_test                      99.94248
average_reward_test                  0.487593
round_time_test        0 days 00:00:10.932899
round_time_total       0 days 00:10:08.566508
loss_total                        1389.997967
loss_critic                       1811.947978
loss_actor                        -297.802207
memory_size                        594291.472 

=== epoch 8/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   13
episode_length                     385.538462
returns                            -42.607236
return_std                         108.959513
average_reward                       -0.12559
round_time             0 days 00:10:09.221295
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       571.286196
return_std_test                    107.073428
average_reward_test                  0.571286
round_time_test        0 days 00:00:10.887819
round_time_total       0 days 00:10:09.222365
loss_total                        1405.349218
loss_critic                       1831.254234
loss_actor                        -298.270968
memory_size                        596210.668 

=== epoch 8/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   16
episode_length                       444.4375
returns                            -58.380185
return_std                         107.755594
average_reward                      -0.133287
round_time             0 days 00:10:08.520538
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       556.796583
return_std_test                    167.584538
average_reward_test                  0.556797
round_time_test        0 days 00:00:10.868651
round_time_total       0 days 00:10:08.521933
loss_total                        1406.147286
loss_critic                       1832.482211
loss_actor                        -299.192547
memory_size                        598130.187 

=== epoch 8/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   20
episode_length                         462.95
returns                             -65.88554
return_std                          109.61539
average_reward                      -0.141597
round_time             0 days 00:10:09.606604
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       675.696084
return_std_test                      9.186528
average_reward_test                  0.675696
round_time_test        0 days 00:00:10.926117
round_time_total       0 days 00:10:09.608102
loss_total                        1387.610355
loss_critic                       1809.508764
loss_actor                        -299.983404
memory_size                        600035.705 

=== epoch 8/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   20
episode_length                          462.2
returns                            -69.082336
return_std                         105.632906
average_reward                      -0.145515
round_time             0 days 00:10:09.989238
episodes_test                            10.0
episode_length_test                     907.9
returns_test                       708.410429
return_std_test                    215.666262
average_reward_test                  0.776243
round_time_test        0 days 00:00:11.182509
round_time_total       0 days 00:10:09.990319
loss_total                         1379.20635
loss_critic                       1799.072372
loss_actor                        -300.257863
memory_size                        601932.988 

=== epoch 8/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:38,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   24
episode_length                     370.458333
returns                            -54.204851
return_std                          89.319025
average_reward                      -0.156361
round_time             0 days 00:10:10.069331
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       640.267285
return_std_test                    101.873621
average_reward_test                  0.640267
round_time_test        0 days 00:00:10.948374
round_time_total       0 days 00:10:10.070441
loss_total                        1393.705816
loss_critic                       1817.281441
loss_actor                        -300.596799
memory_size                        603731.081 

=== epoch 8/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:37,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   25
episode_length                          387.2
returns                            -61.943196
return_std                          89.806353
average_reward                      -0.163065
round_time             0 days 00:10:09.358706
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       563.206883
return_std_test                    213.993313
average_reward_test                  0.563207
round_time_test        0 days 00:00:10.944526
round_time_total       0 days 00:10:09.359792
loss_total                        1403.504138
loss_critic                       1829.717577
loss_actor                        -301.349747
memory_size                        605593.672 

=== epoch 8/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:51,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.28it/s]
episodes                                   30
episode_length                     308.833333
returns                            -46.758217
return_std                          92.005226
average_reward                       -0.15946
round_time             0 days 00:10:11.197378
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       491.585403
return_std_test                    171.572822
average_reward_test                  0.491585
round_time_test        0 days 00:00:11.125687
round_time_total       0 days 00:10:11.198466
loss_total                        1423.893449
loss_critic                       1855.631193
loss_actor                        -303.057651
memory_size                       607455.1295 

=== epoch 8/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   34
episode_length                     269.411765
returns                            -33.181086
return_std                          87.785189
average_reward                      -0.130024
round_time             0 days 00:10:07.777545
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       726.432727
return_std_test                     100.78303
average_reward_test                  0.726433
round_time_test        0 days 00:00:11.170475
round_time_total       0 days 00:10:07.778625
loss_total                        1412.056415
loss_critic                       1840.588251
loss_actor                        -302.071049
memory_size                        609261.663 

=== epoch 8/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.28it/s]
episodes                                   32
episode_length                      287.34375
returns                            -34.246707
return_std                          93.248043
average_reward                      -0.126128
round_time             0 days 00:10:11.072812
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       629.898734
return_std_test                    114.096666
average_reward_test                  0.629899
round_time_test        0 days 00:00:10.990959
round_time_total       0 days 00:10:11.073895
loss_total                        1394.982376
loss_critic                       1818.913101
loss_actor                        -300.740658
memory_size                        611073.383 

=== epoch 8/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.27it/s]
episodes                                   32
episode_length                      289.40625
returns                            -28.028248
return_std                          90.135958
average_reward                      -0.102985
round_time             0 days 00:10:11.269611
episodes_test                            10.0
episode_length_test                     909.7
returns_test                       550.197729
return_std_test                    201.740918
average_reward_test                  0.600172
round_time_test        0 days 00:00:10.992819
round_time_total       0 days 00:10:11.270683
loss_total                        1378.710711
loss_critic                       1798.763357
loss_actor                        -301.499991
memory_size                        612906.437 

=== epoch 8/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:00,  3.02it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   30
episode_length                          277.4
returns                            -24.352025
return_std                          88.201545
average_reward                      -0.106477
round_time             0 days 00:10:10.344000
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       774.463695
return_std_test                     48.957885
average_reward_test                  0.774464
round_time_test        0 days 00:00:10.925810
round_time_total       0 days 00:10:10.345074
loss_total                        1383.339107
loss_critic                       1804.682818
loss_actor                        -302.035862
memory_size                        614812.182 

=== epoch 8/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   28
episode_length                     321.214286
returns                            -26.112404
return_std                           88.21363
average_reward                      -0.088224
round_time             0 days 00:10:12.432386
episodes_test                            10.0
episode_length_test                     905.5
returns_test                       573.476431
return_std_test                     179.80511
average_reward_test                  0.629718
round_time_test        0 days 00:00:10.970946
round_time_total       0 days 00:10:12.433689
loss_total                        1405.794345
loss_critic                       1832.837324
loss_actor                        -302.377687
memory_size                        616712.966 

=== epoch 8/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   25
episode_length                         374.64
returns                            -40.754774
return_std                         101.184051
average_reward                      -0.114352
round_time             0 days 00:10:10.209381
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       534.689231
return_std_test                    105.159092
average_reward_test                  0.534689
round_time_test        0 days 00:00:10.893971
round_time_total       0 days 00:10:10.210448
loss_total                        1406.360431
loss_critic                       1833.721066
loss_actor                        -303.082235
memory_size                         618573.59 

=== epoch 8/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:01,  3.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   22
episode_length                     442.318182
returns                            -42.543667
return_std                         105.566828
average_reward                      -0.095677
round_time             0 days 00:10:12.381670
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       338.247325
return_std_test                    151.516751
average_reward_test                  0.338247
round_time_test        0 days 00:00:10.955590
round_time_total       0 days 00:10:12.382743
loss_total                        1397.569313
loss_critic                        1822.77515
loss_actor                        -303.254162
memory_size                        620487.705 

=== epoch 8/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   19
episode_length                     524.263158
returns                            -64.858286
return_std                         116.519447
average_reward                      -0.122482
round_time             0 days 00:10:11.989048
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       456.725831
return_std_test                      15.76859
average_reward_test                  0.456726
round_time_test        0 days 00:00:11.034120
round_time_total       0 days 00:10:11.990438
loss_total                        1378.669814
loss_critic                       1799.362142
loss_actor                        -304.099617
memory_size                         622399.59 

=== epoch 8/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   25
episode_length                         384.72
returns                            -37.802824
return_std                         106.237297
average_reward                      -0.107568
round_time             0 days 00:10:13.660131
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       409.365015
return_std_test                     91.519364
average_reward_test                  0.409365
round_time_test        0 days 00:00:11.033741
round_time_total       0 days 00:10:13.661429
loss_total                        1375.103223
loss_critic                       1795.116722
loss_actor                        -304.950895
memory_size                       624229.2125 

=== epoch 8/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.27it/s]
episodes                                   31
episode_length                     288.354839
returns                            -32.476485
return_std                         103.350801
average_reward                      -0.123914
round_time             0 days 00:10:12.607023
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       390.462107
return_std_test                    130.343867
average_reward_test                  0.390462
round_time_test        0 days 00:00:11.180023
round_time_total       0 days 00:10:12.608127
loss_total                        1403.946146
loss_critic                       1831.086229
loss_actor                        -304.614312
memory_size                       625999.6415 

=== epoch 8/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:17,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   31
episode_length                     283.032258
returns                             -31.26848
return_std                         101.244861
average_reward                      -0.119464
round_time             0 days 00:10:13.382859
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       769.005211
return_std_test                      34.16888
average_reward_test                  0.769005
round_time_test        0 days 00:00:11.033059
round_time_total       0 days 00:10:13.383931
loss_total                        1397.788404
loss_critic                       1823.377083
loss_actor                        -304.566443
memory_size                       627766.3215 

=== epoch 8/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:14,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   36
episode_length                     242.527778
returns                            -26.888314
return_std                          96.601585
average_reward                      -0.104938
round_time             0 days 00:10:14.710251
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       797.371619
return_std_test                      17.01663
average_reward_test                  0.797372
round_time_test        0 days 00:00:10.947818
round_time_total       0 days 00:10:14.711334
loss_total                        1363.512719
loss_critic                       1780.579717
loss_actor                        -304.755405
memory_size                        629630.713 

=== epoch 8/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   35
episode_length                     273.542857
returns                            -28.111479
return_std                          97.370528
average_reward                      -0.099683
round_time             0 days 00:10:14.134428
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        556.33319
return_std_test                    102.351806
average_reward_test                  0.556333
round_time_test        0 days 00:00:11.048380
round_time_total       0 days 00:10:14.135497
loss_total                        1381.127233
loss_critic                       1802.807694
loss_actor                        -305.594724
memory_size                       631504.5235 

=== epoch 8/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   34
episode_length                     270.676471
returns                            -22.178185
return_std                          93.146907
average_reward                      -0.095656
round_time             0 days 00:10:14.661386
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       556.507716
return_std_test                     20.286835
average_reward_test                  0.556508
round_time_test        0 days 00:00:10.920210
round_time_total       0 days 00:10:14.662476
loss_total                        1415.926588
loss_critic                       1846.233349
loss_actor                         -305.30059
memory_size                       633334.7425 

=== epoch 8/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   28
episode_length                     332.964286
returns                            -33.350763
return_std                         104.401481
average_reward                      -0.101682
round_time             0 days 00:10:16.277172
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       662.509618
return_std_test                     65.693628
average_reward_test                   0.66251
round_time_test        0 days 00:00:10.926308
round_time_total       0 days 00:10:16.278245
loss_total                        1373.182762
loss_critic                       1792.443721
loss_actor                        -303.861195
memory_size                        635218.108 

=== epoch 8/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   28
episode_length                     333.357143
returns                            -30.124309
return_std                          96.144294
average_reward                      -0.093658
round_time             0 days 00:10:13.293809
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       636.355496
return_std_test                     39.947191
average_reward_test                  0.636355
round_time_test        0 days 00:00:10.913580
round_time_total       0 days 00:10:13.294876
loss_total                        1376.774224
loss_critic                       1797.176406
loss_actor                        -304.834632
memory_size                       637088.9285 

=== epoch 8/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:36,  2.86it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   25
episode_length                         352.36
returns                            -46.602965
return_std                         110.911725
average_reward                      -0.144072
round_time             0 days 00:10:17.528984
episodes_test                            10.0
episode_length_test                     905.3
returns_test                       708.693692
return_std_test                    224.405061
average_reward_test                  0.779948
round_time_test        0 days 00:00:10.848666
round_time_total       0 days 00:10:17.530079
loss_total                        1378.508662
loss_critic                       1799.600971
loss_actor                        -305.860698
memory_size                       638931.7535 

=== epoch 8/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:46,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   33
episode_length                     299.454545
returns                            -37.507798
return_std                         102.857946
average_reward                      -0.125287
round_time             0 days 00:10:13.382068
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       796.705076
return_std_test                     39.480929
average_reward_test                  0.796705
round_time_test        0 days 00:00:10.898376
round_time_total       0 days 00:10:13.383555
loss_total                        1379.650607
loss_critic                       1801.148661
loss_actor                        -306.341727
memory_size                        640702.645 

=== epoch 8/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   37
episode_length                     246.297297
returns                            -17.207098
return_std                          80.232813
average_reward                      -0.090111
round_time             0 days 00:10:13.576194
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       720.077611
return_std_test                      36.27907
average_reward_test                  0.720078
round_time_test        0 days 00:00:10.807062
round_time_total       0 days 00:10:13.577279
loss_total                        1374.199337
loss_critic                       1794.535656
loss_actor                        -307.146058
memory_size                       642488.3335 

=== epoch 8/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   40
episode_length                         233.95
returns                              -15.2646
return_std                          82.443439
average_reward                      -0.075878
round_time             0 days 00:10:14.247079
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       580.084722
return_std_test                    205.110721
average_reward_test                  0.580085
round_time_test        0 days 00:00:10.921965
round_time_total       0 days 00:10:14.248394
loss_total                        1404.816808
loss_critic                       1833.077986
loss_actor                        -308.228022
memory_size                       644223.9085 

=== epoch 8/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.27it/s]
episodes                                   37
episode_length                     266.324324
returns                            -26.631985
return_std                          92.450896
average_reward                      -0.098126
round_time             0 days 00:10:12.574800
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       712.489071
return_std_test                     23.666808
average_reward_test                  0.712489
round_time_test        0 days 00:00:10.908626
round_time_total       0 days 00:10:12.576106
loss_total                        1437.833588
loss_critic                       1874.370878
loss_actor                        -308.315698
memory_size                        646107.565 

=== epoch 8/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   36
episode_length                     250.722222
returns                            -18.006249
return_std                          83.850106
average_reward                       -0.08648
round_time             0 days 00:10:12.048238
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       578.237647
return_std_test                    130.726439
average_reward_test                  0.578238
round_time_test        0 days 00:00:10.894123
round_time_total       0 days 00:10:12.049446
loss_total                        1413.657366
loss_critic                       1844.118783
loss_actor                        -308.188437
memory_size                       648024.5925 

=== epoch 8/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:40,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   32
episode_length                      307.09375
returns                            -28.932639
return_std                           91.41136
average_reward                      -0.090828
round_time             0 days 00:10:13.893692
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       625.376854
return_std_test                     43.864557
average_reward_test                  0.625377
round_time_test        0 days 00:00:10.919709
round_time_total       0 days 00:10:13.894788
loss_total                        1420.003571
loss_critic                       1851.869977
loss_actor                         -307.46218
memory_size                        649859.855 

=== epoch 8/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   25
episode_length                         369.56
returns                            -46.613129
return_std                          94.014958
average_reward                       -0.13271
round_time             0 days 00:10:11.601721
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       727.583058
return_std_test                     32.493218
average_reward_test                  0.727583
round_time_test        0 days 00:00:10.867879
round_time_total       0 days 00:10:11.602991
loss_total                        1418.624095
loss_critic                       1850.183486
loss_actor                        -307.613587
memory_size                        651742.102 

=== epoch 8/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:20,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.28it/s]
episodes                                   29
episode_length                     306.896552
returns                            -26.522484
return_std                          86.652842
average_reward                      -0.096107
round_time             0 days 00:10:10.948660
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       697.158587
return_std_test                     42.123719
average_reward_test                  0.697159
round_time_test        0 days 00:00:11.043259
round_time_total       0 days 00:10:10.949739
loss_total                        1422.934427
loss_critic                       1855.615242
loss_actor                        -307.788952
memory_size                       653521.9715 

=== epoch 8/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   32
episode_length                         272.25
returns                            -13.334308
return_std                          75.232005
average_reward                       -0.06217
round_time             0 days 00:10:12.189077
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       519.498146
return_std_test                    145.251914
average_reward_test                  0.519498
round_time_test        0 days 00:00:10.877086
round_time_total       0 days 00:10:12.190300
loss_total                        1386.341452
loss_critic                       1809.640581
loss_actor                        -306.855181
memory_size                        655334.045 

=== epoch 8/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   30
episode_length                          312.7
returns                            -24.033242
return_std                          86.195619
average_reward                      -0.081721
round_time             0 days 00:10:14.694183
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        701.71104
return_std_test                     19.230912
average_reward_test                  0.701711
round_time_test        0 days 00:00:10.888342
round_time_total       0 days 00:10:14.695253
loss_total                        1373.807956
loss_critic                       1794.254057
loss_actor                        -307.976563
memory_size                        657230.077 

=== epoch 8/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   28
episode_length                     355.821429
returns                            -33.253086
return_std                          98.110583
average_reward                      -0.093619
round_time             0 days 00:10:13.560301
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       435.053948
return_std_test                    163.927507
average_reward_test                  0.435054
round_time_test        0 days 00:00:10.920385
round_time_total       0 days 00:10:13.561792
loss_total                        1415.168616
loss_critic                       1846.318096
loss_actor                        -309.429431
memory_size                       659172.9145 

=== epoch 8/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   28
episode_length                     327.464286
returns                            -27.235357
return_std                           95.47457
average_reward                      -0.094932
round_time             0 days 00:10:17.322840
episodes_test                            10.0
episode_length_test                     905.3
returns_test                       549.319422
return_std_test                     170.68236
average_reward_test                  0.605004
round_time_test        0 days 00:00:10.839249
round_time_total       0 days 00:10:17.323915
loss_total                        1412.228665
loss_critic                        1842.61149
loss_actor                        -309.302768
memory_size                       661017.2175 

=== epoch 8/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   22
episode_length                     406.772727
returns                            -55.437837
return_std                         104.266313
average_reward                      -0.144254
round_time             0 days 00:10:17.047206
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       735.902866
return_std_test                      34.72075
average_reward_test                  0.735903
round_time_test        0 days 00:00:10.853158
round_time_total       0 days 00:10:17.048517
loss_total                        1381.523501
loss_critic                       1803.979031
loss_actor                        -308.298755
memory_size                       662878.9945 

=== epoch 8/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:43,  3.10it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   24
episode_length                     377.791667
returns                            -58.123351
return_std                         106.721888
average_reward                      -0.149911
round_time             0 days 00:10:15.270906
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       454.685959
return_std_test                     25.157055
average_reward_test                  0.454686
round_time_test        0 days 00:00:11.041433
round_time_total       0 days 00:10:15.272347
loss_total                        1381.737856
loss_critic                       1804.689413
loss_actor                         -310.06849
memory_size                        664730.887 

=== epoch 8/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:31,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   25
episode_length                         348.04
returns                            -40.835005
return_std                          95.702036
average_reward                      -0.131408
round_time             0 days 00:10:14.641228
episodes_test                            10.0
episode_length_test                     906.0
returns_test                       525.130051
return_std_test                    153.464072
average_reward_test                  0.588546
round_time_test        0 days 00:00:10.979654
round_time_total       0 days 00:10:14.642335
loss_total                        1389.508601
loss_critic                       1814.618805
loss_actor                        -310.932348
memory_size                        666595.962 

=== epoch 8/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:47,  3.39it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   28
episode_length                     347.142857
returns                            -40.027288
return_std                          93.853953
average_reward                      -0.111493
round_time             0 days 00:10:17.905042
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       661.708282
return_std_test                     65.798801
average_reward_test                  0.661708
round_time_test        0 days 00:00:11.005924
round_time_total       0 days 00:10:17.906131
loss_total                        1390.371901
loss_critic                       1815.715587
loss_actor                        -311.002998
memory_size                        668466.729 

=== epoch 8/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   35
episode_length                     270.657143
returns                            -12.883028
return_std                          81.137546
average_reward                      -0.051931
round_time             0 days 00:10:15.655420
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       548.227212
return_std_test                      93.39051
average_reward_test                  0.548227
round_time_test        0 days 00:00:10.970853
round_time_total       0 days 00:10:15.656904
loss_total                        1395.846309
loss_critic                       1822.551171
loss_actor                        -310.973274
memory_size                        670265.725 

=== epoch 8/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   41
episode_length                     235.731707
returns                            -10.984301
return_std                          72.377815
average_reward                      -0.029997
round_time             0 days 00:10:17.960342
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        689.52722
return_std_test                     44.222664
average_reward_test                  0.689527
round_time_test        0 days 00:00:10.710661
round_time_total       0 days 00:10:17.961421
loss_total                        1400.914159
loss_critic                       1829.158756
loss_actor                        -312.064342
memory_size                        671938.851 

=== epoch 8/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:38,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   40
episode_length                          224.1
returns                             -3.985118
return_std                          60.072209
average_reward                      -0.042487
round_time             0 days 00:10:15.376435
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       652.818209
return_std_test                     32.180112
average_reward_test                  0.652818
round_time_test        0 days 00:00:11.093137
round_time_total       0 days 00:10:15.377525
loss_total                        1414.948554
loss_critic                       1846.783415
loss_actor                        -312.391015
memory_size                         673688.11 

=== epoch 8/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   44
episode_length                     216.818182
returns                             -5.752184
return_std                          59.474968
average_reward                       -0.03633
round_time             0 days 00:10:14.937725
episodes_test                            11.0
episode_length_test                846.545455
returns_test                       623.062594
return_std_test                    283.408533
average_reward_test                  0.743826
round_time_test        0 days 00:00:10.774305
round_time_total       0 days 00:10:14.938810
loss_total                        1416.707547
loss_critic                       1848.938418
loss_actor                        -312.216066
memory_size                       675549.3455 

=== epoch 8/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   44
episode_length                     214.909091
returns                            -10.463299
return_std                          65.199285
average_reward                      -0.040936
round_time             0 days 00:10:17.846164
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       685.580705
return_std_test                     82.491185
average_reward_test                  0.685581
round_time_test        0 days 00:00:10.956390
round_time_total       0 days 00:10:17.847233
loss_total                        1388.876168
loss_critic                       1814.215985
loss_actor                        -312.483222
memory_size                       677396.7125 

=== epoch 8/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   37
episode_length                     267.810811
returns                            -28.752223
return_std                           76.59819
average_reward                      -0.103986
round_time             0 days 00:10:17.591499
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       684.384421
return_std_test                     34.001137
average_reward_test                  0.684384
round_time_test        0 days 00:00:10.844852
round_time_total       0 days 00:10:17.592721
loss_total                        1387.638433
loss_critic                       1812.660279
loss_actor                        -312.449065
memory_size                        679237.332 

=== epoch 8/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:12,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   27
episode_length                     334.222222
returns                            -40.524937
return_std                          87.753895
average_reward                       -0.13977
round_time             0 days 00:10:18.229960
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       539.354606
return_std_test                    109.128227
average_reward_test                  0.539355
round_time_test        0 days 00:00:10.908905
round_time_total       0 days 00:10:18.231075
loss_total                        1389.958428
loss_critic                       1815.711529
loss_actor                        -313.054101
memory_size                       681120.5965 

=== epoch 8/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:50,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   36
episode_length                     250.555556
returns                            -29.432683
return_std                          85.655096
average_reward                      -0.119762
round_time             0 days 00:10:17.637161
episodes_test                            10.0
episode_length_test                     906.0
returns_test                       587.752045
return_std_test                    187.260222
average_reward_test                  0.658601
round_time_test        0 days 00:00:11.005763
round_time_total       0 days 00:10:17.638735
loss_total                        1410.465362
loss_critic                       1841.689843
loss_actor                        -314.432703
memory_size                        682852.819 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 9/10 ===== round 1/50 ======================================
  0%|          | 7/2000 [00:02<10:55,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:05<00:00,  3.30it/s]
episodes                                   18
episode_length                     101.222222
returns                             14.321464
return_std                           42.35034
average_reward                       0.126421
round_time             0 days 00:10:05.570938
episodes_test                            10.0
episode_length_test                     909.9
returns_test                       570.165998
return_std_test                    154.807124
average_reward_test                  0.627005
round_time_test        0 days 00:00:10.961870
round_time_total       0 days 00:10:05.572035
loss_total                        1405.767484
loss_critic                       1835.879096
loss_actor                        -314.679107
memory_size                       684477.7785 

=== epoch 9/10 ===== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   27
episode_length                     146.555556
returns                             10.764645
return_std                          57.410607
average_reward                       0.075454
round_time             0 days 00:10:08.428820
episodes_test                            11.0
episode_length_test                904.909091
returns_test                       537.145403
return_std_test                    219.365391
average_reward_test                  0.593856
round_time_test        0 days 00:00:10.914152
round_time_total       0 days 00:10:08.429909
loss_total                        1395.416829
loss_critic                       1822.998403
loss_actor                        -314.909591
memory_size                       686211.6235 

=== epoch 9/10 ===== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   30
episode_length                          199.7
returns                             -7.256973
return_std                          85.392162
average_reward                      -0.037798
round_time             0 days 00:10:08.662332
episodes_test                            13.0
episode_length_test                755.076923
returns_test                       578.621276
return_std_test                    276.522198
average_reward_test                  0.771918
round_time_test        0 days 00:00:10.807563
round_time_total       0 days 00:10:08.663432
loss_total                        1402.229797
loss_critic                       1831.464296
loss_actor                        -314.708331
memory_size                        688029.689 

=== epoch 9/10 ===== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:08<00:00,  3.29it/s]
episodes                                   39
episode_length                     200.615385
returns                             -4.216285
return_std                          81.893882
average_reward                      -0.016563
round_time             0 days 00:10:08.927519
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       647.821917
return_std_test                     53.145325
average_reward_test                  0.647822
round_time_test        0 days 00:00:11.070118
round_time_total       0 days 00:10:08.928583
loss_total                        1408.790822
loss_critic                       1839.938044
loss_actor                        -315.798193
memory_size                         689806.93 

=== epoch 9/10 ===== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   42
episode_length                          237.0
returns                            -11.852089
return_std                           84.71975
average_reward                      -0.045254
round_time             0 days 00:10:09.638864
episodes_test                            10.0
episode_length_test                     910.3
returns_test                          602.441
return_std_test                    167.463927
average_reward_test                  0.656411
round_time_test        0 days 00:00:10.994539
round_time_total       0 days 00:10:09.639939
loss_total                        1442.251954
loss_critic                       1881.503905
loss_actor                        -314.755969
memory_size                       691722.8365 

=== epoch 9/10 ===== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:08,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   30
episode_length                     300.266667
returns                             -21.74193
return_std                          96.900363
average_reward                      -0.090938
round_time             0 days 00:10:12.040915
episodes_test                            11.0
episode_length_test                863.090909
returns_test                        537.60458
return_std_test                    174.166798
average_reward_test                  0.623953
round_time_test        0 days 00:00:10.940835
round_time_total       0 days 00:10:12.041997
loss_total                         1381.87932
loss_critic                       1806.037715
loss_actor                        -314.754395
memory_size                       693627.6055 

=== epoch 9/10 ===== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:20,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:10<00:00,  3.28it/s]
episodes                                   24
episode_length                        373.875
returns                            -49.451832
return_std                         116.465773
average_reward                       -0.14625
round_time             0 days 00:10:10.677618
episodes_test                            11.0
episode_length_test                830.818182
returns_test                       604.591382
return_std_test                     257.24569
average_reward_test                  0.734831
round_time_test        0 days 00:00:11.168127
round_time_total       0 days 00:10:10.679085
loss_total                        1409.531106
loss_critic                       1840.578608
loss_actor                        -314.659034
memory_size                       695474.4925 

=== epoch 9/10 ===== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   26
episode_length                     372.423077
returns                              -40.7483
return_std                         115.501417
average_reward                      -0.108304
round_time             0 days 00:10:16.599248
episodes_test                            10.0
episode_length_test                     972.9
returns_test                       640.146307
return_std_test                       90.5546
average_reward_test                  0.663699
round_time_test        0 days 00:00:10.970902
round_time_total       0 days 00:10:16.600378
loss_total                        1428.396737
loss_critic                       1864.156836
loss_actor                        -314.643784
memory_size                       697383.8275 

=== epoch 9/10 ===== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:37,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   25
episode_length                         333.52
returns                            -37.703427
return_std                         113.450484
average_reward                      -0.135176
round_time             0 days 00:10:16.450530
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       616.198202
return_std_test                     62.643996
average_reward_test                  0.616198
round_time_test        0 days 00:00:10.998865
round_time_total       0 days 00:10:16.451601
loss_total                        1393.621917
loss_critic                       1820.844956
loss_actor                        -315.270365
memory_size                       699195.6175 

=== epoch 9/10 ===== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:09,  2.98it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   29
episode_length                     306.310345
returns                            -32.996143
return_std                         112.527573
average_reward                      -0.121985
round_time             0 days 00:10:14.762662
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       546.346764
return_std_test                      55.75337
average_reward_test                  0.546347
round_time_test        0 days 00:00:10.845955
round_time_total       0 days 00:10:14.763794
loss_total                        1412.022887
loss_critic                       1844.188405
loss_actor                        -316.639323
memory_size                        701042.912 

=== epoch 9/10 ===== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   27
episode_length                     338.925926
returns                            -46.043983
return_std                         115.637313
average_reward                      -0.149794
round_time             0 days 00:10:16.972015
episodes_test                            11.0
episode_length_test                842.909091
returns_test                       497.193674
return_std_test                    190.049923
average_reward_test                  0.592836
round_time_test        0 days 00:00:10.915810
round_time_total       0 days 00:10:16.973140
loss_total                        1414.972145
loss_critic                        1848.09129
loss_actor                        -317.504564
memory_size                        702898.709 

=== epoch 9/10 ===== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<09:42,  3.43it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   33
episode_length                     273.060606
returns                            -23.756211
return_std                          97.809015
average_reward                      -0.097969
round_time             0 days 00:10:16.075701
episodes_test                            14.0
episode_length_test                673.642857
returns_test                        491.81014
return_std_test                    306.741245
average_reward_test                  0.738099
round_time_test        0 days 00:00:11.071618
round_time_total       0 days 00:10:16.077146
loss_total                        1441.357061
loss_critic                       1881.478295
loss_actor                         -319.12799
memory_size                        704687.062 

=== epoch 9/10 ===== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   32
episode_length                         267.25
returns                             -31.50848
return_std                         103.420192
average_reward                      -0.121841
round_time             0 days 00:10:14.638333
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       638.399695
return_std_test                     38.936825
average_reward_test                    0.6384
round_time_test        0 days 00:00:11.112503
round_time_total       0 days 00:10:14.639425
loss_total                        1442.696622
loss_critic                       1883.086317
loss_actor                        -318.862289
memory_size                       706531.3995 

=== epoch 9/10 ===== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   33
episode_length                     276.787879
returns                            -27.382545
return_std                          95.479248
average_reward                      -0.110398
round_time             0 days 00:10:16.375989
episodes_test                            11.0
episode_length_test                825.272727
returns_test                       555.315454
return_std_test                    242.822458
average_reward_test                  0.677902
round_time_test        0 days 00:00:10.907424
round_time_total       0 days 00:10:16.377446
loss_total                         1438.41852
loss_critic                       1877.833238
loss_actor                        -319.240491
memory_size                         708355.99 

=== epoch 9/10 ===== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   32
episode_length                        274.375
returns                             -28.67071
return_std                          98.777931
average_reward                      -0.124985
round_time             0 days 00:10:14.512414
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       625.027666
return_std_test                     68.894825
average_reward_test                  0.625028
round_time_test        0 days 00:00:10.817626
round_time_total       0 days 00:10:14.513495
loss_total                        1427.880844
loss_critic                       1864.284431
loss_actor                        -317.733631
memory_size                       710196.1825 

=== epoch 9/10 ===== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.13it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.26it/s]
episodes                                   33
episode_length                     291.757576
returns                            -36.303383
return_std                         101.005991
average_reward                      -0.112659
round_time             0 days 00:10:13.534320
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       692.955101
return_std_test                     43.704942
average_reward_test                  0.692955
round_time_test        0 days 00:00:10.939707
round_time_total       0 days 00:10:13.535403
loss_total                        1404.321447
loss_critic                       1834.790413
loss_actor                        -317.554544
memory_size                       712084.7045 

=== epoch 9/10 ===== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   26
episode_length                     377.846154
returns                            -59.224457
return_std                         107.839709
average_reward                      -0.145727
round_time             0 days 00:10:15.747537
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        624.52459
return_std_test                     37.613174
average_reward_test                  0.624525
round_time_test        0 days 00:00:10.908187
round_time_total       0 days 00:10:15.748629
loss_total                        1392.622977
loss_critic                       1820.369408
loss_actor                        -318.362878
memory_size                        713959.523 

=== epoch 9/10 ===== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:04,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   33
episode_length                     294.151515
returns                            -36.589516
return_std                         103.412773
average_reward                      -0.127177
round_time             0 days 00:10:15.938724
episodes_test                            10.0
episode_length_test                     910.9
returns_test                       570.945774
return_std_test                     161.41954
average_reward_test                  0.633523
round_time_test        0 days 00:00:10.943355
round_time_total       0 days 00:10:15.940061
loss_total                        1380.965499
loss_critic                       1806.055608
loss_actor                        -319.395061
memory_size                       715798.8815 

=== epoch 9/10 ===== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   41
episode_length                      236.02439
returns                            -21.421324
return_std                          91.651155
average_reward                      -0.094675
round_time             0 days 00:10:14.693963
episodes_test                            15.0
episode_length_test                625.333333
returns_test                       446.971657
return_std_test                    324.134639
average_reward_test                  0.718248
round_time_test        0 days 00:00:10.887985
round_time_total       0 days 00:10:14.695059
loss_total                        1403.783687
loss_critic                       1834.642133
loss_actor                        -319.650232
memory_size                       717463.4945 

=== epoch 9/10 ===== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   46
episode_length                     203.891304
returns                            -12.708173
return_std                          77.824098
average_reward                      -0.071714
round_time             0 days 00:10:16.202470
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       691.066773
return_std_test                     55.206194
average_reward_test                  0.691067
round_time_test        0 days 00:00:10.865840
round_time_total       0 days 00:10:16.203537
loss_total                        1421.576972
loss_critic                        1857.03679
loss_actor                        -320.262427
memory_size                       719177.8805 

=== epoch 9/10 ===== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:17,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:12<00:00,  3.27it/s]
episodes                                   52
episode_length                     161.115385
returns                               5.45678
return_std                          58.548608
average_reward                      -0.019268
round_time             0 days 00:10:12.723934
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       722.860268
return_std_test                     42.173983
average_reward_test                   0.72286
round_time_test        0 days 00:00:10.842573
round_time_total       0 days 00:10:12.725004
loss_total                        1405.671052
loss_critic                       1837.177239
loss_actor                        -320.353824
memory_size                       720768.3265 

=== epoch 9/10 ===== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:35,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   59
episode_length                     163.542373
returns                               0.81829
return_std                          69.551467
average_reward                       -0.00031
round_time             0 days 00:10:14.124575
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                        771.06429
return_std_test                     47.466121
average_reward_test                  0.771064
round_time_test        0 days 00:00:11.321819
round_time_total       0 days 00:10:14.125658
loss_total                        1391.642289
loss_critic                        1819.93786
loss_actor                         -321.54012
memory_size                        722608.709 

=== epoch 9/10 ===== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   55
episode_length                     167.818182
returns                              1.440166
return_std                          69.883184
average_reward                      -0.001513
round_time             0 days 00:10:14.130129
episodes_test                            11.0
episode_length_test                     852.0
returns_test                       652.816725
return_std_test                    218.302052
average_reward_test                  0.770225
round_time_test        0 days 00:00:10.964078
round_time_total       0 days 00:10:14.131216
loss_total                        1406.543114
loss_critic                       1838.430545
loss_actor                         -321.00673
memory_size                       724379.9705 

=== epoch 9/10 ===== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:11,  2.97it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   47
episode_length                          190.0
returns                             -8.621719
return_std                          80.118461
average_reward                      -0.054418
round_time             0 days 00:10:12.049938
episodes_test                            12.0
episode_length_test                    763.25
returns_test                       573.652394
return_std_test                    296.318428
average_reward_test                  0.744168
round_time_test        0 days 00:00:10.836655
round_time_total       0 days 00:10:12.051020
loss_total                        1391.805203
loss_critic                       1820.333462
loss_actor                        -322.307969
memory_size                       726212.7865 

=== epoch 9/10 ===== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:13,  2.96it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   47
episode_length                     210.255319
returns                             -8.965158
return_std                          90.018076
average_reward                      -0.043385
round_time             0 days 00:10:14.469752
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       709.563743
return_std_test                     42.158385
average_reward_test                  0.709564
round_time_test        0 days 00:00:10.746574
round_time_total       0 days 00:10:14.470829
loss_total                        1423.342036
loss_critic                       1860.005673
loss_actor                        -323.312645
memory_size                        728013.922 

=== epoch 9/10 ===== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:41,  2.84it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   43
episode_length                     204.883721
returns                             -8.796961
return_std                          82.166322
average_reward                      -0.058314
round_time             0 days 00:10:14.554491
episodes_test                            10.0
episode_length_test                     907.7
returns_test                       693.238555
return_std_test                    207.090506
average_reward_test                   0.75658
round_time_test        0 days 00:00:10.861751
round_time_total       0 days 00:10:14.555586
loss_total                        1449.179171
loss_critic                       1892.231595
loss_actor                        -323.030654
memory_size                       729659.4765 

=== epoch 9/10 ===== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:32,  2.88it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   39
episode_length                     217.205128
returns                            -10.828467
return_std                          81.576605
average_reward                      -0.073858
round_time             0 days 00:10:16.109771
episodes_test                            12.0
episode_length_test                    765.25
returns_test                        547.70167
return_std_test                    284.831253
average_reward_test                  0.715615
round_time_test        0 days 00:00:10.822764
round_time_total       0 days 00:10:16.111146
loss_total                        1435.958123
loss_critic                       1875.918309
loss_actor                        -323.882744
memory_size                        731512.128 

=== epoch 9/10 ===== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   36
episode_length                     251.472222
returns                            -19.837012
return_std                          93.490375
average_reward                      -0.096437
round_time             0 days 00:10:14.105816
episodes_test                            11.0
episode_length_test                896.545455
returns_test                       600.340522
return_std_test                    194.472729
average_reward_test                  0.676101
round_time_test        0 days 00:00:10.811442
round_time_total       0 days 00:10:14.106906
loss_total                        1411.508506
loss_critic                       1845.334006
loss_actor                        -323.793614
memory_size                        733440.025 

=== epoch 9/10 ===== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   33
episode_length                     252.484848
returns                            -12.701939
return_std                          91.358188
average_reward                      -0.088754
round_time             0 days 00:10:15.202506
episodes_test                            14.0
episode_length_test                691.785714
returns_test                         530.0574
return_std_test                    309.707262
average_reward_test                  0.761571
round_time_test        0 days 00:00:10.837063
round_time_total       0 days 00:10:15.203593
loss_total                        1427.227648
loss_critic                       1865.215248
loss_actor                        -324.722877
memory_size                        735303.091 

=== epoch 9/10 ===== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:46,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   30
episode_length                     326.966667
returns                            -41.929018
return_std                         108.118786
average_reward                      -0.123024
round_time             0 days 00:10:13.849563
episodes_test                            11.0
episode_length_test                833.181818
returns_test                       521.347268
return_std_test                    213.674323
average_reward_test                  0.627902
round_time_test        0 days 00:00:11.057097
round_time_total       0 days 00:10:13.850627
loss_total                        1436.165788
loss_critic                       1876.215026
loss_actor                        -324.031284
memory_size                        737162.794 

=== epoch 9/10 ===== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:07,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   28
episode_length                     328.678571
returns                            -50.018051
return_std                         111.575703
average_reward                      -0.153471
round_time             0 days 00:10:15.158314
episodes_test                            10.0
episode_length_test                     923.7
returns_test                       641.319757
return_std_test                    148.858738
average_reward_test                   0.69982
round_time_test        0 days 00:00:10.958487
round_time_total       0 days 00:10:15.159610
loss_total                        1429.650584
loss_critic                       1867.782872
loss_actor                        -322.878688
memory_size                       738912.1945 

=== epoch 9/10 ===== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   28
episode_length                     342.107143
returns                            -53.035255
return_std                         108.986723
average_reward                      -0.152838
round_time             0 days 00:10:17.615441
episodes_test                            11.0
episode_length_test                828.454545
returns_test                        582.76552
return_std_test                    253.341123
average_reward_test                  0.694234
round_time_test        0 days 00:00:11.216543
round_time_total       0 days 00:10:17.616530
loss_total                        1410.373131
loss_critic                       1843.497056
loss_actor                        -322.122694
memory_size                       740838.0265 

=== epoch 9/10 ===== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.25it/s]
episodes                                   33
episode_length                     281.090909
returns                            -31.918129
return_std                          90.341702
average_reward                      -0.121351
round_time             0 days 00:10:15.173154
episodes_test                            11.0
episode_length_test                824.454545
returns_test                       608.267557
return_std_test                    272.619529
average_reward_test                  0.739771
round_time_test        0 days 00:00:10.984018
round_time_total       0 days 00:10:15.174220
loss_total                        1390.233486
loss_critic                       1818.427186
loss_actor                        -322.541437
memory_size                        742618.291 

=== epoch 9/10 ===== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.25it/s]
episodes                                   33
episode_length                     295.606061
returns                            -40.865372
return_std                          94.068978
average_reward                      -0.140513
round_time             0 days 00:10:16.701142
episodes_test                            11.0
episode_length_test                826.090909
returns_test                       640.764444
return_std_test                    289.097999
average_reward_test                  0.770456
round_time_test        0 days 00:00:11.134473
round_time_total       0 days 00:10:16.702396
loss_total                        1377.448221
loss_critic                       1802.781229
loss_actor                        -323.883937
memory_size                        744485.764 

=== epoch 9/10 ===== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   31
episode_length                     299.741935
returns                            -38.572982
return_std                          95.628767
average_reward                      -0.134704
round_time             0 days 00:10:17.788742
episodes_test                            10.0
episode_length_test                     910.4
returns_test                       621.923161
return_std_test                     175.15215
average_reward_test                  0.679757
round_time_test        0 days 00:00:11.145948
round_time_total       0 days 00:10:17.789824
loss_total                        1404.525644
loss_critic                       1836.760724
loss_actor                        -324.414811
memory_size                       746324.2225 

=== epoch 9/10 ===== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:26,  2.91it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   35
episode_length                     265.028571
returns                            -27.448224
return_std                          87.013377
average_reward                       -0.11831
round_time             0 days 00:10:16.341087
episodes_test                            11.0
episode_length_test                853.272727
returns_test                       568.033938
return_std_test                    225.570121
average_reward_test                  0.663078
round_time_test        0 days 00:00:10.965297
round_time_total       0 days 00:10:16.342158
loss_total                          1382.7216
loss_critic                       1809.603747
loss_actor                        -324.807108
memory_size                        748122.069 

=== epoch 9/10 ===== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   34
episode_length                     285.735294
returns                              -33.5589
return_std                          96.256104
average_reward                      -0.116879
round_time             0 days 00:10:14.851131
episodes_test                            12.0
episode_length_test                762.416667
returns_test                        509.82606
return_std_test                    265.409649
average_reward_test                  0.668364
round_time_test        0 days 00:00:10.904841
round_time_total       0 days 00:10:14.852360
loss_total                        1403.979337
loss_critic                       1836.241956
loss_actor                        -325.071258
memory_size                       749976.1545 

=== epoch 9/10 ===== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   29
episode_length                     293.586207
returns                            -39.763238
return_std                          98.971175
average_reward                      -0.149436
round_time             0 days 00:10:16.285300
episodes_test                            11.0
episode_length_test                854.272727
returns_test                       529.507318
return_std_test                    205.829534
average_reward_test                  0.632412
round_time_test        0 days 00:00:11.077463
round_time_total       0 days 00:10:16.286383
loss_total                        1437.393298
loss_critic                       1878.119545
loss_actor                        -325.511817
memory_size                       751869.9755 

=== epoch 9/10 ===== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   33
episode_length                     294.727273
returns                            -42.075852
return_std                          96.210016
average_reward                      -0.130049
round_time             0 days 00:10:17.306765
episodes_test                            12.0
episode_length_test                771.416667
returns_test                       520.044015
return_std_test                    261.212183
average_reward_test                  0.672443
round_time_test        0 days 00:00:10.895963
round_time_total       0 days 00:10:17.307853
loss_total                         1391.65137
loss_critic                       1821.330155
loss_actor                        -327.063904
memory_size                        753711.913 

=== epoch 9/10 ===== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:44,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   38
episode_length                     252.447368
returns                            -27.744714
return_std                          89.545106
average_reward                      -0.111319
round_time             0 days 00:10:15.842687
episodes_test                            12.0
episode_length_test                     766.5
returns_test                       555.625967
return_std_test                    300.057157
average_reward_test                  0.718323
round_time_test        0 days 00:00:11.021678
round_time_total       0 days 00:10:15.843780
loss_total                        1413.181947
loss_critic                       1848.493952
loss_actor                        -328.066192
memory_size                       755484.4235 

=== epoch 9/10 ===== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:29,  2.89it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:18<00:00,  3.23it/s]
episodes                                   34
episode_length                     255.088235
returns                            -26.801548
return_std                           81.15039
average_reward                      -0.117678
round_time             0 days 00:10:18.974134
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       660.189594
return_std_test                     28.943352
average_reward_test                   0.66019
round_time_test        0 days 00:00:10.978955
round_time_total       0 days 00:10:18.975576
loss_total                        1405.694369
loss_critic                       1839.212035
loss_actor                        -328.376424
memory_size                       757180.4375 

=== epoch 9/10 ===== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   37
episode_length                     248.945946
returns                            -19.298578
return_std                          71.754953
average_reward                      -0.094536
round_time             0 days 00:10:16.097819
episodes_test                            12.0
episode_length_test                     762.0
returns_test                       510.736144
return_std_test                    268.645431
average_reward_test                  0.662035
round_time_test        0 days 00:00:10.951118
round_time_total       0 days 00:10:16.099036
loss_total                        1414.161959
loss_critic                       1849.595145
loss_actor                        -327.570907
memory_size                        759082.766 

=== epoch 9/10 ===== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<12:00,  2.77it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   44
episode_length                     221.454545
returns                            -11.663094
return_std                          61.812792
average_reward                      -0.058379
round_time             0 days 00:10:15.591808
episodes_test                            10.0
episode_length_test                     901.4
returns_test                        588.04137
return_std_test                    201.182747
average_reward_test                  0.656763
round_time_test        0 days 00:00:10.987221
round_time_total       0 days 00:10:15.592879
loss_total                        1413.595036
loss_critic                       1848.657655
loss_actor                        -326.655575
memory_size                       760884.2575 

=== epoch 9/10 ===== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   39
episode_length                     224.717949
returns                             -8.812235
return_std                           58.80179
average_reward                       -0.06921
round_time             0 days 00:10:16.941196
episodes_test                            12.0
episode_length_test                762.416667
returns_test                       498.990933
return_std_test                    271.018977
average_reward_test                  0.667362
round_time_test        0 days 00:00:10.842057
round_time_total       0 days 00:10:16.942498
loss_total                        1398.910589
loss_critic                       1830.342931
loss_actor                        -326.818906
memory_size                        762609.448 

=== epoch 9/10 ===== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:29,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   38
episode_length                     258.763158
returns                            -24.646545
return_std                          83.796365
average_reward                      -0.092657
round_time             0 days 00:10:14.689190
episodes_test                            10.0
episode_length_test                     920.5
returns_test                       686.706754
return_std_test                    178.181636
average_reward_test                   0.75707
round_time_test        0 days 00:00:11.021426
round_time_total       0 days 00:10:14.690448
loss_total                        1422.923969
loss_critic                       1860.281392
loss_actor                        -326.505851
memory_size                        764521.167 

=== epoch 9/10 ===== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:03,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   37
episode_length                          240.0
returns                             -23.45795
return_std                          85.303062
average_reward                       -0.11161
round_time             0 days 00:10:17.055144
episodes_test                            10.0
episode_length_test                     903.7
returns_test                       666.104159
return_std_test                    207.605182
average_reward_test                  0.745176
round_time_test        0 days 00:00:11.008978
round_time_total       0 days 00:10:17.056346
loss_total                        1394.717514
loss_critic                       1825.143275
loss_actor                         -326.98565
memory_size                        766222.067 

=== epoch 9/10 ===== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:05,  3.00it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   49
episode_length                     183.918367
returns                            -15.332606
return_std                          79.986303
average_reward                      -0.088971
round_time             0 days 00:10:17.826237
episodes_test                            14.0
episode_length_test                674.785714
returns_test                       476.082658
return_std_test                    300.868619
average_reward_test                   0.71105
round_time_test        0 days 00:00:10.878706
round_time_total       0 days 00:10:17.827529
loss_total                        1400.476296
loss_critic                       1832.384471
loss_actor                        -327.156523
memory_size                        768023.459 

=== epoch 9/10 ===== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:47,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   41
episode_length                     221.585366
returns                            -25.444715
return_std                          95.970294
average_reward                      -0.122011
round_time             0 days 00:10:14.527284
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       740.113532
return_std_test                     42.324217
average_reward_test                  0.740114
round_time_test        0 days 00:00:11.050824
round_time_total       0 days 00:10:14.528361
loss_total                         1394.73985
loss_critic                       1825.127397
loss_actor                        -326.810462
memory_size                       769679.9225 

=== epoch 9/10 ===== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:45,  3.09it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
episodes                                   46
episode_length                     200.630435
returns                            -18.760349
return_std                          91.883823
average_reward                      -0.101629
round_time             0 days 00:10:17.142753
episodes_test                            10.0
episode_length_test                     904.4
returns_test                        636.12267
return_std_test                    200.242267
average_reward_test                   0.70019
round_time_test        0 days 00:00:10.969325
round_time_total       0 days 00:10:17.143844
loss_total                        1411.396532
loss_critic                        1846.01802
loss_actor                        -327.089553
memory_size                       771515.0685 

=== epoch 9/10 ===== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:22,  2.92it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:16<00:00,  3.24it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
episodes                                   38
episode_length                     258.684211
returns                             -30.01481
return_std                          99.992849
average_reward                      -0.119724
round_time             0 days 00:10:16.899164
episodes_test                            12.0
episode_length_test                757.583333
returns_test                       578.977635
return_std_test                    325.037597
average_reward_test                  0.772019
round_time_test        0 days 00:00:11.009682
round_time_total       0 days 00:10:16.900252
loss_total                        1441.683102
loss_critic                       1883.874598
loss_actor                        -327.083016
memory_size                        773396.315 


<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
=== epoch 10/10 ==== round 1/50 ======================================
  0%|          | 6/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                    3
episode_length                          340.0
returns                            -74.749481
return_std                          98.179932
average_reward                      -0.245159
round_time             0 days 00:10:07.809338
episodes_test                            12.0
episode_length_test                765.166667
returns_test                       578.946919
return_std_test                    291.964274
average_reward_test                  0.750106
round_time_test        0 days 00:00:10.924777
round_time_total       0 days 00:10:07.810441
loss_total                        1417.939195
loss_critic                       1854.290257
loss_actor                        -327.465183
memory_size                         775313.18 

=== epoch 10/10 ==== round 2/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                    7
episode_length                     452.428571
returns                            -91.109404
return_std                         102.899844
average_reward                      -0.202685
round_time             0 days 00:10:08.217641
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       734.644731
return_std_test                     52.642136
average_reward_test                  0.734645
round_time_test        0 days 00:00:10.769852
round_time_total       0 days 00:10:08.219070
loss_total                        1378.661467
loss_critic                       1805.568353
loss_actor                        -328.966203
memory_size                        777214.934 

=== epoch 10/10 ==== round 3/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   17
episode_length                     350.176471
returns                            -56.044738
return_std                         101.728854
average_reward                      -0.158816
round_time             0 days 00:10:09.920983
episodes_test                            11.0
episode_length_test                858.090909
returns_test                       651.380109
return_std_test                    237.745771
average_reward_test                  0.762116
round_time_test        0 days 00:00:10.978297
round_time_total       0 days 00:10:09.922317
loss_total                        1433.058594
loss_critic                       1873.685382
loss_actor                        -329.448688
memory_size                        779041.747 

=== epoch 10/10 ==== round 4/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:49,  3.07it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   24
episode_length                     314.708333
returns                            -41.196614
return_std                          97.505728
average_reward                      -0.139916
round_time             0 days 00:10:09.677667
episodes_test                            11.0
episode_length_test                851.545455
returns_test                       595.948775
return_std_test                    228.868011
average_reward_test                  0.699365
round_time_test        0 days 00:00:10.971133
round_time_total       0 days 00:10:09.678945
loss_total                        1454.849362
loss_critic                       1900.867068
loss_actor                        -329.221595
memory_size                        780860.661 

=== epoch 10/10 ==== round 5/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:31,  3.16it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   31
episode_length                     319.225806
returns                            -47.781979
return_std                         105.033744
average_reward                      -0.146533
round_time             0 days 00:10:12.260506
episodes_test                            15.0
episode_length_test                     654.8
returns_test                       465.911818
return_std_test                    308.316268
average_reward_test                  0.714684
round_time_test        0 days 00:00:11.254357
round_time_total       0 days 00:10:12.261582
loss_total                        1423.321153
loss_critic                       1861.445394
loss_actor                        -329.175946
memory_size                        782745.382 

=== epoch 10/10 ==== round 6/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   31
episode_length                     300.064516
returns                            -35.129947
return_std                          99.363816
average_reward                      -0.122934
round_time             0 days 00:10:11.914160
episodes_test                            10.0
episode_length_test                     903.8
returns_test                       655.860432
return_std_test                    210.574597
average_reward_test                  0.722109
round_time_test        0 days 00:00:11.330376
round_time_total       0 days 00:10:11.915332
loss_total                        1407.110938
loss_critic                       1841.444168
loss_actor                        -330.222117
memory_size                       784601.1605 

=== epoch 10/10 ==== round 7/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:18,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   36
episode_length                     267.666667
returns                            -30.099724
return_std                         100.554981
average_reward                       -0.11252
round_time             0 days 00:10:12.251916
episodes_test                            12.0
episode_length_test                     769.0
returns_test                       500.391454
return_std_test                    254.791685
average_reward_test                  0.644578
round_time_test        0 days 00:00:10.911586
round_time_total       0 days 00:10:12.252980
loss_total                        1443.378816
loss_critic                       1887.058927
loss_actor                        -331.341763
memory_size                       786421.4825 

=== epoch 10/10 ==== round 8/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   39
episode_length                     255.871795
returns                            -17.804069
return_std                          93.681589
average_reward                      -0.070944
round_time             0 days 00:10:14.745493
episodes_test                            11.0
episode_length_test                876.636364
returns_test                       663.022646
return_std_test                     220.34873
average_reward_test                  0.759754
round_time_test        0 days 00:00:10.820936
round_time_total       0 days 00:10:14.746587
loss_total                        1414.815973
loss_critic                        1851.66549
loss_actor                        -332.582218
memory_size                       788198.7785 

=== epoch 10/10 ==== round 9/50 ======================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:53,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   38
episode_length                     235.763158
returns                            -15.088315
return_std                          85.563317
average_reward                      -0.081374
round_time             0 days 00:10:14.230204
episodes_test                            15.0
episode_length_test                650.466667
returns_test                       450.324321
return_std_test                    303.734599
average_reward_test                   0.69775
round_time_test        0 days 00:00:10.906903
round_time_total       0 days 00:10:14.231293
loss_total                        1397.729002
loss_critic                       1830.499547
loss_actor                        -333.353309
memory_size                       789922.7785 

=== epoch 10/10 ==== round 10/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:13<00:00,  3.26it/s]
episodes                                   35
episode_length                     259.628571
returns                            -17.034716
return_std                          89.270305
average_reward                      -0.081809
round_time             0 days 00:10:14.526694
episodes_test                            15.0
episode_length_test                655.066667
returns_test                       490.818916
return_std_test                    326.691467
average_reward_test                  0.750919
round_time_test        0 days 00:00:10.729808
round_time_total       0 days 00:10:14.527764
loss_total                        1432.233558
loss_critic                       1873.656692
loss_actor                        -333.459106
memory_size                       791815.6565 

=== epoch 10/10 ==== round 11/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:18,  2.94it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:14<00:00,  3.26it/s]
episodes                                   46
episode_length                     203.521739
returns                              -2.93492
return_std                          73.899013
average_reward                      -0.031524
round_time             0 days 00:10:14.626559
episodes_test                            13.0
episode_length_test                750.461538
returns_test                       589.940664
return_std_test                    291.633332
average_reward_test                  0.786996
round_time_test        0 days 00:00:10.900175
round_time_total       0 days 00:10:14.627665
loss_total                        1415.273044
loss_critic                       1852.652344
loss_actor                        -334.244283
memory_size                       793544.9705 

=== epoch 10/10 ==== round 12/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:17<00:00,  3.24it/s]
episodes                                   42
episode_length                     232.738095
returns                            -10.184864
return_std                          89.269365
average_reward                      -0.042203
round_time             0 days 00:10:18.205160
episodes_test                            11.0
episode_length_test                838.909091
returns_test                       626.577198
return_std_test                    258.427867
average_reward_test                  0.744207
round_time_test        0 days 00:00:10.974837
round_time_total       0 days 00:10:18.206340
loss_total                        1403.830459
loss_critic                       1838.592705
loss_actor                        -335.218641
memory_size                        795326.762 

=== epoch 10/10 ==== round 13/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:26,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:15<00:00,  3.25it/s]
episodes                                   44
episode_length                         222.75
returns                             -8.730936
return_std                          89.150703
average_reward                      -0.035755
round_time             0 days 00:10:15.753499
episodes_test                            13.0
episode_length_test                745.076923
returns_test                       588.711472
return_std_test                    266.223361
average_reward_test                  0.793478
round_time_test        0 days 00:00:10.856724
round_time_total       0 days 00:10:15.754601
loss_total                        1443.644765
loss_critic                       1888.576771
loss_actor                        -336.083389
memory_size                       797067.1925 

=== epoch 10/10 ==== round 14/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:07,  3.28it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:11<00:00,  3.27it/s]
episodes                                   46
episode_length                     204.586957
returns                              -2.64247
return_std                          87.893529
average_reward                      -0.026347
round_time             0 days 00:10:12.070225
episodes_test                            10.0
episode_length_test                     911.3
returns_test                       720.180051
return_std_test                    203.197696
average_reward_test                  0.790473
round_time_test        0 days 00:00:10.752420
round_time_total       0 days 00:10:12.071361
loss_total                        1398.198878
loss_critic                       1831.838033
loss_actor                        -336.357865
memory_size                        798810.431 

=== epoch 10/10 ==== round 15/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:09<00:00,  3.28it/s]
episodes                                   50
episode_length                         187.46
returns                             -2.458393
return_std                          81.117156
average_reward                      -0.012004
round_time             0 days 00:10:09.643756
episodes_test                            14.0
episode_length_test                695.714286
returns_test                       566.893752
return_std_test                    323.368983
average_reward_test                  0.818813
round_time_test        0 days 00:00:10.928088
round_time_total       0 days 00:10:09.644843
loss_total                        1416.348929
loss_critic                       1854.794831
loss_actor                        -337.434815
memory_size                       800558.6645 

=== epoch 10/10 ==== round 16/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:42,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:05<00:00,  3.30it/s]
episodes                                   38
episode_length                     232.947368
returns                            -15.693468
return_std                          93.029293
average_reward                      -0.088285
round_time             0 days 00:10:05.829325
episodes_test                            12.0
episode_length_test                822.416667
returns_test                       626.825435
return_std_test                     256.44014
average_reward_test                  0.763188
round_time_test        0 days 00:00:10.863194
round_time_total       0 days 00:10:05.830810
loss_total                        1440.496426
loss_critic                       1885.021005
loss_actor                        -337.602021
memory_size                       802437.7655 

=== epoch 10/10 ==== round 17/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:44,  2.83it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:07<00:00,  3.29it/s]
episodes                                   40
episode_length                          248.3
returns                            -19.753244
return_std                          90.060023
average_reward                      -0.078017
round_time             0 days 00:10:07.892245
episodes_test                            16.0
episode_length_test                  603.1875
returns_test                       454.668372
return_std_test                    335.586752
average_reward_test                  0.757777
round_time_test        0 days 00:00:11.061820
round_time_total       0 days 00:10:07.893304
loss_total                        1449.473937
loss_critic                       1896.453186
loss_actor                        -338.443189
memory_size                       804318.1025 

=== epoch 10/10 ==== round 18/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:48,  3.08it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:05<00:00,  3.30it/s]
episodes                                   49
episode_length                     201.163265
returns                            -19.447034
return_std                          80.532126
average_reward                       -0.09604
round_time             0 days 00:10:06.359822
episodes_test                            10.0
episode_length_test                     902.1
returns_test                       643.925073
return_std_test                    230.340948
average_reward_test                  0.714884
round_time_test        0 days 00:00:11.234229
round_time_total       0 days 00:10:06.361128
loss_total                        1456.484783
loss_critic                        1905.46546
loss_actor                        -339.438072
memory_size                       805940.2435 

=== epoch 10/10 ==== round 19/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:05<00:00,  3.30it/s]
episodes                                   49
episode_length                     190.938776
returns                            -15.756966
return_std                          74.046621
average_reward                       -0.09069
round_time             0 days 00:10:05.803080
episodes_test                            10.0
episode_length_test                     996.2
returns_test                       761.430299
return_std_test                     50.297257
average_reward_test                  0.764932
round_time_test        0 days 00:00:11.094009
round_time_total       0 days 00:10:05.804407
loss_total                         1472.24226
loss_critic                       1925.173297
loss_actor                        -339.482017
memory_size                       807551.9455 

=== epoch 10/10 ==== round 20/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:54,  3.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:01<00:00,  3.33it/s]
episodes                                   46
episode_length                     200.304348
returns                            -18.957837
return_std                          84.013203
average_reward                      -0.105843
round_time             0 days 00:10:01.940920
episodes_test                            13.0
episode_length_test                756.307692
returns_test                       569.919649
return_std_test                    281.298232
average_reward_test                  0.757579
round_time_test        0 days 00:00:10.752843
round_time_total       0 days 00:10:01.941989
loss_total                        1444.739555
loss_critic                       1890.840593
loss_actor                        -339.664734
memory_size                       809429.7335 

=== epoch 10/10 ==== round 21/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:21,  3.21it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:59<00:00,  3.34it/s]
episodes                                   56
episode_length                     160.589286
returns                             -9.811294
return_std                          68.104134
average_reward                      -0.075958
round_time             0 days 00:09:59.632292
episodes_test                            10.0
episode_length_test                    1000.0
returns_test                       677.272145
return_std_test                     68.895647
average_reward_test                  0.677272
round_time_test        0 days 00:00:10.695811
round_time_total       0 days 00:09:59.633366
loss_total                        1449.671203
loss_critic                       1897.046419
loss_actor                        -339.829794
memory_size                       811152.6925 

=== epoch 10/10 ==== round 22/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:28,  3.17it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:57<00:00,  3.35it/s]
episodes                                   53
episode_length                     181.566038
returns                            -17.368224
return_std                          77.950718
average_reward                      -0.093486
round_time             0 days 00:09:57.752626
episodes_test                            15.0
episode_length_test                655.133333
returns_test                       502.573472
return_std_test                    308.834766
average_reward_test                  0.772583
round_time_test        0 days 00:00:10.730843
round_time_total       0 days 00:09:57.753710
loss_total                        1444.370206
loss_critic                       1890.673713
loss_actor                        -340.843958
memory_size                       813008.3145 

=== epoch 10/10 ==== round 23/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:41,  3.11it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   33
episode_length                     275.181818
returns                            -36.819209
return_std                          96.532879
average_reward                      -0.122935
round_time             0 days 00:09:55.722777
episodes_test                            15.0
episode_length_test                653.333333
returns_test                       505.201304
return_std_test                    325.881854
average_reward_test                  0.772994
round_time_test        0 days 00:00:10.847597
round_time_total       0 days 00:09:55.724044
loss_total                        1478.260093
loss_critic                       1933.260236
loss_actor                        -341.740607
memory_size                       814903.5515 

=== epoch 10/10 ==== round 24/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:21,  2.93it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:59<00:00,  3.34it/s]
episodes                                   41
episode_length                      228.97561
returns                            -25.338559
return_std                           86.59159
average_reward                      -0.120101
round_time             0 days 00:10:00.058626
episodes_test                            16.0
episode_length_test                  585.0625
returns_test                       445.215508
return_std_test                    356.510791
average_reward_test                  0.762668
round_time_test        0 days 00:00:10.727218
round_time_total       0 days 00:10:00.059701
loss_total                        1454.389518
loss_critic                       1903.656919
loss_actor                        -342.680227
memory_size                       816667.1915 

=== epoch 10/10 ==== round 25/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:34,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:57<00:00,  3.35it/s]
episodes                                   39
episode_length                      254.25641
returns                            -32.513281
return_std                          93.774241
average_reward                      -0.123543
round_time             0 days 00:09:57.533124
episodes_test                            13.0
episode_length_test                754.923077
returns_test                       569.657959
return_std_test                    243.228757
average_reward_test                  0.755176
round_time_test        0 days 00:00:10.891250
round_time_total       0 days 00:09:57.534351
loss_total                        1455.727481
loss_critic                       1905.089319
loss_actor                        -341.719994
memory_size                       818459.3575 

=== epoch 10/10 ==== round 26/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:27,  3.18it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:55<00:00,  3.36it/s]
episodes                                   30
episode_length                     291.666667
returns                            -29.171226
return_std                          97.580908
average_reward                      -0.119949
round_time             0 days 00:09:56.237234
episodes_test                            12.0
episode_length_test                    816.25
returns_test                       573.641212
return_std_test                    249.525886
average_reward_test                   0.70675
round_time_test        0 days 00:00:10.973587
round_time_total       0 days 00:09:56.238495
loss_total                        1458.353926
loss_critic                       1908.286344
loss_actor                        -341.375882
memory_size                       820355.5995 

=== epoch 10/10 ==== round 27/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:58,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:02<00:00,  3.32it/s]
episodes                                   31
episode_length                     287.870968
returns                            -30.173252
return_std                         103.656641
average_reward                      -0.122566
round_time             0 days 00:10:02.847844
episodes_test                            13.0
episode_length_test                707.615385
returns_test                       544.886904
return_std_test                    334.009987
average_reward_test                   0.77144
round_time_test        0 days 00:00:10.899351
round_time_total       0 days 00:10:02.849027
loss_total                        1446.176663
loss_critic                       1893.329167
loss_actor                        -342.433476
memory_size                       822237.1715 

=== epoch 10/10 ==== round 28/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:23,  3.20it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:59<00:00,  3.34it/s]
episodes                                   38
episode_length                     251.315789
returns                            -26.618891
return_std                         104.030644
average_reward                      -0.096808
round_time             0 days 00:09:59.642522
episodes_test                            12.0
episode_length_test                787.333333
returns_test                       604.780135
return_std_test                    262.622878
average_reward_test                  0.761612
round_time_test        0 days 00:00:10.725107
round_time_total       0 days 00:09:59.643597
loss_total                        1451.622055
loss_critic                       1900.216418
loss_actor                        -342.755537
memory_size                        824070.545 

=== epoch 10/10 ==== round 29/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<11:02,  3.01it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:58<00:00,  3.34it/s]
episodes                                   29
episode_length                     315.724138
returns                            -37.802664
return_std                         110.078962
average_reward                      -0.128854
round_time             0 days 00:09:58.750189
episodes_test                            16.0
episode_length_test                  600.1875
returns_test                       472.664352
return_std_test                    352.657174
average_reward_test                  0.786123
round_time_test        0 days 00:00:10.949190
round_time_total       0 days 00:09:58.751453
loss_total                        1419.983604
loss_critic                       1860.727275
loss_actor                        -342.991194
memory_size                       825853.1775 

=== epoch 10/10 ==== round 30/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:19,  3.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:00<00:00,  3.33it/s]
episodes                                   34
episode_length                     257.352941
returns                            -28.615872
return_std                         100.236458
average_reward                      -0.116193
round_time             0 days 00:10:01.062756
episodes_test                            13.0
episode_length_test                708.538462
returns_test                       533.724684
return_std_test                    335.481006
average_reward_test                  0.747889
round_time_test        0 days 00:00:10.854935
round_time_total       0 days 00:10:01.064238
loss_total                        1455.123078
loss_critic                       1904.943018
loss_actor                        -344.156807
memory_size                       827638.9825 

=== epoch 10/10 ==== round 31/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<12:19,  2.70it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:27<00:00,  2.91it/s]
episodes                                   36
episode_length                     263.527778
returns                            -34.856215
return_std                          94.767713
average_reward                      -0.130113
round_time             0 days 00:11:28.466748
episodes_test                            10.0
episode_length_test                     901.7
returns_test                       720.668071
return_std_test                    239.531347
average_reward_test                  0.798489
round_time_test        0 days 00:00:10.847742
round_time_total       0 days 00:11:28.467801
loss_total                        1440.247264
loss_critic                       1886.728647
loss_actor                        -345.678399
memory_size                       829485.0995 

=== epoch 10/10 ==== round 32/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:18,  3.23it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:02<00:00,  3.32it/s]
episodes                                   34
episode_length                     260.205882
returns                            -28.405052
return_std                          84.607667
average_reward                      -0.124651
round_time             0 days 00:10:02.892642
episodes_test                            14.0
episode_length_test                674.785714
returns_test                       539.401436
return_std_test                    327.863168
average_reward_test                  0.804961
round_time_test        0 days 00:00:10.850188
round_time_total       0 days 00:10:02.893734
loss_total                        1439.985087
loss_critic                       1886.770676
loss_actor                        -347.157399
memory_size                        831385.858 

=== epoch 10/10 ==== round 33/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:57,  3.03it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:21<00:00,  3.22it/s]
episodes                                   54
episode_length                     172.259259
returns                            -13.511642
return_std                          75.782549
average_reward                      -0.092653
round_time             0 days 00:10:21.611367
episodes_test                            16.0
episode_length_test                  598.1875
returns_test                       453.194436
return_std_test                    343.165776
average_reward_test                  0.758139
round_time_test        0 days 00:00:10.749668
round_time_total       0 days 00:10:21.612606
loss_total                        1498.289987
loss_critic                       1959.936802
loss_actor                        -348.297408
memory_size                       833046.3095 

=== epoch 10/10 ==== round 34/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:51,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:05<00:00,  3.01it/s]
episodes                                   51
episode_length                     180.431373
returns                              -14.5919
return_std                          75.621203
average_reward                      -0.088125
round_time             0 days 00:11:05.821422
episodes_test                            13.0
episode_length_test                711.538462
returns_test                       518.044143
return_std_test                     312.29978
average_reward_test                  0.734688
round_time_test        0 days 00:00:12.014257
round_time_total       0 days 00:11:05.822912
loss_total                        1511.292775
loss_critic                       1975.954882
loss_actor                        -347.355768
memory_size                       834544.5855 

=== epoch 10/10 ==== round 35/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<16:15,  2.05it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [12:22<00:00,  2.69it/s]
episodes                                   53
episode_length                     171.754717
returns                            -11.187403
return_std                          71.143685
average_reward                      -0.076652
round_time             0 days 00:12:23.443655
episodes_test                            10.0
episode_length_test                     904.3
returns_test                       723.311551
return_std_test                    234.857441
average_reward_test                  0.795749
round_time_test        0 days 00:00:13.206544
round_time_total       0 days 00:12:23.444760
loss_total                        1442.970337
loss_critic                       1890.442128
loss_actor                        -346.916954
memory_size                       836411.5195 

=== epoch 10/10 ==== round 36/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:25,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [11:19<00:00,  2.94it/s]
episodes                                   57
episode_length                     146.929825
returns                             -6.675314
return_std                          68.111477
average_reward                      -0.075246
round_time             0 days 00:11:20.148618
episodes_test                            11.0
episode_length_test                826.454545
returns_test                        621.17125
return_std_test                    282.657442
average_reward_test                  0.749381
round_time_test        0 days 00:00:11.124404
round_time_total       0 days 00:11:20.149698
loss_total                        1496.113629
loss_critic                       1956.917756
loss_actor                        -347.103004
memory_size                        838137.086 

=== epoch 10/10 ==== round 37/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:04,  3.30it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:00<00:00,  3.33it/s]
episodes                                   67
episode_length                     131.402985
returns                             -3.743104
return_std                          62.590747
average_reward                      -0.051758
round_time             0 days 00:10:01.388469
episodes_test                            13.0
episode_length_test                724.384615
returns_test                       593.186676
return_std_test                    342.548677
average_reward_test                   0.81997
round_time_test        0 days 00:00:10.847556
round_time_total       0 days 00:10:01.389537
loss_total                        1464.903726
loss_critic                       1917.853033
loss_actor                         -346.89364
memory_size                        839813.153 

=== epoch 10/10 ==== round 38/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<11:06,  2.99it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:58<00:00,  3.34it/s]
episodes                                   50
episode_length                         196.44
returns                            -19.710456
return_std                          76.424046
average_reward                      -0.098959
round_time             0 days 00:09:58.755161
episodes_test                            11.0
episode_length_test                822.909091
returns_test                       609.659588
return_std_test                    284.980086
average_reward_test                  0.729808
round_time_test        0 days 00:00:10.963947
round_time_total       0 days 00:09:58.756408
loss_total                         1471.85899
loss_critic                       1926.320924
loss_actor                        -345.988868
memory_size                        841548.646 

=== epoch 10/10 ==== round 39/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:33,  3.15it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:02<00:00,  3.32it/s]
episodes                                   55
episode_length                     176.763636
returns                            -12.282636
return_std                          68.299322
average_reward                      -0.070899
round_time             0 days 00:10:02.604930
episodes_test                            11.0
episode_length_test                832.090909
returns_test                       691.593223
return_std_test                     302.31919
average_reward_test                  0.830685
round_time_test        0 days 00:00:10.865985
round_time_total       0 days 00:10:02.606018
loss_total                        1447.226006
loss_critic                       1895.580457
loss_actor                         -346.19193
memory_size                        843267.167 

=== epoch 10/10 ==== round 40/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:36,  3.14it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:59<00:00,  3.34it/s]
episodes                                   54
episode_length                      168.62963
returns                             -6.056107
return_std                          62.279659
average_reward                      -0.051408
round_time             0 days 00:10:00.123449
episodes_test                            13.0
episode_length_test                712.384615
returns_test                       563.098237
return_std_test                    329.693805
average_reward_test                  0.795713
round_time_test        0 days 00:00:10.867139
round_time_total       0 days 00:10:00.124530
loss_total                        1472.897735
loss_critic                       1927.816153
loss_actor                        -346.776065
memory_size                       845002.8665 

=== epoch 10/10 ==== round 41/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:52,  3.06it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [09:59<00:00,  3.34it/s]
episodes                                   52
episode_length                     172.173077
returns                             -0.854241
return_std                          58.708297
average_reward                      -0.021063
round_time             0 days 00:09:59.745526
episodes_test                            15.0
episode_length_test                634.333333
returns_test                       517.972751
return_std_test                    357.400921
average_reward_test                  0.809725
round_time_test        0 days 00:00:10.772852
round_time_total       0 days 00:09:59.746638
loss_total                        1447.885477
loss_critic                       1896.799723
loss_actor                        -347.771639
memory_size                       846772.4125 

=== epoch 10/10 ==== round 42/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<10:56,  3.04it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:00<00:00,  3.33it/s]
episodes                                   47
episode_length                     206.191489
returns                             -4.120803
return_std                           63.95614
average_reward                      -0.023855
round_time             0 days 00:10:00.736135
episodes_test                            19.0
episode_length_test                510.736842
returns_test                       412.929133
return_std_test                    371.731934
average_reward_test                  0.811965
round_time_test        0 days 00:00:10.806177
round_time_total       0 days 00:10:00.737566
loss_total                        1491.708979
loss_critic                       1951.475066
loss_actor                        -347.355515
memory_size                       848618.3425 

=== epoch 10/10 ==== round 43/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:39,  3.12it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [10:00<00:00,  3.33it/s]
episodes                                   39
episode_length                     235.974359
returns                            -10.517477
return_std                          74.179555
average_reward                      -0.057934
round_time             0 days 00:10:01.024837
episodes_test                            17.0
episode_length_test                582.176471
returns_test                       468.281524
return_std_test                    369.855008
average_reward_test                  0.809679
round_time_test        0 days 00:00:10.869965
round_time_total       0 days 00:10:01.025933
loss_total                        1449.213048
loss_critic                       1898.431807
loss_actor                         -347.66212
memory_size                        850403.466 

=== epoch 10/10 ==== round 44/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 5/2000 [00:01<10:24,  3.19it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [13:11<00:00,  2.53it/s]
episodes                                   33
episode_length                     280.060606
returns                            -24.115634
return_std                          89.607998
average_reward                      -0.078109
round_time             0 days 00:13:12.235863
episodes_test                            10.0
episode_length_test                     907.0
returns_test                       725.743647
return_std_test                    219.350933
average_reward_test                  0.793196
round_time_test        0 days 00:00:10.894910
round_time_total       0 days 00:13:12.237426
loss_total                        1457.269172
loss_critic                       1908.875357
loss_actor                        -349.155691
memory_size                       852318.3855 

=== epoch 10/10 ==== round 45/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:22,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:24<00:00,  2.31it/s]
episodes                                   33
episode_length                          284.0
returns                            -29.376575
return_std                          92.019572
average_reward                      -0.104688
round_time             0 days 00:14:24.795878
episodes_test                            18.0
episode_length_test                554.444444
returns_test                        446.70301
return_std_test                    353.048366
average_reward_test                  0.804728
round_time_test        0 days 00:00:12.054853
round_time_total       0 days 00:14:24.797195
loss_total                        1467.287101
loss_critic                       1921.811064
loss_actor                        -350.808885
memory_size                        854154.827 

=== epoch 10/10 ==== round 46/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:39,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:24<00:00,  2.31it/s]
episodes                                   29
episode_length                     325.172414
returns                            -46.267589
return_std                         103.937556
average_reward                      -0.140368
round_time             0 days 00:14:24.702285
episodes_test                            10.0
episode_length_test                     990.8
returns_test                       806.615924
return_std_test                     57.154826
average_reward_test                  0.819834
round_time_test        0 days 00:00:11.828623
round_time_total       0 days 00:14:24.703567
loss_total                        1462.716169
loss_critic                        1916.46984
loss_actor                        -352.298655
memory_size                        856013.374 

=== epoch 10/10 ==== round 47/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:50,  2.24it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:31<00:00,  2.30it/s]
episodes                                   30
episode_length                     300.266667
returns                            -40.333935
return_std                         102.397565
average_reward                       -0.14141
round_time             0 days 00:14:32.012581
episodes_test                            12.0
episode_length_test                768.416667
returns_test                        594.34093
return_std_test                    304.510302
average_reward_test                  0.778714
round_time_test        0 days 00:00:11.978515
round_time_total       0 days 00:14:32.013862
loss_total                        1447.522365
loss_critic                       1897.317015
loss_actor                        -351.656371
memory_size                        857831.274 

=== epoch 10/10 ==== round 48/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<14:58,  2.22it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:24<00:00,  2.31it/s]
episodes                                   30
episode_length                          301.0
returns                            -35.725954
return_std                          99.361874
average_reward                      -0.125104
round_time             0 days 00:14:25.257796
episodes_test                            15.0
episode_length_test                640.066667
returns_test                       467.768386
return_std_test                    281.663648
average_reward_test                  0.736742
round_time_test        0 days 00:00:12.058136
round_time_total       0 days 00:14:25.259044
loss_total                        1481.640595
loss_critic                       1940.253927
loss_actor                        -352.812868
memory_size                       859660.2495 

=== epoch 10/10 ==== round 49/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 4/2000 [00:01<14:38,  2.27it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:32<00:00,  2.29it/s]
episodes                                   43
episode_length                     221.674419
returns                            -19.331972
return_std                          83.488249
average_reward                      -0.098132
round_time             0 days 00:14:33.152854
episodes_test                            13.0
episode_length_test                754.384615
returns_test                       618.482605
return_std_test                    321.793388
average_reward_test                  0.824649
round_time_test        0 days 00:00:12.099387
round_time_total       0 days 00:14:33.154322
loss_total                        1456.269472
loss_critic                       1908.776782
loss_actor                        -353.759901
memory_size                        861449.684 

=== epoch 10/10 ==== round 50/50 =====================================
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
  0%|          | 3/2000 [00:01<14:21,  2.32it/s]/<ANONYMIZED PATH>/python3.12/site-packages/gymnasium/envs/registration.py:519: DeprecationWarning: [33mWARN: The environment Ant-v4 is out of date. You should consider upgrading to version `v5`.[0m
  logger.deprecation(
100%|██████████| 2000/2000 [14:23<00:00,  2.32it/s]
<GE123_Delay<NormalizeActionWrapper<Float64ToFloat32<CompatWrapper<NoisyActionWrapper<TimeLimit<OrderEnforcing<PassiveEnvChecker<AntEnv<Ant-v4>>>>>>>>>>
episodes                                   38
episode_length                     234.394737
returns                            -17.305703
return_std                          82.705877
average_reward                      -0.100584
round_time             0 days 00:14:23.860547
episodes_test                            15.0
episode_length_test                621.666667
returns_test                        516.57727
return_std_test                    382.707849
average_reward_test                  0.829054
round_time_test        0 days 00:00:11.950107
round_time_total       0 days 00:14:23.861966
loss_total                         1495.43139
loss_critic                       1957.754031
loss_actor                        -353.859312
memory_size                       863200.9885 


